MPI-INF Logo
Campus Event Calendar

Event Entry

What and Who

Challenges in deep text understanding in professional domains

Vijay Saraswat
Goldman Sachs
MPI Colloquium Series Distinguished Speaker

Vijay Saraswat is a Technology Fellow at Goldman Sachs, where he helped establishing the corporate Research & Development Engineering group, and leads the CoreAI research group. Previously, he was a member of the Research Staff at Xerox PARC, a technology consultant at AT&T Research, and a Distinguished Research Staff member and Chief Scientist at IBM TJ Watson Research Center. He has worked broadly across many areas of Computer Science, primarily in logic, programming languages, distributed systems and AI. He earned a B Tech in Electrical Engineering from IIT Kanpur in 1982 and a PhD in Computer Science from Carnegie Mellon University in 1989.
AG 1, AG 2, AG 3, INET, AG 4, AG 5, SWS, RG1, MMCI  
Public Audience
English

Date, Time and Location

Tuesday, 30 October 2018
10:00
60 Minutes
E1 4
024
Saarbrücken

Abstract

NLP research has entered a golden period. Most current work, though, is focused on “mass content” -- content from the web, social media, news sources. Here, relatively shallow meaning extraction techniques have worked reasonably well. But professional (legal, financial) text has enormously rich structure, different from mass content. Here documents such as regulations, contracts, agreements, financial prospectuses, company and analyst reports must be addressed. A contract (e.g. commercial line of credit) may involve multiple documents with amendments. References are used at multiple semantic levels, and written using genre-specific conventions. Sentences may be highly complex, spread over multiple paragraphs. Documents may contain technical terms with specialized meaning, and nested definitional scopes. They may talk of abstract roles and potential events, rather than just real-world events. We outline a long-term research agenda to computationalize such documents. We think of language processors as compilers that operate on the input at varying levels of abstraction (abstract syntax tree, intermediate representation) and use many techniques (partial evaluation, abstract interpretation) to generate meaning representations for use with reasoners. hey must combine deep learning with linguistically rigorous analyses, leveraging logical representations.

Contact

Johannes Hoffart
9325-5012
--email hidden
passcode not visible
logged in users only

Tags, Category, Keywords and additional notes

If you would like to meet with Vijay Saraswat before the talk on Monday, Oct 29, please contact Johannes Hoffart (jhoffart@mpi-inf.mpg.de) to schedule a meeting.

Petra Schaaf, 10/24/2018 13:15
Petra Schaaf, 10/24/2018 09:55 -- Created document.