MPI-INF Logo
Campus Event Calendar

Event Entry

What and Who

Exploiting Weak Supervision in NLP tasks: Application to Sentiment Summarization

Ivan Titov
University of Illinois
Talk
AG 1, AG 3, AG 4, AG 5, SWS, RG1, MMCI  
Public Audience
English

Date, Time and Location

Thursday, 12 March 2009
11:15
45 Minutes
E1 4
024
Saarbrücken

Abstract

In recent years, most of the research in structured prediction in NLP (e.g., parsing, segmentation and extraction problems) has been focused on supervised methods requiring large amounts of labeled data.


Constructing such datasets is very expensive and time consuming. However, for many tasks it is possible to obtain abundant amounts of unlabeled content annotated with labels correlated with required structured predictions. Examples of such correlated labels include titles of documents and topic tags for text segmentation, sentiment scores and helpfulness ratings for summarization. Abundance of such weakly supervised data opens an interesting line of research: designing models leveraging these labelings to tackle a wide variety of NLP problems.

In this talk I will be considering the sentiment summarization problem. I will present statistical models which exploit user generated numerical aspect ratings to discover corresponding topics and are therefore able to extract fragments of text discussing these aspects without the need of annotated data. I will also discuss implications to other NLP problems, generalization performance of the proposed methods, and important open research questions.

Contact

Conny Liegl
302-70150
--email hidden
passcode not visible
logged in users only

Conny Liegl, 03/09/2009 14:14 -- Created document.