Campus Event Calendar: Sarath Kumar Kondreddi (05/06/2014 in E1 4/024)

Campus Event Calendar

Campus Event Calendar:
- All Upcoming:
  - only for D1
  - only for D2
  - only for INET
  - only for D4
  - only for D5
  - only for D6
  - only for RG1
  - Mailing Lists
  - by Speaker
  - by Type
  - by Category
  - by Title
  - Calendar
  - RSS Feed
- History of Events:

Event Entry

What and Who

Human Computing and Crowdsourcing Methods for Knowledge Acquisition

Sarath Kumar Kondreddi

Max-Planck-Institut für Informatik - D5

Promotionskolloquium

AG 1, AG 2, AG 3, AG 4, AG 5, SWS, RG1, MMCI

Public Audience

English

Note: We use this to send email in the morning.

Date, Time and Location

Tuesday, 6 May 2014

12:15

60 Minutes

E1 4

024

Saarbrücken

Abstract

Ambiguity, complexity, and diversity in natural language textual
expressions are major hindrances to automated knowledge extraction. As a
result state-of-the-art methods for extracting entities and
relationships from unstructured data make incorrect extractions or
produce noise. With the advent of human computing, computationally hard
tasks have been addressed through human inputs. While text-based
knowledge acquisition can benefit from this approach, humans alone
cannot bear the burden of extracting knowledge from the vast textual
resources that exist today. Even making payments for crowdsourced
acquisition can quickly become prohibitively expensive.
In this thesis we present principled methods that effectively garner
human computing inputs for improving the extraction of knowledge-base
facts from natural language texts. Our methods complement automatic
extraction techniques with human computing to reap the benefits of both
while overcoming each other’s limitations. We present the architecture
and implementation of HIGGINS, a system that combines an information
extraction (IE) engine with a human computing (HC) engine to produce
high quality facts. The IE engine combines statistics derived from large
Web corpora with semantic resources like WordNet and ConceptNet to
construct a large dictionary of entity and relational phrases. It
employs specifically designed statistical language models for phrase
relatedness to come up with questions and relevant candidate answers
that are presented to human workers. Through extensive experiments we
establish the superiority of this approach in extracting
relation-centric facts from text. In our experiments we extract facts
about fictitious characters in narrative text, where the issues of
diversity and complexity in expressing relations are far more
pronounced. Finally, we also demonstrate how interesting human computing
games can be designed for knowledge acquisition tasks.

Contact

Petra Schaaf

5000

--email hidden

System used:

Meeting URL:

Meeting ID:

Passcode:

passcode not visible

Code Visible for:

logged in users only

Petra Schaaf, 04/28/2014 11:50 -- Created document.

Imprint / Impressum | Data Protection / Datenschutzhinweis