Max-Planck-Institut für Informatik
max planck institut
mpii logo Minerva of the Max Planck Society

MPI-INF or MPI-SWS or Local Campus Event Calendar

<< Previous Entry Next Entry >> New Event Entry Edit this Entry Login to DB (to update, delete)
What and Who
Title:Efficient knowledge management for named entities from text
Speaker:Sourav Dutta
coming from:Max-Planck-Institut für Informatik - D5
Speakers Bio:
Event Type:Promotionskolloquium
Visibility:D1, D2, D3, D4, D5, SWS, RG1, MMCI
We use this to send out email in the morning.
Level:Public Audience
Date, Time and Location
Date:Thursday, 9 March 2017
Duration:60 Minutes
Building:E1 4
The evolution of search from keywords to entities has necessitated the efficient harvesting and management

of entity-centric information for constructing knowledge bases catering to various applications such as semantic
search, question answering, and information retrieval. The vast amounts of natural language texts available
across diverse domains on the Web provide rich sources for discovering facts about named entities such as
people, places, and organizations.
A key challenge, in this regard, entails the need for precise identification and disambiguation of entities across
documents for extraction of attributes/relations and their proper representation in knowledge bases. Additionally,
the applicability of such repositories not only involves the quality and accuracy of the stored information,
but also storage management and query processing efficiency.
This dissertation aims to tackle the above problems by presenting efficient approaches for entity-centric
knowledge acquisition from texts and its representation in knowledge repositories.
This dissertation presents a robust approach for identifying text phrases pertaining to the same named entity
across huge corpora, and their disambiguation to canonical entities present in a knowledge base, by using
enriched semantic contexts and link validation encapsulated in a hierarchical clustering framework.
This work further presents language and consistency features for classification models to compute the
credibility of obtained textual facts, ensuring quality of the extracted information.
Finally, an encoding algorithm, using frequent term detection and improved data locality, to represent entities
for enhanced knowledge base storage and query performance is presented.

Name(s):Petra Schaaf
EMail:--email address not disclosed on the web
Video Broadcast
Video Broadcast:NoTo Location:
Tags, Category, Keywords and additional notes
Attachments, File(s):

Petra Schaaf/AG5/MPII/DE, 03/01/2017 10:35 AM
Last modified:
halma/MPII/DE, 11/07/2018 04:52 PM
  • Petra Schaaf, 03/01/2017 10:38 AM -- Created document.