Thesis - Doctoral dissertation | @PhdThesis | Doktorarbeit

Author(s)*:DeMelo, Gerard
BibTeX citekey*:DeMelo2010

Title, School
Title*:Graph-based Methods for Large-Scale Multilingual Knowledge Integration
School:Universität des Saarlandes
Type of Thesis*:Doctoral dissertation

Publishers Name:Universität des Saarlandes
Publishers Address:Saarbrücken

Note, Abstract, Copyright
LaTeX Abstract:Given that much of our knowledge is expressed in textual form, information

systems are increasingly dependent on knowledge about words and the entities they represent. This thesis investigates novel methods for automatically building large repositories of knowledge that capture semantic relationships between words, names, and entities, in many different languages. Three major contributions are made, each involving graph algorithms and statistical techniques that combine evidence from multiple sources of information.
The lexical integration method involves learning models that disambiguate word meanings based on contextual information in a graph, thereby providing a means to connect words to the entities that they denote. The entity integration method combines semantic items from different sources into a single unified registry of entities by reconciling equivalence and distinctness information and solving a combinatorial optimization problem. Finally, the taxonomic integration method adds a comprehensive and coherent taxonomic hierarchy on top of this registry,
capturing how different entities relate to each other.
Together, these methods can be used to produce a large-scale multilingual
knowledge base semantically describing over 5 million entities
and over 16 million natural language words and names in more than 200
different languages.

Referees, Status, Dates
1. Referee:Prof. Hans Uszkoreit, PhD, Universität des Saarlandes, DFKI
2. Referee:Prof. Hinrich Schütze, PhD, Universität Stuttgart
Supervisor:Prof. Dr.-Ing. Gerhard Weikum
Date Kolloquium:15 December 2010
Chair Kolloquium:Prof. Dr.rer.nat. Gert Smolka

MPG Unit:Max-Planck-Institut für Informatik
