Max-Planck-Institut für Informatik
max planck institut
informatik
mpii logo Minerva of the Max Planck Society
 

MPI-I-2010-5-008

Query relaxation for entity-relationship search

Elbassuoni, Shady and Ramanath, Maya and Weikum, Gerhard

MPI-I-2010-5-008. December 2010, 40 pages. | Status: available - back from printing | Next --> Entry | Previous <-- Entry

Abstract in LaTeX format:
Entity-relationship-structured data is becoming more important on the
Web. For example, large knowledge bases have been automatically constructed
by information extraction from Wikipedia and other Web sources.
Entities and relationships can be represented by subject-property-object
triples in the RDF model, and can then be precisely searched by structured
query languages like SPARQL. Because of their Boolean-match semantics,
such queries often return too few or even no results. To improve recall, it
is thus desirable to support users by automatically relaxing or reformulating
queries in such a way that the intention of the original user query is preserved
while returning a sufficient number of ranked results.
In this paper we describe comprehensive methods to relax SPARQL-like
triple-pattern queries, possibly augmented with keywords, in a fully automated
manner. Our framework produces a set of relaxations by means of
statistical language models for structured RDF data and queries. The query
processing algorithms merge the results of different relaxations into a unified
result list, with ranking based again on language models. Our experimental
evaluation, with two different datasets about movies and books, shows the
effectiveness of the automatically generated relaxations and the improved
quality of query results based on assessments collected on the Amazon Mechanical
Turk platform.
Acknowledgement:
References to related material:

To download this research report, please select the type of document that fits best your needs.Attachement Size(s):
MPI-I-2010-5-008.pdf MPI-I-2010-5-008.pdf 279 KBytes
Please note: If you don't have a viewer for PostScript on your platform, try to install GhostScript and GhostView
URL to this document: http://domino.mpi-inf.mpg.de/internet/reports.nsf/NumberView/2010-5-008
Hide details for BibTeXBibTeX
@TECHREPORT{Elbassuoni2010,
  AUTHOR = {Elbassuoni, Shady and Ramanath, Maya and Weikum, Gerhard},
  TITLE = {Query relaxation for entity-relationship search},
  TYPE = {Research Report},
  INSTITUTION = {Max-Planck-Institut f{\"u}r Informatik},
  ADDRESS = {Stuhlsatzenhausweg 85, 66123 Saarbr{\"u}cken, Germany},
  NUMBER = {MPI-I-2010-5-008},
  MONTH = {December},
  YEAR = {2010},
  ISSN = {0946-011X},
}