Journal Article
@Article
Artikel in Fachzeitschrift


Show entries of:

this year (2017) | last year (2016) | two years ago (2015) | Notes URL

Action:

login to update

Options:








Author, Editor(s)

Author(s):

Theobald, Martin
Bast, Holger
Majumdar, Debapriyo
Schenkel, Ralf
Weikum, Gerhard

dblp
dblp
dblp
dblp
dblp



BibTeX cite key*:

TheobaldBMSW_VLDBJ

Title

Title*:

TopX: Efficient and Versatile Top-k Query Processing for Semistructured Data

Journal

Journal Title*:

The VLDB Journal

Journal's URL:

http://www.springerlink.com/openurl.asp?genre=journal&issn=1066-8888

Download URL
for the article:

http://www.springerlink.com/content/y34h4h0741378kl6/

Language:

English

Publisher

Publisher's
Name:

Springer

Publisher's URL:


Publisher's
Address:

Heidelberg, Germany

ISSN:


Vol, No, pp, Date

Volume*:

17

Number:

2

Publishing Date:

January 2008

Pages*:

81-115

Number of
VG Pages:


Page Start:


Page End:


Sequence Number:


DOI:

10.1007/s00778-007-0072-z

Note, Abstract, ©

Note:


(LaTeX) Abstract:

Recent IR extensions to XML query languages such as Xpath 1.0 Full-Text or the NEXI query language
of the INEX benchmark series reflect the emerging interest in IR-style ranked retrieval
over semistructured data.
TopX is a top-$k$ retrieval engine for text and semistructured data.
It terminates query execution as soon as it can safely determine
the $k$ top-ranked result elements according to a monotonic score aggregation function with respect to a multidimensional query.
It efficiently supports vague search on both content- and structure-oriented query conditions for dy\-namic query relaxation with controllable influence on the result ranking.
The main contributions of this paper unfold into four main points:
1) fully implemented models and algorithms for ranked XML retrieval with XPath Full-Text functionality,
2) efficient and effective top-$k$ query processing for semistructured data,
3) support for integrating thesauri and ontologies with statistically quantified relationships among concepts, leveraged for word-sense disambiguation and \linebreak query expansion, and
4) a comprehensive description of the TopX system, with performance experiments on large-scale corpora like TREC Terabyte and INEX Wikipedia.

URL for the Abstract:


Categories,
Keywords:


HyperLinks / References / URLs:


Copyright Message:


Personal Comments:


Download
Access Level:

Intranet

Correlation

MPG Unit:

Max-Planck-Institut für Informatik



MPG Subunit:

Databases and Information Systems Group

Audience:

popular

Appearance:

MPII WWW Server, MPII FTP Server, MPG publications list, university publications list, working group publication list, Fachbeirat, VG Wort


BibTeX Entry:

@ARTICLE{TheobaldBMSW_VLDBJ,
AUTHOR = {Theobald, Martin and Bast, Holger and Majumdar, Debapriyo and Schenkel, Ralf and Weikum, Gerhard},
TITLE = {{TopX}: Efficient and Versatile Top-k Query Processing for Semistructured Data},
JOURNAL = {The VLDB Journal},
PUBLISHER = {Springer},
YEAR = {2008},
NUMBER = {2},
VOLUME = {17},
PAGES = {81--115},
ADDRESS = {Heidelberg, Germany},
MONTH = {January},
DOI = {10.1007/s00778-007-0072-z},
}


Entry last modified by Martin Theobald, 04/14/2009
Show details for Edit History (please click the blue arrow to see the details)Edit History (please click the blue arrow to see the details)
Hide details for Edit History (please click the blue arrow to see the details)Edit History (please click the blue arrow to see the details)

Editor(s)
Ralf Schenkel
Created
06/25/2007 12:16:13 PM
Revisions
6.
5.
4.
3.
2.
Editor(s)
Martin Theobald
Olha Condor
Olha Condor
Martin Theobald
Martin Theobald
Edit Dates
04/14/2009 04:09:11 PM
26.01.2009 14:44:57
19.01.2009 15:32:55
11/17/2008 04:08:22 PM
04.12.2007 11:07:59