Unpublished, Draft, To Appear
@UnPublished
Unveröffentlicht, Entwurf


Show entries of:

this year (2019) | last year (2018) | two years ago (2017) | Notes URL

Action:

login to update

Options:









Author, Editor

Author(s):

Celikik, Marjan
Bast, Holger
Manolache, Gabriel

dblp
dblp
dblp



BibTeX citekey*:

bastcelikikmanolache2009-snippets

Title, Booktitle

Title*:

Efficient Index-Based Snippet Generation

Vol, No, pp., Year

Month:


Year:

2011

Language:

English

Pages:


Abstract, Links, ©

Note:

unpublished

LaTeX Abstract:

Ranked result lists with query-dependent snippets have become state of the art
in text search. They are typically implemented by searching, at query time,
for occurrences of the query words in the top-ranked documents. This
\emph{document-based} approach has three inherent problems: (i) when a
document is indexed by terms which it does not contain literally (e.g.,
related words or spelling variants), localization of the corresponding
snippets becomes problematic; (ii) each query operator (e.g., phrase or
proximity search) has to be implemented twice, on the index side in order to
compute the correct result set, and on the snippet generation side to generate
the appropriate snippets; and (iii) in a worst case, the whole document needs
to be scanned for occurrences of the query words, which is problematic for
very long documents.

We present a new \emph{index-based} method that localizes snippets by
information solely computed from the index, and that overcomes all three
problems. Unlike previous index-based methods, we show how to achieve this at
essentially no extra cost in query processing time, by a technique we call
\emph{query rotation}. We also show how our index-based method allows the
caching of individual segments instead of complete documents, which enables a
significantly larger cache hit ratio as compared to the document-based
approach. We have fully integrated our implementation with the CompleteSearch
engine.

Categories / Keywords:

Snippet generation, Advanced Search, Caching

HyperLinks / References / URLs:


Personal Comments:

keine Publikation gefunden 17.12.10/bc

File Upload:




Download
Access Level:

Internal

Correlation

MPG Unit:

Max-Planck-Institut für Informatik



MPG Subunit:

Algorithms and Complexity Group

Audience:

popular

Appearance:

MPII WWW Server, MPII FTP Server, university publications list, working group publication list, Fachbeirat, VG Wort

BibTeX Entry:
@UNPUBLISHED{bastcelikikmanolache2009-snippets,
AUTHOR = {Celikik, Marjan and Bast, Holger and Manolache, Gabriel},
TITLE = {Efficient Index-Based Snippet Generation},
YEAR = {2011},
NOTE = {unpublished},
}


Entry last modified by Thomas Sauerwald, 04/01/2011
Show details for Edit History (please click the blue arrow to see the details)Edit History (please click the blue arrow to see the details)
Hide details for Edit History (please click the blue arrow to see the details)Edit History (please click the blue arrow to see the details)

Editor(s)
Marjan Celikik
Created
02/03/2009 04:05:10 PM
Revisions
4.
3.
2.
1.
0.
Editor(s)
Thomas Sauerwald
Anja Becker
Anja Becker
Marjan Celikik
Marjan Celikik
Edit Dates
04/01/2011 02:20:42 PM
17.12.2010 13:10:05
04.03.2010 11:16:49
03/27/2009 02:55:31 PM
02/03/2009 04:05:10 PM