Technical, Research Report
@TechReport
Technischer-, Forschungsbericht


Show entries of:

this year (2019) | last year (2018) | two years ago (2017) | Notes URL

Action:

login to update

Options:









Author, Editor

Author(s):

Berberich, Klaus
Bedathur, Srikanta
Neumann, Thomas
Weikum, Gerhard

dblp
dblp
dblp
dblp



Editor(s):





BibTeX Citekey*:

TechReportBBNW-2007

Language:

English

Title, Institution

Title*:

A Time Machine for Text Search

Institution*:

Max-Planck-Institut for Informatics

Publishers or Institutions Address*:

Saarbrücken, Germany

Type:

Research Report

No, Year, pp.,

Number*:

MPII-I-2007-5-02

Pages*:

39

Month:

July

VG Wort
Pages*:

60

Year*:

2007

ISBN/ISSN:

0946-011X





DOI:




Note, Abstract, ©

Note:


(LaTeX) Abstract:

Text search over temporally versioned document collections such as web archives has received little attention as a research problem. As a consequence, there is no scalable and principled solution to search such a collection as of a specified time t. In this work, we address this shortcoming and propose an efficient solution for time-travel text search by extending the inverted file index to make it ready for temporal search. We introduce approximate temporal coalescing as a tunable method to reduce the index size without significantly coalescing the quality of results. In order to further improve the performance of time-travel queries, we introduce two principled techniques to trade off index size for its performance. These techniques can be formulated as optimization problems that can be solved to near-optimality. Finally, our approach is evaluated in a comprehensive series of experiments on two largescale real-world datasets. Results unequivocally show that our methods make it possible to build an efficient "time machine" scalable to large versioned text collections.

Categories / Keywords:


Copyright Message:


HyperLinks / References / URLs:


Personal Comments:


File Upload:


Download
Access Level:

Public

Correlation

MPG Unit:

Max-Planck-Institut für Informatik



MPG Subunit:

Databases and Information Systems Group

Audience:

Expert

Appearance:

MPII WWW Server, MPII FTP Server, MPG publications list, university publications list, working group publication list, Fachbeirat, VG Wort


BibTeX Entry:
@TECHREPORT{TechReportBBNW-2007,
AUTHOR = {Berberich, Klaus and Bedathur, Srikanta and Neumann, Thomas and Weikum, Gerhard},
TITLE = {A Time Machine for Text Search},
YEAR = {2007},
TYPE = {Research Report},
INSTITUTION = {Max-Planck-Institut for Informatics},
NUMBER = {MPII-I-2007-5-02},
PAGES = {39},
ADDRESS = {Saarbr{\"u}cken, Germany},
MONTH = {July},
ISBN = {0946-011X},
}


Entry last modified by Ralf Schenkel, 04/07/2009
Show details for Edit History (please click the blue arrow to see the details)Edit History (please click the blue arrow to see the details)
Hide details for Edit History (please click the blue arrow to see the details)Edit History (please click the blue arrow to see the details)

Editor(s)
Adriana Davidescu
Created
07/20/2007 02:39:33 PM
Revisions
2.
1.
0.

Editor(s)
Ralf Schenkel
Adriana Davidescu
Adriana Davidescu

Edit Dates
07.04.2009 15:32:34
28.12.2007 12:20:47
20.07.2007 14:54:12

Show details for Attachment SectionAttachment Section
Hide details for Attachment SectionAttachment Section
TechReportBBNW-2007.pdf
View attachments here: