Max-Planck-Institut für Informatik
max planck institut
mpii logo Minerva of the Max Planck Society


Diversifying Search Results Using Time

Gupta, Dhruv and Berberich, Klaus

MPI-I-2016-5-001. March 2016, ? pages. | Status: available - back from printing | Next --> Entry | Previous <-- Entry

Abstract in LaTeX format:
Getting an overview of a historic entity or event can be difficult in
search results, especially if important dates concerning the entity or
event are not known beforehand. For such information needs,
users would benefit if returned results covered diverse dates, thus
giving an overview of what has happened throughout
history. Diversifying search results based on important dates can be a
building block for applications, for instance, in digital
humanities. Historians would thus be able to quickly explore
longitudinal document collections by querying for entities or events
without knowing associated important dates apriori.

In this work, we describe an approach to diversify search
results using temporal expressions (e.g., in the 1990s) from their
contents. Our approach first identifies time intervals of interest
to the given keyword query based on pseudo-relevant documents. It
then re-ranks query results so as to maximize the coverage of
identified time intervals.

We present a novel and objective evaluation for our proposed
approach. We test the effectiveness of our methods on the
New York Times Annotated corpus and the Living Knowledge corpus, collectively
consisting of around 6 million documents. Using history-oriented queries and
encyclopedic resources we show that our method indeed is able to present
search results diversified along time.
References to related material:

To download this research report, please select the type of document that fits best your needs.Attachement Size(s):
MPI-I-2016-5-001.pdfMPI-I-2016-5-001.pdf443 KBytes
Please note: If you don't have a viewer for PostScript on your platform, try to install GhostScript and GhostView
URL to this document:
Hide details for BibTeXBibTeX
  AUTHOR = {Gupta, Dhruv and Berberich, Klaus},
  TITLE = {Diversifying Search Results Using Time},
  TYPE = {Research Report},
  INSTITUTION = {Max-Planck-Institut f{\"u}r Informatik},
  ADDRESS = {Stuhlsatzenhausweg 85, 66123 Saarbr{\"u}cken, Germany},
  NUMBER = {MPI-I-2016-5-001},
  MONTH = {March},
  YEAR = {2016},
  ISSN = {0946-011X},