Proceedings Article, Paper
@InProceedings
Beitrag in Tagungsband, Workshop


Show entries of:

this year (2020) | last year (2019) | two years ago (2018) | Notes URL

Action:

login to update

Options:








Author, Editor

Author(s):

Bender, Matthias
Michel, Sebastian
Triantafillou, Peter
Weikum, Gerhard
Zimmer, Christian

dblp
dblp
dblp
dblp
dblp

Not MPG Author(s):

Triantafillou, Peter

Editor(s):

Baeza-Yates, Ricardo A.
Ziviani, Nivio
Marchionini, Gary
Moffat, Alistair
Tait, John

dblp
dblp
dblp
dblp
dblp

Not MPII Editor(s):

Baeza-Yates, Ricardo A.
Ziviani, Nivio
Marchionini, Gary
Moffat, Alistair
Tait, John

BibTeX cite key*:

SIGIR2005

Title, Booktitle

Title*:

Improving Collection Selection with Overlap-Awareness

Booktitle*:

SIGIR 2005 : Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '05)

Event, URLs

URL of the conference:

http://www.dcc.ufmg.br/eventos/sigir2005/

URL for downloading the paper:

http://doi.acm.org/10.1145/1076049

Event Address*:

Salvador, Brazil

Language:

English

Event Date*
(no longer used):


Organization:


Event Start Date:

15 August 2005

Event End Date:

19 August 2005

Publisher

Name*:

ACM

URL:

http://www.acm.org

Address*:

New York, USA

Type:


Vol, No, Year, pp.

Series:


Volume:


Number:


Month:


Pages:

67-74

Year*:

2005

VG Wort Pages:

40

ISBN/ISSN:

1-59593-034-5

Sequence Number:


DOI:




Note, Abstract, ©

Note:

Acceptance ratio 1:5

(LaTeX) Abstract:

Collection selection has been a research issue for years. Most
of the existing literature estimates the expected result quality
of a collection, typically using precomputed statistics,
and ranks the collections accordingly. We believe that this
is insufficient if the collections overlap, e.g., in the scenario of
autonomous peers crawling the web. We argue for the extension
of existing quality measures using estimators of mutual
overlap among collections and present experiments in which
this combination outperforms CORI, a popular approach
based on quality estimation. In our experiments, we use a
prototype implementation of a P2P web search engine that
allows handling large amounts of data in a distributed and
self-organizing manner. Taking overlap into account during
collection selection in this scenario can drastically decrease
the number of collections that have to be contacted in order
to reach a satisfactory level of recall, which is a great step
towards the feasibility of distributed web search.

Keywords:

Algorithms, Design, Experimentation, Peer-to-Peer, Distributed IR, query routing, overlap



Download
Access Level:

Internal

Correlation

MPG Unit:

Max-Planck-Institut für Informatik



MPG Subunit:

Databases and Information Systems Group

Appearance:

MPII WWW Server, MPII FTP Server, MPG publications list, university publications list, working group publication list, Fachbeirat, VG Wort



BibTeX Entry:

@INPROCEEDINGS{SIGIR2005,
AUTHOR = {Bender, Matthias and Michel, Sebastian and Triantafillou, Peter and Weikum, Gerhard and Zimmer, Christian},
EDITOR = {Baeza-Yates, Ricardo A. and Ziviani, Nivio and Marchionini, Gary and Moffat, Alistair and Tait, John},
TITLE = {Improving Collection Selection with Overlap-Awareness},
BOOKTITLE = {SIGIR 2005 : Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '05)},
PUBLISHER = {ACM},
YEAR = {2005},
PAGES = {67--74},
ADDRESS = {Salvador, Brazil},
ISBN = {1-59593-034-5},
NOTE = {Acceptance ratio 1:5},
}


Entry last modified by Adriana Davidescu, 07/20/2007
Show details for Edit History (please click the blue arrow to see the details)Edit History (please click the blue arrow to see the details)
Hide details for Edit History (please click the blue arrow to see the details)Edit History (please click the blue arrow to see the details)

Editor(s)
Matthias Bender
Created
04/12/2005 09:12:56 AM
Revisions
23.
22.
21.
20.
19.
Editor(s)
Adriana Davidescu
Petra Schaaf
Ralf Schenkel
Christine Kiesel
Christine Kiesel
Edit Dates
20.07.2007 15:44:20
04.04.2007 13:16:28
30.03.2007 16:27:20
17.01.2006 10:30:31
06.01.2006 10:19:06