Proceedings Article, Paper
@InProceedings
Beitrag in Tagungsband, Workshop


Show entries of:

this year (2019) | last year (2018) | two years ago (2017) | Notes URL

Action:

login to update

Options:








Author, Editor

Author(s):

Siersdorfer, Stefan
Sizov, Sergej

dblp
dblp



Editor(s):

Järvelin, Kalervo
Allan, James
Bruza, Peter
Sanderson, Mark

dblp
dblp
dblp
dblp

Not MPII Editor(s):

Järvelin, Kalervo
Allan, James
Bruza, Peter
Sanderson, Mark

BibTeX cite key*:

SiersClust2004

Title, Booktitle

Title*:

Restrictive Clustering and Metaclustering for self-organizing Document Collections


p84-sizov.pdf (153.63 KB)

Booktitle*:

Proceedings of SIGIR 2004: the Twenty-Seventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval

Event, URLs

URL of the conference:

http://www.sigir.org/sigir2004/

URL for downloading the paper:


Event Address*:

Sheffield, UK

Language:

English

Event Date*
(no longer used):


Organization:


Event Start Date:

25 July 2004

Event End Date:

29 July 2004

Publisher

Name*:

ACM

URL:


Address*:

New York, USA

Type:


Vol, No, Year, pp.

Series:


Volume:


Number:


Month:


Pages:

226-233

Year*:

2004

VG Wort Pages:


ISBN/ISSN:


Sequence Number:


DOI:




Note, Abstract, ©

Note:

Acceptance ratio 1:5

(LaTeX) Abstract:

This paper addresses the problem of automatically structuring
heterogenous document collections by using clustering
methods. In contrast to traditional clustering, we study
restrictive methods and ensemble-based meta methods that
may decide to leave out some documents rather than assigning
them to inappropriate clusters with low confidence.
These techniques result in higher cluster purity, better overall
accuracy, and make unsupervised self-organization more
robust. Our comprehensive experimental studies on three
different real-world data collections demonstrate these benefits.
The proposed methods seem particularly suitable for
automatically substructuring personal email folders or personal Web directories that are populated by focused crawlers,
and they can be combined with supervised classification
techniques.

Keywords:

Meta Clustering, Restrictive Clustering



Download
Access Level:

Public

Correlation

MPG Unit:

Max-Planck-Institut für Informatik



MPG Subunit:

Databases and Information Systems Group

Appearance:

MPII WWW Server, MPII FTP Server, MPG publications list, university publications list, working group publication list, Fachbeirat, VG Wort



BibTeX Entry:

@INPROCEEDINGS{SiersClust2004,
AUTHOR = {Siersdorfer, Stefan and Sizov, Sergej},
EDITOR = {J{\"a}rvelin, Kalervo and Allan, James and Bruza, Peter and Sanderson, Mark},
TITLE = {Restrictive Clustering and Metaclustering for self-organizing Document Collections},
BOOKTITLE = {Proceedings of SIGIR 2004: the Twenty-Seventh Annual International ACM SIGIR Conference on Research and Development in Information Retrieval},
PUBLISHER = {ACM},
YEAR = {2004},
PAGES = {226--233},
ADDRESS = {Sheffield, UK},
NOTE = {Acceptance ratio 1:5},
}


Entry last modified by Christine Kiesel, 05/31/2005
Show details for Edit History (please click the blue arrow to see the details)Edit History (please click the blue arrow to see the details)
Hide details for Edit History (please click the blue arrow to see the details)Edit History (please click the blue arrow to see the details)

Editor(s)
Sergej Sizov
Created
01/26/2005 10:25:08 AM
Revisions
4.
3.
2.
1.
0.
Editor(s)
Christine Kiesel
Christine Kiesel
Petra Schaaf
Sabine Krott
Sabine Krott
Edit Dates
31.05.2005 14:53:38
31.05.2005 14:52:57
14.04.2005 10:27:23
09.02.2005 10:45:36
26.01.2005 10:25:08
Show details for Attachment SectionAttachment Section
Hide details for Attachment SectionAttachment Section

View attachments here:


File Attachment Icon
p84-sizov.pdf