MPI-INF Logo
Publications

Server    domino.mpi-inf.mpg.de

Proceedings Article, Paper
@InProceedings
Beitrag in Tagungsband, Workshop

Author, Editor
Author(s):
Bast, Holger
Dupret, Georges
Majumdar, Debapriyo
Piwowarski, Benjamin
dblp
dblp
dblp
dblp
Not MPG Author(s):
Dupret, Georges
Piwowarski, Benjamin
Editor(s):
Ackermann, Markus
Berendt, Bettina
Grobelnik, Marko
Hotho, Andreas
Mladenic, Dunja
Semeraro, Giovanni
Spiliopoulou, Myra
Stumme, Gerd
Svatek, Vojtech
van Someren, Maarten W.
dblp
dblp
dblp
dblp
dblp
dblp
dblp
dblp
dblp
dblp
Not MPII Editor(s):
Ackermann, Markus
Berendt, Bettina
Grobelnik, Marko
Hotho, Andreas
Mladenic, Dunja
Semeraro, Giovanni
Spiliopoulou, Myra
Stumme, Gerd
Svatek, Vojtech
van Someren, Maarten W.
BibTeX cite key*:
BastDMP06
Title, Booktitle
Title*:
Discovering a Term Taxonomy from Term Similarities Using Principal Component Analysis
Booktitle*:
Semantics, web and mining : Joint International Workshops, EWMF 2005 and KDO 2005
Event, URLs
Conference URL::
Downloading URL:
http://www.springerlink.com/content/y3111k29gk641w7q/fulltext.pdf
Event Address*:
Porto, Portugal
Language:
English
Event Date*
(no longer used):
Organization:
Event Start Date:
3 October 2005
Event End Date:
7 October 2005
Publisher
Name*:
Springer
URL:
Address*:
Berlin, Germany
Type:
Vol, No, Year, pp.
Series:
Lecture Notes in Computer Science
Volume:
4289
Number:
Month:
Pages:
103-120
Year*:
2006
VG Wort Pages:
ISBN/ISSN:
978-3-540-47697-9; 3-540-47697-0
Sequence Number:
DOI:
Note, Abstract, ©
(LaTeX) Abstract:
We show that eigenvector decomposition can be used to extract a term taxonomy from a given collection of text documents. So far, methods based on eigenvector decomposition, such as latent semantic indexing (LSI) or principal component analysis (PCA), were only known to be useful for extracting symmetric relations between terms. We give a precise mathematical criterion for distinguishing between four kinds of relations of a given pair of terms of a given collection: unrelated (car - fruit), symmetrically related (car - automobile), asymmetrically related with the first term being more specific than the second (banana - fruit), and asymmetrically related in the other direction (fruit - banana). We give theoretical evidence for the soundness of our criterion, by showing that in a simplified mathematical model the criterion does the apparently right thing. We applied our scheme to the reconstruction of a selected part of the open directory project (ODP) hierarchy, with promising results.
URL for the Abstract:
http://www.springerlink.com/content/y3111k29gk641w7q/
Download
Access Level:
Public

Correlation
MPG Unit:
Max-Planck-Institut für Informatik
MPG Subunit:
Algorithms and Complexity Group
Appearance:
MPII WWW Server, MPII FTP Server, MPG publications list, university publications list, working group publication list, Fachbeirat, VG Wort



BibTeX Entry:

@INPROCEEDINGS{BastDMP06,
AUTHOR = {Bast, Holger and Dupret, Georges and Majumdar, Debapriyo and Piwowarski, Benjamin},
EDITOR = {Ackermann, Markus and Berendt, Bettina and Grobelnik, Marko and Hotho, Andreas and Mladenic, Dunja and Semeraro, Giovanni and Spiliopoulou, Myra and Stumme, Gerd and Svatek, Vojtech and van Someren, Maarten W.},
TITLE = {Discovering a Term Taxonomy from Term Similarities Using Principal Component Analysis},
BOOKTITLE = {Semantics, web and mining : Joint International Workshops, EWMF 2005 and KDO 2005},
PUBLISHER = {Springer},
YEAR = {2006},
VOLUME = {4289},
PAGES = {103--120},
SERIES = {Lecture Notes in Computer Science},
ADDRESS = {Porto, Portugal},
ISBN = {978-3-540-47697-9},
; ISBN = {3-540-47697-0},
}


Entry last modified by Christine Kiesel, 03/17/2007
Hide details for Edit History (please click the blue arrow to see the details)Edit History (please click the blue arrow to see the details)

Editor(s)
Holger Bast
Created
09/15/2006 10:53:42 PM
Revisions
3.
2.
1.
0.
Editor(s)
Christine Kiesel
Christine Kiesel
Holger Bast
Holger Bast
Edit Dates
17.03.2007 00:53:35
17.03.2007 00:35:26
09/15/2006 10:54:43 PM
09/15/2006 10:53:43 PM