Proceedings Article, Paper
@InProceedings
Beitrag in Tagungsband, Workshop


Show entries of:

this year (2017) | last year (2016) | two years ago (2015) | Notes URL

Action:

login to update

Options:








Author, Editor

Author(s):

Schenkel, Ralf
Suchanek, Fabian
Kasneci, Gjergji

dblp
dblp
dblp



Editor(s):

Kemper, Alfons
Schöning, Harald
Rose, Thomas
Jarke, Matthias
Seidl, Thomas
Quix, Christoph
Brochhaus, Christoph

dblp
dblp
dblp
dblp
dblp
dblp
dblp

Not MPII Editor(s):

Kemper, Alfons
Schöning, Harald
Rose, Thomas
Jarke, Matthias
Seidl, Thomas
Quix, Christoph
Brochhaus, Christoph

BibTeX cite key*:

SchenkelSK07

Title, Booktitle

Title*:

YAWN: A Semantically Annotated Wikipedia XML Corpus

Booktitle*:

12. GI-Fachtagung für Datenbanksysteme in Business, Technologie und Web (BTW 2007)

Event, URLs

URL of the conference:

http://www.btw2007.de

URL for downloading the paper:


Event Address*:

Aachen, Germany

Language:

English

Event Date*
(no longer used):


Organization:


Event Start Date:

7 March 2007

Event End Date:

9 March 2007

Publisher

Name*:

Gesellschaft für Informatik

URL:


Address*:

Bonn, Germany

Type:


Vol, No, Year, pp.

Series:

Lecture Notes in Informatics

Volume:

103

Number:


Month:


Pages:

277-291

Year*:

2007

VG Wort Pages:

28

ISBN/ISSN:

978-3-88579-197-3

Sequence Number:


DOI:




Note, Abstract, ©


(LaTeX) Abstract:

The paper presents YAWN, a system to convert the well-known and widely used Wikipedia collection into an XML corpus with semantically rich, self-explaining tags. We introduce algorithms to annotate pages and links with concepts from the WordNet thesaurus. This annotation process exploits categorical information in Wikipedia, which is a high-quality, manually assigned source of information, extracts additional information from lists, and utilizes the invocations of templates with named parameters. We give examples how such annotations can be exploited for high-precision queries.



Download
Access Level:


Correlation

MPG Unit:

Max-Planck-Institut für Informatik



MPG Subunit:

Databases and Information Systems Group

Appearance:

MPII WWW Server, MPII FTP Server, MPG publications list, university publications list, working group publication list, Fachbeirat, VG Wort



BibTeX Entry:

@INPROCEEDINGS{SchenkelSK07,
AUTHOR = {Schenkel, Ralf and Suchanek, Fabian and Kasneci, Gjergji},
EDITOR = {Kemper, Alfons and Sch{\"o}ning, Harald and Rose, Thomas and Jarke, Matthias and Seidl, Thomas and Quix, Christoph and Brochhaus, Christoph},
TITLE = {{YAWN}: A Semantically Annotated {Wikipedia} {XML} Corpus},
BOOKTITLE = {12. GI-Fachtagung f{\"u}r Datenbanksysteme in Business, Technologie und Web (BTW 2007)},
PUBLISHER = {Gesellschaft für Informatik},
YEAR = {2007},
VOLUME = {103},
PAGES = {277--291},
SERIES = {Lecture Notes in Informatics},
ADDRESS = {Aachen, Germany},
ISBN = {978-3-88579-197-3},
}


Entry last modified by Olha Condor, 11/18/2008
Show details for Edit History (please click the blue arrow to see the details)Edit History (please click the blue arrow to see the details)
Hide details for Edit History (please click the blue arrow to see the details)Edit History (please click the blue arrow to see the details)

Editor(s)
Ralf Schenkel
Created
11/27/2006 02:12:08 PM
Revisions
10.
9.
8.
7.
6.
Editor(s)
Olha Condor
Adriana Davidescu
Petra Schaaf
Ralf Schenkel
Ralf Schenkel
Edit Dates
18.11.2008 13:06:27
14.01.2008 16:34:22
04.04.2007 13:52:11
30.03.2007 16:24:50
15.03.2007 14:08:07