MPI-INF Logo
Campus Event Calendar

Event Entry

New for: D1, D2

What and Who

Chunk Tagging (Shallow Parsing with Markov Models)

Wojciech Skut
Computerlinguistisches Kolloquium
AG 1, AG 2  
AG Audience

Date, Time and Location

Thursday, 23 July 98
16:00
-- Not specified --
17.2 - Computerlinguistik
EG, Seminarraum
Saarbrücken

Abstract

In my talk, I will present the **chunk tagger**, i.e., a shallow parser based on Hidden Markov Models (HMMs). The parser assigns tree segments of limited depth to sequences of part-of-speech tags in the same way a POS tagger finds the optimal tag sequence for a sequence of words, e.g.

| . . ___ . . . . . . . . . . . . . . . . .__
| |. . . . . . . . . . _______ |
| | . . . . .__________ | |
| | . . . _ | | |
| | | . .__ | | |
| | | | | | |

chunk tagging

ART ADV CARD CARD NN ADJA NN

part-of-speech tagging

der fast 50 000 DM teure Wagen


Though ambiguity is much higher than with standard POS tagging, the chunker
achieves fairly good accuracy as far as recognition of simple as well as
complex NPs, PPs and APs is concerned (89.5% unlabelled total match). The
representation format for structures enables us to take advantage of
disambiguation hints provided by strictly local lexical contexts (trigrams of
POS tags).

The chunker is available in two versions which differ w.r.t. the techniques
used for parameter estimation (linear interpolation vs. maximum entropy). I
will contrast these two method and compare their results.

Contact

--email hidden
passcode not visible
logged in users only

Uwe Brahm, 04/12/2007 11:58 -- Created document.