MPI-INF Logo
Campus Event Calendar

Event Entry

What and Who

Efficient construction of a finite-state automaton from a set of strings

Bojana Kodric
University of Zagreb
Talk
AG 1  
AG Audience
English

Date, Time and Location

Wednesday, 28 March 2012
10:00
30 Minutes
E1 4
024
Saarbrücken

Abstract

Finite-state automata are used for certain aspects of Natural Language Processing. Of particular interest to the NLP community are deterministic, acyclic, finite-state automata, which are called dictionaries.

Traditional construction of dictionaries from strings in form of finite automata involves constructing a trie and then minimizing it. Although such method is correct, the size of the intermediate product - the trie - can be enormous. Two methods that will be presented (for sorted and unsorted input) construct a minimal automaton in a single phase by adding new strings one by one and minimizing the resulting automaton on-the-fly. The advantages of incremental construction are the minimal intermediate memory requirements and dramatical reduction in the total construction time of these minimal dictionaries in comparison to other methods.

Contact

Rob van Stee
--email hidden
passcode not visible
logged in users only

Rob van Stee, 03/26/2012 15:28 -- Created document.