An inflected form of a word in the analyser lexicon


The output consists of three types of information: base form, paradigm tags, analysis tags

The base form is the normal lexical lookup form delimited by:

<base> ... </base>

The paradigm tags are optional and they are delimited by:

<par> ... </par>

The analysis tags relate the base form to the inflected form within one paradigm and they are delimited by:

<anl> ... </anl>

Implementation Note

Ideally, this method is implemented as a lexicon transducer run with the hfst-infl software, but if the lexicographer has created a specific analyzer for some other purpose that can be reused, this method may be implemented as a language dependent shell script transforming the existing analyzer output to the above mentioned format.

-- KristerLinden - 24 Apr 2008

Topic revision: r2 - 2008-04-30 - KristerLinden
This site is powered by the TWiki collaboration platform Powered by PerlCopyright © 2008-2018 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback