This web is for holding topics deemed as old or irrelevant for KitWiki. If you think the topic doesn't belong here, please check that it's named properly (is a WikiWord) and descriptively, contains relevant data, and is put back to a relevant web.

textmorfo


TextMorfo, a shallow parser for Finnish syntax

Description

The Textmorfo program of Kielikone Oy parses and disambiguates Finnish text. The program contains

  • the morphological parser MORFO of Kielikone Oy
  • a dependency parser which disambiguates the morphological analyses produced by MORFO.

References:
  • Jšppinen, H.; Lehtola,A.; Nelimarkka, E.; and Ylilammi, N., "Knowledge Engineering Approach to Morphological Analysis", First Conference of the European Chapter of ACL, Pisa, Italy, 1983, pp. 49-51.
  • Harri Jšppinen and Matti Ylilammi, "Associative Model of Morphological Analysis: An Empirical Inquiry", Computational Linguistics, Vol. 12, No 4, 1986, pp. 257-272.
  • Valkonen, K., Jšppinen, H., and Lehtola, A., "Blackboard-based dependency parsing". In Proceedings of IJCAI'87, Tenth International Joint Conference on Artificial Intelligence, 1987, pp. 700-702.

Version and Copyright Information

version: TextMorfo v.2.0

copyright: (C) Kielikone Oy 2002

Usage

input:
[corpus.csc.fi]$ echo "Luennon lopusta oli kulunut tunti" | textmorfo

output:
----------------------------------------------
TextMorfo v.2.0 -- (C) Kielikone Oy 2002
----------------------------------------------
BaseForm=kulua tunti,SurfaceForm=tunti,Category=Adjective-Noun,Case=,Number=,BasicPart=tunti,Component=,Position=5,Tense=,Voice=,Modal=,PersonN=,PersonP=,Clitic1=,Clitic2=,Comparison= 
BaseForm=loppu,SurfaceForm=lopusta,Category=Noun,Case=El,Number=SG,BasicPart=loppu,Component=,Position=2,Tense=,Voice=,Modal=,PersonN=,PersonP=,Clitic1=,Clitic2=,Comparison= 
BaseForm=kulua,SurfaceForm=kulunut,Category=Verb,Case=Nom,Number=SG,BasicPart=kulua,Component=,Position=4,Tense=,Voice=Act,Modal=IIpartic,PersonN=,PersonP=,Clitic1=,Clitic2=,Comparison= 
BaseForm=tunti,SurfaceForm=tunti,Category=Noun,Case=Nom,Number=SG,BasicPart=tunti,Component=,Position=5,Tense=,Voice=,Modal=,PersonN=,PersonP=,Clitic1=,Clitic2=,Comparison= 
BaseForm=Luento,SurfaceForm=Luennon,Category=Noun,Case=Gen,Number=SG,BasicPart=Luento,Component=,Position=1,Tense=,Voice=,Modal=,PersonN=,PersonP=,Clitic1=,Clitic2=,Comparison= 
BaseForm=olla,SurfaceForm=oli,Category=Verb,Case=,Number=,BasicPart=olla,Component=,Position=3,Tense=Imp,Voice=Act,Modal=Ind,PersonN=S,PersonP=3P,Clitic1=,Clitic2=,Comparison= 
-----------------------------------------------------------------------

Remarks:
  • In the output, ŚšŲŇń÷ are replaced with characters {|}[\]
  • the word order is not preserved in the output
  • each word occurs on a line of its own

Help, Manuals and Documentation

help commands:

further information:

Bugs

License Text

Other Information

Field of science: Linguistics

Available:
corpus

License: LicenseTypeACSCPaysTheCopy

To be copied to: https://wwwk.csc.fi/english/research/software/textmorfo
To be seen at: http://www.csc.fi/english/research/software/textmorfo
See also: KitWiki.SuomenKielipankki:Dev:Linguistics_Software, Old.ToolResources
The users may add their own comments to: ToolResource_textmorfo_Comments

When editing, please move cursor to the form below. Do not add anything here.
Topic revision: r5 - 2008-11-21 - HennaRiikkaLaitinen
 
This site is powered by the TWiki collaboration platform Powered by PerlCopyright © 2008-2018 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback