Some Word Lists Generated from the Text Bank of Finnish

How They Were Created

See the topic Helpdesk:MakingWordLists. Due to the incoherencies in the encoding schemes of the corpora, this method has many problems, but if you are not interested in exact counts and 100 percent coverage of the material, the results may be usable.

Availability

The word lists are available to the users of the Text Bank of Finnish, that is a resource in the Language Bank of Finland at CSC.

Location

The word lists are at corpus.csc.fi at the directory /fs/kielipankki/words/sktp/.

Group Bytes Created Filename
sktp-a 38268247 May 18 16:49 sanat.aca
sktp-a 24139423 May 18 18:15 sanat.aca.w
sktp-a 38268246 May 18 17:12 sanat.aca.x
sktp-a 575399 May 18 17:31 sanat.aca.33000.txt
sktp-a 311399 May 18 17:31 sanat.aca.33000.w
sktp-a 96771260 May 18 17:23 sanat.all
sktp-a 96770702 May 18 17:31 sanat.all.x
sktp-a 580456 May 18 17:31 sanat.all.33000.txt
sktp-a 316456 May 18 17:31 sanat.all.33000.w
sktp-b 81188643 May 18 15:16 sanat.biz
sktp-b 52229875 May 18 18:02 sanat.biz.w
sktp-b 81188086 May 18 16:32 sanat.biz.x
sktp-b 584138 May 18 17:31 sanat.biz.33000.txt
sktp-b 320138 May 18 17:32 sanat.biz.33000.w
Topic revision: r7 - 2006-08-31 - AnssiYliJyra
 
This site is powered by the TWiki collaboration platform Powered by PerlCopyright © 2008-2017 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback