This web is for holding topics deemed as old or irrelevant for KitWiki. If you think the topic doesn't belong here, please check that it's named properly (is a WikiWord) and descriptively, contains relevant data, and is put back to a relevant web.


Kotus Swedish-Finnish Parallel Corpus


Research Institute for the Languages of Finland (Kotus) has collected a parallel Finnish-Swedish corpus of business news. This material has been converted to XML with assistance of Master's Innovations Ltd.

Home Page:

Version and Size

Version: The current version was prepared in 2005.

Size: According to preliminary information, the corpus would contain:

  • Finnish: 130386 sentences, 1667304 word tokens all in all.
  • Swedish: 130384 sentences, 2274475 word tokens all in all.

Content and Structure

Directory in the Corpus Server


Directory Listing



Access Rights and Conditions

Copying of this material is not allowed without permission from the distributor (Kotus).

The Group of Unix Users Having Access to the Resource: sktp-a


Making Bibliographical Reference to the Material:

Kotus Finnish-Swedish Parallel Corpus. 2005. Collected by Research Institute for the Languages of Finland.

Other References

Release Notes and Details

The collector has informal licenses for the research material. CSC has started to compile respective written licenses for wider use, but this process is still not completed.

Sending Bug Reports

To be copied to:
To be seen at:
*See also other resources: in KitWiki, in
All users may add their comments to Resource__Comments

When editing, please move cursor to the form below. Do not add anything here.
Topic revision: r8 - 2008-11-07 - HennaRiikkaLaitinen
This site is powered by the TWiki collaboration platform Powered by PerlCopyright © 2008-2019 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback