Corpora of Newspaper Texts

  • Computer corpora in Finnish, Swedish and English languages (newspaper texts), with requests and relevance information used in information retrieval evaluation.
  • About 142.2, 42.5, and 251 million word tokens respectively; or 1088MB, 281 MB, and 1530 MB respectively.
  • Contact person: Eija Airio (št) uta.fi
  • http://www.info.uta.fi/tutkimus/databases.php

FinClarinCorpusResourceForm
Resource Name Corpora of Newspaper Texts
Resource Type Written Corpus
Languages English, Finnish, Swedish
Languages (other)

Description Computer corpora in Finnish, Swedish and English languages (newspaper texts), with requests and relevance information used in information retrieval evaluation.
Country

Institute Department of Information Studies, University of Tampere
Contact Person Eija Airio (št) uta.fi
Begin year of resource creation

Finalization year

Format

Metadata Link

Publications

Reference Link

Collection Working Languages

Collection Long term preservation by

Collection Location

Collection Content Type

Collection Format Detailed

Collection Quality

Collection Applications

Collection Project

Collection Size About 142.2, 42.5, and 251 million word tokens respectively; or 1088MB, 281 MB, and 1530 MB respectively.
Collection Distribution Form

Collection Access

Collection Source

IPR Ethical Reference

IPR Legal Reference

IPR License Type

IPR Description

IPR Contact Person

Topic revision: r2 - 2011-11-14 - KimmoKoskenniemi
 
This site is powered by the TWiki collaboration platform Powered by PerlCopyright © 2008-2018 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback