HANCO

Laitos: Slavistiikan ja baltologian laitos
Yhteyshenkilö: Mihail Kopotev
Sähköposti: Mihail.kopotev@helsinki.fi

1. Linguistic research resource

l. Official or identificatory name and acronym Helsinki annotated corpus of Russian language HANCO
m. Short description of content Morphologically and syntactically annotated corpus of the modern Russian language.
n. Originality status The corpus has been creating since 2001 at the Slavonic and Baltic Department, U of Helsinki.
o. Description of size and extent 100 000 words

Proportion which has been automatically/manually annotated/verified with respect to
morphology: 80 % automatically annotated / 20 % manually annotated. 80 % manually verified
syntax: 60 % automatically annotated / 40 % manually annotated. 80 % manually verified
p. Storage format Electronic:
MySql database
Access database
q. (Estimated) time invested in the collection and processing of the resource 70 person months
r. Contact person(s) and their contact information (E-mail & telephone); may be the same for all points (a), (b) and (c). a) person who in practice administers the resource and grants (possibly required) usage permits
admin@ling.helsinki.fi.
b) person who has physical possession of the contracts concerning the resource, by which the resource has been acquired for use at the department
Mikhail Kopotev (mihail.kopotev@helsinki.fi; 191 22 028)
c) person(s) who has/have originally contracted acquired, collected, compiled and/or annotated the resource, and who thus has copyright to the material and whose permission is (possibly) required to access the resource.
Mikhail Kopotev (mihail.kopotev@helsinki.fi; 191 22 028)
Arto Mustajoki (arto.mustajoki@helsinki.fi)
s. (Main) references to published articles or other written works describing the resource itself or research based on its use. A. Kopotev, M. The Helsinki Annotated Corpus of Russian Texts HANCO: a tool for teaching and learning of Russian
B. Мустайоки, А., Копотев, М.В. ‘Принципы создания Хельсинкского аннотированного корпуса русских текстов (ХАНКО) в сети Интернет (Principles of the Creation of the Helsinki Annotated Corpus HANCO)’ Научно-техническая информация. Сер. 2: Информационные системы и процессы. № 6: Корпусная лингвистика в России, 2003, c. 33-37
C. Копотев. М. В. ‘Несмотря на «потому что», или Многокомпонентные единицы в аннотированном корпусе русских текстов (In Spite of potomu čto, Or Compounds in the Helsinki Annotated Corpus of Russan Text HANCO)’ Компьютерная лингвистика и интеллектуальные технологии. Труды международной конференции Диалог-2004, Москва: Наука, 2004. с. 335-339.
D. Копотев. М.В. ‘Принципы синтаксической разметки Хельсинкского аннотированного корпуса русских текстов ХАНКО (Principles of the syntactic annotation in the Helsinki Annotated Corpus HANCO)’ Компьютерная лингвистика и интеллектуальные технологии. Труды международной конференции Диалог–2006, Москва: изд-во РГГУ, 2006, с. 280–284.
t. Link(s) to more extensive/thorough descriptions of the resource in the Internet (which may be in any language) http://www.slav.helsinki.fi/hanco/index_en.html
http://www.helsinki.fi/~kopotev/hanco.pdf
u. Physical location of resource (server and directory path or Internet address, or room/person in the case of non-electronic materials) /web/ling/projects/hanco at angarak
http://www.slav.helsinki.fi/hanco/index_en.html
v. Miscellaneous other notes  

FinClarinCorpusResourceForm
Resource Name Helsinki annotated corpus of Russian language HANCO
Resource Type Written Corpus
Languages Russian
Languages (other)

Description Morphologically and syntactically annotated corpus of the modern Russian language.
Country

Institute The Department of Slavonic and Baltic Languages and Literatures, University of Helsinki
Contact Person admin@ling.helsinki.fi, Mikhail Kopotev (mihail.kopotev@helsinki.fi), Arto Mustajoki (arto.mustajoki@helsinki.fi)
Begin year of resource creation 2001
Finalization year

Format MySql database, Access database
Metadata Link

Publications

Reference Link

Collection Working Languages

Collection Long term preservation by

Collection Location

Collection Content Type

Collection Format Detailed

Collection Quality

Collection Applications

Collection Project

Collection Size 100 000 words
Collection Distribution Form

Collection Access

Collection Source

IPR Ethical Reference

IPR Legal Reference

IPR License Type

IPR Description

IPR Contact Person

Topic revision: r3 - 2011-11-14 - KimmoKoskenniemi
 
This site is powered by the TWiki collaboration platform Powered by PerlCopyright © 2008-2018 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback