Back to Search Start Over

Bulgarian sense-annotated corpus – between the tradition and novelty

Authors :
Svetla Koeva
Source :
Cognitive Studies | Études cognitives, Vol 0, Iss 12, Pp 181-198 (2015)
Publication Year :
2015
Publisher :
Institute of Slavic Studies Polish Academy of Sciences, 2015.

Abstract

Bulgarian sense-annotated corpus – between the tradition and noveltyThe Bulgarian Sense-annotated Corpus (BulSemCor) is compiled according to the general methodology established by the SemCor project. It is a subset of the Brown Corpus of Bulgarian semantically annotated with a corresponding synonym set (synset) in the Bulgarian wordnet. Unlike the bulk of sense-annotated corpora where only (sets of) content words are annotated, in BulSemCor each lexical unit has been assigned a sense. The main contributions achieved in the work on BulSemCor are briefly decides in the presented paper: definition of an annotation schema, compilation of an input corpus, development of a sense-annotated corpus, Bulgarian wordnet enlargement.

Details

ISSN :
23922397
Database :
OpenAIRE
Journal :
Cognitive Studies | Études cognitives
Accession number :
edsair.doi.dedup.....b3d5ebcf06cc41f7acb90b3a15a5c83e
Full Text :
https://doi.org/10.11649/cs.2012.012