Back to Search
Start Over
Bulgarian sense-annotated corpus – between the tradition and novelty
- Source :
- Cognitive Studies | Études cognitives, Vol 0, Iss 12, Pp 181-198 (2015)
- Publication Year :
- 2015
- Publisher :
- Institute of Slavic Studies Polish Academy of Sciences, 2015.
-
Abstract
- Bulgarian sense-annotated corpus – between the tradition and noveltyThe Bulgarian Sense-annotated Corpus (BulSemCor) is compiled according to the general methodology established by the SemCor project. It is a subset of the Brown Corpus of Bulgarian semantically annotated with a corresponding synonym set (synset) in the Bulgarian wordnet. Unlike the bulk of sense-annotated corpora where only (sets of) content words are annotated, in BulSemCor each lexical unit has been assigned a sense. The main contributions achieved in the work on BulSemCor are briefly decides in the presented paper: definition of an annotation schema, compilation of an input corpus, development of a sense-annotated corpus, Bulgarian wordnet enlargement.
- Subjects :
- Text corpus
Linguistics and Language
Computer Networks and Communications
Computer science
Brown Corpus
InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL
WordNet
lcsh:P325-325.5
computer.software_genre
ComputingMethodologies_ARTIFICIALINTELLIGENCE
Lexical item
Corpus linguistics
Synonym (database)
annotation principles
Bulgarian
corpus studies
business.industry
Communication
Novelty
lcsh:P98-98.5
corpus annotation
lcsh:Lexicography
Linguistics
language.human_language
ComputingMethodologies_PATTERNRECOGNITION
ComputingMethodologies_DOCUMENTANDTEXTPROCESSING
language
Artificial intelligence
lcsh:Computational linguistics. Natural language processing
business
lcsh:P327-327.5
computer
Natural language processing
lcsh:Semantics
Subjects
Details
- ISSN :
- 23922397
- Database :
- OpenAIRE
- Journal :
- Cognitive Studies | Études cognitives
- Accession number :
- edsair.doi.dedup.....b3d5ebcf06cc41f7acb90b3a15a5c83e
- Full Text :
- https://doi.org/10.11649/cs.2012.012