Back to Search Start Over

Lexicographic Study of Synonymy: Clarifying Semantic Similarity between Words

Authors :
Vladimir V. Bochkarev
Valery D. Solovyev
Liliia Khalitova
Gulnara Gimaletdinova
Source :
Computación y Sistemas. 25
Publication Year :
2021
Publisher :
Instituto Politecnico Nacional/Centro de Investigacion en Computacion, 2021.

Abstract

The problem of determining semantic similarity between words affects the understanding of synonymy and creates obstacles to the work oflexico graphers. The study was carried out as a part ofa larger research project on expert assessment of synonymic rows in RuWordNet thesaurus (a WordNet–like thesaurus for the Russian language). The aim of this study is to analyze RuWordNet thesaurus and compare it with classical dictionaries of Russian synonyms. For this purpose, the authors singled out entry words (adjectives N = 68 and verbs N = 117) and their analogues (adjectives N = 558 and verbs N = 1410) from the New Explanatory Dictionary of Russian Synonyms by Yu. Apresyan (NEDS). An analogue is viewed as aword whose meaning essentially intersects with the general meaning of a given synonymic row, although it lacks the needed semantic similarity that could indicate the presence of synonymy or near–synonymy (Apresyan). The quantitative analysis based on the breadth–first search (BFS) algorithm estimated the distance between each pair entry word! analogue. The quantitative method revealed that the analogues described in NEDS correlate with the hyponyms and hyperonyms in RuWordNet which contributes to the study of near–synonymy. The qualitative method (observation and linguistic interpretation) was used to analyze pairs entry word! analogue which showed the longest distance; such words were 52 adjectives and15 verbs. First, the meanings of entry words and analogues were checked against two Russian language thesauri, then, their representation in the tree graph of RuWordNet was traced. The analysis revealed inaccuracies concerning the similarity between certainwords. The recommendations for further improvement of RuWordNet were given.

Details

ISSN :
20079737 and 14055546
Volume :
25
Database :
OpenAIRE
Journal :
Computación y Sistemas
Accession number :
edsair.doi...........6095da8553af62db79d87dcd94afc08b
Full Text :
https://doi.org/10.13053/cys-25-3-4028