Back to Search
Start Over
A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment
- Source :
- CIÊNCIAVITAE, Scopus-Elsevier, Universidade Nova de Lisboa, Ahmadi, S, McCrae, J, Nimb, S, Khan, F, Monachini, M, Pedersen, B S, Declerck, T, Wissik, T, Bellandi, A, Pisani, I, Troelsgård, T, Olsen, S, Krek, S, Lipp, V, Váradi, T, Simon, L, Gyorffy, A, Tiberius, C, Schoonheim, T, Moshe, Y B, Rudich, M, Abu Ahmad, R, Lonke, D, Kovalenko, K, Langemets, M, Kallas, J, Dereza, O, Fransen, T, Cillessen, D, Lindemann, D, Alonso, M, Salgado, A, Sancho, J L, Ureña-Ruiz, R-J, Zamorano, J P, Simov, K, Osenova, P, Kancheva, Z, Radev, I, Stanković, R, Perdih, A & Gabrovsek, D 2020, A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment . in N Calzolari (ed.), Proceedings of the 12th Language Resources and Evaluation Conference . European Language Resources Association, Marseille, France, pp. 3232-3242 . < http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.395.pdf >
-
Abstract
- Aligning senses across resources and languages is a challenging task with beneficial applications in the field of natural language processing and electronic lexicography. In this paper, we describe our efforts in manually aligning monolingual dictionaries. The alignment iscarried out at sense-level for various resources in 15 languages. Moreover, senses are annotated with possible semantic relationships suchas broadness, narrowness, relatedness, and equivalence. In comparison to previous datasets for this task, this dataset covers a wide rangeof languages and resources and focuses on the more challenging task of linking general-purpose language. We believe that our data willpave the way for further advances in alignment and evaluation of word senses by creating new solutions, particularly those notoriouslyrequiring data such as neural networks. Our resources are publicly available at https://github.com/elexis-eu/MWSA. Aligning senses across resources and languages is a challenging task with beneficial applications in the field of natural language processing and electronic lexicography. In this paper, we describe our efforts in manually aligning monolingual dictionaries. The alignment iscarried out at sense-level for various resources in 15 languages. Moreover, senses are annotated with possible semantic relationships suchas broadness, narrowness, relatedness, and equivalence. In comparison to previous datasets for this task, this dataset covers a wide rangeof languages and resources and focuses on the more challenging task of linking general-purpose language. We believe that our data willpave the way for further advances in alignment and evaluation of word senses by creating new solutions, particularly those notoriouslyrequiring data such as neural networks. Our resources are publicly available at https://github.com/elexis-eu/MWSA.
- Subjects :
- 050101 languages & linguistics
Lexicography
lexical semantic resources
Language resource
05 social sciences
language resource
02 engineering and technology
sense alignment
Lexical semantics resoruces
lexicography
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
0501 psychology and cognitive sciences
Sense alignment
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- CIÊNCIAVITAE, Scopus-Elsevier, Universidade Nova de Lisboa, Ahmadi, S, McCrae, J, Nimb, S, Khan, F, Monachini, M, Pedersen, B S, Declerck, T, Wissik, T, Bellandi, A, Pisani, I, Troelsgård, T, Olsen, S, Krek, S, Lipp, V, Váradi, T, Simon, L, Gyorffy, A, Tiberius, C, Schoonheim, T, Moshe, Y B, Rudich, M, Abu Ahmad, R, Lonke, D, Kovalenko, K, Langemets, M, Kallas, J, Dereza, O, Fransen, T, Cillessen, D, Lindemann, D, Alonso, M, Salgado, A, Sancho, J L, Ureña-Ruiz, R-J, Zamorano, J P, Simov, K, Osenova, P, Kancheva, Z, Radev, I, Stanković, R, Perdih, A & Gabrovsek, D 2020, A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment . in N Calzolari (ed.), Proceedings of the 12th Language Resources and Evaluation Conference . European Language Resources Association, Marseille, France, pp. 3232-3242 . < http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.395.pdf >
- Accession number :
- edsair.doi.dedup.....0d180d31d54235c9e1fdb9e000e84be2