Back to Search Start Over

A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment

Authors :
Ahmadi, S.
Mccrae, J. P.
Nimb, S.
Khan, F.
Monachini, M.
Pedersen, B. S.
Declerck, T.
Wissik, T.
Bellandi, A.
Pisani, I.
Troelsgård, T.
Olsen, S.
Krek, S.
Lipp, V.
Váradi, T.
Simon, L.
Gyorffy, A.
Tiberius, C.
Schoonheim, T.
Moshe, Y. B.
Rudich, M.
Ahmad, R. A.
Lonke, D.
Kovalenko, K.
Langemets, M.
Kallas, J.
Dereza, O.
Fransen, T.
Cillessen, D.
Lindemann, D.
Alonso, M.
Ana Salgado
Sancho, J. L.
Ureña-Ruiz, R. J.
Zamorano, J. P.
Simov, K.
Osenova, P.
Kancheva, Z.
Radev, I.
Stankovi, R.
Perdih, A.
Gabrovšek, D.
Horizon 2020
Bulgarian National Interdisciplinary Research e-Infrastructure for Resources and Technologies
Science Foundation Ireland
Irish Research Council
Centro de Linguística da UNL (CLUNL)
Calzolari, Nicoletta
Source :
CIÊNCIAVITAE, Scopus-Elsevier, Universidade Nova de Lisboa, Ahmadi, S, McCrae, J, Nimb, S, Khan, F, Monachini, M, Pedersen, B S, Declerck, T, Wissik, T, Bellandi, A, Pisani, I, Troelsgård, T, Olsen, S, Krek, S, Lipp, V, Váradi, T, Simon, L, Gyorffy, A, Tiberius, C, Schoonheim, T, Moshe, Y B, Rudich, M, Abu Ahmad, R, Lonke, D, Kovalenko, K, Langemets, M, Kallas, J, Dereza, O, Fransen, T, Cillessen, D, Lindemann, D, Alonso, M, Salgado, A, Sancho, J L, Ureña-Ruiz, R-J, Zamorano, J P, Simov, K, Osenova, P, Kancheva, Z, Radev, I, Stanković, R, Perdih, A & Gabrovsek, D 2020, A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment . in N Calzolari (ed.), Proceedings of the 12th Language Resources and Evaluation Conference . European Language Resources Association, Marseille, France, pp. 3232-3242 . < http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.395.pdf >

Abstract

Aligning senses across resources and languages is a challenging task with beneficial applications in the field of natural language processing and electronic lexicography. In this paper, we describe our efforts in manually aligning monolingual dictionaries. The alignment iscarried out at sense-level for various resources in 15 languages. Moreover, senses are annotated with possible semantic relationships suchas broadness, narrowness, relatedness, and equivalence. In comparison to previous datasets for this task, this dataset covers a wide rangeof languages and resources and focuses on the more challenging task of linking general-purpose language. We believe that our data willpave the way for further advances in alignment and evaluation of word senses by creating new solutions, particularly those notoriouslyrequiring data such as neural networks. Our resources are publicly available at https://github.com/elexis-eu/MWSA. Aligning senses across resources and languages is a challenging task with beneficial applications in the field of natural language processing and electronic lexicography. In this paper, we describe our efforts in manually aligning monolingual dictionaries. The alignment iscarried out at sense-level for various resources in 15 languages. Moreover, senses are annotated with possible semantic relationships suchas broadness, narrowness, relatedness, and equivalence. In comparison to previous datasets for this task, this dataset covers a wide rangeof languages and resources and focuses on the more challenging task of linking general-purpose language. We believe that our data willpave the way for further advances in alignment and evaluation of word senses by creating new solutions, particularly those notoriouslyrequiring data such as neural networks. Our resources are publicly available at https://github.com/elexis-eu/MWSA.

Details

Database :
OpenAIRE
Journal :
CI&#202;NCIAVITAE, Scopus-Elsevier, Universidade Nova de Lisboa, Ahmadi, S, McCrae, J, Nimb, S, Khan, F, Monachini, M, Pedersen, B S, Declerck, T, Wissik, T, Bellandi, A, Pisani, I, Troelsg&#229;rd, T, Olsen, S, Krek, S, Lipp, V, V&#225;radi, T, Simon, L, Gyorffy, A, Tiberius, C, Schoonheim, T, Moshe, Y B, Rudich, M, Abu Ahmad, R, Lonke, D, Kovalenko, K, Langemets, M, Kallas, J, Dereza, O, Fransen, T, Cillessen, D, Lindemann, D, Alonso, M, Salgado, A, Sancho, J L, Ure&#241;a-Ruiz, R-J, Zamorano, J P, Simov, K, Osenova, P, Kancheva, Z, Radev, I, Stanković, R, Perdih, A &amp; Gabrovsek, D 2020, A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment . in N Calzolari (ed.), Proceedings of the 12th Language Resources and Evaluation Conference . European Language Resources Association, Marseille, France, pp. 3232-3242 . < http://www.lrec-conf.org/proceedings/lrec2020/pdf/2020.lrec-1.395.pdf >
Accession number :
edsair.doi.dedup.....0d180d31d54235c9e1fdb9e000e84be2