Back to Search Start Over

A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment

Authors :
Calzolari, Nicoletta
Ahmadi, Sina
McCrae, John
Nimb, Sanni
Khan, Fahad
Monachini, Monica
Pedersen, Bolette Sandford
Declerck, Thierry
Wissik, Tanja
Bellandi, Andrea
Pisani, Irene
Troelsgård, Thomas
Olsen, Sussi
Krek, Simon
Lipp, Veronika
Váradi, Tamás
Simon, László
Gyorffy, András
Tiberius, Carole
Schoonheim, Tanneke
Moshe, Yifat Ben
Rudich, Maya
Abu Ahmad, Raya
Lonke, Dorielle
Kovalenko, Kira
Langemets, Margit
Kallas, Jelena
Dereza, Oksana
Fransen, Theodorus
Cillessen, David
Lindemann, David
Alonso, Mikel
Salgado, Ana
Sancho, José Luis
Ureña-Ruiz, Rafael-J
Zamorano, Jordi Porta
Simov, Kiril
Osenova, Petya
Kancheva, Zara
Radev, Ivaylo
Stanković, Ranka
Perdih, Andrej
Gabrovsek, Dejan
Calzolari, Nicoletta
Ahmadi, Sina
McCrae, John
Nimb, Sanni
Khan, Fahad
Monachini, Monica
Pedersen, Bolette Sandford
Declerck, Thierry
Wissik, Tanja
Bellandi, Andrea
Pisani, Irene
Troelsgård, Thomas
Olsen, Sussi
Krek, Simon
Lipp, Veronika
Váradi, Tamás
Simon, László
Gyorffy, András
Tiberius, Carole
Schoonheim, Tanneke
Moshe, Yifat Ben
Rudich, Maya
Abu Ahmad, Raya
Lonke, Dorielle
Kovalenko, Kira
Langemets, Margit
Kallas, Jelena
Dereza, Oksana
Fransen, Theodorus
Cillessen, David
Lindemann, David
Alonso, Mikel
Salgado, Ana
Sancho, José Luis
Ureña-Ruiz, Rafael-J
Zamorano, Jordi Porta
Simov, Kiril
Osenova, Petya
Kancheva, Zara
Radev, Ivaylo
Stanković, Ranka
Perdih, Andrej
Gabrovsek, Dejan
Source :
Ahmadi , S , McCrae , J , Nimb , S , Khan , F , Monachini , M , Pedersen , B S , Declerck , T , Wissik , T , Bellandi , A , Pisani , I , Troelsgård , T , Olsen , S , Krek , S , Lipp , V , Váradi , T , Simon , L , Gyorffy , A , Tiberius , C , Schoonheim , T , Moshe , Y B , Rudich , M , Abu Ahmad , R , Lonke , D , Kovalenko , K , Langemets , M , Kallas , J , Dereza , O , Fransen , T , Cillessen , D , Lindemann , D , Alonso , M , Salgado , A , Sancho , J L , Ureña-Ruiz , R-J , Zamorano , J P , Simov , K , Osenova , P , Kancheva , Z , Radev , I , Stanković , R , Perdih , A & Gabrovsek , D 2020 , A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment . in N Calzolari (ed.) , Proceedings of the 12th Language Resources and Evaluation Conference . European Language Resources Association , Marseille, France , pp. 3232-3242 . <
Publication Year :
2020

Abstract

Aligning senses across resources and languages is a challenging task with beneficial applications in the field of natural language processing and electronic lexicography. In this paper, we describe our efforts in manually aligning monolingual dictionaries. The alignment is carried out at sense-level for various resources in 15 languages. Moreover, senses are annotated with possible semantic relationships such as broadness, narrowness, relatedness, and equivalence. In comparison to previous datasets for this task, this dataset covers a wide range of languages and resources and focuses on the more challenging task of linking general-purpose language. We believe that our data will pave the way for further advances in alignment and evaluation of word senses by creating new solutions, particularly those notoriously requiring data such as neural networks. Our resources are publicly available at https://github.com/elexis-eu/MWSA.&lt;br /&gt;Aligning senses across resources and languages is a challenging task with beneficial applications in the field of natural language processing and electronic lexicography. In this paper, we describe our efforts in manually aligning monolingual dictionaries. The alignment is carried out at sense-level for various resources in 15 languages. Moreover, senses are annotated with possible semantic relationships such as broadness, narrowness, relatedness, and equivalence. In comparison to previous datasets for this task, this dataset covers a wide range of languages and resources and focuses on the more challenging task of linking general-purpose language. We believe that our data will pave the way for further advances in alignment and evaluation of word senses by creating new solutions, particularly those notoriously requiring data such as neural networks. Our resources are publicly available at https://github.com/elexis-eu/MWSA.

Details

Database :
OAIster
Journal :
Ahmadi , S , McCrae , J , Nimb , S , Khan , F , Monachini , M , Pedersen , B S , Declerck , T , Wissik , T , Bellandi , A , Pisani , I , Troelsgård , T , Olsen , S , Krek , S , Lipp , V , Váradi , T , Simon , L , Gyorffy , A , Tiberius , C , Schoonheim , T , Moshe , Y B , Rudich , M , Abu Ahmad , R , Lonke , D , Kovalenko , K , Langemets , M , Kallas , J , Dereza , O , Fransen , T , Cillessen , D , Lindemann , D , Alonso , M , Salgado , A , Sancho , J L , Ureña-Ruiz , R-J , Zamorano , J P , Simov , K , Osenova , P , Kancheva , Z , Radev , I , Stanković , R , Perdih , A &amp; Gabrovsek , D 2020 , A Multilingual Evaluation Dataset for Monolingual Word Sense Alignment . in N Calzolari (ed.) , Proceedings of the 12th Language Resources and Evaluation Conference . European Language Resources Association , Marseille, France , pp. 3232-3242 . <
Notes :
application/pdf, English
Publication Type :
Electronic Resource
Accession number :
edsoai.on1296097094
Document Type :
Electronic Resource