Back to Search Start Over

PARSEME corpus release 1.3

Authors :
Savary, Agata
Ben Khelil, Cherifa
Ramisch, Carlos
Giouli, Voula
Barbu Mititelu, Verginica
Hadj Mohamed, Najet
Krstev, Cvetana
Liebeskind, Chaya
Xu, Hongzhi
Stymne, Sara
Güngör, Tunga
Pickard, Thomas
Guillaume, Bruno
Bejček, Eduard
Bhatia, Archna
Candito, Marie
Gantar, Polona
Iñurrieta, Uxoa
Gatt, Albert
Kovalevskaite, Jolanta
Lichte, Timm
Ljubešić, Nikola
Monti, Johanna
Parra Escartin, Carla
Shamsfard, Mehrnoush
Stoyanova, Ivelina
Vincze, Veronika
Walsh, Abigail
Savary, Agata
Ben Khelil, Cherifa
Ramisch, Carlos
Giouli, Voula
Barbu Mititelu, Verginica
Hadj Mohamed, Najet
Krstev, Cvetana
Liebeskind, Chaya
Xu, Hongzhi
Stymne, Sara
Güngör, Tunga
Pickard, Thomas
Guillaume, Bruno
Bejček, Eduard
Bhatia, Archna
Candito, Marie
Gantar, Polona
Iñurrieta, Uxoa
Gatt, Albert
Kovalevskaite, Jolanta
Lichte, Timm
Ljubešić, Nikola
Monti, Johanna
Parra Escartin, Carla
Shamsfard, Mehrnoush
Stoyanova, Ivelina
Vincze, Veronika
Walsh, Abigail
Source :
Bhatia, Archna , Evang, Kilian , Garcia, Marcos , Giouli, Voula , Han, Lifeng , Taslimipoor, Shiva (Ed.), 19th Workshop on Multiword Expressions, MWE 2023 - Proceedings, p.24-35. Dubrovnik, Croatia: Association for Computational Linguistics.
Publication Year :
2023

Abstract

We present version 1.3 of the PARSEME multilingual corpus annotated with verbal multiword expressions. Since the previous version, new languages have joined the undertaking of creating such a resource, some of the already existing corpora have been enriched with new annotated texts, while others have been enhanced in various ways. The PARSEME multilingual corpus represents 26 languages now. All monolingual corpora therein use Universal Dependencies v.2 tagset. They are (re-)split observing the PARSEME v.1.2 standard, which puts impact on unseen VMWEs. With the current iteration, the corpus release process has been detached from shared tasks; instead, a process for continuous improvement and systematic releases has been introduced.

Details

Database :
OAIster
Journal :
Bhatia, Archna , Evang, Kilian , Garcia, Marcos , Giouli, Voula , Han, Lifeng , Taslimipoor, Shiva (Ed.), 19th Workshop on Multiword Expressions, MWE 2023 - Proceedings, p.24-35. Dubrovnik, Croatia: Association for Computational Linguistics.
Notes :
English
Publication Type :
Electronic Resource
Accession number :
edsoai.on1445830677
Document Type :
Electronic Resource