Back to Search Start Over

Subjecthood and annotation: The cases of French and Wolof

Authors :
Bondéelle, Olivier
Kahane, Sylvain
Centre d'Etudes des Relations et Contacts Linguistiques et Littéraires - UR UPJV 4283 (CERCLL)
Université de Picardie Jules Verne (UPJV)
Modèles, Dynamiques, Corpus (MoDyCo)
Université Paris Nanterre (UPN)-Centre National de la Recherche Scientifique (CNRS)
M.C. de Marneffe
M. de Lhoneux
J. Nivre
S. Schuster
Source :
COLING 2020Fourth Workshop on Universal Dependencies (UDW 2020)Proceedings of the WorkshopDecember 13, 2020 Barcelona, Spain, M.C. de Marneffe, M. de Lhoneux, J. Nivre, & S. Schuster. COLING 2020 Fourth Workshop on Universal Dependencies (UDW 2020) Proceedings of the Workshop December 13, 2020 Barcelona, Spain (Online), International Conference on Computational Linguistics (ICCL) (2020), Association for Computational Linguistics, 2020, ACL Anthology, 978-1-952148-48-4
Publication Year :
2020
Publisher :
HAL CCSD, 2020.

Abstract

International audience; This article considers the annotation of subjects in UD treebanks. The identification of the subjectposes a particular problem in Wolof, due to pronominal indices whose status as a pronoun or apronominal affix is uncertain. In the UD treebank available for Wolof (Dione, 2019), these have beenannotated depending on the construction either as true subjects, or as morphosyntactic features agreeingwith the verb. The study of this corpus of 40 000 words allows us to show that the problem is indeeddifficult to solve, especially since Wolof has a rich system of auxiliaries and several basic constructionswith different properties. Before addressing the case of Wolof, we will present the simpler, but partlycomparable, case of French, where subject clitics also tend to behave like affixes, and subjecthood canmove from the preverbal to the detached position. We will also make a several annotationrecommendations that would avoid overwriting information regarding subjecthood.

Details

Language :
English
ISBN :
978-1-952148-48-4
ISBNs :
9781952148484
Database :
OpenAIRE
Journal :
COLING 2020Fourth Workshop on Universal Dependencies (UDW 2020)Proceedings of the WorkshopDecember 13, 2020 Barcelona, Spain, M.C. de Marneffe, M. de Lhoneux, J. Nivre, & S. Schuster. COLING 2020 Fourth Workshop on Universal Dependencies (UDW 2020) Proceedings of the Workshop December 13, 2020 Barcelona, Spain (Online), International Conference on Computational Linguistics (ICCL) (2020), Association for Computational Linguistics, 2020, ACL Anthology, 978-1-952148-48-4
Accession number :
edsair.dedup.wf.001..47fbc06fbac8ea5a7df8e69da0774c1f