Back to Search Start Over

Building an Endangered Language Resource in the Classroom: Universal Dependencies for Kakataibo

Authors :
Zariquiey, Roberto
Alvarado, Claudia
Echevarria, Ximena
Gomez, Luisa
Gonzales, Rosa
Illescas, Mariana
Oporto, Sabina
Blum, Frederic
Oncevay, Arturo
Vera, Javier
Zariquiey, Roberto
Alvarado, Claudia
Echevarria, Ximena
Gomez, Luisa
Gonzales, Rosa
Illescas, Mariana
Oporto, Sabina
Blum, Frederic
Oncevay, Arturo
Vera, Javier
Publication Year :
2022

Abstract

In this paper, we launch a new Universal Dependencies treebank for an endangered language from Amazonia: Kakataibo, a Panoan language spoken in Peru. We first discuss the collaborative methodology implemented, which proved effective to create a treebank in the context of a Computational Linguistic course for undergraduates. Then, we describe the general details of the treebank and the language-specific considerations implemented for the proposed annotation. We finally conduct some experiments on part-of-speech tagging and syntactic dependency parsing. We focus on monolingual and transfer learning settings, where we study the impact of a Shipibo-Konibo treebank, another Panoan language resource.<br />Comment: Accepted to LREC 2022

Details

Database :
OAIster
Publication Type :
Electronic Resource
Accession number :
edsoai.on1333779534
Document Type :
Electronic Resource