Back to Search Start Over

Applying Support Vector Machines to POS tagging of the Ainu Language

Authors :
Yoshio Momouchi
Karol Nowakowski
Michal Ptaszynski
Fumito Masui
Source :
Proceedings of the Workshop on Computational Methods for Endangered Languages. 2
Publication Year :
2019
Publisher :
University of Colorado at Boulder, 2019.

Abstract

We describe our attempt to apply a state-of-the-art sequential tagger – SVMTool – in the task of automatic part-of-speech annotation of the Ainu language, a critically endangered language isolate spoken by the native inhabitants of northern Japan. Our experiments indicated that it performs better than the custom system proposed in previous research (POST-AL), especially when applied to out-of-domain data. The biggest advantage of the model trained using SVMTool over the POST-AL tagger is its ability to guess part-of-speech tags for OoV words, with the accuracy of up to 63%.

Details

Volume :
2
Database :
OpenAIRE
Journal :
Proceedings of the Workshop on Computational Methods for Endangered Languages
Accession number :
edsair.doi...........f309fa34958d7c106bc3ba4ebd69d6fa
Full Text :
https://doi.org/10.33011/computel.v2i.449