Back to Search Start Over

Probabilistic Finite-State morphological segmenter for Wixarika (huichol) language1.

Authors :
Mager, Manuel
Carrillo, Diónico
Meza, Ivan
Pinto
Singh
Villavicencio
Mayr-Schlegel
Stamatatos
Source :
Journal of Intelligent & Fuzzy Systems. 2018, Vol. 34 Issue 5, p3081-3087. 7p.
Publication Year :
2018

Abstract

In this work, we present a morphological segmenter for the Mexican indigenous language Wixarika. Segmentation is fundamental for rich morphological languages, a common aspect of the native American languages, to improve other tasks like machine translation, dialogue systems, summarization, etc. On top of the agglutinative nature of the language, the low amount of resources and the lack of an orthographic standard among dialects add to the challenge. Our proposal is based on a probabilistic finite-state approach that exploits regular agglutinative patterns and requires little linguistic knowledge. We show that our approach outperforms unsupervised and semi-supervised methods in a low-resource context. The dataset used in this work was openly released for future work by the community. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10641246
Volume :
34
Issue :
5
Database :
Academic Search Index
Journal :
Journal of Intelligent & Fuzzy Systems
Publication Type :
Academic Journal
Accession number :
129968542
Full Text :
https://doi.org/10.3233/JIFS-169492