Back to Search Start Over

An Unsupervised Method for Uncovering Morphological Chains

Authors :
Tommi S. Jaakkola
Karthik Narasimhan
Regina Barzilay
Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science
Narasimhan, Karthik Rajagopal
Barzilay, Regina
Jaakkola, Tommi S.
Source :
Association for Computational Linguistics
Publication Year :
2015
Publisher :
MIT Press - Journals, 2015.

Abstract

Most state-of-the-art systems today produce morphological analysis based only on orthographic patterns. In contrast, we propose a model for unsupervised morphological analysis that integrates orthographic and semantic views of words. We model word formation in terms of morphological chains, from base words to the observed words, breaking the chains into parent-child relations. We use log-linear models with morpheme and word-level features to predict possible parents, including their modifications, for each word. The limited set of candidate parents for each word render contrastive estimation feasible. Our model consistently matches or outperforms five state-of-the-art systems on Arabic, English and Turkish.<br />United States. Intelligence Advanced Research Projects Activity (United States. Dept. of Defense/U.S. Army Research Laboratory Contract W911NF-12-C-0013)

Details

ISSN :
2307387X
Volume :
3
Database :
OpenAIRE
Journal :
Transactions of the Association for Computational Linguistics
Accession number :
edsair.doi.dedup.....526d652f2e4855a4bc7e8fc8ae68b1fd
Full Text :
https://doi.org/10.1162/tacl_a_00130