Back to Search Start Over

PRICAI 2014: Trends in Artificial Intelligence

Authors :
Swakkhar Shatabda
Deborah Richards
Ilya Sinayskiy
W. Kleijn
Ira Puspitasari
Zhendong Niu
Anthony Truskinger
Naoki Fukuta
Abdul Sattar
Salma Jamoussi
Sanparith Marukatat
Ann Nicholson
César A. Astudillo
Mehul Bhatt
M A Hakim Newton
Hamed Hassanzadeh
Joao Leite
Takayuki Ito
Alan Wee-Chung Liew
Donghui Lin
Abhaya Nayak
Doan Nguyen
Jiamou Liu
Paul Compton
Matthias Knorr
Endong Tong
Michael Towsey
Pavel Surynek
Lei Pan
Fenghui Ren
Jakob Suchan
Sanjiang Li
Minjie Zhang
Toru Ishida
Gang Li
Muhammad Tahajjudi Ghifary
Ziheng Wei
Yuki Yamagishi
Mahmood Rashid
Ingrid Zukerman
Jane Hunter
Quan Bai
Zahid Islam
Federico Cerutti
Richi Nayak
Ricardo Gonçalves
Erwin Oh
B. John Oommen
Francesco Petruccione
Alexander Ferrein
Source :
Lecture Notes in Computer Science ISBN: 9783319135595
Publication Year :
2014
Publisher :
Springer International Publishing, 2014.

Abstract

High quality bilingual dictionaries are rarely available for lower-density language pairs, especially for those that are closely related. Using a third language as a pivot to link two other languages is a well-known solution, and usually requires only two input bilingual dictionaries to automatically induce the new one. This approach, however, produces many incorrect translation pairs because the dictionary entries are normally are not transitive due to polysemy and the ambiguous words in the pivot language. Utilizing the complete structures of the input bilingual dictionaries positively influences the result since dropped meanings can be countered. Moreover, an additional input dictionary may provide more complete information for calculating the semantic distance between word senses which is key to suppressing wrong sense matches. This paper proposes an extended constraint optimization model to inducing new dictionaries of closely related languages from multiple input dictionaries, and its formalization based on Integer Linear Programming. Evaluations indicated that the proposal not only outperforms the baseline method, but also shows improvements in performance and scalability as more dictionaries are utilized.

Details

ISBN :
978-3-319-13559-5
ISBNs :
9783319135595
Database :
OpenAIRE
Journal :
Lecture Notes in Computer Science ISBN: 9783319135595
Accession number :
edsair.doi...........9f164fde2e1586c03801bcf51f61f667