Back to Search Start Over

Predicting ultrasound tongue image from lip images using sequence to sequence learning.

Authors :
Xu, Kele
Zhao, Jianqiao
Zhu, Boqing
Zhao, Chaojie
Source :
Journal of the Acoustical Society of America. Jun2020, Vol. 147 Issue 6, pEL441-EL446. 6p.
Publication Year :
2020

Abstract

Understanding the dynamic system that produces speech is essential to advancing speech science, and several simultaneous sensory streams can be leveraged to describe the process. As the tongue functional deformation correlates with the lip's shapes of the speaker, this paper aims to explore the association between them. The problem is formulated as a sequence to sequence learning task and a deep neural network is trained using unlabeled lip videos to predict an upcoming ultrasound tongue image sequence. Experimental results show that the machine learning model can predict the tongue's motion with satisfactory performance, which demonstrates that the learned neural network can build the association between two imaging modalities. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00014966
Volume :
147
Issue :
6
Database :
Academic Search Index
Journal :
Journal of the Acoustical Society of America
Publication Type :
Academic Journal
Accession number :
144345609
Full Text :
https://doi.org/10.1121/10.0001328