Back to Search
Start Over
Predicting ultrasound tongue image from lip images using sequence to sequence learning.
- Source :
-
Journal of the Acoustical Society of America . Jun2020, Vol. 147 Issue 6, pEL441-EL446. 6p. - Publication Year :
- 2020
-
Abstract
- Understanding the dynamic system that produces speech is essential to advancing speech science, and several simultaneous sensory streams can be leveraged to describe the process. As the tongue functional deformation correlates with the lip's shapes of the speaker, this paper aims to explore the association between them. The problem is formulated as a sequence to sequence learning task and a deep neural network is trained using unlabeled lip videos to predict an upcoming ultrasound tongue image sequence. Experimental results show that the machine learning model can predict the tongue's motion with satisfactory performance, which demonstrates that the learned neural network can build the association between two imaging modalities. [ABSTRACT FROM AUTHOR]
- Subjects :
- *ULTRASONIC imaging
*LIPS
*DYNAMICAL systems
*MACHINE learning
*DEEP learning
Subjects
Details
- Language :
- English
- ISSN :
- 00014966
- Volume :
- 147
- Issue :
- 6
- Database :
- Academic Search Index
- Journal :
- Journal of the Acoustical Society of America
- Publication Type :
- Academic Journal
- Accession number :
- 144345609
- Full Text :
- https://doi.org/10.1121/10.0001328