Back to Search Start Over

Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis

Authors :
Dines, John
Liang, Hui
Saheer, Lakshmi
Gibson, Matthew
Byrne, William
Oura, Keiichiro
Tokuda, Keiichi
Yamagishi, Junichi
King, Simon
Wester, Mirjam
Hirsimäki, Teemu
Karhila, Reima
Kurimo, Mikko
Source :
Computer Speech & Language. Feb2013, Vol. 27 Issue 2, p420-437. 18p.
Publication Year :
2013

Abstract

Abstract: In this paper we present results of unsupervised cross-lingual speaker adaptation applied to text-to-speech synthesis. The application of our research is the personalisation of speech-to-speech translation in which we employ a HMM statistical framework for both speech recognition and synthesis. This framework provides a logical mechanism to adapt synthesised speech output to the voice of the user by way of speech recognition. In this work we present results of several different unsupervised and cross-lingual adaptation approaches as well as an end-to-end speaker adaptive speech-to-speech translation system. Our experiments show that we can successfully apply speaker adaptation in both unsupervised and cross-lingual scenarios and our proposed algorithms seem to generalise well for several language pairs. We also discuss important future directions including the need for better evaluation metrics. [Copyright &y& Elsevier]

Details

Language :
English
ISSN :
08852308
Volume :
27
Issue :
2
Database :
Academic Search Index
Journal :
Computer Speech & Language
Publication Type :
Academic Journal
Accession number :
83576290
Full Text :
https://doi.org/10.1016/j.csl.2011.08.003