1. Polyglot Speech Synthesis: A Review
- Author
-
Bidisha Sharma and S. R. Mahadeva Prasanna
- Subjects
business.industry ,Computer science ,Process (engineering) ,Speech recognition ,020206 networking & telecommunications ,Speech synthesis ,Polyglot ,02 engineering and technology ,computer.software_genre ,Focus (linguistics) ,030507 speech-language pathology & audiology ,03 medical and health sciences ,Text to speech synthesis ,ComputingMethodologies_PATTERNRECOGNITION ,0202 electrical engineering, electronic engineering, information engineering ,Natural (music) ,Artificial intelligence ,Electrical and Electronic Engineering ,0305 other medical science ,business ,computer ,Natural language processing - Abstract
The term polyglot speech synthesis refers to the process of producing speech in multiple languages and single speaker's voice from a single text-to-speech synthesis (TTS) system. This report reviews existing efforts in the literature to develop a polyglot TTS. Different methods described in this review mainly focus on developing a natural, intelligible, and cost-effective TTS system for multilingual text input. Since multilingual text is becoming very common in all applications of TTS, recent focus is made on developing a cost-effective polyglot TTS system, instead of conventional monolingual TTS. This review also discusses the pros and cons of different methods and mentions possible directions to overcome the limitations.
- Published
- 2016
- Full Text
- View/download PDF