Back to Search
Start Over
How to compare TTS systems: a new subjective evaluation methodology focused on differences
- Source :
- INTERSPEECH, Interspeech, Interspeech, Sep 2015, Dresden, Germany
- Publication Year :
- 2015
- Publisher :
- ISCA, 2015.
-
Abstract
- International audience; Subjective evaluation is a crucial problem in the speech processing community and especially for the speech synthesis field, no matter what system is used. Indeed, when trying to assess the effectiveness of a proposed method, researchers usually conduct subjective evaluations by randomly choosing a small set of samples, from the same domain, taken from a baseline system and the proposed one. When selecting them randomly, statistically, samples with almost no differences are evaluated and the global measure is smoothed which may lead to judge the improvement not significant.To solve this methodological flaw, we propose to compare speech synthesis systems on thousands of generated samples from various domains and to focus subjective evaluations on the most relevant ones by computing a normalized alignment cost between sample pairs. This process has been successfully applied both in the HTS statistical framework and in the corpusbased approach. We have conducted two perceptive experiments by generating more than 27,000 samples for each system under comparison. A comparison between tests involving most different samples and randomly chosen samples shows clearly that the proposed approach
- Subjects :
- [INFO.INFO-AI] Computer Science [cs]/Artificial Intelligence [cs.AI]
[INFO.INFO-TS] Computer Science [cs]/Signal and Image Processing
subjective evaluation
Computer science
Process (engineering)
Speech recognition
Speech synthesis
Sample (statistics)
02 engineering and technology
computer.software_genre
Machine learning
Field (computer science)
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
Domain (software engineering)
030507 speech-language pathology & audiology
03 medical and health sciences
speech synthesis
[INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
0202 electrical engineering, electronic engineering, information engineering
[INFO.INFO-HC]Computer Science [cs]/Human-Computer Interaction [cs.HC]
Measure (data warehouse)
business.industry
020206 networking & telecommunications
Speech processing
[INFO.INFO-SD] Computer Science [cs]/Sound [cs.SD]
[INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD]
Artificial intelligence
[INFO.INFO-HC] Computer Science [cs]/Human-Computer Interaction [cs.HC]
0305 other medical science
Focus (optics)
business
computer
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- Interspeech 2015
- Accession number :
- edsair.doi.dedup.....aa307fd3704b05776cc1404450ddb954