Author: "Simon King" / Journal: interspeech 2015 - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Simon King"' showing total 4 results

Start Over Author "Simon King" Journal interspeech 2015

4 results on '"Simon King"'

1. Minimum trajectory error training for deep neural networks, combined with stacked bottleneck features

Author: Zhizheng Wu and Simon King
Subjects: Mean squared error, Estimation theory, business.industry, Computer science, Speech recognition, Acoustic model, Computer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing), Context (language use), Speech synthesis, Pattern recognition, computer.software_genre, Bottleneck, Computer Science::Sound, Artificial intelligence, business, computer, Parametric statistics
Abstract: Recently, Deep Neural Networks (DNNs) have shown promise as an acoustic model for statistical parametric speech synthesis. Their ability to learn complex mappings from linguistic features to acoustic features has advanced the naturalness of synthesis speech significantly. However, because DNN parameter estimation methods typically attempt to minimise the mean squared error of each individual frame in the training data, the dynamic and continuous nature of speech parameters is neglected. In this paper, we propose a training criterion that minimises speech parameter trajectory errors, and so takes dynamic constraints from a wide acoustic context into account during training. We combine this novel training criterion with our previously proposed stacked bottleneck features, which provide wide linguistic context. Both objective and subjective evaluation results confirm the effectiveness of the proposed training criterion for improving model accuracy and naturalness of synthesised speech.
Published: 2015

2. Robustness to additive noise of locally-normalized cepstral coefficients in speaker verification

Author: José Novoa, Richard M. Stern, Víctor Poblete, Josué Fredes, Simon King, and Néstor Becerra Yoma
Subjects: Speaker verification, Robustness (computer science), Computer science, Speech recognition, Mel-frequency cepstrum
Published: 2015

3. Towards minimum perceptual error training for DNN-based speech synthesis

Author: Zhizheng Wu, Simon King, and Cassia Valentini-Botinhao
Subjects: Computer science, business.industry, Speech recognition, Speech synthesis, Pattern recognition, Fundamental frequency, Function (mathematics), computer.software_genre, Domain (software engineering), Noise, ComputingMethodologies_PATTERNRECOGNITION, Quality (physics), Cepstrum, Mel-frequency cepstrum, Artificial intelligence, business, computer
Abstract: We propose to use a perceptually-oriented domain to improve the quality of text-to-speech generated by deep neural networks (DNNs). We train a DNN that predicts the parameters required for speech reconstruction but whose cost function is calculated in another domain. In this paper, to represent this perceptual domain we extract an approximated version of the Spectro-Temporal Excitation Pattern that was originally proposed as part of a model of hearing speech in noise. We train DNNs that predict band aperiodicity, fundamental frequency and Mel cepstral coefficients and compare generated speech when the spectral cost function is defined in the Mel cepstral, warped log spectrum or perceptual domains. Objective results indicate that the perceptual domain system achieves the highest quality.
Published: 2015

4. Reconstructing voices within the multiple-average-voice-model framework

Author: Mark J. F. Gales, Pierre Lanchantin, Junichi Yamagishi, Christophe Veaux, and Simon King
Subjects: Voice activity detection, Computer science, Speech recognition, Speech synthesis, computer.software_genre, Speaker recognition, Identity (music), medicine, Speech disorder, medicine.symptom, Adaptation (computer science), Hidden Markov model, computer, Interpolation
Abstract: Personalisation of voice output communication aids (VOCAs) allows to preserve the vocal identity of people suffering from speech disorders. This can be achieved by the adaptation of HMM-based speech synthesis systems using a small amount of adaptation data. When the voice has begun to deteriorate, reconstruction is still possible in the statistical domain by correcting the parameters of the models associated with the speech disorder. This can be done by substituting those with parameters from a donor’s voice, at risk of losing part of the identity of the patient. Recently, the Multiple-Average-Voice-Model (Multiple AVM) framework has been proposed for speaker adaptation. Adaptation is performed via interpolation into a speaker eigenspace spanned by the mean vectors of speaker-adapted AVMs which can be tuned to the individual speaker. In this paper, we present the benefits of this framework for voice reconstruction: it requires only a very small amount of adaptation data, interpolation can be performed in a clean speech eigenspace and the resulting voice can be easily fine-tuned by acting on the interpolation weights. We illustrate our points with a subjective assessment of the reconstructed voice. Index Terms: HMM-Based speech synthesis, speaker adaptation, multiple average voice model, cluster adaptive training, voice reconstruction, voice output communication aids.
Published: 2015

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

4 results on '"Simon King"'

1. Minimum trajectory error training for deep neural networks, combined with stacked bottleneck features

2. Robustness to additive noise of locally-normalized cepstral coefficients in speaker verification

3. Towards minimum perceptual error training for DNN-based speech synthesis

4. Reconstructing voices within the multiple-average-voice-model framework

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Database

4 results on '"Simon King"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources