Back to Search Start Over

Two-scale Auditory Feature Based Non-intrusive Speech Quality Evaluation.

Authors :
Audhkhasi, Kartik
Kumar, Arun
Source :
IETE Journal of Research. Mar/Apr2010, Vol. 56 Issue 2, p111-118. 8p. 3 Diagrams, 1 Chart, 4 Graphs.
Publication Year :
2010

Abstract

This paper proposes a novel two-scale auditory feature based algorithm for non-intrusive evaluation of speech quality. The neuron firing probabilities along the length of the basilar membrane, from an explicit auditory model, are used to extract features from the distorted speech signal. This is in contrast to previous methods, which either use standard vocal tract based features, or incorporate only some aspects of the human auditory perception mechanism. The features are extracted at two scales, namely a global scale spanning all voiced frames in an utterance, and a local scale spanning voiced frames from contiguous voiced segments in the utterance. This is followed by a simple information fusion at the score level using Gaussian Mixture Models (GMMs). The use of an explicit auditory model to extract features is based on the premise that similar processing (in a qualitative sense) happens in human speech perception. In addition, auditory feature extraction at two scales incorporates the effects of both long term and short term distortions on speech quality. The proposed algorithm is shown to perform at least as good as the ITU-T Recommendation P.563. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
03772063
Volume :
56
Issue :
2
Database :
Academic Search Index
Journal :
IETE Journal of Research
Publication Type :
Academic Journal
Accession number :
50507016
Full Text :
https://doi.org/10.4103/0377-2063.63087