1. Improvement of speaker recognition by combining residual and prosodic features with acoustic features
- Author
-
Hsiao-Chuan Wang and Shi-Han Chen
- Subjects
business.industry ,Computer science ,Speech recognition ,Speech coding ,Vector quantization ,Pattern recognition ,Speech processing ,Linear discriminant analysis ,Speaker recognition ,Residual ,ComputingMethodologies_PATTERNRECOGNITION ,Codec ,Artificial intelligence ,business ,Pitch contour - Abstract
When a speech signal is encoded in some low bit-rate coding formats, it becomes more difficult to distinguish speaker identities. The paper investigates the codec effect on acoustic and prosodic features. A new representation of prosodic features based on the piecewise fitting of the pitch contour is introduced. A method for including residual features based on the LDA (linear discriminant analysis) algorithm is suggested. By combining prosodic features with acoustic features, we can improve the performance of a speaker recognition system. A series of experiments is performed with coded speech affected by G.729A and GSM codec processes to demonstrate the effectiveness of our proposed method.
- Published
- 2004
- Full Text
- View/download PDF