201. Research on Speaker Recognition Based on Multifractal Spectrum Feature
- Author
-
Jinming Wang, Xiongwei Zhang, and Yuhuan Zhou
- Subjects
Wavelet transform modulus maxima method ,business.industry ,Speech recognition ,Feature extraction ,Wavelet transform ,Pattern recognition ,Linear predictive coding ,Speech processing ,Speaker recognition ,Feature (machine learning) ,Mel-frequency cepstrum ,Artificial intelligence ,business ,Mathematics - Abstract
In this paper, a new nonlinear feature extraction method based on the WTMM (wavelet transform modulus-maxima method) is proposed, which can greatly facilitate the extraction of the multifractal spectrum feature (MSF) from speech signals. The MSF combined with traditional linear features can obviously improve the performance of speaker recognition system. Experiment results show that 6-dimensional MSF combined with LPC make recognition accuracy increase 6.4 percentage points, and 6-dimensional MSF combined with MFCC, LPC make recognition accuracy increase 1.6 percentage points and reach 98.8% in short speech (2 seconds) speaker recognition.
- Published
- 2010