Back to Search
Start Over
Wavelet based sub-band parameters for classification of unaspirated Hindi stop consonants in initial position of CV syllables
- Source :
- International Journal of Speech Technology. 16:323-332
- Publication Year :
- 2012
- Publisher :
- Springer Science and Business Media LLC, 2012.
-
Abstract
- This paper proposes a new feature extraction technique using wavelet based sub-band parameters (WBSP) for classification of unaspirated Hindi stop consonants. The extracted acoustic parameters show marked deviation from the values reported for English and other languages, Hindi having distinguishing manner based features. Since acoustic parameters are difficult to be extracted automatically for speech recognition. Mel Frequency Cepstral Coefficient (MFCC) based features are usually used. MFCC are based on short time Fourier transform (STFT) which assumes the speech signal to be stationary over a short period. This assumption is specifically violated in case of stop consonants. In WBSP, from acoustic study, the features derived from CV syllables have different weighting factors with the middle segment having the maximum. The wavelet transform has been applied to splitting of signal into 8 sub-bands of different bandwidths and the variation of energy in different sub-bands is also taken into account. WBSP gives improved classification scores. The number of filters used (8) for feature extraction in WBSP is less compared to the number (24) used for MFCC. Its classification performance has been compared with four other techniques using linear classifier. Further, Principal components analysis (PCA) has also been applied to reduce dimensionality.
- Subjects :
- Linguistics and Language
Computer science
business.industry
Speech recognition
Feature extraction
Short-time Fourier transform
Wavelet transform
Linear classifier
Pattern recognition
Language and Linguistics
Human-Computer Interaction
Wavelet
Aspirated consonant
Principal component analysis
Computer Vision and Pattern Recognition
Artificial intelligence
Mel-frequency cepstrum
business
Software
Subjects
Details
- ISSN :
- 15728110 and 13812416
- Volume :
- 16
- Database :
- OpenAIRE
- Journal :
- International Journal of Speech Technology
- Accession number :
- edsair.doi...........d483da5684327ff9bc41da997e8e534b
- Full Text :
- https://doi.org/10.1007/s10772-012-9185-x