Back to Search
Start Over
Multidirectional Regression (MDR)-Based Features for Automatic Voice Disorder Detection
- Source :
- Journal of Voice. 26:817.e19-817.e27
- Publication Year :
- 2012
- Publisher :
- Elsevier BV, 2012.
-
Abstract
- Summary Background and Objective Objective assessment of voice pathology has a growing interest nowadays. Automatic speech/speaker recognition (ASR) systems are commonly deployed in voice pathology detection. The aim of this work was to develop a novel feature extraction method for ASR that incorporates distributions of voiced and unvoiced parts, and voice onset and offset characteristics in a time-frequency domain to detect voice pathology. Materials and Methods The speech samples of 70 dysphonic patients with six different types of voice disorders and 50 normal subjects were analyzed. The Arabic spoken digits (1–10) were taken as an input. The proposed feature extraction method was embedded into the ASR system with Gaussian mixture model (GMM) classifier to detect voice disorder. Results Accuracy of 97.48% was obtained in text independent (all digits' training) case, and over 99% accuracy was obtained in text dependent (separate digit's training) case. The proposed method outperformed the conventional Mel frequency cepstral coefficient (MFCC) features. Conclusion The results of this study revealed that incorporating voice onset and offset information leads to efficient automatic voice disordered detection.
- Subjects :
- Adult
Male
Sound Spectrography
Time Factors
Adolescent
Voice Quality
Computer science
Speech recognition
Feature extraction
Speech Acoustics
Voice Disorder
Pattern Recognition, Automated
Voice analysis
Automation
Young Adult
Speech and Hearing
Speech Production Measurement
Predictive Value of Tests
Humans
Models, Statistical
Voice Disorders
Voice activity detection
business.industry
Signal Processing, Computer-Assisted
Pattern recognition
Acoustics
Middle Aged
LPN and LVN
Mixture model
Speaker recognition
Otorhinolaryngology
Case-Control Studies
Linear Models
Female
Mel-frequency cepstrum
Artificial intelligence
business
Algorithms
Subjects
Details
- ISSN :
- 08921997
- Volume :
- 26
- Database :
- OpenAIRE
- Journal :
- Journal of Voice
- Accession number :
- edsair.doi.dedup.....84ddc7115baf07858ec53d2af0f02445
- Full Text :
- https://doi.org/10.1016/j.jvoice.2012.05.002