Back to Search
Start Over
Role of voice activity detection methods for the speakers in the wild challenge
- Source :
- NCC
- Publication Year :
- 2017
- Publisher :
- IEEE, 2017.
-
Abstract
- One of the major reasons for the performance degradation of a speaker verification (SV) system in real-world conditions is its inability to spot speech regions due to the presence of noise. This work focuses on the role of voice activity detection (VAD) methods in alleviating such shortcomings. The experiments are conducted on the core-core task of the speakers in the wild (SITW) challenge. Two VAD approaches are explored in this work. One of them is the recently proposed self-adaptive VAD and the other is based on vowel-like region (VLR) detection. For evaluating the effectiveness of these approaches, the SV systems are developed using the i-vector framework in the front-end and probabilistic linear discriminant analysis (PLDA) in the back-end. The self-adaptive VAD based system shows better performance compared to the VLR based system in high SNR condition. Under degraded conditions, the VLR based method is relatively more robust compared to self-adaptive VAD. Exploiting these complementary features, significant improvements in the SV performances are noted with the fusion of scores of the two systems.
- Subjects :
- Speaker verification
Voice activity detection
Noise measurement
Computer science
Speech recognition
020206 networking & telecommunications
02 engineering and technology
Speaker recognition
Probabilistic linear discriminant analysis
030507 speech-language pathology & audiology
03 medical and health sciences
Robustness (computer science)
0202 electrical engineering, electronic engineering, information engineering
Noise (video)
Mel-frequency cepstrum
0305 other medical science
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2017 Twenty-third National Conference on Communications (NCC)
- Accession number :
- edsair.doi...........368ad937076d0704f9ce015489c86ceb
- Full Text :
- https://doi.org/10.1109/ncc.2017.8077146