Back to Search
Start Over
Integration of Beamforming and Uncertainty-of-Observation Techniques for Robust ASR in Multi-Source Environments
- Source :
- Computer Speech & Language, 27, 837-850, Computer Speech & Language, 27, 3, pp. 837-850
- Publication Year :
- 2013
-
Abstract
- This paper presents a new approach for increasing the robustness of multi-channel automatic speech recognition in noisy and reverberant multi-source environments. The proposed method uses uncertainty propagation techniques to dynamically compensate the speech features and the acoustic models for the observation uncertainty determined at the beamforming stage. We present and analyze two methods that allow integrating classical multi-channel signal processing approaches like delay and sum beamformers or Zelinski-type Wiener filters, with uncertainty-of-observation techniques like uncertainty decoding or modified imputation. An analysis of the results on the PASCAL-CHiME task shows that this approach consistently outperforms conventional beamformers with a minimal increase in computational complexity. The use of dynamic compensation based on observation uncertainty also outperforms conventional static adaptation with no need of adaptation data.
- Subjects :
- Beamforming
Propagation of uncertainty
Signal processing
Computational complexity theory
BBfor2 Cohesion
Computer science
Speech recognition
020206 networking & telecommunications
02 engineering and technology
Theoretical Computer Science
Human-Computer Interaction
030507 speech-language pathology & audiology
03 medical and health sciences
Robustness (computer science)
0202 electrical engineering, electronic engineering, information engineering
Imputation (statistics)
Language & Speech Technology
0305 other medical science
Algorithm
Software
Decoding methods
Multi-source
Computer Science::Information Theory
Subjects
Details
- ISSN :
- 08852308
- Volume :
- 27
- Database :
- OpenAIRE
- Journal :
- Computer Speech & Language
- Accession number :
- edsair.doi.dedup.....14d9e619ca1c56dd7c40b7e2f79ce797
- Full Text :
- https://doi.org/10.1016/j.csl.2012.07.009