Back to Search
Start Over
Independent Component Analysis and Time-Frequency Masking for Speech Recognition in Multitalker Conditions
- Source :
- EURASIP Journal on Audio, Speech, and Music Processing, Vol 2010 (2010)
- Publication Year :
- 2010
- Publisher :
- Springer Science and Business Media LLC, 2010.
-
Abstract
- When a number of speakers are simultaneously active, for example in meetings or noisy public places, the sources of interest need to be separated from interfering speakers and from each other in order to be robustly recognized. Independent component analysis (ICA) has proven a valuable tool for this purpose. However, ICA outputs can still contain strong residual components of the interfering speakers whenever noise or reverberation is high. In such cases, nonlinear postprocessing can be applied to the ICA outputs, for the purpose of reducing remaining interferences. In order to improve robustness to the artefacts and loss of information caused by this process, recognition can be greatly enhanced by considering the processed speech feature vector as a random variable with time-varying uncertainty, rather than as deterministic. The aim of this paper is to show the potential to improve recognition of multiple overlapping speech signals through nonlinear postprocessing together with uncertainty-based decoding techniques.
- Subjects :
- Reverberation
Acoustics and Ultrasonics
Computer science
Feature vector
Speech recognition
lcsh:QC221-246
Residual
Speech processing
Independent component analysis
lcsh:QA75.5-76.95
Robustness (computer science)
lcsh:Acoustics. Sound
lcsh:Electronic computers. Computer science
Electrical and Electronic Engineering
Random variable
Decoding methods
Subjects
Details
- ISSN :
- 16874722
- Volume :
- 2010
- Database :
- OpenAIRE
- Journal :
- EURASIP Journal on Audio, Speech, and Music Processing
- Accession number :
- edsair.doi.dedup.....0a9d824174eb732d7a6354a5ac6087de
- Full Text :
- https://doi.org/10.1186/1687-4722-2010-651420