Back to Search
Start Over
Variational Bayesian Inference for Multichannel Dereverberation and Noise Reduction
- Source :
- IEEE/ACM Transactions on Audio, Speech, and Language Processing. 22:1320-1335
- Publication Year :
- 2014
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2014.
-
Abstract
- Room reverberation and background noise severely degrade the quality of hands-free speech communication systems. In this work, we address the problem of combined speech dereverberation and noise reduction using a variational Bayesian (VB) inference approach. Our method relies on a multichannel state-space model for the acoustic channels that combines frame-based observation equations in the frequency domain with a first-order Markov model to describe the time-varying nature of the room impulse responses. By modeling the channels and the source signal as latent random variables, we formulate a lower bound on the log-likelihood function of the model parameters given the observed microphone signals and iteratively maximize it using an online expectation-maximization approach. Our derivation yields update equations to jointly estimate the channel and source posterior distributions and the remaining model parameters. An inspection of the resulting VB algorithmfor blind equalization and channel identification (VB-BENCH) reveals that the presented framework includes previously proposed methods as special cases. Finally, we evaluate the performance of our approach in terms of speech quality, adaptation times, and speech recognition results to demonstrate its effectiveness for a wide range of reverberation and noise conditions.
- Subjects :
- Reverberation
Acoustics and Ultrasonics
Computer science
Speech recognition
Noise reduction
Bayesian inference
Markov model
Background noise
Computational Mathematics
Noise
Computer Science::Sound
Expectation–maximization algorithm
Computer Science (miscellaneous)
Electrical and Electronic Engineering
Blind equalization
Subjects
Details
- ISSN :
- 23299304 and 23299290
- Volume :
- 22
- Database :
- OpenAIRE
- Journal :
- IEEE/ACM Transactions on Audio, Speech, and Language Processing
- Accession number :
- edsair.doi...........318d621480b0e0357aa9cbcd4a099960
- Full Text :
- https://doi.org/10.1109/taslp.2014.2329732