Back to Search
Start Over
Multichannel speech reinforcement based on binaural unmasking
- Source :
- Signal Processing. 139:165-172
- Publication Year :
- 2017
- Publisher :
- Elsevier BV, 2017.
-
Abstract
- Multichannel speech reinforcement exploiting DoA information is proposed.An empirical evidence of the binaural unmasking for monaural speech is provided.Proposed reinforcement restores perceived loudness considering binaural unmasking.The performance of the algorithm is verified through subjective listening tests. Speech reinforcement or near-end listening enhancement is a technique that modifies the far-end signal to mitigate the effect of the near-end noise, usually based on the power spectra of the far-end signal and the near-end noise. Psychoacoustic experiments have shown that the location of a noise source with respect to that of a signal source affects the amount of masking. Since conventional speech reinforcement methods obtain spectral gain based only on the power spectra, this psychoacoustic phenomenon called binaural unmasking has not been considered in those approaches. In this paper, we propose a novel speech reinforcement algorithm that modifies the far-end speech signal based on both the power spectrum and the direction-of-arrival (DoA) of the noise. Specifically, we have computed the equivalent frontal noise level from the observed noise level and the estimated DoA, and used it to compute spectral gains as in conventional partial loudness restoration-based speech reinforcement. Experimental results showed that the proposed method outperformed the conventional methods based on partial loudness restoration and speech intelligibility index (SII) optimization in terms of the overall perceived quality through subjective listening tests.
- Subjects :
- Masking (art)
Computer science
Speech recognition
Direction of arrival
Spectral density
020206 networking & telecommunications
02 engineering and technology
Monaural
Intelligibility (communication)
01 natural sciences
Signal
Loudness
Noise
Control and Systems Engineering
0103 physical sciences
Signal Processing
0202 electrical engineering, electronic engineering, information engineering
Active listening
Computer Vision and Pattern Recognition
Psychoacoustics
Electrical and Electronic Engineering
010301 acoustics
Binaural recording
Software
Subjects
Details
- ISSN :
- 01651684
- Volume :
- 139
- Database :
- OpenAIRE
- Journal :
- Signal Processing
- Accession number :
- edsair.doi...........70ab75bbc890034ea34f7af835eac86a