Back to Search Start Over

Source Separation via Spectral Masking for Speech Recognition Systems

Authors :
Gustavo Fernandes Rodrigues
Thiago de Souza Siqueira
Ana Cláudia Silva de Souza
Hani Camille Yehia
Source :
International Journal of Advances in Telecommunications, Electrotechnics, Signals and Systems, Vol 1, Iss 2-3, Pp 80-85 (2012)
Publication Year :
2012
Publisher :
International Science and Engineering Society, o.s., 2012.

Abstract

In this paper we present an insight into the use of spectral masking techniques in time-frequency domain, as a preprocessing step for the speech signal recognition. Speech recognition systems have their performance negatively affected in noisy environments or in the presence of other speech signals. The limits of these masking techniques for different levels of the signal-to-noise ratio are discussed. We show the robustness of the spectral masking techniques against four types of noise: white, pink, brown and human speech noise (bubble noise). The main contribution of this work is to analyze the performance limits of recognition systems using spectral masking. We obtain an increase of 18% on the speech hit rate, when the speech signals were corrupted by other speech signals or bubble noise, with different signal-to-noise ratio of approximately 1, 10 and 20 dB. On the other hand, applying the ideal binary masks to mixtures corrupted by white, pink and brown noise, results an average growth of 9% on the speech hit rate, with the same different signal-to-noise ratio. The experimental results suggest that the masking spectral techniques are more suitable for the case when it is applied a bubble noise, which is produced by human speech, than for the case of applying white, pink and brown noise.

Subjects

Subjects :
Telecommunication
TK5101-6720

Details

Language :
English
ISSN :
18055443
Volume :
1
Issue :
2-3
Database :
Directory of Open Access Journals
Journal :
International Journal of Advances in Telecommunications, Electrotechnics, Signals and Systems
Publication Type :
Academic Journal
Accession number :
edsdoj.0889c144867d4803b5ba997c9466e4ad
Document Type :
article
Full Text :
https://doi.org/10.11601/ijates.v1i2-3.16