Back to Search Start Over

Single-channel speech separation using combined EMD and speech-specific information.

Authors :
Prasanna Kumar, M.
Kumaraswamy, R.
Source :
International Journal of Speech Technology; Dec2017, Vol. 20 Issue 4, p1037-1047, 11p
Publication Year :
2017

Abstract

Multi-channel blind source separation (BSS) methods use more than one microphone. There is a need to develop speech separation algorithms under single microphone scenario. In this paper we propose a method for single channel speech separation (SCSS) by combining empirical mode decomposition (EMD) and speech specific information. Speech specific information is derived in the form of source-filter features. Source features are obtained using multi pitch information. Filter information is estimated using formant analysis. To track multi pitch information in the mixed signal we apply simple-inverse filtering tracking (SIFT) and histogram based pitch estimation to excitation source information. Formant estimation is done using linear predictive (LP) analysis. Pitch and formant estimation are done with and without EMD decomposition for better extraction of the individual speakers in the mixture. Combining EMD with speech specific information provides encouraging results for single-channel speech separation. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
13812416
Volume :
20
Issue :
4
Database :
Complementary Index
Journal :
International Journal of Speech Technology
Publication Type :
Academic Journal
Accession number :
125968232
Full Text :
https://doi.org/10.1007/s10772-017-9468-3