Back to Search
Start Over
Whispered Speech Detection Using Fusion of Group-Delay-Based Subband Modulation Spectrum and Correntropy Features
- Source :
- IEEE Signal Processing Letters. 23:1042-1046
- Publication Year :
- 2016
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2016.
-
Abstract
- In this letter, we propose a novel fusion feature for detection of whispered speech in noisy environment using a group-delay-based instantaneous spectrum analysis. The fusion feature involves two individual components, namely, subband modulation spectrum (SMS)-based features and subband correntropy (SCE) features, both extracted from the instantaneous spectrum. The instantaneous spectrum estimation involves zero-time windowing for improved temporal resolution and group-delay computation for improved spectral resolution, as compared to the traditional discrete-Fourier-transform-based spectrum estimation. The SMS features capture the spectral representation of the subband energy time trajectories, while the SCE features model the fluctuations in the subband energy time trajectories. The SMS captures both the short-term as well as long-term spectral characteristics of whispered speech and is known to provide good separation between speech and noise components. The correntropy features help capture the dynamics of the vocal tract system to discriminate noisy whisper from noise. Whisper speech detection experiments using support vector machine models and the proposed features indicate promising performance under low signal-to-noise conditions.
- Subjects :
- Computer science
Speech recognition
Feature extraction
02 engineering and technology
zero-time windowing
030507 speech-language pathology & audiology
03 medical and health sciences
0202 electrical engineering, electronic engineering, information engineering
Electrical and Electronic Engineering
Spectral resolution
Group delay and phase delay
Whispered speech detection
Voice activity detection
ta213
Noise measurement
business.industry
Applied Mathematics
020206 networking & telecommunications
Pattern recognition
group delay
Support vector machine
Noise
Feature (computer vision)
Temporal resolution
Signal Processing
Artificial intelligence
0305 other medical science
business
correntropy
modulation spectrum
instantaneous spectrum analysis
Vocal tract
Energy (signal processing)
Subjects
Details
- ISSN :
- 15582361 and 10709908
- Volume :
- 23
- Database :
- OpenAIRE
- Journal :
- IEEE Signal Processing Letters
- Accession number :
- edsair.doi.dedup.....4cd5f2980f5c626ebf55455a6fbd7b92
- Full Text :
- https://doi.org/10.1109/lsp.2016.2580912