Back to Search Start Over

Spectrogram based features selection using multiple kernel learning for speech/music discrimination.

Authors :
Nilufar, Sharmin
Ray, Nilanjan
Molla, M. K. Islam
Hirose, Keikichi
Source :
2012 IEEE International Conference on Acoustics, Speech & Signal Processing (ICASSP); 1/ 1/2012, p501-504, 4p
Publication Year :
2012

Abstract

This paper presents a multiple kernel learning (MKL) approach to speech/music discrimination (SMD). The time-frequency representation (spectrogram) implemented by short-time Fourier transform (STFT) of audio segment is decomposed by wavelet packet transform into different subband levels. The subbands, which contain rich texture information, are used as features for this discrimination problem. MKL technique is used to select the optimal subbands to discriminate the audio signals. The proposed MKL based algorithm is applied for SMD of a standard dataset. The experimental results show that the proposed technique yields noticeable improvements in classification accuracy and tolerance toward different noise types compared to the existing methods. [ABSTRACT FROM PUBLISHER]

Details

Language :
English
ISBNs :
9781467300452
Database :
Complementary Index
Journal :
2012 IEEE International Conference on Acoustics, Speech & Signal Processing (ICASSP)
Publication Type :
Conference
Accession number :
86551598
Full Text :
https://doi.org/10.1109/ICASSP.2012.6287926