Back to Search Start Over

ROBUST SPEAKER GENDER IDENTIFICATION USING EMPIRICAL DECOMPOSITION-BASED CEPSTRAL FEATURES

Authors :
Ghasem Alipoor
Ehsan Samadi
Source :
Asia-Pacific Journal of Information Technology and Multimedia, Vol 7, Iss 01, Pp 71-81 (2018)
Publication Year :
2018
Publisher :
UKM Press, 2018.

Abstract

Automatic gender identification is one of the appealing fields of research with numerous practical applications. However, this issue has not gained its deserved attention, in particular in the presence of environmental noises. In this paper, using the empirical mode decomposition (EMD), some new and improved mel-frequency cepstral coefficient (MFCC) features are developed to address this problem. In the proposed approach, EMD is employed as a filter bank to decompose the speech signal into its frequency bands. Furthermore, another variant is also developed in which the complete ensemble EMD (CEEMD) supersedes the EMD. Moreover, support vector machine (SVM) with radial basis function (RBF) kernel is employed for classification. Performance of these methods is examined for gender identification, in noise-free environments as well as in the presence of various Gaussian and non-Gaussian noises. Simulation results show that, although with fewer features used, utilizing the improved EMD-based cepstral features in noiseless situations leads to the same accuracy as that of the original MFCCs. However, in noisy environments the proposed methods outperform the conventional way of extracting the MFCCs.

Details

Language :
English, Malay
ISSN :
22892192
Volume :
7
Issue :
01
Database :
Directory of Open Access Journals
Journal :
Asia-Pacific Journal of Information Technology and Multimedia
Publication Type :
Academic Journal
Accession number :
edsdoj.570e381b32345579267142d2ed63278
Document Type :
article
Full Text :
https://doi.org/10.17576/apjitm-2018-0701-06