Back to Search
Start Over
Analysis of Phase Spectrum of Speech Signals Using Allpass Modeling
- Source :
- IEEE/ACM Transactions on Audio, Speech, and Language Processing. 23:2371-2383
- Publication Year :
- 2015
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2015.
-
Abstract
- The phase spectrum of Fourier transform has received lesser prominence than its magnitude counterpart in speech processing. In this paper, we propose a method for parametric modeling of the phase spectrum, and discuss its applications in speech signal processing. The phase spectrum is modeled as the response of an allpass (AP) filter, whose coefficients are estimated from the knowledge of speech production process, especially the impulse-like nature of excitation source. A signal retaining only the phase spectral component of speech signal is derived by suppressing the magnitude spectral component, and is modeled as the output of an AP filter excited with a sequence of impulses. Entropy of energy of the input signal is minimized to estimate the coefficients of the AP filter. The resulting objective function, being nonconvex in nature, is minimized using particle swarm optimization. The group delay response of estimated AP filters can be used for accurate analysis of resonances of the vocal-tract system (VTS). The error signal associated with AP modeling provides unambiguous evidence about the instants of significant excitation of the VTS. The applications of the proposed AP modeling include, but not limited to, formant tracking, extraction of glottal closure instants, speaker verification and speech synthesis.
- Subjects :
- Signal processing
Acoustics and Ultrasonics
Acoustics
Speech synthesis
Spectral component
Speech processing
computer.software_genre
Computational Mathematics
symbols.namesake
Formant
Fourier transform
Computer Science::Sound
Computer Science (miscellaneous)
symbols
Electrical and Electronic Engineering
computer
All-pass filter
Mathematics
Group delay and phase delay
Subjects
Details
- ISSN :
- 23299304 and 23299290
- Volume :
- 23
- Database :
- OpenAIRE
- Journal :
- IEEE/ACM Transactions on Audio, Speech, and Language Processing
- Accession number :
- edsair.doi...........db0c118fac49648753b1f31035f53381
- Full Text :
- https://doi.org/10.1109/taslp.2015.2479045