Back to Search
Start Over
Long Term Modeling of Phase Trajectories within the Speech Sinusoidal Model Framework
- Source :
- Proceedings of the INTERSPEECH-8th International Conference on Spoken Language Processing (ICSLP04), INTERSPEECH-8th International Conference on Spoken Language Processing (ICSLP04), INTERSPEECH-8th International Conference on Spoken Language Processing (ICSLP04), Oct 2004, South Korea. pp.2469-2472, INTERSPEECH
- Publication Year :
- 2004
- Publisher :
- HAL CCSD, 2004.
-
Abstract
- In this paper, the problem of modeling the trajectory of the phase of speech signal is addressed within the context of the sinusoidal model of speech. A global or long-term model of the trajectory of the phase of the partials is proposed for each entire voiced section of speech, contrary to standard models, which are defined on a frame-by-frame basis. The complete analysis-modeling-synthesis process is presented. We compare two basic long-term models, namely a polynomial and a DCT-based model, with classical (frame-by-frame) interpolation schemes, given that the analysis process is the same in all cases. Promising results are given and the interest of the presented models for speech coding and speech watermarking applications is discussed. 1. Introduction Sinusoidal modeling of audio signals has been extensively studied since the eighties and successfully applied to a wide range of applications, such as coding or time- and frequency-stretching [1-5]. The signal is modeled as the sum of a small number
- Subjects :
- Audio signal
Computer science
Speech recognition
Speech coding
[INFO.INFO-OH]Computer Science [cs]/Other [cs.OH]
020206 networking & telecommunications
Sinusoidal model
02 engineering and technology
[INFO.INFO-OH] Computer Science [cs]/Other [cs.OH]
030507 speech-language pathology & audiology
03 medical and health sciences
Computer Science::Sound
0202 electrical engineering, electronic engineering, information engineering
Discrete cosine transform
0305 other medical science
Digital watermarking
ComputingMilieux_MISCELLANEOUS
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- Proceedings of the INTERSPEECH-8th International Conference on Spoken Language Processing (ICSLP04), INTERSPEECH-8th International Conference on Spoken Language Processing (ICSLP04), INTERSPEECH-8th International Conference on Spoken Language Processing (ICSLP04), Oct 2004, South Korea. pp.2469-2472, INTERSPEECH
- Accession number :
- edsair.doi.dedup.....34256d187a6e21d9ce1f7110f40200d6