Back to Search Start Over

Long Term Modeling of Phase Trajectories within the Speech Sinusoidal Model Framework

Authors :
Laurent Girin
Mohammad Firouzmand
Sylvain Marchand
Institut de la communication parlée (ICP)
Institut National Polytechnique de Grenoble (INPG)-Centre National de la Recherche Scientifique (CNRS)-Université Stendhal - Grenoble 3
Laboratoire Bordelais de Recherche en Informatique (LaBRI)
Université de Bordeaux (UB)-Centre National de la Recherche Scientifique (CNRS)-École Nationale Supérieure d'Électronique, Informatique et Radiocommunications de Bordeaux (ENSEIRB)
Marchand, Sylvain
Source :
Proceedings of the INTERSPEECH-8th International Conference on Spoken Language Processing (ICSLP04), INTERSPEECH-8th International Conference on Spoken Language Processing (ICSLP04), INTERSPEECH-8th International Conference on Spoken Language Processing (ICSLP04), Oct 2004, South Korea. pp.2469-2472, INTERSPEECH
Publication Year :
2004
Publisher :
HAL CCSD, 2004.

Abstract

In this paper, the problem of modeling the trajectory of the phase of speech signal is addressed within the context of the sinusoidal model of speech. A global or long-term model of the trajectory of the phase of the partials is proposed for each entire voiced section of speech, contrary to standard models, which are defined on a frame-by-frame basis. The complete analysis-modeling-synthesis process is presented. We compare two basic long-term models, namely a polynomial and a DCT-based model, with classical (frame-by-frame) interpolation schemes, given that the analysis process is the same in all cases. Promising results are given and the interest of the presented models for speech coding and speech watermarking applications is discussed. 1. Introduction Sinusoidal modeling of audio signals has been extensively studied since the eighties and successfully applied to a wide range of applications, such as coding or time- and frequency-stretching [1-5]. The signal is modeled as the sum of a small number

Details

Language :
English
Database :
OpenAIRE
Journal :
Proceedings of the INTERSPEECH-8th International Conference on Spoken Language Processing (ICSLP04), INTERSPEECH-8th International Conference on Spoken Language Processing (ICSLP04), INTERSPEECH-8th International Conference on Spoken Language Processing (ICSLP04), Oct 2004, South Korea. pp.2469-2472, INTERSPEECH
Accession number :
edsair.doi.dedup.....34256d187a6e21d9ce1f7110f40200d6