Start Over

Long Term Modeling of Phase Trajectories within the Speech Sinusoidal Model Framework

Authors :: Laurent Girin
Mohammad Firouzmand
Sylvain Marchand
Institut de la communication parlée (ICP)
Institut National Polytechnique de Grenoble (INPG)-Centre National de la Recherche Scientifique (CNRS)-Université Stendhal - Grenoble 3
Laboratoire Bordelais de Recherche en Informatique (LaBRI)
Université de Bordeaux (UB)-Centre National de la Recherche Scientifique (CNRS)-École Nationale Supérieure d'Électronique, Informatique et Radiocommunications de Bordeaux (ENSEIRB)
Marchand, Sylvain
Source :: Proceedings of the INTERSPEECH-8th International Conference on Spoken Language Processing (ICSLP04), INTERSPEECH-8th International Conference on Spoken Language Processing (ICSLP04), INTERSPEECH-8th International Conference on Spoken Language Processing (ICSLP04), Oct 2004, South Korea. pp.2469-2472, INTERSPEECH
Publication Year :: 2004
Publisher :: HAL CCSD, 2004.
Abstract: In this paper, the problem of modeling the trajectory of the phase of speech signal is addressed within the context of the sinusoidal model of speech. A global or long-term model of the trajectory of the phase of the partials is proposed for each entire voiced section of speech, contrary to standard models, which are defined on a frame-by-frame basis. The complete analysis-modeling-synthesis process is presented. We compare two basic long-term models, namely a polynomial and a DCT-based model, with classical (frame-by-frame) interpolation schemes, given that the analysis process is the same in all cases. Promising results are given and the interest of the presented models for speech coding and speech watermarking applications is discussed. 1. Introduction Sinusoidal modeling of audio signals has been extensively studied since the eighties and successfully applied to a wide range of applications, such as coding or time- and frequency-stretching [1-5]. The signal is modeled as the sum of a small number

Subjects :: Audio signal
Computer science
Speech recognition
Speech coding
[INFO.INFO-OH]Computer Science [cs]/Other [cs.OH]
020206 networking & telecommunications
Sinusoidal model
02 engineering and technology
[INFO.INFO-OH] Computer Science [cs]/Other [cs.OH]
030507 speech-language pathology & audiology
03 medical and health sciences
Computer Science::Sound
0202 electrical engineering, electronic engineering, information engineering
Discrete cosine transform
0305 other medical science
Digital watermarking
ComputingMilieux_MISCELLANEOUS

Details

Language :: English
Database :: OpenAIRE
Journal :: Proceedings of the INTERSPEECH-8th International Conference on Spoken Language Processing (ICSLP04), INTERSPEECH-8th International Conference on Spoken Language Processing (ICSLP04), INTERSPEECH-8th International Conference on Spoken Language Processing (ICSLP04), Oct 2004, South Korea. pp.2469-2472, INTERSPEECH
Accession number :: edsair.doi.dedup.....34256d187a6e21d9ce1f7110f40200d6

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Long Term Modeling of Phase Trajectories within the Speech Sinusoidal Model Framework

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Long Term Modeling of Phase Trajectories within the Speech Sinusoidal Model Framework

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources