Author: "Laurent Girin" / Publisher: isca - Searchworks@Jio Institute Digital Library Search Results

1. Les auto-encodeurs variationnels dynamiques et leur application à la modélisation de spectrogrammes de parole

Author: Laurent Girin, Xiaoyu Bie, Simon Leglaive, Thomas Hueber, and Xavier Alameda-Pineda
Published: 2022
Full Text: View/download PDF

2. A simple hybrid acoustic/morphologically-constrained technique for the synthesis of stop consonants in various vocalic contexts

Author: Frédéric Berthommier, Laurent Girin, and Louis-Jean Boë
Published: 2012
Full Text: View/download PDF

3. A signal processing system for having the sound 'pop-out' in noise thanks to the image of the speaker's lips: new advances using multi-layer perceptrons

Author: Laurent Girin, Laurent Varin, Gang Feng, and Jean-Luc Schwartz
Published: 1998
Full Text: View/download PDF

4. Noisy speech enhancement by fusion of auditory and visual information: a study of vowel transitions

Author: Laurent Girin, Gang Feng, and Jean-Luc Schwartz
Published: 1997
Full Text: View/download PDF

5. A Benchmark of Dynamical Variational Autoencoders Applied to Speech Spectrogram Modeling

Author: Xiaoyu Bie, Laurent Girin, Thomas Hueber, Simon Leglaive, Xavier Alameda-Pineda, Vers des robots à l’intelligence sociale au travers de l’apprentissage, de la perception et de la commande (ROBOTLEARN), Inria Grenoble - Rhône-Alpes, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Université Grenoble Alpes (UGA), GIPSA - Cognitive Robotics, Interactive Systems, & Speech Processing (GIPSA-CRISSP), GIPSA Pôle Parole et Cognition (GIPSA-PPC), Grenoble Images Parole Signal Automatique (GIPSA-lab), Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA)-Centre National de la Recherche Scientifique (CNRS)-Université Grenoble Alpes (UGA)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP ), Université Grenoble Alpes (UGA)-Grenoble Images Parole Signal Automatique (GIPSA-lab), Université Grenoble Alpes (UGA), Institut d'Électronique et des Technologies du numéRique (IETR), Université de Rennes (UR)-Institut National des Sciences Appliquées - Rennes (INSA Rennes), Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS)-Nantes Université - pôle Sciences et technologie, Nantes Université (Nantes Univ)-Nantes Université (Nantes Univ), CentraleSupélec [campus de Rennes], ANR-19-P3IA-0003,MIAI,MIAI @ Grenoble Alpes(2019), ANR-19-CE33-0008,ML3RI,Apprentissage de bas-niveau d'ineractions robotiques multi-modales avec plusieurs personnes(2019), European Project: 871245,H2020-EU.2.1.1. - INDUSTRIAL LEADERSHIP - Leadership in enabling and industrial technologies - Information and Communication Technologies (ICT),SPRING(2020), CentraleSupélec, and European Project: H2020,SPRING
Subjects: FOS: Computer and information sciences, Sound (cs.SD), Computer science, 02 engineering and technology, Computer Science - Sound, [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI], 030507 speech-language pathology & audiology, 03 medical and health sciences, Speech signals modeling, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], Audio and Speech Processing (eess.AS), 0202 electrical engineering, electronic engineering, information engineering, FOS: Electrical engineering, electronic engineering, information engineering, Complex data type, Sequence, Series (mathematics), business.industry, speech analysisresynthesis, 020206 networking & telecommunications, Pattern recognition, dynamical variational autoencoders, Autoencoder, Generative model, Recurrent neural network, speech spectrograms, [INFO.INFO-SD]Computer Science [cs]/Sound [cs.SD], Benchmark (computing), Spectrogram, Artificial intelligence, 0305 other medical science, business, Electrical Engineering and Systems Science - Audio and Speech Processing
Abstract: The Variational Autoencoder (VAE) is a powerful deep generative model that is now extensively used to represent high-dimensional complex data via a low-dimensional latent space learned in an unsupervised manner. In the original VAE model, input data vectors are processed independently. In recent years, a series of papers have presented different extensions of the VAE to process sequential data, that not only model the latent space, but also model the temporal dependencies within a sequence of data vectors and corresponding latent vectors, relying on recurrent neural networks. We recently performed a comprehensive review of those models and unified them into a general class called Dynamical Variational Autoencoders (DVAEs). In the present paper, we present the results of an experimental benchmark comparing six of those DVAE models on the speech analysis-resynthesis task, as an illustration of the high potential of DVAEs for speech modeling., Comment: Accepted to Interspeech 2021. arXiv admin note: text overlap with arXiv:2008.12595
Full Text: View/download PDF

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

5 results on '"Laurent Girin"'

1. Les auto-encodeurs variationnels dynamiques et leur application à la modélisation de spectrogrammes de parole

2. A simple hybrid acoustic/morphologically-constrained technique for the synthesis of stop consonants in various vocalic contexts

3. A signal processing system for having the sound 'pop-out' in noise thanks to the image of the speaker's lips: new advances using multi-layer perceptrons

4. Noisy speech enhancement by fusion of auditory and visual information: a study of vowel transitions

5. A Benchmark of Dynamical Variational Autoencoders Applied to Speech Spectrogram Modeling

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

5 results on '"Laurent Girin"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources