164 results on '"Laurent Girin"'
Search Results
2. Exploring the Multidimensional Representation of Unidimensional Speech Acoustic Parameters Extracted by Deep Unsupervised Models.
3. Unsupervised speech enhancement with deep dynamical generative speech and noise models.
4. Speech Modeling with a Hierarchical Transformer Dynamical VAE.
5. Fill in the Gap! Combining Self-supervised Representation Learning with Neural Audio Synthesis for Speech Inpainting.
6. Learning and controlling the source-filter representation of speech with a variational autoencoder.
7. BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model.
8. Repeat after Me: Self-Supervised Learning of Acoustic-to-Articulatory Mapping by Vocal Imitation.
9. A multimodal dynamical variational autoencoder for audiovisual speech representation learning.
10. Exploring the multidimensional representation of individual speech acoustic parameters extracted by deep unsupervised models.
11. Unsupervised Speech Enhancement Using Dynamical Variational Autoencoders.
12. Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation.
13. A Multimodal Dynamical Variational Autoencoder for Audiovisual Speech Representation Learning.
14. Speech Modeling with a Hierarchical Transformer Dynamical VAE.
15. Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation.
16. Unsupervised speech enhancement with deep dynamical generative speech and noise models.
17. A Benchmark of Dynamical Variational Autoencoders Applied to Speech Spectrogram Modeling.
18. Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input.
19. Learning Robust Speech Representation with an Articulatory-Regularized Variational Autoencoder.
20. Improved feature extraction for CRNN-based multiple sound source localization.
21. Saladnet: Self-Attentive Multisource Localization in the Ambisonics Domain.
22. What the Future Brings: Investigating the Impact of Lookahead for Incremental Neural TTS.
23. High-Resolution Speaker Counting in Reverberant Rooms Using CRNN with Ambisonics Features.
24. A Recurrent Variational Autoencoder for Speech Enhancement.
25. Make That Sound More Metallic: Towards a Perceptually Relevant Control of the Timbre of Synthesizer Sounds Using a Variational Autoencoder.
26. Dynamical Variational Autoencoders: A Comprehensive Review.
27. Variational Bayesian Inference for Audio-Visual Tracking of Multiple Speakers.
28. Learning and controlling the source-filter representation of speech with a variational autoencoder.
29. BERT, can HE predict contrastive focus? Predicting and controlling prominence in neural TTS using a language model.
30. Unsupervised Multiple-Object Tracking with a Dynamical Variational Autoencoder.
31. HiT-DVAE: Human Motion Generation via Hierarchical Transformer Dynamical VAE.
32. Repeat after me: Self-supervised learning of acoustic-to-articulatory mapping by vocal imitation.
33. Bayesian time-domain multiple sound source localization for a stochastic machine.
34. Speech Enhancement with Variational Autoencoders and Alpha-stable Distributions.
35. Semi-supervised Multichannel Speech Enhancement with Variational Autoencoders and Non-negative Matrix Factorization.
36. Evaluating the Potential Gain of Auditory and Audiovisual Speech-Predictive Coding Using Deep Learning.
37. Audio-Visual Speech Enhancement Using Conditional Variational Auto-Encoders.
38. A variance Modeling Framework based on variational Autoencoders for speech enhancement.
39. Online Localization of Multiple Moving Speakers in Reverberant Environments.
40. Multisource Mint Using Convolutive Transfer Function.
41. Accounting for Room Acoustics in Audio-Visual Multi-Speaker Tracking.
42. Audio-Visual Variational Fusion for Multi-Person Tracking with Robots.
43. A Benchmark of Dynamical Variational Autoencoders applied to Speech Spectrogram Modeling.
44. SALADnet: Self-Attentive multisource Localization in the Ambisonics Domain.
45. Alternate Endings: Improving Prosody for Incremental Neural TTS with Predicted Future Text Input.
46. Multichannel CRNN for Speaker Counting: an Analysis of Performance.
47. Improved feature extraction for CRNN-based multiple sound source localization.
48. Learning robust speech representation with an articulatory-regularized variational autoencoder.
49. Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders.
50. A Survey of Sound Source Localization with Deep Learning Methods.
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.