173 results on '"Romain Serizel"'
Search Results
2. Mixture of Mixups for Multi-label Classification of Rare Anuran Sounds.
3. Self-Supervised Learning for Few-Shot Bird Sound Classification.
4. Posterior Sampling Algorithms for Unsupervised Speech Enhancement with Recurrent Variational Autoencoder.
5. Unsupervised Speech Enhancement with Diffusion-Based Generative Models.
6. A Weighted-Variance Variational Autoencoder Model for Speech Enhancement.
7. Performance and Energy Balance: A Comprehensive Study of State-of-the-Art Sound Event Detection Systems.
8. Diffusion-Based Speech Enhancement with a Weighted Generative-Supervised Learning Loss.
9. RoboVox: A Single/Multi-channel Far-field Speaker Recognition Benchmark for a Mobile Robot.
10. Self-supervised learning with Diffusion-based multichannel speech enhancement for speaker verification under noisy conditions.
11. BinauRec: A dataset to test the influence of the use of room impulse responses on binaural speech enhancement.
12. Performance Above All? Energy Consumption vs. Performance, a Study on Sound Event Detection with Heterogeneous Data.
13. Audio-Visual Speech Enhancement with a Deep Kalman Filter Generative Model.
14. Spice+: Evaluation of Automatic Audio Captioning Systems with Pre-Trained Language Models.
15. Lightweight Annotation and Class Weight Training for Automatic Estimation of Alarm Audibility in Noise.
16. Fast and Efficient Speech Enhancement with Variational Autoencoders.
17. A decade of DCASE: Achievements, practices, evaluations and future challenges.
18. Angular Distance Distribution Loss for Audio Classification.
19. Energy Consumption Trends in Sound Event Detection Systems.
20. Domain-Invariant Representation Learning of Bird Sounds.
21. Diffusion-based Unsupervised Audio-visual Speech Enhancement.
22. From Computation to Consumption: Exploring the Compute-Energy Link for Training and Testing Neural Networks for SED Systems.
23. Latent Watermarking of Audio Generative Models.
24. A Phoneme-Scale Assessment of Multichannel Speech Enhancement Algorithms.
25. DCASE 2024 Task 4: Sound Event Detection with Heterogeneous Data and Missing Labels.
26. Barlow Twins self-supervised learning for robust speaker recognition.
27. A Comprehensive Exploration of Noise Robustness and Noise Compensation in ResNet and TDNN-based Speaker Recognition Systems.
28. Learning Noise Robust ResNet-Based Speaker Embedding for Speaker Recognition.
29. A Benchmark of State-of-the-Art Sound Event Detection Systems Evaluated on Synthetic Soundscapes.
30. Threshold Independent Evaluation of Sound Event Detection Scores.
31. Joint Optimization of Diffusion Probabilistic-Based Multichannel Speech Enhancement with Far-Field Speaker Verification.
32. From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion.
33. Posterior sampling algorithms for unsupervised speech enhancement with recurrent variational autoencoder.
34. From Discrete Tokens to High-Fidelity Audio Using Multi-Band Diffusion.
35. Self-Supervised Learning for Few-Shot Bird Sound Classification.
36. Pretraining Representations for Bioacoustic Few-shot Detection using Supervised Contrastive Learning.
37. Unsupervised speech enhancement with diffusion-based generative models.
38. SAMbA: Speech enhancement with Asynchronous ad-hoc Microphone Arrays.
39. Diffusion-based speech enhancement with a weighted generative-supervised learning loss.
40. Post-Processing Independent Evaluation of Sound Event Detection Systems.
41. Performance and energy balance: a comprehensive study of state-of-the-art sound event detection systems.
42. Regularized Contrastive Pre-training for Few-shot Bioacoustic Sound Detection.
43. Attention-based distributed speech enhancement for unconstrained microphone arrays with varying number of nodes.
44. Compensate multiple distortions for speaker recognition systems.
45. Improving Sound Event Detection Metrics: Insights from DCASE 2020.
46. Sound Event Detection and Separation: A Benchmark on Desed Synthetic Soundscapes.
47. What's all the Fuss about Free Universal Sound Separation Data?
48. Distributed Speech Separation in Spatially Unconstrained Microphone Arrays.
49. The Impact of Non-Target Events in Synthetic Soundscapes for Sound Event Detection.
50. Automated Audio Captioning by Fine-Tuning BART with AudioSet Tags.
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.