Search

Your search keyword '"Hain, Thomas"' showing total 160 results

Search Constraints

Start Over You searched for: Author "Hain, Thomas" Remove constraint Author: "Hain, Thomas" Search Limiters Available in Library Collection Remove constraint Search Limiters: Available in Library Collection
160 results on '"Hain, Thomas"'

Search Results

1. Methods for Automatic Matrix Language Determination of Code-Switched Speech

2. Using Speech Foundational Models in Loss Functions for Hearing Aid Speech Enhancement

3. Improving Accented Speech Recognition using Data Augmentation based on Unsupervised Text-to-Speech Synthesis

4. LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related Tasks

5. Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition

6. EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark

7. 1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem

8. Automatic Speech Recognition System-Independent Word Error Rate Estimation

9. Hallucination in Perceptual Metric-Driven Speech Enhancement Networks

10. Improving Acoustic Word Embeddings through Correspondence Training of Self-supervised Speech Representations

11. SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations

12. Progressive unsupervised domain adaptation for ASR using ensemble models and multi-stage training

13. Non-Intrusive Speech Intelligibility Prediction for Hearing-Impaired Users using Intermediate ASR Features and Human Memory Models

14. Multi-CMGAN+/+: Leveraging Multi-Objective Speech Quality Metric Prediction for Speech Enhancement

15. MUST: A Multilingual Student-Teacher Learning approach for low-resource speech recognition

16. Fast Word Error Rate Estimation Using Self-Supervised Representations For Speech And Text

17. On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments

18. The Effect of Spoken Language on Speech Enhancement using Self-Supervised Speech Representation Loss Functions

19. Non Intrusive Intelligibility Predictor for Hearing Impaired Individuals using Self Supervised Speech Representations

20. Empirical Interpretation of the Relationship Between Speech Acoustic Context and Emotion Recognition

21. Learning Cross-lingual Mappings for Data Augmentation to Improve Low-Resource Speech Recognition

22. On Data Sampling Strategies for Training Neural Network Speech Separation Models

23. Towards domain generalisation in ASR with elitist sampling and ensemble knowledge distillation

24. Perceive and predict: self-supervised speech representation based loss functions for speech enhancement

25. Dynamic Kernels and Channel Attention for Low Resource Speaker Verification

26. Probing Statistical Representations For End-To-End ASR

27. Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation

28. Unsupervised data selection for Speech Recognition with contrastive loss ratios

29. Investigating the Impact of Cross-lingual Acoustic-Phonetic Similarities on Multilingual Speech Recognition

30. Non-Linear Pairwise Language Mappings for Low-Resource Multilingual Acoustic Model Fusion

31. A cross-corpus study on speech emotion recognition

32. Insights on Neural Representations for End-to-End Speech Recognition

33. Utterance Weighted Multi-Dilation Temporal Convolutional Networks for Monaural Speech Dereverberation

34. Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation

35. Efficient Non-Autoregressive GAN Voice Conversion using VQWav2vec Features and Dynamic Convolution

36. MetricGAN+/-: Increasing Robustness of Noise Reduction on Unseen Data

37. Multiple-hypothesis CTC-based semi-supervised adaptation of end-to-end speech recognition

38. T-vectors: Weakly Supervised Speaker Identification Using Hierarchical Transformer Model

39. Towards Low-Resource StarGAN Voice Conversion using Weight Adaptive Instance Normalization

40. Improving Audio Anomalies Recognition Using Temporal Convolutional Attention Network

41. Unsupervised Acoustic Unit Representation Learning for Voice Conversion using WaveNet Auto-encoders

42. Exploration of Audio Quality Assessment and Anomaly Localisation Using Attention Models

43. Speaker Re-identification with Speaker Dependent Speech Enhancement

44. Weakly Supervised Training of Hierarchical Attention Networks for Speaker Identification

45. Supervised Speaker Embedding De-Mixing in Two-Speaker Environment

46. Robust Speaker Recognition Using Speech Enhancement And Attention Model

47. H-VECTORS: Utterance-level Speaker Embedding Using A Hierarchical Attention Model

48. Contextual Joint Factor Acoustic Embeddings

49. Improving Noise Robustness In Speaker Identification Using A Two-Stage Attention Model

50. Latent Dirichlet Allocation Based Acoustic Data Selection for Automatic Speech Recognition

Catalog

Books, media, physical & digital resources