Search

Your search keyword '"Audhkhasi, Kartik"' showing total 153 results

Search Constraints

Start Over You searched for: Author "Audhkhasi, Kartik" Remove constraint Author: "Audhkhasi, Kartik"
153 results on '"Audhkhasi, Kartik"'

Search Results

1. STAB: Speech Tokenizer Assessment Benchmark

2. O-1: Self-training with Oracle and 1-best Hypothesis

3. Large-scale Language Model Rescoring on Long-form Data

4. Robust Knowledge Distillation from RNN-T Models With Noisy Training Labels Using Full-Sum Loss

5. Modular Hybrid Autoregressive Transducer

6. Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition

7. Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems

8. End-to-End Spoken Language Understanding Without Full Transcripts

9. AVLnet: Learning Audio-Visual Language Representations from Instructional Videos

10. Single headed attention based sequence-to-sequence model for state-of-the-art results on Switchboard

11. Challenging the Boundaries of Speech Recognition: The MALACH Corpus

12. Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation

13. Acoustically Grounded Word Embeddings for Improved Acoustics-to-Word Speech Recognition

15. Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition

16. Building competitive direct acoustics-to-word models for English conversational speech recognition

17. Direct Acoustics-to-Word Models for English Conversational Speech Recognition

18. English Conversational Telephone Speech Recognition by Humans and Machines

19. End-to-End ASR-free Keyword Search from Speech

20. Invariant Representations for Noisy Speech Recognition

21. Diverse Embedding Neural Network Language Models

23. Generalized Ambiguity Decomposition for Understanding Ensemble Diversity

25. Large-Scale Language Model Rescoring on Long-Form Data

28. Modular Hybrid Autoregressive Transducer

32. Applying Machine Learning to Facilitate Autism Diagnostics: Pitfalls and Promises

35. AVLnet: Learning Audio-Visual Language Representations from Instructional Videos

Catalog

Books, media, physical & digital resources