Search

Your search keyword '"Mahadeokar, Jay"' showing total 109 results

Search Constraints

Start Over You searched for: Author "Mahadeokar, Jay" Remove constraint Author: "Mahadeokar, Jay"
109 results on '"Mahadeokar, Jay"'

Search Results

1. Efficient Streaming LLM for Speech Recognition

2. Frozen Large Language Models Can Perceive Paralinguistic Aspects of Speech

3. M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses

4. Faster Speech-LLaMA Inference with Multi-token Prediction

5. The Llama 3 Herd of Models

6. Towards scalable efficient on-device ASR with transfer learning

7. Effective internal language model training and fusion for factorized transducer model

8. AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs

9. Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR Model

10. TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models

11. Prompting Large Language Models with Speech Recognition Abilities

12. Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale

13. Towards Selection of Text-to-speech Data to Augment ASR Training

14. Multi-Head State Space Model for Speech Recognition

15. Improving Fast-slow Encoder based Transducer with Streaming Deliberation

16. Dynamic Speech Endpoint Detection with Regression Targets

17. Anchored Speech Recognition with Neural Transducers

18. An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition

19. Federated Domain Adaptation for ASR with Full Self-Supervision

20. Streaming parallel transducer beam search with fast-slow cascaded encoders

21. TorchAudio: Building Blocks for Audio and Speech Processing

22. Streaming Transformer Transducer Based Speech Recognition Using Non-Causal Convolution

23. Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios

24. Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion

25. Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency

26. Dissecting User-Perceived Latency of On-Device E2E Speech Recognition

27. Memory-efficient Speech Recognition on Smart Devices

28. Deep Shallow Fusion for RNN-T Personalization

29. Alignment Restricted Streaming Recurrent Neural Network Transducer

30. Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer

31. Contextual RNN-T For Open Domain ASR

32. RNN-T For Latency Controlled ASR With Improved Beam Search

33. Spatial Attention for Far-field Speech Recognition with Deep Beamforming Neural Networks

34. Transformer-Transducer: End-to-End Speech Recognition with Self-Attention

35. Transformer-based Acoustic Modeling for Hybrid Speech Recognition

37. Prompting Large Language Models with Speech Recognition Abilities

39. TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-Device ASR Models

41. Multi-Head State Space Model for Speech Recognition

46. Faster Replacement Paths Algorithm for Undirected, Positive Integer Weighted Graphs with Small Diameter

Catalog

Books, media, physical & digital resources