Search

Your search keyword '"Chen, Zhehuai"' showing total 22 results

Search Constraints

Start Over You searched for: Author "Chen, Zhehuai" Remove constraint Author: "Chen, Zhehuai" Topic computer science - sound Remove constraint Topic: computer science - sound
22 results on '"Chen, Zhehuai"'

Search Results

1. Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits

2. Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data

3. EMMeTT: Efficient Multimodal Machine Translation Training

4. Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition

5. BESTOW: Efficient and Streamable Speech Language Model with the Best of Two Worlds in GPT and T5

6. Less is More: Accurate Speech Recognition & Translation without Web-Scale Data

7. Transducers with Pronunciation-aware Embeddings for Automatic Speech Recognition

8. GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators

9. High-precision Voice Search Query Correction via Retrievable Speech-text Embedings

10. SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation

11. Using Text Injection to Improve Recognition of Personal Identifiers in Speech

12. Understanding Shared Speech-Text Representations

13. Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages

14. Accelerating RNN-T Training and Inference Using CTC guidance

15. Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech

16. Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR

17. JOIST: A Joint Speech and Text Streaming Model For ASR

18. Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data

19. MAESTRO: Matched Speech Text Representations through Modality Matching

20. Injecting Text in Self-Supervised Speech Pretraining

21. An Asynchronous WFST-Based Decoder For Automatic Speech Recognition

22. Modular End-to-end Automatic Speech Recognition Framework for Acoustic-to-word Model

Catalog

Books, media, physical & digital resources