Search

Your search keyword '"Chen, Zhehuai"' showing total 112 results

Search Constraints

Start Over You searched for: Author "Chen, Zhehuai" Remove constraint Author: "Chen, Zhehuai"
112 results on '"Chen, Zhehuai"'

Search Results

1. Detecting the Undetectable: Assessing the Efficacy of Current Spoof Detection Methods Against Seamless Speech Edits

2. NeKo: Toward Post Recognition Generative Correction Large Language Models with Task-Oriented Experts

3. Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

4. Anticipating Future with Large Language Model for Simultaneous Machine Translation

5. VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning

6. Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data

7. EMMeTT: Efficient Multimodal Machine Translation Training

8. Chain-of-Thought Prompting for Speech Translation

9. Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition

10. BESTOW: Efficient and Streamable Speech Language Model with the Best of Two Worlds in GPT and T5

11. Less is More: Accurate Speech Recognition & Translation without Web-Scale Data

12. DeSTA: Enhancing Speech Language Models through Descriptive Speech-Text Alignment

13. Instruction Data Generation and Unsupervised Adaptation for Speech Language Models

14. Transducers with Pronunciation-aware Embeddings for Automatic Speech Recognition

15. GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators

16. High-precision Voice Search Query Correction via Retrievable Speech-text Embedings

17. SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation

18. Using Text Injection to Improve Recognition of Personal Identifiers in Speech

19. Understanding Shared Speech-Text Representations

20. Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages

21. Accelerating RNN-T Training and Inference Using CTC guidance

22. Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech

23. Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR

24. JOIST: A Joint Speech and Text Streaming Model For ASR

25. Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data

26. MAESTRO: Matched Speech Text Representations through Modality Matching

27. Unsupervised Data Selection via Discrete Speech Representation for ASR

28. Injecting Text in Self-Supervised Speech Pretraining

29. An Asynchronous WFST-Based Decoder For Automatic Speech Recognition

30. Modular End-to-end Automatic Speech Recognition Framework for Acoustic-to-word Model

31. End-to-end contextual speech recognition using class language models and a token passing decoder

32. Linguistic Search Optimization for Deep Learning Based LVCSR

33. Sequence Discriminative Training for Deep Learning based Acoustic Keyword Spotting

34. A GPU-based WFST Decoder with Exact Lattice Generation

35. On Modular Training of Neural Acoustics-to-Word Model for LVCSR

36. Progressive Joint Modeling in Unsupervised Single-channel Overlapped Speech Recognition

39. Multi-view LSTM Language Model with Word-Synchronized Auxiliary Feature for LVCSR

40. A Unified Confidence Measure Framework Using Auxiliary Normalization Graph

Catalog

Books, media, physical & digital resources