Search

Your search keyword '"Li, Xinjian"' showing total 26 results

Search Constraints

Start Over You searched for: Author "Li, Xinjian" Remove constraint Author: "Li, Xinjian" Publication Type Reports Remove constraint Publication Type: Reports
26 results on '"Li, Xinjian"'

Search Results

1. Towards Robust Speech Representation Learning for Thousands of Languages

2. YODAS: Youtube-Oriented Dataset for Audio and Speech

3. Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

4. Reproducing Whisper-Style Training Using an Open-Source Toolkit and Publicly Available Data

5. Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining

6. Textless Direct Speech-to-Speech Translation with Discrete Speech Representation

7. ASR2K: Speech Recognition for Around 2000 Languages without Audio

8. On Adversarial Robustness of Large-scale Audio Visual Learning

9. Multi-Faceted Hierarchical Multi-Task Learning for a Large Number of Tasks with Multi-dimensional Relations

10. On Prosody Modeling for ASR+TTS based Voice Conversion

11. Phoneme Recognition through Fine Tuning of Phonetic Representations: a Case Study on Luhya Language Varieties

12. Tusom2021: A Phonetically Transcribed Speech Dataset from an Endangered Language for Universal Phone Recognition Experiments

13. Acoustics Based Intent Recognition Using Discovered Phonetic Units for Low Resource Languages

14. Revisiting Factorizing Aggregated Posterior in Learning Disentangled Representations

15. Semi-Supervised Recognition under a Noisy and Fine-grained Dataset

16. A Summary of the First Workshop on Language Technology for Language Documentation and Revitalization

17. AlloVera: A Multilingual Allophone Database

18. Universal Phone Recognition with a Multilingual Allophone System

19. Towards Zero-shot Learning for Automatic Phonemic Transcription

20. Adversarial Music: Real World Audio Adversary Against Wake-word Detection System

21. SANTLR: Speech Annotation Toolkit for Low Resource Languages

22. Multilingual Speech Recognition with Corpus Relatedness Sampling

23. The ARIEL-CMU Systems for LoReHLT18

24. Phoneme Level Language Models for Sequence Based Low Resource ASR

25. Real-time Neural-based Input Method

26. Domain Robust Feature Extraction for Rapid Low Resource ASR Development

Catalog

Books, media, physical & digital resources