Search

Your search keyword '"Du, Zhihao"' showing total 249 results

Search Constraints

Start Over You searched for: Author "Du, Zhihao" Remove constraint Author: "Du, Zhihao"
249 results on '"Du, Zhihao"'

Search Results

1. Unispeaker: A Unified Approach for Multimodality-driven Speaker Generation

2. MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

3. CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models

4. OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation

5. Enhancing Low-Resource ASR through Versatile TTS: Bridging the Data Gap

6. IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities

7. CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens

8. FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs

9. An Embarrassingly Simple Approach for LLM with Strong ASR Capacity

13. SA-Paraformer: Non-autoregressive End-to-End Speaker-Attributed ASR

14. LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT

15. The second multi-channel multi-party meeting transcription challenge (M2MeT) 2.0): A benchmark for speaker-attributed ASR

16. FunCodec: A Fundamental, Reproducible and Integrable Open-source Toolkit for Neural Speech Codec

17. CASA-ASR: Context-Aware Speaker-Attributed ASR

18. FunASR: A Fundamental End-to-End Speech Recognition Toolkit

19. TOLD: A Novel Two-Stage Overlap-Aware Framework for Speaker Diarization

20. Speaker Overlap-aware Neural Diarization for Multi-party Meeting Analysis

21. A Comparative Study on Multichannel Speaker-Attributed Automatic Speech Recognition in Multi-party Meetings

22. MFCCA:Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario

23. The influence of physical activity on internet addiction among Chinese college students: the mediating role of self-esteem and the moderating role of gender

24. A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings

25. Speaker Embedding-aware Neural Diarization: an Efficient Framework for Overlapping Speech Diarization in Meeting Scenarios

26. Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge

27. Speaker Embedding-aware Neural Diarization for Flexible Number of Speakers with Textual Information

28. M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge

32. On the Relationship Between Newton’s Law and Sports : Taking Track and Field as an Example

35. A Monaural Speech Enhancement Method for Robust Small-Footprint Keyword Spotting

36. Acoustic Scene Classification by Implicitly Identifying Distinct Sound Events

40. Investigation of Monaural Front-End Processing for Robust ASR without Retraining or Joint-Training

Catalog

Books, media, physical & digital resources