Search

Your search keyword '"Meng, Lingwei"' showing total 21 results

Search Constraints

Start Over You searched for: Author "Meng, Lingwei" Remove constraint Author: "Meng, Lingwei" Database arXiv Remove constraint Database: arXiv
21 results on '"Meng, Lingwei"'

Search Results

1. ARLON: Boosting Diffusion Transformers with Autoregressive Models for Long Video Generation

2. Towards Within-Class Variation in Alzheimer's Disease Detection from Spontaneous Speech

3. Disentangling Speakers in Multi-Talker Speech Recognition with Speaker-Aware CTC

4. Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions

5. LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization

6. Large Language Model-based FMRI Encoding of Language Functions for Subjects with Neurocognitive Disorder

7. Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System

8. Autoregressive Speech Synthesis without Vector Quantization

9. VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via Monotonic Alignment

10. WavLLM: Towards Robust and Adaptive Speech Large Language Model

11. UNIT-DSR: Dysarthric Speech Reconstruction System Using Speech Unit Normalization

12. Cross-Speaker Encoding Network for Multi-Talker Speech Recognition

13. Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator

14. The defender's perspective on automatic speaker verification: An overview

15. A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker One

16. 2D and 3D CT Radiomic Features Performance Comparison in Characterization of Gastric Cancer: A Multi-center Study

17. Exploring linguistic feature and model combination for speech recognition based automatic AD detection

18. Tackling Spoofing-Aware Speaker Verification with Multi-Model Fusion

19. Spoofing-Aware Speaker Verification by Multi-Level Fusion

20. The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge

21. PM2.5-GNN: A Domain Knowledge Enhanced Graph Neural Network For PM2.5 Forecasting

Catalog

Books, media, physical & digital resources