Search

Your search keyword '"Guo, Pengcheng"' showing total 738 results

Search Constraints

Start Over You searched for: Author "Guo, Pengcheng" Remove constraint Author: "Guo, Pengcheng"
738 results on '"Guo, Pengcheng"'

Search Results

1. Optimizing Dysarthria Wake-Up Word Spotting: An End-to-End Approach for SLT 2024 LRDWWS Challenge

2. NPU-NTU System for Voice Privacy 2024 Challenge

3. Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models

4. Towards Rehearsal-Free Multilingual ASR: A LoRA-based Case Study on Whisper

5. Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models

6. CRMSP: A Semi-supervised Approach for Key Information Extraction with Class-Rebalancing and Merged Semantic Pseudo-Labeling

7. MUSA: Multi-lingual Speaker Anonymization via Serial Disentanglement

8. Distinctive and Natural Speaker Anonymization via Singular Value Transformation-assisted Matrix

9. Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets

10. Enhancing Lip Reading with Multi-Scale Video and Multi-Encoder

11. An audio-quality-based multi-strategy approach for target speaker extraction in the MISP 2023 Challenge

12. The NPU-ASLP-LiAuto System Description for Visual Speech Recognition in CNVSRC 2023

13. ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge

14. MLCA-AVSR: Multi-Layer Cross Attention Fusion based Audio-Visual Speech Recognition

15. Automatic channel selection and spatial feature integration for multi-channel speech recognition across various array topologies

17. Decoupling and Interacting Multi-Task Learning Network for Joint Speech and Accent Recognition

18. SA-Paraformer: Non-autoregressive End-to-End Speaker-Attributed ASR

19. Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study

20. Timbre-reserved Adversarial Attack in Speaker Identification

21. TVDO: Tchebycheff Value-Decomposition Optimization for Multi-Agent Reinforcement Learning

22. Adaptive Contextual Biasing for Transducer Based Streaming Speech Recognition

26. Multimodal cell atlas of the ageing human skeletal muscle

27. A spatiotemporal atlas of cholestatic injury and repair in mice

28. A spatiotemporal atlas of mouse liver homeostasis and regeneration

29. Pseudo-Siamese Network based Timbre-reserved Black-box Adversarial Attack in Speaker Identification

30. BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR

31. TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition

32. Contextualized End-to-End Speech Recognition with Contextual Phrase Prediction Network

33. The NPU-ASLP System for Audio-Visual Speech Recognition in MISP 2022 Challenge

34. VE-KWS: Visual Modality Enhanced End-to-End Keyword Spotting

35. The Design of an Efficient Distributed Collaborative Scheduling Method and the Optimal Planning Strategy for Providing Photovoltaic Access Capacity

36. Controllability of Windmill Networks

37. TESSP: Text-Enhanced Self-Supervised Speech Pre-training

38. Distinguishable Speaker Anonymization based on Formant and Fundamental Frequency Scaling

39. Preserving background sound in noise-robust voice conversion via multi-task learning

40. MFCCA:Multi-Frame Cross-Channel attention for multi-speaker ASR in Multi-party meeting scenario

41. NWPU-ASLP System for the VoicePrivacy 2022 Challenge

44. Improving Transformer-based Conversational ASR by Inter-Sentential Attention Mechanism

45. Linguistic-Acoustic Similarity Based Accent Shift for Accent Recognition

Catalog

Books, media, physical & digital resources