Search

Your search keyword '"Zhang, Xulong"' showing total 571 results

Search Constraints

Start Over You searched for: Author "Zhang, Xulong" Remove constraint Author: "Zhang, Xulong"
571 results on '"Zhang, Xulong"'

Search Results

1. Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning

2. RSET: Remapping-based Sorting Method for Emotion Transfer Speech Synthesis

3. RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval

4. MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion

5. Learning Expressive Disentangled Speech Representations with Soft Speech Units and Adversarial Style Augmentation

6. QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering

7. EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning

8. EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization

9. CONTUNER: Singing Voice Beautifying with Pitch and Expressiveness Condition

10. Medical Speech Symptoms Classification via Disentangled Representation

12. Elimination of the confrontation between theory and experiment in flexoelectric Bi2GeO5

13. DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation

14. CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding

15. CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation

18. Stock Volatility Prediction Based on Transformer Model Using Mixed-Frequency Data

19. Research on the Impact of Executive Shareholding on New Investment in Enterprises Based on Multivariable Linear Regression Model

20. A Hierarchy-based Analysis Approach for Blended Learning: A Case Study with Chinese Students

21. An Empirical Study of Attention Networks for Semantic Segmentation

22. Contrastive Latent Space Reconstruction Learning for Audio-Text Retrieval

23. FastGraphTTS: An Ultrafast Syntax-Aware Speech Synthesis Framework

24. AOSR-Net: All-in-One Sandstorm Removal Network

25. DiffTalker: Co-driven audio-image diffusion for talking faces via intermediate landmarks

26. Voice Conversion with Denoising Diffusion Probabilistic GAN Models

27. Machine Unlearning Methodology base on Stochastic Teacher Network

28. Symbolic & Acoustic: Multi-domain Music Emotion Modeling for Instrumental Music

29. Sparks of Large Audio Models: A Survey and Outlook

30. PMVC: Data Augmentation-Based Prosody Modeling for Expressive Voice Conversion

31. EmoMix: Emotion Mixing via Diffusion Models for Emotional Speech Synthesis

32. SAR: Self-Supervised Anti-Distortion Representation for End-To-End Speech Model

33. Work in Progress: Empowering Vocational Education with Automation Technology and PLC Integration

34. An Empirical Study of Attention Networks for Semantic Segmentation

35. A Hierarchy-Based Analysis Approach for Blended Learning: A Case Study with Chinese Students

36. Stock Volatility Prediction Based on Transformer Model Using Mixed-Frequency Data

37. Research on the Impact of Executive Shareholding on New Investment in Enterprises Based on Multivariable Linear Regression Model

39. Improving EEG-based Emotion Recognition by Fusing Time-frequency And Spatial Representations

40. Dynamic Alignment Mask CTC: Improved Mask-CTC with Aligned Cross Entropy

41. QI-TTS: Questioning Intonation Control for Emotional Speech Synthesis

42. Improving Music Genre Classification from Multi-Modal Properties of Music and Genre Correlations Perspective

45. Linguistic-Enhanced Transformer with CTC Embedding for Speech Recognition

46. Improving Imbalanced Text Classification with Dynamic Curriculum Learning

47. MetaSpeech: Speech Effects Switch Along with Environment for Metaverse

48. Semi-Supervised Learning Based on Reference Model for Low-resource TTS

49. Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach

50. Adapitch: Adaption Multi-Speaker Text-to-Speech Conditioned on Pitch Disentangling with Untranscribed Data

Catalog

Books, media, physical & digital resources