Search

Your search keyword '"Lu, Heng"' showing total 37 results

Search Constraints

Start Over You searched for: Author "Lu, Heng" Remove constraint Author: "Lu, Heng" Publication Type Reports Remove constraint Publication Type: Reports
37 results on '"Lu, Heng"'

Search Results

1. Deep Learning Meets OBIA: Tasks, Challenges, Strategies, and Perspectives

2. CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens

3. The Impact of Quantization and Pruning on Deep Reinforcement Learning Models

4. FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs

5. GMP-TL: Gender-augmented Multi-scale Pseudo-label Enhanced Transfer Learning for Speech Emotion Recognition

6. Vec-Tok Speech: speech vectorization and tokenization for neural speech generation

7. SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation

8. PP-MeT: a Real-world Personalized Prompt based Meeting Transcription System

9. PromptVC: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts

10. DiaCorrect: Error Correction Back-end For Speaker Diarization

11. MSAC: Multiple Speech Attribute Control Method for Reliable Speech Emotion Recognition

12. METTS: Multilingual Emotional Text-to-Speech by Cross-speaker and Cross-lingual Emotion Transfer

13. GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Accurate Speech Emotion Recognition

14. HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanism

15. LMEC: Learnable Multiplicative Absolute Position Embedding Based Conformer for Speech Recognition

16. DiaCorrect: End-to-end error correction for speaker diarization

17. Parallel measurements of vibrational modes in a few-layer graphene nanomechanical resonator using software-defined radio dongles

18. Improving Cross-lingual Speech Synthesis with Triplet Training Scheme

19. The USTC-Ximalaya system for the ICASSP 2022 multi-channel multi-party meeting transcription (M2MeT) challenge

20. Imaging vibrations of locally gated, electromechanical few layer graphene resonators with a moving vacuum enclosure

21. Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training

22. TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis

23. FeatherTTS: Robust and Efficient attention based Neural TTS

24. Dyson's Equations for Quantum Gravity in the Hartree-Fock Approximation

25. Peking Opera Synthesis via Duration Informed Attention Network

26. DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion System

27. Gravitational Fluctuations as an Alternative to Inflation III. Numerical Results

28. AdaDurIAN: Few-shot Adaptation for Neural Text-to-Speech with DurIAN

29. FeatherWave: An efficient high-fidelity neural vocoder with multi-band linear prediction

30. Synthesising Expressiveness in Peking Opera via Duration Informed Attention Network

31. Learning Singing From Speech

32. PitchNet: Unsupervised Singing Voice Conversion with Pitch Adversarial Network

33. Gravitational Fluctuations as an Alternative to Inflation II. CMB Angular Power Spectrum

34. DurIAN: Duration Informed Attention Network For Multimodal Synthesis

35. Gravitational Fluctuations as an Alternative to Inflation

36. Linear networks based speaker adaptation for speech synthesis

37. Deep Feed-forward Sequential Memory Networks for Speech Synthesis

Catalog

Books, media, physical & digital resources