Search

Your search keyword '"Liu, Jinglin"' showing total 37 results

Search Constraints

Start Over You searched for: Author "Liu, Jinglin" Remove constraint Author: "Liu, Jinglin" Database arXiv Remove constraint Database: arXiv
37 results on '"Liu, Jinglin"'

Search Results

1. MulliVC: Multi-lingual Voice Conversion With Cycle Consistency

2. Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis

3. C2G2: Controllable Co-speech Gesture Generation with Latent Diffusion Model

4. Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis

5. Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias

6. Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis

7. Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation

8. AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation

9. CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training

10. RMSSinger: Realistic-Music-Score based Singing Voice Synthesis

11. AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment

12. GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation

13. AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

14. MUG: A General Meeting Understanding and Generation Benchmark

15. Overview of the ICASSP 2023 General Meeting Understanding and Generation Challenge (MUG)

16. GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis

17. Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models

18. DopplerBAS: Binaural Audio Synthesis Addressing Doppler Effect

19. VarietySound: Timbre-Controllable Video to Sound Generation via Unsupervised Information Disentanglement

20. ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech

21. Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech

22. TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation

23. GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech

24. Learning the Beauty in Songs: Neural Singing Voice Beautifier

25. MR-SVS: Singing Voice Synthesis with Multi-Reference Encoder

26. Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus

27. SimulSLT: End-to-End Simultaneous Sign Language Translation

28. SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation

29. PortaSpeech: Portable and High-Quality Generative Text-to-Speech

30. SimulLR: Simultaneous Lip Reading Transducer with Attention-Guided Adaptive Memory

31. Parallel and High-Fidelity Text-to-Lip Generation

32. EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model

33. DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

34. DenoiSpeech: Denoising Text to Speech with Frame-Level Noise Modeling

35. FastLR: Non-Autoregressive Lipreading Model with Integrate-and-Fire

36. Task-Level Curriculum Learning for Non-Autoregressive Neural Machine Translation

37. A Study of Non-autoregressive Model for Sequence Generation

Catalog

Books, media, physical & digital resources