Search

Your search keyword '"Wu, Yihan"' showing total 931 results

Search Constraints

Start Over You searched for: Author "Wu, Yihan" Remove constraint Author: "Wu, Yihan"
931 results on '"Wu, Yihan"'

Search Results

1. Enhancing Audiovisual Speech Recognition through Bifocal Preference Optimization

2. De-mark: Watermark Removal in Large Language Models

3. A Watermark for Order-Agnostic Language Models

4. ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech

5. LoVA: Long-form Video-to-Audio Generation

6. SpoofCeleb: Speech Deepfake Detection and SASV In The Wild

7. Robust Audiovisual Speech Recognition Models with Mixture-of-Experts

8. Text-To-Speech Synthesis In The Wild

9. YuLan: An Open-source Large Language Model

10. The Interspeech 2024 Challenge on Speech Processing Using Discrete Units

11. Distortion-free Watermarks are not Truly Distortion-free under Watermark Key Collisions

15. Analyzing the Impact of Tall Building Geometries on Wind Environment in a Hypothetical Urban Context: A Typological and Parametric Study

16. Alterations of electrocortical activity during hand movements induced by motor cortex glioma

17. Lambda: Learning Matchable Prior For Entity Alignment with Unlabeled Dangling Cases

18. Few-Shot Class Incremental Learning with Attention-Aware Self-Adaptive Prompt

20. Your Vision-Language Model Itself Is a Strong Filter: Towards High-Quality Instruction Tuning with Data Selection

21. SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition

27. The root canal morphology of mandibular anterior teeth and its correlation with the occurrence of three-rooted mandibular first molars

28. Local environment-based machine learning for molecular adsorption energy prediction

29. GPT-4 Vision on Medical Image Classification -- A Case Study on COVID-19 Dataset

30. A Resilient and Accessible Distribution-Preserving Watermark for Large Language Models

31. Shielding the Unseen: Privacy Protection through Poisoning NeRF with Spatial Deformation

32. Unbiased Watermark for Large Language Models

33. Markov Chain-Guided Graph Construction and Sampling Depth Optimization for EEG-Based Mental Disorder Detection

34. Characterizing normal perinatal development of the human brain structural connectivity

35. Cooperation or Competition: Avoiding Player Domination for Multi-Target Robustness via Adaptive Budgets

36. ComedicSpeech: Text To Speech For Stand-up Comedies in Low-Resource Scenarios

37. ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech

38. Adversarial Weight Perturbation Improves Generalization in Graph Neural Networks

39. VideoDubber: Machine Translation with Speech-Aware Length Control for Video Dubbing

40. PromptTTS: Controllable Text-to-Speech with Text Descriptions

41. Towards Robust Dataset Learning

42. Visual representations in the human brain are aligned with large language models

49. Schizophrenia detection based on EEG using Recurrent Auto-Encoder framework

50. Self-supervised Context-aware Style Representation for Expressive Speech Synthesis

Catalog

Books, media, physical & digital resources