Search

Your search keyword '"Lee, Hung-yi"' showing total 925 results

Search Constraints

Start Over You searched for: Author "Lee, Hung-yi" Remove constraint Author: "Lee, Hung-yi"
925 results on '"Lee, Hung-yi"'

Search Results

1. Fusion of Discrete Representations and Self-Augmented Representations for Multilingual Automatic Speech Recognition

2. How to Learn a New Language? An Efficient Solution for Self-Supervised Learning Models Unseen Languages Adaption in Low-Resource Scenario

3. Building a Taiwanese Mandarin Spoken Language Model: A First Attempt

4. Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

5. Align-SLM: Textless Spoken Language Models with Reinforcement Learning from AI Feedback

6. Can Large Audio-Language Models Truly Hear? Tackling Hallucinations with Multi-Task Assessment and Stepwise Audio Reasoning

7. Meta-DiffuB: A Contextualized Sequence-to-Sequence Text Diffusion Model with Meta-Exploration

8. Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data

9. Unveiling Narrative Reasoning Limits of Large Language Models with Trope in Movie Synopses

10. Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models

11. Improving Speech Emotion Recognition in Under-Resourced Languages via Speech-to-Speech Translation with Bootstrapping Data Selection

12. Meta-Whisper: Speech-Based Meta-ICL for ASR on Low-Resource Languages

13. DFADD: The Diffusion and Flow-Matching Based Audio Deepfake Dataset

14. Efficient Training of Self-Supervised Speech Foundation Models on a Compute Budget

15. Property Neurons in Self-Supervised Speech Transformers

16. Non-instructional Fine-tuning: Enabling Instruction-Following Capabilities in Pre-trained Language Models without Instruction-Following Data

17. SpeechCaps: Advancing Instruction-Based Universal Speech Models with Multi-Talker Speaking Style Captioning

18. SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks

19. Spoken Stereoset: On Evaluating Social Bias Toward Speaker in Speech Large Language Models

20. Decoding Biases: Automated Methods and LLM Judges for Gender Bias Detection in Language Models

21. Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models

22. EMO-Codec: An In-Depth Look at Emotion Preservation capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations

23. I Need Help! Evaluating LLM's Ability to Ask for Users' Support: A Case Study on Text-to-SQL Generation

24. Leave No Knowledge Behind During Knowledge Distillation: Towards Practical and Effective Knowledge Distillation for Code-Switching ASR Using Realistic Data

25. Speech-Copilot: Leveraging Large Language Models for Speech Processing via Task Decomposition, Modularization, and Program Generation

26. Listen and Speak Fairly: A Study on Semantic Gender Bias in Speech Integrated Large Language Models

27. Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course

28. Investigating the Effects of Large-Scale Pseudo-Stereo Data and Different Speech Foundation Model on Dialogue Generative Spoken Language Model

29. DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging

30. DeSTA: Enhancing Speech Language Models through Descriptive Speech-Text Alignment

31. Can LLMs Understand the Implication of Emphasized Sentences in Dialogue?

32. Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy Speech

33. Investigating Video Reasoning Capability of Large Language Models with Tropes in Movies

34. On the Evaluation of Speech Foundation Models for Spoken Language Understanding

35. StreamBench: Towards Benchmarking Continuous Improvement of Language Agents

36. ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets

37. Understanding Sounds, Missing the Questions: The Challenge of Object Hallucination in Large Audio-Language Models

38. CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems

39. Do Prompts Really Prompt? Exploring the Prompt Understanding Capability of Whisper

40. DAISY: Data Adaptive Self-Supervised Early Exit for Speech Representation Models

41. Emo-bias: A Large Scale Evaluation of Social Bias on Speech Emotion Recognition

42. On the social bias of speech self-supervised models

43. Neural Codec-based Adversarial Sample Detection for Speaker Verification

44. Singing Voice Graph Modeling for SingFake Detection

45. Dataset-Distillation Generative Model for Speech Emotion Recognition

46. Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition

47. InstructionCP: A fast approach to transfer Large Language Models into target language

48. LLM Discussion: Enhancing the Creativity of Large Language Models via Discussion Framework and Role-Play

49. A Large-Scale Evaluation of Speech Foundation Models

50. Advancing Large Language Models to Capture Varied Speaking Styles and Respond Properly in Spoken Conversations

Catalog

Books, media, physical & digital resources