Search

Your search keyword '"Lee, Hung-yi"' showing total 925 results

Search Constraints

Start Over You searched for: Author "Lee, Hung-yi" Remove constraint Author: "Lee, Hung-yi"
925 results on '"Lee, Hung-yi"'

Search Results

51. Towards audio language modeling -- an overview

52. Codec-SUPERB: An In-Depth Analysis of Sound Codec Models

53. EMO-SUPERB: An In-depth Look at Speech Emotion Recognition

54. SpeechCLIP+: Self-supervised multi-task representation learning for speech via CLIP and speech-image data

55. Merging Facts, Crafting Fallacies: Evaluating the Contradictory Nature of Aggregated Factual Claims in Long-Form Generations

56. Integrating Self-supervised Speech Model with Pseudo Word-level Targets from Visually-grounded Speech Model

57. REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR

58. SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering

59. Maximizing Data Efficiency for Cross-Lingual TTS Adaptation by Self-Supervised Representation Mixing and Embedding Initialization

60. Over-Reasoning and Redundant Calculation of Large Language Models

61. Examining Forgetting in Continual Pre-training of Aligned Large Language Models

62. Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks

63. PEFT for Speech: Unveiling Optimal Placement, Merging Strategies, and Ensemble Techniques

64. Investigating Zero-Shot Generalizability on Mandarin-English Code-Switched ASR and Speech-to-text Translation of Recent Foundation Models with Self-Supervision and Weak Supervision

65. Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue

66. Noise robust distillation of self-supervised speech models via correlation metrics

67. GSQA: An End-to-End Model for Generative Spoken Question Answering

68. Scalable Ensemble-based Detection Method against Adversarial Attacks for speaker verification

69. Step by Step to Fairness: Attributing Societal Bias in Task-oriented Dialogue Systems

70. Exploring In-Context Learning of Textless Speech Language Model for Speech Classification Tasks

71. Learning from Red Teaming: Gender Bias Provocation and Mitigation in Large Language Models

72. A Closer Look into Automatic Evaluation Using Large Language Models

73. Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond

74. Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New Languages

75. Zero Resource Code-switched Speech Benchmark Using Speech Utterance Pairs For Multiple Spoken Languages

76. Prompting and Adapter Tuning for Self-supervised Encoder-Decoder Speech Model

77. Low-Resource Self-Supervised Learning with SSL-Enhanced TTS

78. Investigating Human-Identifiable Features Hidden in Adversarial Perturbations

79. Towards General-Purpose Text-Instruction-Guided Voice Conversion

80. AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models

81. Dynamic-SUPERB: Towards A Dynamic, Collaborative, and Comprehensive Instruction-Tuning Benchmark for Speech

82. Improving Non-autoregressive Translation Quality with Pretrained Language Model, Embedding Distillation and Upsampling Strategy for CTC

83. Revealing the Blind Spot of Sentence Encoder Evaluation by HEROS

84. SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts

85. Why We Should Report the Details in Subjective Evaluation of TTS More Rigorously

86. How to Estimate Model Transferability of Pre-Trained Speech Models?

87. MiniSUPERB: Lightweight Benchmark for Self-supervised Speech Models

88. Improving Textless Spoken Language Understanding with Discrete Units as Intermediate Target

89. The defender's perspective on automatic speaker verification: An overview

90. ML-SUPERB: Multilingual Speech Universal PERformance Benchmark

91. Improving Cascaded Unsupervised Speech Translation with Denoising Back-translation

92. Can Large Language Models Be an Alternative to Human Evaluations?

93. Personalized Lightweight Text-to-Speech: Voice Cloning with Adaptive Structured Pruning

94. Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences

95. SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks

96. Ensemble knowledge distillation of self-supervised speech models

97. Position Matters! Empirical Study of Order Effect in Knowledge-grounded Dialogue

98. Hierarchical Programmatic Reinforcement Learning via Learning to Compose Programs

99. SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks

100. Systematic Analysis for Pretrained Language Model Priming for Parameter-Efficient Fine-tuning

Catalog

Books, media, physical & digital resources