Author: "Chung-Hsien Wu" / Publication Year Range: Last 50 years - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Chung-Hsien Wu"' showing total 669 results

Start Over Author "Chung-Hsien Wu" Publication Year Range Last 50 years

669 results on '"Chung-Hsien Wu"'

1. Dynamic Sampling-Based Meta-Learning Using Multilingual Acoustic Data for Under-Resourced Speech Recognition

Author: I-Ting Hsieh, Chung-Hsien Wu, and Zhe-Hong Zhao
Subjects: Under-resourced speech recognition, dynamic sampling, model-agnostic meta-learning, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
Abstract: Under-resourced automatic speech recognition (ASR) has become an active field of research and has experienced significant progress during the past decade. However, the performance of under-resourced ASR trained by existing methods is still far inferior to high-resourced ASR for practical applications. In this paper, speech data from languages that share the most phonemes with the under-resourced language are selected as supplementary resources for meta-training based on the Model-Agnostic Meta-Learning (MAML) strategy. Besides supplementary language selection, this paper proposes a dynamic sampling method instead of the original random sampling method to select support and query sets for each task in MAML to improve meta-training performance. In this study, Taiwanese is selected as the under-resourced language, and the speech corpus of five languages, including Mandarin, English, Japanese, Cantonese, and Thai, are chosen as supplementary training data for acoustic model training. The proposed dynamic sampling approach uses phonemes, pronunciation, and speech recognition models as the basis to determine the proportion of each supplementary language to select helpful utterances for MAML. For evaluation, with the selected utterances from each supplementary language for meta-training, we obtained a Word Error Rate of 20.24% and a Syllable Error Rate of 8.35% for Taiwanese ASR, which were better than the baseline model (26.18% and 13.99%) using only the Taiwanese corpus and other methods.
Published: 2024
Full Text: View/download PDF

2. Informative and Long-Term Response Generation using Multiple Suggestions and User Persona Retrieval in a Dialogue System

Author: Jia-Hao Hsu, Tsai-Yi Chen, and Chung-Hsien Wu
Subjects: Electronic computers. Computer science, QA75.5-76.95
Published: 2024
Full Text: View/download PDF

3. Editorial for Special Issue on Pre-trained Large Language Models for Information Processing

Author: Bin Wang, Tatsuya Kawahara, Haizhou Li, Helen Meng, and Chung-Hsien Wu
Subjects: Electronic computers. Computer science, QA75.5-76.95
Published: 2024
Full Text: View/download PDF

4. Speech Enhancement Using Dynamic Learning in Knowledge Distillation via Reinforcement Learning

Author: Shih-Chuan Chu, Chung-Hsien Wu, and Tsai-Wei Su
Subjects: Deep learning, speech enhancement, knowledge distillation, reinforcement learning, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
Abstract: In recent years, most of the research on speech enhancement (SE) has applied different strategies to improve performance through deep neural network models. However, as the performance improves, the memory resources and computational requirements of the model also increase, making it difficult to directly apply them to edge computing. Therefore, various model compression and acceleration techniques are desired. This paper proposes a learning method that dynamically uses Knowledge Distillation (KD) to teach a small student model from a large teacher model by considering the learning ratio from the teacher’s output and the real target based on reinforcement learning (RL). During the KD learning process, RL is adopted to estimate the learning ratio by considering the reward favoring the hard target (clean speech) or the soft target (the output of the teacher model) during the training of KD. The proposed method results in a more stable training process for the resulting smaller SE model and yields improved performance. In the experiment, we used the TIMIT and CSTR VCTK datasets and evaluated two representative SE models that employ different loss functions. On the TIMIT dataset, when we reduced the number of parameters in the Wave-U-Net student model from 10.3 million to 2.6 million, our method performed better than non-KD models with improvements of 0.05 in PESQ, 0.1 in STOI, and 0.47 in the scale-invariant signal-to-distortion ratio. Moreover, by utilizing prior knowledge from the pre-trained teacher model, our method effectively guided the learning process of the student model, achieving excellent performance even under low SNR conditions. Furthermore, we use Conv-Tasnet to further validate our proposed method. Finally, for ease of comparison, we conducted a comparison on the VCTK dataset as well.
Published: 2023
Full Text: View/download PDF

5. Automatic Bipolar Disorder Assessment Using Machine Learning With Smartphone-Based Digital Phenotyping

Author: Chung-Hsien Wu, Jia-Hao Hsu, Cheng-Ray Liou, Hung-Yi Su, Esther Ching-Lan Lin, and Po-See Chen
Subjects: Bipolar disorder, digital phenotyping, HAM-D, heterogeneous data, missing data, YMRS, Electrical engineering. Electronics. Nuclear engineering, TK1-9971
Abstract: Bipolar disorder (BD) is one of the most common mental illnesses worldwide. In this study, a smartphone application was developed to collect digital phenotyping data of users, and an ensemble method combining the results from a model pool was established through heterogeneous digital phenotyping. The aim was to predict the severity of bipolar symptoms by using two clinician-administered scales, the Hamilton Depression Rating Scale (HAM-D) and the Young Mania Rating Scale (YMRS). The collected digital phenotype data included the user’s location information (GPS), self-report scales, daily mood, sleep patterns, and multimedia records (text, speech, and video). Each category of digital phenotype data was used for training models and predicting the rating scale scores (HAM-D and YMRS). Seven models were tested and compared, and different combinations of feature types were used to evaluate the performance of heterogeneous data. To address missing data, an ensemble approach was employed to increase flexibility in rating scale score prediction. This study collected heterogeneous digital phenotype data from 84 individuals with BD and 11 healthy controls. Five-fold cross-validation was employed for evaluation. The experimental results revealed that the Lasso and ElasticNet regression models were the most effective in predicting rating scale scores, and heterogeneous data performed better than homogeneous data, with a mean absolute error of 1.36 and 0.55 for HAM-D and YMRS, respectively; this margin of error meets medical requirements.
Published: 2023
Full Text: View/download PDF

6. Conditional Adversarial Learning for Empathetic Dialogue Response Generation

Author: Ming-Hsiang Su, Chung-Hsien Wu, and Chia-Yu Liao
Subjects: Electronic computers. Computer science, QA75.5-76.95
Published: 2023
Full Text: View/download PDF

7. Speaker-Specific Articulatory Feature Extraction Based on Knowledge Distillation for Speaker Recognition

Author: Qian-Bei Hong, Chung-Hsien Wu, and Hsin-Min Wang
Subjects: Electronic computers. Computer science, QA75.5-76.95
Published: 2023
Full Text: View/download PDF

8. Miscommunication handling in spoken dialog systems based on error-aware dialog state detection

Author: Chung-Hsien Wu, Ming-Hsiang Su, and Wei-Bin Liang
Subjects: Error-aware dialog act, Miscommunication, Spoken dialog systems, Acoustics. Sound, QC221-246, Electronic computers. Computer science, QA75.5-76.95
Abstract: Abstract With the exponential growth in computing power and progress in speech recognition technology, spoken dialog systems (SDSs) with which a user interacts through natural speech has been widely used in human-computer interaction. However, error-prone automatic speech recognition (ASR) results usually lead to inappropriate semantic interpretation so that miscommunication happens easily. This paper presents an approach to error-aware dialog state (DS) detection for robust miscommunication handling in an SDS. Non-understanding (Non-U) and misunderstanding (Mis-U) are considered for miscommunication handling in this study. First, understanding evidence (UE), derived from the recognition confidence, is adopted for Non-U detection followed by Non-U recovery. For Mis-U with the recognized sentence containing uncertain recognized words, the partial sentences obtained by removing potentially misrecognized words from the input utterance are organized, based on regular expressions, as a tree structure to tolerate the deletion or rejection of keywords resulting from misrecognition for Mis-U DS modeling. Latent semantic analysis is then employed to consider the verified words and their n-grams for DS detection, including Mis-U and predefined Base DSs. Historical information-based n-grams are employed to find the most likely DS for the SDS. Several experiments were performed with a dialog corpus for the restaurant reservation task. The experimental results show that the proposed approach achieved a promising performance for Non-U recovery and Mis-U repair as well as a satisfactory task success rate for the dialogs using the proposed method.
Published: 2017
Full Text: View/download PDF

9. Detection, Measurement, and Enhancement of Happiness

Author: Jhing-Fa Wang, Chung-Hsien Wu, Shulan Hsieh, Shyhnan Liou, and Bo-Wei Chen
Subjects: Technology, Medicine, Science
Published: 2014
Full Text: View/download PDF

10. Digital Phenotyping-Based Bipolar Disorder Assessment Using Multiple Correlation Data Imputation and Lasso-MLP.

Author: Jia-Hao Hsu, Chung-Hsien Wu 0001, Wei-Kai Wang, Hung-Yi Su, Esther Ching-Lan Lin, and Po See Chen
Published: 2024
Full Text: View/download PDF

11. Development of a Taiwanese Speech Synthesis System Using Hidden Markov Models and a Robust Tonal Phoneme Corpus.

Author: Yung-Ji Sher, Ming-Chun Hsu, Yu-Hsien Chiu, Yeou-Jiunn Chen, Chung-Hsien Wu 0001, and Jiunn-Liang Wu
Published: 2024

12. Scalable Audio-Content Analysis

Author: Hyoung-Gook Kim, Liming Chen, Chung-Hsien Wu, Malcolm Slaney, Bhiksha Raj, and Paris Smaragdis
Subjects: Acoustics. Sound, QC221-246, Electronic computers. Computer science, QA75.5-76.95
Published: 2010
Full Text: View/download PDF

13. Speech Emotion Recognition using Decomposed Speech via Multi-task Learning.

Author: Jia-Hao Hsu, Chung-Hsien Wu 0001, and Yu-Hung Wei
Published: 2023
Full Text: View/download PDF

14. Temporal and Type Correlation in Digital Phenotyping for Bipolar Disorder State Prediction Using Multitask Self-Supervised Learning.

Author: Jia-Hao Hsu, Hua-Wei Tseng, Chung-Hsien Wu 0001, Esther Ching-Lan Lin, and Po See Chen
Published: 2023
Full Text: View/download PDF

15. Data Selection Based on Phoneme Affinity Matrix for Electrolarynx Speech Recognition.

Author: I-Ting Hsieh, Chung-Hsien Wu 0001, and Shu-Wei Tsai
Published: 2023
Full Text: View/download PDF

16. Applying Segment-Level Attention on Bi-Modal Transformer Encoder for Audio-Visual Emotion Recognition.

Author: Jia-Hao Hsu and Chung-Hsien Wu 0001
Published: 2023
Full Text: View/download PDF

17. Generalization Ability Improvement of Speaker Representation and Anti-Interference for Speaker Verification.

Author: Qian-Bei Hong, Chung-Hsien Wu 0001, and Hsin-Min Wang
Published: 2023
Full Text: View/download PDF

18. Empathetic Response Generation Based on Plug-and-Play Mechanism With Empathy Perturbation.

Author: Jia-Hao Hsu, Jeremy Chang, Min-Hsueh Kuo, and Chung-Hsien Wu 0001
Published: 2023
Full Text: View/download PDF

19. Decomposition and Reorganization of Phonetic Information for Speaker Embedding Learning.

Author: Qian-Bei Hong, Chung-Hsien Wu 0001, and Hsin-Min Wang
Published: 2023
Full Text: View/download PDF

20. Linear-time Mixed-Cell-Height Legalization for Minimizing Maximum Displacement.

Author: Chung-Hsien Wu 0001, Wai-Kei Mak, and Chris Chu
Published: 2022
Full Text: View/download PDF

21. Memory-Efficient Multi-Step Speech Enhancement with Neural ODE.

Author: Jen-Hung Huang and Chung-Hsien Wu 0001
Published: 2022
Full Text: View/download PDF

22. Applying Emotional Keyphrase Correlation for Diversity Enhancement in Empathetic Dialogue Response Generation.

Author: Jeremy Chang and Chung-Hsien Wu 0001
Published: 2022
Full Text: View/download PDF

23. Assessment of Bipolar Disorder Using Heterogeneous Data of Smartphone-Based Digital Phenotyping.

Author: Hung-Yi Su, Chung-Hsien Wu 0001, Cheng-Ray Liou, Esther Ching-Lan Lin, and Po See Chen
Published: 2021
Full Text: View/download PDF

24. Task-Aware BERT-based Sentiment Analysis from Multiple Essences of the Text.

Author: Jia-Hao Hsu, Chung-Hsien Wu 0001, and Tsung-Hsien Yang
Published: 2021

25. Ensemble of One Model: Creating Model Variations for Transformer with Layer Permutation.

Author: Andrew Liaw, Jia-Hao Hsu, and Chung-Hsien Wu 0001
Published: 2021

26. Speech Enhancement Based on Masking Approach Considering Speech Quality and Acoustic Confidence for Noisy Speech Recognition.

Author: Shih-Chuan Chu, Chung-Hsien Wu 0001, and Yun-Wen Lin
Published: 2021

27. Improvement of Spatial Ambiguity in Multi-Channel Speech Separation Using Channel Attention.

Author: Qian-Bei Hong, Chung-Hsien Wu 0001, Thanh Binh Nguyen 0013, and Hsin-Min Wang
Published: 2021

28. Transformer-based Empathetic Response Generation Using Dialogue Situation and Advanced-Level Definition of Empathy.

Author: Yi-Hsuan Wang, Jia-Hao Hsu, Chung-Hsien Wu 0001, and Tsung-Hsien Yang
Published: 2021
Full Text: View/download PDF

29. Latent Attribute Control for Story Generation.

Author: Yu-Siou Tang and Chung-Hsien Wu 0001
Published: 2021
Full Text: View/download PDF

30. Statistics Pooling Time Delay Neural Network Based on X-Vector for Speaker Verification.

Author: Qian-Bei Hong, Chung-Hsien Wu 0001, Hsin-Min Wang, and Chien-Lin Huang
Published: 2020
Full Text: View/download PDF

31. Combining Deep Embeddings of Acoustic and Articulatory Features for Speaker Identification.

Author: Qian-Bei Hong, Chung-Hsien Wu 0001, Hsin-Min Wang, and Chien-Lin Huang
Published: 2020
Full Text: View/download PDF

32. Acoustic and Textual Data Augmentation for Code-Switching Speech Recognition in Under-Resourced Language.

Author: I-Ting Hsieh, Chung-Hsien Wu 0001, and Chun-Huang Wang
Published: 2020

33. Attentively-Coupled Long Short-Term Memory for Audio-Visual Emotion Recognition.

Author: Jia-Hao Hsu and Chung-Hsien Wu 0001
Published: 2020

34. Natural Language Processing Methods for Detection of Influenza-Like Illness from Chief Complaints.

Author: Jia-Hao Hsu, Ting-Chia Weng, Chung-Hsien Wu 0001, and Tzong-Shiann Ho
Published: 2020

35. Exploring Macroscopic and Microscopic Fluctuations of Elicited Facial Expressions for Mood Disorder Classification.

Author: Qian-Bei Hong, Chung-Hsien Wu 0001, Ming-Hsiang Su, and Chia-Cheng Chang
Published: 2021
Full Text: View/download PDF

36. Speech Emotion Recognition Considering Nonverbal Vocalization in Affective Conversations.

Author: Jia-Hao Hsu, Ming-Hsiang Su, Chung-Hsien Wu 0001, and Yi-Hsuan Chen
Published: 2021
Full Text: View/download PDF

37. Follow-Up Question Generation Using Neural Tensor Network-Based Domain Ontology Population in an Interview Coaching System.

Author: Ming-Hsiang Su, Chung-Hsien Wu 0001, and Yi Chang
Published: 2019
Full Text: View/download PDF

38. Speech Emotion Recognition Using Deep Neural Network Considering Verbal and Nonverbal Speech Sounds.

Author: Kun-Yi Huang, Chung-Hsien Wu 0001, Qian-Bei Hong, Ming-Hsiang Su, and Yi-Hsuan Chen
Published: 2019
Full Text: View/download PDF

39. Automatic Ontology Population Using Deep Learning for Triple Extraction.

Author: Ming-Hsiang Su, Chung-Hsien Wu 0001, and Po-Chen Shih
Published: 2019
Full Text: View/download PDF

40. Sequential Speaker Embedding and Transfer Learning for Text-Independent Speaker Identification.

Author: Qian-Bei Hong, Chung-Hsien Wu 0001, Ming-Hsiang Su, and Hsin-Min Wang
Published: 2019
Full Text: View/download PDF

41. Why Do People Back Crowdfunding Projects?

Author: Ying-Feng Kuo 0001, Cathy S. Lin, Chung-Hsien Wu 0003, and Tsung-Hsun Tsai
Published: 2019
Full Text: View/download PDF

42. Detecting Unipolar and Bipolar Depressive Disorders from Elicited Speech Responses Using Latent Affective Structure Model.

Author: Kun-Yi Huang, Chung-Hsien Wu 0001, Ming-Hsiang Su, and Yu-Ting Kuo
Published: 2020
Full Text: View/download PDF

43. Sound Events Recognition and Retrieval Using Multi-Convolutional-Channel Sparse Coding Convolutional Neural Networks.

Author: Chien-Yao Wang, Tzu-Chiang Tai, Jia-Ching Wang, Andri Santoso, Seksan Mathulaprangsan, Chin-Chin Chiang, and Chung-Hsien Wu 0001
Published: 2020
Full Text: View/download PDF

44. A Two-Stage Transformer-Based Approach for Variable-Length Abstractive Summarization.

Author: Ming-Hsiang Su, Chung-Hsien Wu 0001, and Hao-Tse Cheng
Published: 2020
Full Text: View/download PDF

45. Attention-Based Response Generation Using Parallel Double Q-Learning for Dialog Policy Decision in a Conversational System.

Author: Ming-Hsiang Su, Chung-Hsien Wu 0001, and Liang-Yu Chen
Published: 2020
Full Text: View/download PDF

46. Cell-Coupled Long Short-Term Memory With L-Skip Fusion Mechanism for Mood Disorder Detection Through Elicited Audiovisual Features.

Author: Ming-Hsiang Su, Chung-Hsien Wu 0001, Kun-Yi Huang, and Tsung-Hsien Yang
Published: 2020
Full Text: View/download PDF

47. CREER: A Large-Scale Corpus for Relation Extraction and Entity Recognition.

Author: Yu-Siou Tang and Chung-Hsien Wu 0001
Published: 2022
Full Text: View/download PDF

48. Follow-up Question Generation Using Pattern-based Seq2seq with a Small Corpus for Interview Coaching.

Author: Ming-Hsiang Su, Chung-Hsien Wu 0001, Kun-Yi Huang, Qian-Bei Hong, and Huai-Hung Huang
Published: 2018
Full Text: View/download PDF

49. Attention-Based Dialog State Tracking for Conversational Interview Coaching.

Author: Ming-Hsiang Su, Chung-Hsien Wu 0001, Kun-Yi Huang, and Chu-Kwang Chen
Published: 2018
Full Text: View/download PDF

50. Locality-Preserving Complex-Valued Gaussian Process Latent Variable Model for Robust Face Recognition.

Author: Sih-Huei Chen, Yuan-Shan Lee, Yu-Sheng Hsu, Chung-Hsien Wu 0001, and Jia-Ching Wang
Published: 2018
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

669 results on '"Chung-Hsien Wu"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources