Search

Your search keyword '"Wang, Hsin-Min"' showing total 572 results

Search Constraints

Start Over You searched for: Author "Wang, Hsin-Min" Remove constraint Author: "Wang, Hsin-Min"
572 results on '"Wang, Hsin-Min"'

Search Results

2. Effective Noise-aware Data Simulation for Domain-adaptive Speech Enhancement Leveraging Dynamic Stochastic Perturbation

3. SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models

4. Unmasking Illusions: Understanding Human Perception of Audiovisual Deepfakes

5. SpeechCLIP+: Self-supervised multi-task representation learning for speech via CLIP and speech-image data

6. HAAQI-Net: A Non-intrusive Neural Music Audio Quality Assessment Model for Hearing Aids

7. Multi-objective Non-intrusive Hearing-aid Speech Assessment Model

8. AV-Lip-Sync+: Leveraging AV-HuBERT to Exploit Multimodal Inconsistency for Video Deepfake Detection

9. AVTENet: Audio-Visual Transformer-based Ensemble Network Exploiting Multiple Experts for Video Deepfake Detection

10. The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains

11. A Study on Incorporating Whisper for Robust Speech Assessment

12. Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement

13. Non-Intrusive Speech Intelligibility Prediction for Hearing Aids using Whisper and Metadata

14. Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model

15. Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features

16. Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion

17. BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithm

18. A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech

19. CasNet: Investigating Channel Robustness for Speech Separation

21. Mandarin Singing Voice Synthesis with Denoising Diffusion Probabilistic Wasserstein GAN

22. NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling

23. A Study of Using Cepstrogram for Countermeasure Against Replay Attacks

24. MTI-Net: A Multi-Target Speech Intelligibility Prediction Model

25. MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids

26. Filter-based Discriminative Autoencoders for Children Speech Recognition

27. Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks

28. Generation of Speaker Representations Using Heterogeneous Training Batch Assembly

29. Multi-target Extractor and Detector for Unknown-number Speaker Diarization

30. Subspace-based Representation and Learning for Phonotactic Spoken Language Recognition

31. Speech-enhanced and Noise-aware Networks for Robust Speech Recognition

32. Chain-based Discriminative Autoencoders for Speech Recognition

33. The VoiceMOS Challenge 2022

34. Partially Fake Audio Detection by Self-attention-based Fake Span Discovery

35. EMGSE: Acoustic/EMG Fusion for Multimodal Speech Enhancement

36. HASA-net: A non-intrusive hearing-aid speech assessment network

37. Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features

38. Speech Enhancement-assisted Voice Conversion in Noisy Environments

39. Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion

40. SurpriseNet: Melody Harmonization Conditioning on User-controlled Surprise Contours

41. SVSNet: An End-to-end Speaker Voice Similarity Assessment Model

42. Dual-Path Filter Network: Speaker-Aware Modeling for Speech Separation

43. Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder

44. A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion

45. Sequence to General Tree: Knowledge-Guided Geometry Word Problem Solving

46. AlloST: Low-resource Speech Translation without Source Transcription

47. The AS-NU System for the M2VoC Challenge

48. Speech Recognition by Simply Fine-tuning BERT

49. Speech Enhancement with Zero-Shot Model Selection

50. STOI-Net: A Deep Learning based Non-Intrusive Speech Intelligibility Assessment Model

Catalog

Books, media, physical & digital resources