389 results on '"Hsin-Min Wang"'
Search Results
2. Magnetic-Field-Assisted Electric-Field-Induced Domain Switching of a Magnetic Single Domain in a Multiferroic/Magnetoelectric Ni Nanochevron/[Pb(Mg1/3Nb2/3)O3]0.68–[PbTiO3]0.32 (PMN–PT) Layered Structure
3. BASPRO: A Balanced Script Producer for Speech Corpus Collection Based on the Genetic Algorithm
4. Speaker-Specific Articulatory Feature Extraction Based on Knowledge Distillation for Speaker Recognition
5. SpeechCLIP+: Self-Supervised Multi-Task Representation Learning for Speech Via Clip and Speech-Image Data.
6. Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model.
7. SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models.
8. HAAQI-Net: A non-intrusive neural music quality assessment model for hearing aids.
9. Unmasking Illusions: Understanding Human Perception of Audiovisual Deepfakes.
10. Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion.
11. A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech.
12. Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features.
13. The Voicemos Challenge 2023: Zero-Shot Subjective Speech Quality Prediction for Multiple Domains.
14. LC4SV: A Denoising Framework Learning to Compensate for Unseen Speaker Verification Models.
15. Multi-Target Extractor and Detector for Unknown-Number Speaker Diarization.
16. Generalization Ability Improvement of Speaker Representation and Anti-Interference for Speaker Verification.
17. Deep Learning-Based Non-Intrusive Multi-Objective Speech Assessment Model With Cross-Domain Features.
18. Decomposition and Reorganization of Phonetic Information for Speaker Embedding Learning.
19. MTI-Net: A Multi-Target Speech Intelligibility Prediction Model.
20. NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling.
21. Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks.
22. Chain-based Discriminative Autoencoders for Speech Recognition.
23. The VoiceMOS Challenge 2022.
24. MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids.
25. EMGSE: Acoustic/EMG Fusion for Multimodal Speech Enhancement.
26. Partially Fake Audio Detection by Self-Attention-Based Fake Span Discovery.
27. Speech-enhanced and Noise-aware Networks for Robust Speech Recognition.
28. Is Character Trigram Overlapping Ratio Still the Best Similarity Measure for Aligning Sentences in a Paraphrased Corpus?
29. Chinese Movie Dialogue Question Answering Dataset.
30. D4AM: A General Denoising Framework for Downstream Acoustic Models.
31. SVSNet: An End-to-End Speaker Voice Similarity Assessment Model.
32. Improved Lite Audio-Visual Speech Enhancement.
33. Utilizing Whisper to Enhance Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids.
34. Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement.
35. A Study on Incorporating Whisper for Robust Speech Assessment.
36. Multi-objective Non-intrusive Hearing-aid Speech Assessment Model.
37. Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model.
38. AV-Lip-Sync+: Leveraging AV-HuBERT to Exploit Multimodal Inconsistency for Video Deepfake Detection.
39. AVTENet: Audio-Visual Transformer-based Ensemble Network Exploiting Multiple Experts for Video Deepfake Detection.
40. Relational Data Selection for Data Augmentation of Speaker-Dependent Multi-Band MelGAN Vocoder.
41. Dual-Path Filter Network: Speaker-Aware Modeling for Speech Separation.
42. A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion.
43. AlloST: Low-Resource Speech Translation Without Source Transcription.
44. Speech Enhancement with Zero-Shot Model Selection.
45. Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence Modeling.
46. HASA-Net: A Non-Intrusive Hearing-Aid Speech Assessment Network.
47. Sequence to General Tree: Knowledge-Guided Geometry Word Problem Solving.
48. Investigation of a Single-Channel Frequency-Domain Speech Enhancement Network to Improve End-to-End Bengali Automatic Speech Recognition Under Unseen Noisy Conditions.
49. Melody Harmonization Using Orderless Nade, Chord Balancing, and Blocked Gibbs Sampling.
50. Speech Recognition by Simply Fine-Tuning Bert.
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.