801 results on '"Hsin-Min Wang"'
Search Results
2. Magnetic-Field-Assisted Electric-Field-Induced Domain Switching of a Magnetic Single Domain in a Multiferroic/Magnetoelectric Ni Nanochevron/[Pb(Mg1/3Nb2/3)O3]0.68–[PbTiO3]0.32 (PMN–PT) Layered Structure
3. BASPRO: A Balanced Script Producer for Speech Corpus Collection Based on the Genetic Algorithm
4. Speaker-Specific Articulatory Feature Extraction Based on Knowledge Distillation for Speaker Recognition
5. SpeechCLIP+: Self-Supervised Multi-Task Representation Learning for Speech Via Clip and Speech-Image Data.
6. Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model.
7. A Model-Selection-Based Self-Splitting Gaussian Mixture Learning with Application to Speaker Identification
8. SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models.
9. HAAQI-Net: A non-intrusive neural music quality assessment model for hearing aids.
10. Unmasking Illusions: Understanding Human Perception of Audiovisual Deepfakes.
11. Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion.
12. A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech.
13. Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features.
14. The Voicemos Challenge 2023: Zero-Shot Subjective Speech Quality Prediction for Multiple Domains.
15. LC4SV: A Denoising Framework Learning to Compensate for Unseen Speaker Verification Models.
16. Multi-Target Extractor and Detector for Unknown-Number Speaker Diarization.
17. Generalization Ability Improvement of Speaker Representation and Anti-Interference for Speaker Verification.
18. Deep Learning-Based Non-Intrusive Multi-Objective Speech Assessment Model With Cross-Domain Features.
19. Decomposition and Reorganization of Phonetic Information for Speaker Embedding Learning.
20. MTI-Net: A Multi-Target Speech Intelligibility Prediction Model.
21. NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling.
22. Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks.
23. Chain-based Discriminative Autoencoders for Speech Recognition.
24. The VoiceMOS Challenge 2022.
25. MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids.
26. EMGSE: Acoustic/EMG Fusion for Multimodal Speech Enhancement.
27. Partially Fake Audio Detection by Self-Attention-Based Fake Span Discovery.
28. Speech-enhanced and Noise-aware Networks for Robust Speech Recognition.
29. Is Character Trigram Overlapping Ratio Still the Best Similarity Measure for Aligning Sentences in a Paraphrased Corpus?
30. Chinese Movie Dialogue Question Answering Dataset.
31. D4AM: A General Denoising Framework for Downstream Acoustic Models.
32. SVSNet: An End-to-End Speaker Voice Similarity Assessment Model.
33. Improved Lite Audio-Visual Speech Enhancement.
34. Utilizing Whisper to Enhance Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids.
35. Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement.
36. A Study on Incorporating Whisper for Robust Speech Assessment.
37. Multi-objective Non-intrusive Hearing-aid Speech Assessment Model.
38. Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model.
39. AV-Lip-Sync+: Leveraging AV-HuBERT to Exploit Multimodal Inconsistency for Video Deepfake Detection.
40. AVTENet: Audio-Visual Transformer-based Ensemble Network Exploiting Multiple Experts for Video Deepfake Detection.
41. Relational Data Selection for Data Augmentation of Speaker-Dependent Multi-Band MelGAN Vocoder.
42. Dual-Path Filter Network: Speaker-Aware Modeling for Speech Separation.
43. A Preliminary Study of a Two-Stage Paradigm for Preserving Speaker Identity in Dysarthric Voice Conversion.
44. AlloST: Low-Resource Speech Translation Without Source Transcription.
45. Speech Enhancement with Zero-Shot Model Selection.
46. Mandarin Electrolaryngeal Speech Voice Conversion with Sequence-to-Sequence Modeling.
47. HASA-Net: A Non-Intrusive Hearing-Aid Speech Assessment Network.
48. Sequence to General Tree: Knowledge-Guided Geometry Word Problem Solving.
49. Investigation of a Single-Channel Frequency-Domain Speech Enhancement Network to Improve End-to-End Bengali Automatic Speech Recognition Under Unseen Noisy Conditions.
50. Melody Harmonization Using Orderless Nade, Chord Balancing, and Blocked Gibbs Sampling.
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.