811 results on '"Hsin-Min Wang"'
Search Results
2. Magnetic-Field-Assisted Electric-Field-Induced Domain Switching of a Magnetic Single Domain in a Multiferroic/Magnetoelectric Ni Nanochevron/[Pb(Mg1/3Nb2/3)O3]0.68–[PbTiO3]0.32 (PMN–PT) Layered Structure
3. BASPRO: A Balanced Script Producer for Speech Corpus Collection Based on the Genetic Algorithm
4. Speaker-Specific Articulatory Feature Extraction Based on Knowledge Distillation for Speaker Recognition
5. A Study On Incorporating Whisper For Robust Speech Assessment.
6. SpeechCLIP+: Self-Supervised Multi-Task Representation Learning for Speech Via Clip and Speech-Image Data.
7. Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model.
8. A Model-Selection-Based Self-Splitting Gaussian Mixture Learning with Application to Speaker Identification
9. A Study on Zero-shot Non-intrusive Speech Assessment using Large Language Models.
10. Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement.
11. Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech Recognition.
12. Robust Audio-Visual Speech Enhancement: Correcting Misassignments in Complex Environments with Advanced Post-Processing.
13. Exploring the Impact of Data Quantity on ASR in Extremely Low-resource Languages.
14. Effective Noise-aware Data Simulation for Domain-adaptive Speech Enhancement Leveraging Dynamic Stochastic Perturbation.
15. The VoiceMOS Challenge 2024: Beyond Speech Quality Prediction.
16. Deep learning for identifying personal and family history of suicidal thoughts and behaviors from EHRs.
17. SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models.
18. HAAQI-Net: A non-intrusive neural music quality assessment model for hearing aids.
19. Unmasking Illusions: Understanding Human Perception of Audiovisual Deepfakes.
20. Audio-Visual Mandarin Electrolaryngeal Speech Voice Conversion.
21. A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech.
22. Mandarin Electrolaryngeal Speech Voice Conversion using Cross-domain Features.
23. The Voicemos Challenge 2023: Zero-Shot Subjective Speech Quality Prediction for Multiple Domains.
24. LC4SV: A Denoising Framework Learning to Compensate for Unseen Speaker Verification Models.
25. Multi-Target Extractor and Detector for Unknown-Number Speaker Diarization.
26. Generalization Ability Improvement of Speaker Representation and Anti-Interference for Speaker Verification.
27. Deep Learning-Based Non-Intrusive Multi-Objective Speech Assessment Model With Cross-Domain Features.
28. Decomposition and Reorganization of Phonetic Information for Speaker Embedding Learning.
29. MTI-Net: A Multi-Target Speech Intelligibility Prediction Model.
30. NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling.
31. Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks.
32. Chain-based Discriminative Autoencoders for Speech Recognition.
33. The VoiceMOS Challenge 2022.
34. MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids.
35. EMGSE: Acoustic/EMG Fusion for Multimodal Speech Enhancement.
36. Partially Fake Audio Detection by Self-Attention-Based Fake Span Discovery.
37. Speech-enhanced and Noise-aware Networks for Robust Speech Recognition.
38. Is Character Trigram Overlapping Ratio Still the Best Similarity Measure for Aligning Sentences in a Paraphrased Corpus?
39. Chinese Movie Dialogue Question Answering Dataset.
40. D4AM: A General Denoising Framework for Downstream Acoustic Models.
41. SVSNet: An End-to-End Speaker Voice Similarity Assessment Model.
42. Improved Lite Audio-Visual Speech Enhancement.
43. Utilizing Whisper to Enhance Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids.
44. Deep Complex U-Net with Conformer for Audio-Visual Speech Enhancement.
45. A Study on Incorporating Whisper for Robust Speech Assessment.
46. Multi-objective Non-intrusive Hearing-aid Speech Assessment Model.
47. Multi-Task Pseudo-Label Learning for Non-Intrusive Speech Quality Assessment Model.
48. AV-Lip-Sync+: Leveraging AV-HuBERT to Exploit Multimodal Inconsistency for Video Deepfake Detection.
49. AVTENet: Audio-Visual Transformer-based Ensemble Network Exploiting Multiple Experts for Video Deepfake Detection.
50. Relational Data Selection for Data Augmentation of Speaker-Dependent Multi-Band MelGAN Vocoder.
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.