Search

Your search keyword '"Tan, Zheng-Hua"' showing total 648 results

Search Constraints

Start Over You searched for: Author "Tan, Zheng-Hua" Remove constraint Author: "Tan, Zheng-Hua" Search Limiters Academic (Peer-Reviewed) Journals Remove constraint Search Limiters: Academic (Peer-Reviewed) Journals
648 results on '"Tan, Zheng-Hua"'

Search Results

1. BiSSL: Bilevel Optimization for Self-Supervised Pre-Training and Fine-Tuning

2. Detecting and Defending Against Adversarial Attacks on Automatic Speech Recognition via Diffusion Models

3. Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder

4. Audio xLSTMs: Learning Self-Supervised Audio Representations with xLSTMs

5. Zero-Shot Audio Captioning Using Soft and Hard Prompts

6. The Effect of Training Dataset Size on Discriminative and Diffusion-Based Speech Enhancement Systems

7. Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations

8. Noise-Robust Keyword Spotting through Self-supervised Pretraining

9. How to train your ears: Auditory-model emulation for large-dynamic-range inputs and mild-to-severe hearing losses

10. Neural Networks Hear You Loud And Clear: Hearing Loss Compensation Using Deep Neural Networks

11. Self-supervised Pretraining for Robust Personalized Voice Activity Detection in Adverse Conditions

12. PAC-Bayes Generalisation Bounds for Dynamical Systems Including Stable RNNs

13. Investigating the Design Space of Diffusion Models for Speech Enhancement

14. Diffusion-Based Speech Enhancement in Matched and Mismatched Conditions Using a Heun-Based Sampler

15. Joint Minimum Processing Beamforming and Near-end Listening Enhancement

16. Masked Autoencoders with Multi-Window Local-Global Attention Are Better Audio Learners

17. Speech inpainting: Context-based speech synthesis guided by video

18. PAC-Bayesian bounds for learning LTI-ss systems with input from empirical loss

19. PAC-Bayesian-Like Error Bound for a Class of Linear Time-Invariant Stochastic State-Space Models

20. Filterbank Learning for Noise-Robust Small-Footprint Keyword Spotting

21. Improved disentangled speech representations using contrastive learning in factorized hierarchical variational autoencoder

22. Leveraging Domain Features for Detecting Adversarial Attacks Against Deep Speech Recognition in Noise

23. Minimum Processing Near-end Listening Enhancement

24. Improving Label-Deficient Keyword Spotting Through Self-Supervised Pretraining

25. Adversarial Multi-Task Deep Learning for Noise-Robust Voice Activity Detection with Low Algorithmic Delay

26. Floor Map Reconstruction Through Radio Sensing and Learning By a Large Intelligent Surface

27. User Localization using RF Sensing: A Performance comparison between LIS and mmWave Radars

28. Complex Recurrent Variational Autoencoder with Application to Speech Enhancement

29. Disentangled Speech Representation Learning Based on Factorized Hierarchical Variational Autoencoder with Self-Supervised Objective

30. Summary On The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Grand Challenge

32. On Training Targets and Activation Functions for Deep Representation Learning in Text-Dependent Speaker Verification

33. Deep Spoken Keyword Spotting: An Overview

34. Joint Far- and Near-End Speech Intelligibility Enhancement based on the Approximated Speech Intelligibility Index

35. Radio Sensing with Large Intelligent Surface for 6G

36. Design of AoI-Aware 5G Uplink Scheduler UsingReinforcement Learning

37. Remote Anomaly Detection in Industry 4.0 Using Resource-Constrained Devices

38. Explicit construction of the minimum error variance estimator for stochastic LTI state-space systems

39. Improvement of Noise-Robust Single-Channel Voice Activity Detection with Spatial Pre-processing

40. INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing

41. On TasNet for Low-Latency Single-Speaker Speech Enhancement

42. PAC-Bayesian theory for stochastic LTI systems

43. Data Generation Using Pass-phrase-dependent Deep Auto-encoders for Text-Dependent Speaker Verification

44. Vocal Tract Length Perturbation for Text-Dependent Speaker Verification with Autoregressive Prediction Coding

45. Assessing Wireless Sensing Potential with Large Intelligent Surfaces

46. CC-Loss: Channel Correlation Loss For Image Classification

47. Advanced Dropout: A Model-free Methodology for Bayesian Dropout Optimization

48. Audio-Visual Speech Inpainting with Deep Learning

49. An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation

50. UIAI System for Short-Duration Speaker Verification Challenge 2020

Catalog

Books, media, physical & digital resources