112 results on '"Shengkui Zhao"'
Search Results
2. MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation.
3. Are Soft Prompts Good Zero-Shot Learners for Speech Recognition?
4. Emotional Dimension Control in Language Model-Based Text-to-Speech: Spanning a Broad Spectrum of Human Emotions.
5. Towards Audio Codec-based Speech Separation.
6. Phonetic Enhanced Language Modeling for Text-to-Speech Synthesis.
7. Adapter-tuning with Effective Token-dependent Representation Shift for Automatic Speech Recognition.
8. ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention.
9. D2Former: A Fully Complex Dual-Path Dual-Decoder Conformer Network Using Joint Complex Masking and Complex Spectral Mapping for Monaural Speech Enhancement.
10. MossFormer: Pushing the Performance Limit of Monaural Speech Separation Using Gated Single-Head Transformer with Convolution-Augmented Joint Self-Attentions.
11. FRCRN: Boosting Feature Representation Using Frequency Recurrence for Monaural Speech Enhancement.
12. End-to-End Complex-Valued Multidilated Convolutional Neural Network for Joint Acoustic Echo Cancellation and Noise Suppression.
13. SPGM: Prioritizing Local Features for enhanced speech separation performance.
14. Are Soft Prompts Good Zero-shot Learners for Speech Recognition?
15. MossFormer2: Combining Transformer and RNN-Free Recurrent Network for Enhanced Time-Domain Monaural Speech Separation.
16. Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses.
17. Towards Natural and Controllable Cross-Lingual Voice Conversion Based on Neural TTS Model and Phonetic Posteriorgram.
18. Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion.
19. Multi-Task Multi-Network Joint-Learning of Deep Residual Networks and Cycle-Consistency Generative Adversarial Networks for Robust Speech Recognition.
20. Fast Learning for Non-Parallel Many-to-Many Voice Conversion with Residual Star Generative Adversarial Networks.
21. End-to-End Complex-Valued Multidilated Convolutional Neural Network for Joint Acoustic Echo Cancellation and Noise Suppression.
22. A novel sparse model for multi-source localization using distributed microphone array.
23. On time-frequency mask estimation for MVDR beamforming with application in robust speech recognition.
24. Large region acoustic source mapping: A generalized sparse constrained deconvolution approach.
25. An expectation-maximization eigenvector clustering approach to direction of arrival estimation of multiple speech sources.
26. Learning to estimate reverberation time in noisy and reverberant rooms.
27. Robust speech recognition using beamforming with adaptive microphone gains and multichannel noise reduction.
28. Large region acoustic source mapping using movable arrays.
29. A learning-based approach to direction of arrival estimation in noisy and reverberant environments.
30. A new auxiliary-vector algorithm with conjugate orthogonality for speech enhancement.
31. Robust DOA estimation of multiple speech sources.
32. ITEM: Immersive Telepresence for Entertainment and Meetings - A Practical Approach.
33. Spatialized audio multiparty teleconferencing with commodity miniature microphone array.
34. A Fast-Converging Adaptive Frequency-Domain MVDR Beamformer for Speech Enhancement.
35. Real-time implementation and performance optimization of 3D sound localization on GPUs.
36. Nonlinear image restoration using recurrent radial basis function network.
37. Teleimmersive Audio-Visual Communication Using Commodity Hardware [Applications Corner].
38. New Variable Step-Sizes Minimizing Mean-Square Deviation for the LMS-Type Algorithms.
39. Underdetermined direction of arrival estimation using acoustic vector sensor.
40. Modified LMS and NLMS Algorithms with a New Variable Step Size.
41. Sliding Mode Control of Fuzzy Dynamic Systems.
42. Adaptive fast finite-time multiple-surface sliding control for a class of uncertain non-linear systems.
43. Variable step-size LMS algorithm with a quotient form.
44. A generalized data windowing scheme for adaptive conjugate gradient algorithms.
45. Stability and Convergence Analysis of Transform-Domain LMS Adaptive Filters With Second-Order Autoregressive Process.
46. Comments on 'Adaptive multiple-surface sliding control for non-autonomous systems with mismatched uncertainties'.
47. ITEM: Immersive Telepresence for Entertainment and Meetings - A Practical Approach.
48. Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses
49. Speech dereverberation for enhancement and recognition using dynamic features constrained deep neural networks and feature adaptation.
50. Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.