Search

Your search keyword '"Deng, Jiajun"' showing total 479 results

Search Constraints

Start Over You searched for: Author "Deng, Jiajun" Remove constraint Author: "Deng, Jiajun"
479 results on '"Deng, Jiajun"'

Search Results

1. Exploring SSL Discrete Speech Features for Zipformer-based Contextual ASR

2. RayFormer: Improving Query-Based Multi-Camera 3D Object Detection via Ray-Centric Strategies

3. Described Spatial-Temporal Video Detection

4. Homogeneous Speaker Features for On-the-Fly Dysarthric and Elderly Speaker Adaptation

5. Self-supervised ASR Models and Features For Dysarthric and Elderly Speech Recognition

6. Hierarchical Temporal Context Learning for Camera-based Semantic Scene Completion

7. One-pass Multiple Conformer and Foundation Speech Systems Compression and Quantization Using An All-in-one Neural Model

8. Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition

9. Towards Effective and Efficient Non-autoregressive Decoding Using Block-based Attention Mask

10. End-to-End Rate-Distortion Optimized 3D Gaussian Representation

11. Agent3D-Zero: An Agent for Zero-shot 3D Understanding

12. HVDistill: Transferring Knowledge from Images to Point Clouds via Unsupervised Hybrid-View Distillation

13. PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest

14. DeepEraser: Deep Iterative Context Mining for Generic Text Eraser

15. Cycle-Consistency Learning for Captioning and Grounding

16. Towards Automatic Data Augmentation for Disordered Speech Recognition

17. FlashOcc: Fast and Memory-Efficient Occupancy Prediction via Channel-to-Height Plugin

18. I$^2$MD: 3D Action Representation Learning with Inter- and Intra-modal Mutual Distillation

19. Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud Recognition

20. SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning

21. Masked Motion Predictors are Strong 3D Action Representation Learners

22. Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection

23. Audio-visual End-to-end Multi-channel Speech Separation, Dereverberation and Recognition

24. Hyper-parameter Adaptation of Conformer ASR Systems for Elderly and Dysarthric Speech Recognition

25. Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems

26. Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems

27. Bi-LRFusion: Bi-Directional LiDAR-Radar Fusion for 3D Dynamic Object Detection

28. Use of Speech Impairment Severity for Dysarthric Speech Recognition

29. Deep Unrestricted Document Image Rectification

31. Exploring Self-supervised Pre-trained ASR Models For Dysarthric and Elderly Speech Recognition

32. Confidence Score Based Speaker Adaptation of Conformer Speech Recognition Systems

33. Recurrent Generic Contour-based Instance Segmentation with Progressive Learning

34. OA-BEV: Bringing Object Awareness to Bird's-Eye-View Representation for Multi-Camera 3D Object Detection

35. 3DPPE: 3D Point Positional Encoding for Multi-Camera 3D Object Detection Transformers

36. Adversarial Data Augmentation Using VAE-GAN for Disordered Speech Recognition

37. Exploiting prompt learning with pre-trained language models for Alzheimer's Disease detection

38. Geometric Representation Learning for Document Image Rectification

39. CMD: Self-supervised 3D Action Representation Learning with Cross-modal Mutual Distillation

41. A patient with bimaxillary protrusion was treated by the extraction of four premolars and four compromised first molars: a case report

42. Conformer Based Elderly Speech Recognition System for Alzheimer's Disease Detection

43. Two-pass Decoding and Cross-adaptation Based System Combination of End-to-end Conformer and Hybrid TDNN ASR Systems

44. Confidence Score Based Conformer Speaker Adaptation for Speech Recognition

45. Exploiting Cross-domain And Cross-Lingual Ultrasound Tongue Imaging Features For Elderly And Dysarthric Speech Recognition

46. TransVG++: End-to-End Visual Grounding with Language Conditioned Vision Transformer

47. Personalized Adversarial Data Augmentation for Dysarthric and Elderly Speech Recognition

48. Audio-visual multi-channel speech separation, dereverberation and recognition

49. Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks

Catalog

Books, media, physical & digital resources