Search

Your search keyword '"Cao Liangliang"' showing total 519 results

Search Constraints

Start Over You searched for: Author "Cao Liangliang" Remove constraint Author: "Cao Liangliang"
519 results on '"Cao Liangliang"'

Search Results

1. Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention

2. MMCOMPOSITION: Revisiting the Compositionality of Pre-trained Vision-Language Models

3. Apple Intelligence Foundation Language Models

4. Diffusion Model-Based Image Editing: A Survey

5. Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models

6. Ferret: Refer and Ground Anything Anywhere at Any Granularity

7. Efficient-3DiM: Learning a Generalizable Single-image Novel-view Synthesizer in One Day

8. Instruction-Following Speech Recognition

10. RoomDreamer: Text-Driven 3D Indoor Scene Synthesis with Coherent Geometry and Texture

11. Less is More: Removing Text-regions Improves CLIP Training Efficiency and Robustness

12. STAIR: Learning Sparse Text and Image Representation in Grounded Tokens

13. Exploiting Category Names for Few-Shot Classification with Vision-Language Models

15. PriFit: Learning to Fit Primitives Improves Few Shot Point Cloud Segmentation

16. Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition

17. Input Length Matters: Improving RNN-T and MWER Training for Long-form Telephony Speech Recognition

19. BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition

21. Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction

22. Bridging the gap between streaming and non-streaming ASR systems bydistilling ensembles of CTC and RNN-T models

23. Exploring Targeted Universal Adversarial Perturbations to End-to-end ASR Models

24. Residual Energy-Based Models for End-to-End Speech Recognition

25. Learning Word-Level Confidence For Subword End-to-End ASR

28. Spatial-Temporal Alignment Network for Action Recognition and Detection

29. Improving Streaming Automatic Speech Recognition With Non-Streaming Model Distillation On Unsupervised Data

30. Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition

31. Zero-shot Entity Linking with Efficient Long Range Sequence Modeling

32. RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions

33. Label-Efficient Learning on Point Clouds using Approximate Convex Decompositions

36. Progressive Learning Algorithm for Efficient Person Re-Identification

37. Speech Sentiment Analysis via Pre-trained Features from End-to-end ASR Models

38. Product Image Recognition with Guidance Learning and Noisy Supervision

39. Accurate and Robust Pulmonary Nodule Detection by 3D Feature Pyramid Network with Self-supervised Feature Learning

40. 3DFPN-HS$^2$: 3D Feature Pyramid Network Based High Sensitivity and Specificity Pulmonary Nodule Detection

41. Automatic adaptation of object detectors to new domains using self-training

42. Learning Deterministic Policy with Target for Power Control in Wireless Networks

46. Matrix Factorization on GPUs with Memory Optimization and Approximate Computing

47. Focal Visual-Text Attention for Visual Question Answering

48. Pluronic F127-modified Ba[TiO.sub.3] for ceramic/polymer nanocomposite dielectric capacitor with enhanced energy storage performance

Catalog

Books, media, physical & digital resources