Search

Your search keyword '"Wang, Gaoang"' showing total 207 results

Search Constraints

Start Over You searched for: Author "Wang, Gaoang" Remove constraint Author: "Wang, Gaoang"
207 results on '"Wang, Gaoang"'

Search Results

1. Frame Order Matters: A Temporal Sequence-Aware Model for Few-Shot Action Recognition

2. STEVE Series: Step-by-Step Construction of Agent Systems in Minecraft

3. BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection

4. CityCraft: A Real Crafter for 3D City Generation

5. S4Fusion: Saliency-aware Selective State Space Model for Infrared Visible Image Fusion

6. FlexiFilm: Long Video Generation with Flexible Conditions

7. MovieChat+: Question-aware Sparse Memory for Long Video Question Answering

8. Do We Really Need a Complex Agent System? Distill Embodied Agent into a Single Model

9. VersaT2I: Improving Text-to-Image Models with Versatile Reward

10. Hierarchical Auto-Organizing System for Open-Ended Multi-Agent Navigation

11. MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant

12. UniDCP: Unifying Multiple Medical Vision-language Tasks via Dynamic Cross-modal Learnable Prompts

13. User-Aware Prefix-Tuning is a Good Learner for Personalized Image Captioning

14. CityGen: Infinite and Controllable 3D City Layout Generation

15. See and Think: Embodied Agent in Virtual Environment

16. Vision meets mmWave Radar: 3D Object Perception Benchmark for Autonomous Driving

17. Sam-Guided Enhanced Fine-Grained Encoding with Mixed Semantic Learning for Medical Image Captioning

19. Devil in the Number: Towards Robust Multi-modality Data Filter

20. FrameRS: A Video Frame Compression Model Composed by Self supervised Video Frame Reconstructor and Key Frame Selector

21. Chasing Consistency in Text-to-3D Generation from a Single Image

22. Bridging Cross-task Protocol Inconsistency for Distillation in Dense Object Detection

23. UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning

24. PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation

25. StableVideo: Text-driven Consistency-aware Diffusion Video Editing

26. MovieChat: From Dense Token to Sparse Memory for Long Video Understanding

27. A Survey of Deep Learning in Sports Applications: Perception, Comprehension, and Decision

28. MPM: A Unified 2D-3D Human Pose Representation via Masked Pose Modeling

29. Language Adaptive Weight Generation for Multi-task Visual Grounding

30. SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation

31. User-Aware Prefix-Tuning Is a Good Learner for Personalized Image Captioning

32. Global Adaptation meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation

33. DDMM-Synth: A Denoising Diffusion Model for Cross-modal Medical Image Synthesis with Sparse-view Measurement Embedding

34. Blind Inpainting with Object-aware Discrimination for Artificial Marker Removal

35. Deep Learning Methods for Small Molecule Drug Discovery: A Survey

36. DIVOTrack: A Novel Dataset and Baseline Method for Cross-View Multi-Object Tracking in DIVerse Open Scenes

37. DiffFashion: Reference-based Fashion Design with Structure-aware Transfer by Diffusion Models

38. STSC-SNN: Spatio-Temporal Synaptic Connection with Temporal Convolution and Attention for Spiking Neural Networks

39. Missing Modality meets Meta Sampling (M3S): An Efficient Universal Approach for Multimodal Sentiment Analysis with Missing Modality

40. Hierarchical Semi-Supervised Contrastive Learning for Contamination-Resistant Anomaly Detection

42. Recent Advances in Embedding Methods for Multi-Object Tracking: A Survey

43. Preserve Pre-trained Knowledge: Transfer Learning With Self-Distillation For Action Recognition

44. Human-Centered Prior-Guided and Task-Dependent Multi-Task Representation Learning for Action Recognition Pre-Training

45. Self-paced Multi-grained Cross-modal Interaction Modeling for Referring Expression Comprehension

46. MAP-SNN: Mapping Spike Activities with Multiplicity, Adaptability, and Plasticity into Bio-Plausible Spiking Neural Networks

48. Disjoint Contrastive Regression Learning for Multi-Sourced Annotations

49. ActiveMatch: End-to-end Semi-supervised Active Representation Learning

50. Track without Appearance: Learn Box and Tracklet Embedding with Local and Global Motion Patterns for Vehicle Tracking

Catalog

Books, media, physical & digital resources