Search

Your search keyword '"Wang, Jingdong"' showing total 1,373 results

Search Constraints

Start Over You searched for: Author "Wang, Jingdong" Remove constraint Author: "Wang, Jingdong"
1,373 results on '"Wang, Jingdong"'

Search Results

1. Schedule Your Edit: A Simple yet Effective Diffusion Noise Schedule for Image Editing

2. Improving Multi-modal Large Language Model through Boosting Vision Capabilities

3. TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model

4. MGMapNet: Multi-Granularity Representation Learning for End-to-End Vectorized HD Map Construction

5. Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation

6. Flipped Classroom: Aligning Teacher Attention with Student in Generalized Category Discovery

7. MonoFormer: One Transformer for Both Diffusion and Autoregression

8. Learning Multiple Probabilistic Decisions from Latent World Model in Autonomous Driving

9. FullAnno: A Data Engine for Enhancing Image Comprehension of MLLMs

10. SpotActor: Training-Free Layout-Controlled Consistent Image Generation

11. Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression

12. EasyChauffeur: A Baseline Advancing Simplicity and Efficiency on Waymax

13. Disentangled Noisy Correspondence Learning

14. ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer

15. Add-SD: Rational Generation without Manual Reference

16. LION: Linear Group RNN for 3D Object Detection in Point Clouds

17. Explore the LiDAR-Camera Dynamic Adjustment Fusion for 3D Object Detection

18. Surfel-based Gaussian Inverse Rendering for Fast and Relightable Dynamic Human Reconstruction from Monocular Video

19. LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction

20. OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection

21. SEED: A Simple and Effective 3D DETR in Point Clouds

22. OVLW-DETR: Open-Vocabulary Light-Weighted Detection Transformer

23. Timestep-Aware Correction for Quantized Diffusion Models

24. Evaluation of Text-to-Video Generation Models: A Dynamics Perspective

25. XLD: A Cross-Lane Dataset for Benchmarking Novel Driving View Synthesis

26. VDG: Vision-Only Dynamic Gaussian for Driving Simulation

27. Assessing Model Generalization in Vicinity

28. Skim then Focus: Integrating Contextual and Fine-grained Views for Repetitive Action Counting

29. Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

30. BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection

31. LW-DETR: A Transformer Replacement to YOLO for Real-Time Detection

32. OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding

33. StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond

34. Towards Unified Multi-granularity Text Detection with Interactive Attention

35. Are Images Indistinguishable to Humans Also Indistinguishable to Classifiers?

36. Dense Connector for MLLMs

37. Unsupervised Pre-training with Language-Vision Prompts for Low-Data Instance Segmentation

38. Automated Multi-level Preference for MLLMs

39. RTG-SLAM: Real-time 3D Reconstruction at Scale using Gaussian Splatting

40. Training-Free Unsupervised Prompt for Vision-Language Models

41. CLIP-GS: CLIP-Informed Gaussian Splatting for Real-time and View-consistent 3D Semantic Understanding

42. TryOn-Adapter: Efficient Fine-Grained Clothing Identity Adaptation for High-Fidelity Virtual Try-On

43. DreamSalon: A Staged Diffusion Framework for Preserving Identity-Context in Editable Face Generation

44. Decoupled Pseudo-labeling for Semi-Supervised Monocular 3D Object Detection

45. Gradient-based Sampling for Class Imbalanced Semi-supervised Object Detection

46. TexRO: Generating Delicate Textures of 3D Models by Recursive Optimization

47. GGRt: Towards Pose-free Generalizable 3D Gaussian Splatting in Real-time

48. Learning to Rematch Mismatched Pairs for Robust Cross-Modal Retrieval

Catalog

Books, media, physical & digital resources