Search

Your search keyword '"Chen, Qifeng"' showing total 151 results

Search Constraints

Start Over You searched for: Author "Chen, Qifeng" Remove constraint Author: "Chen, Qifeng" Database arXiv Remove constraint Database: arXiv
151 results on '"Chen, Qifeng"'

Search Results

1. LiDAR-GS:Real-time LiDAR Re-Simulation using Gaussian Splatting

2. SimMAT: Exploring Transferability from Vision Foundation Models to Any Image Modality

3. HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts

4. Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation

5. Diffusion-Based Visual Art Creation: A Survey and New Perspectives

6. SkillMimic: Learning Reusable Basketball Skills from Demonstrations

7. TALE: Training-free Cross-domain Image Composition via Adaptive Latent Manipulation and Energy-guided Optimization

8. MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions

9. Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection

10. GIC: Gaussian-Informed Continuum for Physical Property Identification and Simulation

11. VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term Modeling

12. Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation

13. LLMs Meet Multimodal Generation and Editing: A Survey

14. Hawk: Learning to Understand Open-World Video Anomalies

15. HR Human: Modeling Human Avatars with Triangular Mesh and High-Resolution Textures from Videos

16. 4D Panoptic Scene Graph Generation

17. PLUTO: Pushing the Limit of Imitation Learning-based Planning for Autonomous Driving

18. Latent Guard: a Safety Framework for Text-to-image Generation

19. Automatic Controllable Colorization via Imagination

20. Robust Depth Enhancement via Polarization Prompt Fusion Tuning

21. OV9D: Open-Vocabulary Category-Level 9D Object Pose and Size Estimation

22. Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts

23. Cross-Cluster Shifting for Efficient and Effective 3D Object Detection in Autonomous Driving

24. Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners

25. Real-time 3D-aware Portrait Editing from a Single Image

26. Using Left and Right Brains Together: Towards Vision and Language Planning

27. Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation

28. DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation

29. SPIRE: Semantic Prompt-Driven Image Restoration

30. MagicScroll: Nontypical Aspect-Ratio Image Generation for Visual Storytelling via Multi-Layered Semantic-Aware Denoising

31. HeadArtist: Text-conditioned 3D Head Generation with Self Score Distillation

32. Learning Naturally Aggregated Appearance for Efficient 3D Editing

33. MagicStick: Controllable Video Editing via Control Handle Transformations

34. LDM-ISP: Enhancing Neural ISP for Low Light with Latent Diffusion Models

35. PPAD: Iterative Interactions of Prediction and Planning for End-to-end Autonomous Driving

36. VideoCrafter1: Open Diffusion Models for High-Quality Video Generation

37. ControlLLM: Augment Language Models with Tools by Searching on Graphs

38. ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models

39. In-Domain GAN Inversion for Faithful Reconstruction and Editability

40. AniPortraitGAN: Animatable 3D Portrait Generation from 2D Image Collections

41. Online Overexposed Pixels Hallucination in Videos with Adaptive Reference Frame Selection

42. CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

43. Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation

44. CMDFusion: Bidirectional Fusion Network with Cross-modality Knowledge Distillation for LIDAR Semantic Segmentation

45. SAD: Segment Any RGBD

46. TextDiffuser: Diffusion Models as Text Painters

47. Flipbot: Learning Continuous Paper Flipping via Coarse-to-Fine Exteroceptive-Proprioceptive Exploration

48. Learn to Grasp via Intention Discovery and its Application to Challenging Clutter

49. ERRA: An Embodied Representation and Reasoning Architecture for Long-horizon Language-conditioned Manipulation Tasks

50. Follow Your Pose: Pose-Guided Text-to-Video Generation using Pose-Free Videos

Catalog

Books, media, physical & digital resources