Search

Your search keyword '"Lu, Jiwen"' showing total 1,535 results

Search Constraints

Start Over You searched for: Author "Lu, Jiwen" Remove constraint Author: "Lu, Jiwen"
1,535 results on '"Lu, Jiwen"'

Search Results

1. FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner

2. Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution

3. DC-Solver: Improving Predictor-Corrector Diffusion Sampler via Dynamic Compensation

4. EmbodiedSAM: Online Segment Any 3D Thing in Real Time

5. Temporal Feature Matters: A Framework for Diffusion Model Quantization

6. Efficient Inference of Vision Instruction-Following Models with Elastic Cache

7. GeoLRM: Geometry-Aware Large Reconstruction Model for High-Quality 3D Gaussian Generation

8. Embodied Instruction Following in Unknown Environments

9. FlowIE: Efficient Image Enhancement via Rectified Flow

10. OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving

11. GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction

12. Hardness-Aware Scene Synthesis for Semi-Supervised 3D Object Detection

13. Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond

14. X-3D: Explicit 3D Structure Modeling for Point Cloud Recognition

15. Narrative Action Evaluation with Prompt-Guided Multimodal Interaction

16. LOGO: A Long-Form Video Dataset for Group Action Quality Assessment

17. DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery

18. Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression

19. Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models

20. Learning Dual-Level Deformable Implicit Representation for Real-World Scale Arbitrary Super-Resolution

21. ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation

22. Memory-based Adapters for Online 3D Scene Perception

23. MADTP: Multimodal Alignment-Guided Dynamic Token Pruning for Accelerating Vision-Language Transformer

24. Path Choice Matters for Clear Attribution in Path Methods

25. OccNeRF: Advancing 3D Occupancy Prediction in LiDAR-Free Environments

26. ThinkBot: Embodied Instruction Following with Thought Chain Reasoning

27. Segment and Caption Anything

28. SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction

29. Fast Shapley Value Estimation: A Unified Approach

30. ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation

33. MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory

34. Anyview: Generalizable Indoor 3D Object Detection with Variable Frames

35. Skip-Plan: Procedure Planning in Instructional Videos via Condensed Action Space Learning

36. TCOVIS: Temporally Consistent Online Video Instance Segmentation

37. DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving

38. Introspective Deep Metric Learning

39. PointOcc: Cylindrical Tri-Perspective View for Point-based 3D Semantic Occupancy Prediction

40. Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models

41. Embodied Task Planning with Large Language Models

42. Towards Accurate Post-training Quantization for Diffusion Models

43. 3D Small Object Detection with Dynamic Spatial Pruning

44. Dense Hybrid Proposal Modulation for Lane Detection

45. Learning Accurate Performance Predictors for Ultrafast Automated Model Compression

46. LRRNet: A Novel Representation Learning Guided Fusion Network for Infrared and Visible Images

47. Binarizing Sparse Convolutional Networks for Efficient Point Cloud Analysis

48. Efficient Meshy Neural Fields for Animatable Human Avatars

49. SurroundOcc: Multi-Camera 3D Occupancy Prediction for Autonomous Driving

50. OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception

Catalog

Books, media, physical & digital resources