Search

Your search keyword '"ZHANG, ZHAOXIANG"' showing total 1,418 results

Search Constraints

Start Over You searched for: Author "ZHANG, ZHAOXIANG" Remove constraint Author: "ZHANG, ZHAOXIANG"
1,418 results on '"ZHANG, ZHAOXIANG"'

Search Results

1. MIO: A Foundation Model on Multimodal Tokens

2. HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models

3. OmniBench: Towards The Future of Universal Omni-Language Models

4. SimMAT: Exploring Transferability from Vision Foundation Models to Any Image Modality

5. Enhancing Sound Source Localization via False Negative Elimination

6. CityX: Controllable Procedural Content Generation for Unbounded 3D Cities

7. Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation

8. General Geometry-aware Weakly Supervised 3D Object Detection

9. Monocular Occupancy Prediction for Scalable Indoor Scenes

10. Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection

11. Enhancing End-to-End Autonomous Driving with Latent World Model

12. Trim 3D Gaussian Splatting for Accurate Geometry Representation

13. Towards Flexible Interactive Reflection Removal with Human Guidance

14. Learning Object-Centric Representation via Reverse Hierarchy Guidance

15. StableMoFusion: Towards Robust and Efficient Diffusion-based Motion Generation Framework

16. A Survey on Personalized Content Synthesis with Diffusion Models

17. Dual-Image Enhanced CLIP for Zero-Shot Anomaly Detection

18. Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond

19. MaterialSeg3D: Segmenting Dense Materials from 2D Priors for 3D Assets

20. Robust Depth Enhancement via Polarization Prompt Fusion Tuning

21. CityGaussian: Real-time High-quality Large-Scale Scene Rendering with Gaussians

22. Reusable Architecture Growth for Continual Stereo Matching

23. Enhancing Visual Continual Learning with Language-Guided Supervision

24. SceneX:Procedural Controllable Large-scale Scene Generation via Large-language Models

25. Generative Active Learning for Image Synthesis Personalization

26. Continual Forgetting for Pre-trained Vision Models

27. Open-world Machine Learning: A Review and New Outlooks

28. MemoNav: Working Memory Model for Visual Navigation

29. DiffSpeaker: Speech-Driven 3D Facial Animation with Diffusion Transformer

33. SAGD: Boundary-Enhanced Segment Anything in 3D Gaussian via Gaussian Decomposition

34. MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object Detection

35. Seek for Incantations: Towards Accurate Text-to-Image Diffusion Synthesis through Prompt Engineering

36. FurniScene: A Large-scale 3D Room Dataset with Intricate Furnishing Scenes

37. Pareto-based Multi-Objective Recommender System with Forgetting Curve

38. Bootstrap Masked Visual Modeling via Hard Patches Mining

39. Compositional Inversion for Stable Diffusion Models

40. GPT4SGG: Synthesizing Scene Graphs from Holistic and Region-specific Narratives

41. Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention

42. Visual Commonsense based Heterogeneous Graph Contrastive Learning

43. OneTrack: Demystifying the Conflict Between Detection and Tracking in End-to-End 3D Trackers

46. RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models

47. Informative Data Mining for One-Shot Cross-Domain Semantic Segmentation

48. DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions

49. Bootstrap Fine-Grained Vision-Language Alignment for Unified Zero-Shot Anomaly Localization

50. FSD V2: Improving Fully Sparse 3D Object Detection with Virtual Voxels

Catalog

Books, media, physical & digital resources