Search

Your search keyword '"TOMBARI, FEDERICO"' showing total 1,095 results

Search Constraints

Start Over You searched for: Author "TOMBARI, FEDERICO" Remove constraint Author: "TOMBARI, FEDERICO"
1,095 results on '"TOMBARI, FEDERICO"'

Search Results

1. Towards Real-Time Open-Vocabulary Video Instance Segmentation

2. LayoutVLM: Differentiable Optimization of 3D Layout via Vision-Language Models

3. ESCAPE: Equivariant Shape Completion via Anchor Point Encoding

4. Gaussians-to-Life: Text-Driven Animation of 3D Gaussian Splatting Scenes

5. SmileSplat: Generalizable Gaussian Splats for Unconstrained Sparse Images

6. Active Data Curation Effectively Distills Large-Scale Multimodal Models

7. UNOPose: Unseen Object Pose Estimation with an Unposed RGB-D Reference Image

8. G2SDF: Surface Reconstruction from Explicit Gaussians with Implicit SDFs

9. TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters

10. Neural Semantic Map-Learning for Autonomous Vehicles

11. Search3D: Hierarchical Open-Vocabulary 3D Segmentation

12. Self-supervised Shape Completion via Involution and Implicit Correspondences

13. LiLoc: Lifelong Localization using Adaptive Submap Joining and Egocentric Factor Graph

14. P2P-Bridge: Diffusion Bridges for 3D Point Cloud Denoising

15. Extracting Training Data from Document-Based VQA Models

16. Toward a Diffusion-Based Generalist for Dense Vision Tasks

17. Dynamic Gaussian Marbles for Novel View Synthesis of Casual Monocular Videos

18. Stylebreeder: Exploring and Democratizing Artistic Styles through Text-to-Image Models

19. RaNeuS: Ray-adaptive Neural Surface Reconstruction

20. Mixed Diffusion for 3D Indoor Scene Synthesis

21. Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians

22. How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs

23. EchoScene: Indoor Scene Generation via Information Echo over Scene Graph Diffusion

24. BRAVE: Broadening the visual encoding of vision-language models

25. PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations

26. Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning

27. OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views

28. 3D scene generation from scene graphs and self-attention

29. Few-shot point cloud reconstruction and denoising via learned Guassian splats renderings and fine-tuned diffusion features

30. SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs

31. CLoRA: A Contrastive Approach to Compose Multiple LoRA Models

32. Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation

33. RadSplat: Radiance Field-Informed Gaussian Splatting for Robust Real-Time Rendering with 900+ FPS

34. GeoGaussian: Geometry-aware Gaussian Splatting for Scene Rendering

35. KP-RED: Exploiting Semantic Keypoints for Joint 3D Shape Retrieval and Deformation

36. Human Pose Descriptions and Subject-Focused Attention for Improved Zero-Shot Transfer in Human-Centric Classification Tasks

37. HyperSDFusion: Bridging Hierarchical Structures in Language and Geometry for Enhanced 3D Text2Shape Generation

38. OpenSUN3D: 1st Workshop Challenge on Open-Vocabulary 3D Scene Understanding

39. Physics-Encoded Graph Neural Networks for Deformation Prediction under Contact

40. Denoising Diffusion via Image-Based Rendering

41. InseRF: Text-Driven Generative Object Insertion in Neural 3D Scenes

42. Learning to Prompt with Text Only Supervision for Vision-Language Models

43. PhysAvatar: Learning the Physics of Dressed 3D Avatars from Visual Observations

44. Self-supervised Shape Completion via Involution and Implicit Correspondences

45. SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs

46. BRAVE: Broadening the Visual Encoding of Vision-Language Models

47. Text-Conditioned Resampler For Long Form Video Understanding

48. SILC: Improving Vision Language Pretraining with Self-distillation

49. EchoScene: Indoor Scene Generation via Information Echo Over Scene Graph Diffusion

50. P2P-Bridge: Diffusion Bridges for 3D Point Cloud Denoising

Catalog

Books, media, physical & digital resources