48 results on '"Silvio Savarese"'
Search Results
2. HIVE: Harnessing Human Feedback for Instructional Visual Editing.
3. ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding.
4. Procedure-Aware Pretraining for Instructional Video Understanding.
5. JRDB-Act: A Large-scale Dataset for Spatio-temporal Action, Social Group and Activity Detection.
6. Topological Planning With Transformers for Vision-and-Language Navigation.
7. TopNet: Structural Point Cloud Decoder.
8. Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks.
9. 4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks.
10. Generalized Intersection Over Union: A Metric and a Loss for Bounding Box Regression.
11. DenseFusion: 6D Object Pose Estimation by Iterative Dense Fusion.
12. SoPhie: An Attentive GAN for Predicting Paths Compliant to Social and Physical Constraints.
13. Neural Task Graphs: Generalizing to Unseen Tasks From a Single Video Demonstration.
14. Taskonomy: Disentangling Task Transfer Learning.
15. Adversarial Feature Augmentation for Unsupervised Domain Adaptation.
16. Demo2Vec: Reasoning Object Affordances From Online Videos.
17. Gibson Env: Real-World Perception for Embodied Agents.
18. Deep Learning Under Privileged Information Using Heteroscedastic Dropout.
19. Social GAN: Socially Acceptable Trajectories With Generative Adversarial Networks.
20. Im2Pano3D: Extrapolating 360° Structure and Semantics Beyond the Field of View.
21. Feedback Networks.
22. Social Scene Understanding: End-to-End Multi-person Action Localization and Collective Activity Recognition.
23. Deep View Morphing.
24. Deep Metric Learning via Lifted Structured Feature Embedding.
25. DeLay: Robust Spatial Layout Estimation for Cluttered Indoor Scenes.
26. Structural-RNN: Deep Learning on Spatio-Temporal Graphs.
27. 3D Semantic Parsing of Large-Scale Indoor Spaces.
28. Social LSTM: Human Trajectory Prediction in Crowded Spaces.
29. Enriching object detection with 2D-3D registration and continuous viewpoint estimation.
30. A coarse-to-fine model for 3D pose estimation and sub-category recognition.
31. Watch-n-patch: Unsupervised understanding of actions and relations.
32. Data-driven 3D Voxel Patterns for object category recognition.
33. Learning an Image-Based Motion Context for Multiple People Tracking.
34. Accurate Localization of 3D Objects from RGB-D Data Using Segmentation Hypotheses.
35. Understanding Indoor Scenes Using 3D Geometric Phrases.
36. Weakly Supervised Learning of Mid-Level Features with Beta-Bernoulli Process Restricted Boltzmann Machines.
37. Dense Object Reconstruction with Semantic Priors.
38. Mobile object detection through client-server based vote transfer.
39. Semantic structure from motion with points, regions, and objects.
40. Estimating the aspect layout of object categories.
41. An efficient branch-and-bound algorithm for optimal human pose estimation.
42. Recognizing human actions by attributes.
43. Cross-view action recognition via view knowledge transfer.
44. Learning context for collective activity recognition.
45. Semantic structure from motion.
46. Toward coherent object detection and scene layout understanding.
47. A multi-view probabilistic model for 3D object classes.
48. Detecting Specular Surfaces on Natural Images.
Catalog
Books, media, physical & digital resources
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.