Search

Your search keyword '"Shrivastava, Abhinav"' showing total 577 results

Search Constraints

Start Over You searched for: Author "Shrivastava, Abhinav" Remove constraint Author: "Shrivastava, Abhinav"
577 results on '"Shrivastava, Abhinav"'

Search Results

1. VeriGraph: Scene Graphs for Execution Verifiable Robot Planning

2. NeRF-Aug: Data Augmentation for Robotics with Neural Radiance Fields

3. LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior

4. Fast Encoding and Decoding for Implicit Video Representation

5. LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation

6. Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics

7. Trajectory-aligned Space-time Tokens for Few-shot Action Recognition

8. WayEx: Waypoint Exploration using a Single Demonstration

9. InVi: Object Insertion In Videos Using Off-the-Shelf Diffusion Models

10. V-VIPE: Variational View Invariant Pose Embedding

11. ARDuP: Active Region Video Diffusion for Universal Policies

12. Composing Object Relations and Attributes for Image-Text Matching

13. AutoHallusion: Automatic Generation of Hallucination Benchmarks for Vision-Language Models

14. UVIS: Unsupervised Video Instance Segmentation

15. MaGGIe: Masked Guided Gradual Human Instance Matting

16. MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

17. What is Point Supervision Worth in Video Instance Segmentation?

18. Measuring Style Similarity in Diffusion Models

19. LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors

20. Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models

21. Explaining the Implicit Neural Canvas: Connecting Pixels to Neurons by Tracing their Contributions

22. Video Dynamics Prior: An Internal Learning Approach for Robust Video Enhancements

23. EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS

24. Gen2Det: Generate to Detect

25. Multimodality-guided Image Style Transfer using Cross-modal GAN Inversion

26. Latent-INR: A Flexible Framework for Implicit Representations of Videos with Discriminative Semantics

27. EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS

28. LEIA: Latent View-Invariant Embeddings for Implicit 3D Articulation

29. Trajectory-Aligned Space-Time Tokens for Few-Shot Action Recognition

30. Fast Encoding and Decoding for Implicit Video Representation

31. LiFT: A Surprisingly Simple Lightweight Feature Transform for Dense ViT Descriptors

33. A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval

34. Do text-free diffusion models learn discriminative visual representations?

35. Multi-entity Video Transformers for Fine-Grained Video Representation Learning

36. SHACIRA: Scalable HAsh-grid Compression for Implicit Neural Representations

37. Chop & Learn: Recognizing and Generating Object-State Compositions

39. Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization

40. Diffusion Models Beat GANs on Image Classification

41. SimpSON: Simplifying Photo Cleanup with Single-Click Distracting Object Segmentation Network

42. MOST: Multiple Object localization with Self-supervised Transformers for object discovery

43. HNeRV: A Hybrid Neural Representation for Videos

44. ASIC: Aligning Sparse in-the-wild Image Collections

45. FlexNeRF: Photorealistic Free-viewpoint Rendering of Moving Humans from Sparse Views

46. Towards Scalable Neural Representation for Diverse Videos

47. Align and Attend: Multimodal Summarization with Dual Contrastive Losses

48. COVID-VTS: Fact Extraction and Verification on Short Video Platforms

50. NIRVANA: Neural Implicit Representations of Videos with Adaptive Networks and Autoregressive Patch-wise Modeling

Catalog

Books, media, physical & digital resources