Search

Your search keyword '"Toshev, Alexander"' showing total 163 results

Search Constraints

Start Over You searched for: Author "Toshev, Alexander" Remove constraint Author: "Toshev, Alexander"
163 results on '"Toshev, Alexander"'

Search Results

1. World-consistent Video Diffusion with Explicit 3D Modeling

2. Multimodal Autoregressive Pre-training of Large Vision Encoders

3. On the Modeling Capabilities of Large Language Models for Sequential Decision Making

4. DataComp-LM: In search of the next generation of training sets for language models

5. Grounding Multimodal Large Language Models in Actions

6. MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

7. Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation

8. Large Language Models as Generalizable Policies for Embodied Tasks

9. Data Filtering Networks

10. Mobile V-MoEs: Scaling Down Vision Transformers via Sparse Mixture-of-Experts

11. Principles and Guidelines for Evaluating Social Robot Navigation Algorithms

12. Value function estimation using conditional diffusion models for control

13. On Robustness in Multimodal Learning

14. STAIR: Learning Sparse Text and Image Representation in Grounded Tokens

15. Perceptual Grouping in Contrastive Vision-Language Models

16. Retrospectives on the Embodied AI Workshop

17. Gesture2Path: Imitation Learning for Gesture-aware Navigation

18. GAUDI: A Neural Architect for Immersive 3D Scene Generation

19. A Protocol for Validating Social Navigation Policies

20. Do As I Can, Not As I Say: Grounding Language in Robotic Affordances

21. Socially Compliant Navigation Dataset (SCAND): A Large-Scale Dataset of Demonstrations for Social Navigation

22. Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning

23. ReLMoGen: Leveraging Motion Generation in Reinforcement Learning for Mobile Manipulation

24. Adversarial Generative Grammars for Human Activity Prediction

25. Learning Object-conditioned Exploration using Distributed Soft Actor Critic

26. ObjectNav Revisited: On Evaluation of Embodied Agents Navigating to Objects

27. Modeling Long-horizon Tasks as Sequential Interaction Landscapes

28. Interactive Gibson Benchmark (iGibson 0.5): A Benchmark for Interactive Navigation in Cluttered Environments

29. Long Range Neural Navigation Policies for the Real World

30. Scene Memory Transformer for Embodied Agents in Long-Horizon Tasks

31. Evolving Space-Time Neural Architectures for Videos

32. Self-supervisory Signals for Object Discovery and Detection

33. Visual Representations for Semantic Target Driven Navigation

34. Sim2Real View Invariant Visual Servoing by Recurrent Control

35. No Fuss Distance Metric Learning using Proxies

36. Towards Accurate Multi-person Pose Estimation in the Wild

37. Adversarial Generative Grammars for Human Activity Prediction

38. Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning Challenge

39. Chained Predictions Using Convolutional Neural Networks

40. Scalable Pre-training of Large Autoregressive Image Models

41. The Unreasonable Effectiveness of Noisy Data for Fine-Grained Recognition

42. Generation and Comprehension of Unambiguous Object Descriptions

43. Pose Embeddings: A Deep Architecture for Learning to Match Human Poses

44. Show and Tell: A Neural Image Caption Generator

45. Deep Convolutional Ranking for Multilabel Image Annotation

46. DeepPose: Human Pose Estimation via Deep Neural Networks

47. Scalable Object Detection using Deep Neural Networks

49. The Unreasonable Effectiveness of Noisy Data for Fine-Grained Recognition

Catalog

Books, media, physical & digital resources