Search

Your search keyword '"Guez, Arthur"' showing total 39 results

Search Constraints

Start Over You searched for: Author "Guez, Arthur" Remove constraint Author: "Guez, Arthur"
39 results on '"Guez, Arthur"'

Search Results

1. A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning

2. Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

3. Gemini: A Family of Highly Capable Multimodal Models

4. Acceleration in Policy Optimization

5. Large-Scale Retrieval for Reinforcement Learning

6. COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation

7. Retrieval-Augmented Reinforcement Learning

8. Muesli: Combining Improvements in Policy Optimization

9. Counterfactual Credit Assignment in Model-Free Reinforcement Learning

10. On the role of planning in model-based deep reinforcement learning

11. Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban

12. Physically Embedded Planning Problems: New Challenges for Reinforcement Learning

13. Value-driven Hindsight Modelling

14. Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model

15. Augmenting learning using symmetry in a biologically-inspired domain

16. An investigation of model-free planning

17. Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search

18. Learning to Search with MCTSnets

19. Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

20. Imagination-Augmented Agents for Deep Reinforcement Learning

21. The Predictron: End-To-End Learning and Planning

22. Learning values across many orders of magnitude

23. Adaptive planning in human search

24. Increasing the Action Gap: New Operators for Reinforcement Learning

25. Deep Reinforcement Learning with Double Q-learning

26. Better Optimism By Bayes: Adaptive Planning with Rich Models

27. Mastering Atari, Go, chess and shogi by planning with a learned model

28. Efficient Bayes-Adaptive Reinforcement Learning using Sample-Based Search

30. Mastering the game of Go without human knowledge

31. Optimism and Adaptivity in Policy Optimization

33. Mastering the game of Go with deep neural networks and tree search

35. Deep Reinforcement Learning with Double Q-Learning

36. Increasing the Action Gap: New Operators for Reinforcement Learning

Catalog

Books, media, physical & digital resources