Search

Your search keyword '"Orseau, Laurent"' showing total 153 results

Search Constraints

Start Over You searched for: Author "Orseau, Laurent" Remove constraint Author: "Orseau, Laurent"
153 results on '"Orseau, Laurent"'

Search Results

1. Super-Exponential Regret for UCT, AlphaGo and Variants

2. Learning Universal Predictors

3. Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search

4. Language Modeling Is Compression

5. Line Search for Convex Minimization

6. Levin Tree Search with Context Models

7. Memory-Based Meta-Learning on Non-Stationary Distributions

8. Isotuning With Applications To Scale-Free Online Learning

9. Proving Theorems using Incremental Learning and Hindsight Experience Replay

10. Goal Misgeneralization in Deep Reinforcement Learning

11. Policy-Guided Heuristic Search with Guarantees

12. Training a First-Order Theorem Prover from Synthetic Data

13. Avoiding Side Effects By Considering Future Tasks

14. Logarithmic Pruning is All You Need

15. Learning to Prove from Synthetic Theorems

16. Pitfalls of learning a reward function online

17. Iterative Budgeted Exponential Search

18. Zooming Cautiously: Linear-Memory Heuristic Search With Node Expansion Guarantees

19. An investigation of model-free planning

20. Soft-Bayes: Prod for Mixtures of Experts with Log-Loss

21. Single-Agent Policy Tree Search With Guarantees

22. Penalizing side effects using stepwise relative reachability

23. Agents and Devices: A Relative Definition of Agency

24. AI Safety Gridworlds

25. Reinforcement Learning with a Corrupted Reward Channel

26. Thompson Sampling is Asymptotically Optimal in General Environments

28. Teleporting Universal Intelligent Agents

29. The Multi-slot Framework: A Formal Model for Multiple, Copiable AIs

30. Universal Knowledge-Seeking Agents for Stochastic Environments

31. Space-Time Embedded Intelligence

32. Memory Issues of Intelligent Agents

33. Universal Knowledge-Seeking Agents

34. Delusion, Survival, and Intelligent Agents

35. Self-Modification and Mortality in Artificial Agents

36. Optimality Issues of Universal Greedy Agents with Static Priors

38. Short Term Memories and Forcing the Re-use of Knowledge for Generalization

40. Proving Theorems using Incremental Learning and Hindsight Experience Replay

44. Learning to Prove from Synthetic Theorems

Catalog

Books, media, physical & digital resources