Search

Your search keyword '"Legg, Shane"' showing total 202 results

Search Constraints

Start Over You searched for: Author "Legg, Shane" Remove constraint Author: "Legg, Shane"
202 results on '"Legg, Shane"'

Search Results

1. Scaling Instructable Agents Across Many Simulated Worlds

2. Levels of AGI for Operationalizing Progress on the Path to AGI

3. The Hydra Effect: Emergent Self-repair in Language Model Computations

4. Randomized Positional Encodings Boost Length Generalization of Transformers

5. Beyond Bayes-optimality: meta-learning what you know you don't know

6. Neural Networks and the Chomsky Hierarchy

7. Your Policy Regularizer is Secretly an Adversary

8. Safe Deep RL in 3D Environments using Human Feedback

9. Model-Free Risk-Sensitive Reinforcement Learning

10. Shaking the foundations: delusions in sequence models for interaction and control

11. Causal Analysis of Agent Behavior for AI Safety

12. Agent Incentives: A Causal Perspective

13. Avoiding Tampering Incentives in Deep RL via Decoupled Approval

14. REALab: An Embedded Perspective on Tampering

15. Algorithms for Causal Reasoning in Probability Trees

16. Meta-trained agents implement Bayes-optimal agents

17. Avoiding Side Effects By Considering Future Tasks

18. Quantifying Differences in Reward Functions

19. Pitfalls of learning a reward function online

20. The Incentives that Shape Behaviour

21. Learning Human Objectives by Evaluating Hypothetical Behavior

22. Modeling AGI Safety Frameworks with Causal Influence Diagrams

23. Meta-learning of Sequential Strategies

24. Understanding Agent Incentives using Causal Influence Diagrams. Part I: Single Action Settings

25. Soft-Bayes: Prod for Mixtures of Experts with Log-Loss

26. Scaling shared model governance via model splitting

27. Scalable agent alignment via reward modeling: a research direction

28. Reward learning from human preferences and demonstrations in Atari

29. Modeling Friends and Foes

30. Penalizing side effects using stepwise relative reachability

31. Agents and Devices: A Relative Definition of Agency

32. IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

33. Psychlab: A Psychology Laboratory for Deep Reinforcement Learning Agents

34. AI Safety Gridworlds

35. Symmetric Decomposition of Asymmetric Games

36. Noisy Networks for Exploration

37. Deep reinforcement learning from human preferences

38. Reinforcement Learning with a Corrupted Reward Channel

39. DeepMind Lab

40. Massively Parallel Methods for Deep Reinforcement Learning

41. An Approximation of the Universal Intelligence Measure

42. Temporal Difference Updating without a Learning Rate

43. Tests of Machine Intelligence

44. Universal Intelligence: A Definition of Machine Intelligence

45. A Collection of Definitions of Intelligence

46. Fitness Uniform Optimization

47. Is there an Elegant Universal Theory of Prediction?

48. A Formal Measure of Machine Intelligence

49. Fitness Uniform Deletion: A Simple Way to Preserve Diversity

50. Tournament versus Fitness Uniform Selection

Catalog

Books, media, physical & digital resources