Search

Your search keyword '"Evans, Owain"' showing total 134 results

Search Constraints

Start Over You searched for: Author "Evans, Owain" Remove constraint Author: "Evans, Owain"
134 results on '"Evans, Owain"'

Search Results

1. Looking Inward: Language Models Can Learn About Themselves by Introspection

2. Me, Myself, and AI: The Situational Awareness Dataset (SAD) for LLMs

3. Connecting the Dots: LLMs can Infer and Verbalize Latent Structure from Disparate Training Data

4. Can Language Models Explain Their Own Classification Behavior?

5. Tell, don't show: Declarative facts influence how LLMs generalize

6. How to Catch an AI Liar: Lie Detection in Black-Box LLMs by Asking Unrelated Questions

7. The Reversal Curse: LLMs trained on 'A is B' fail to learn 'B is A'

8. Taken out of context: On measuring situational awareness in LLMs

9. Forecasting Future World Events with Neural Networks

10. Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

11. Teaching Models to Express Their Uncertainty in Words

12. Truthful AI: Developing and governing AI that does not lie

13. TruthfulQA: Measuring How Models Mimic Human Falsehoods

15. Active Reinforcement Learning: Observing Rewards at a Cost

16. Sensory Optimization: Neural Networks as a Model for Understanding and Creating Art

17. Generalizing from a few environments in safety-critical reinforcement learning

18. Active Reinforcement Learning with Monte-Carlo Tree Search

19. The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation

20. Trial without Error: Towards Safe Reinforcement Learning via Human Intervention

21. When Will AI Exceed Human Performance? Evidence from AI Experts

22. Agent-Agnostic Human-in-the-Loop Reinforcement Learning

23. Learning the Preferences of Ignorant, Inconsistent Agents

24. Benign neglect : the activities and relationship of London yearly meeting of the Religious Society of Friends (Quakers) to Wales, c.1860-c.1918

28. Paediatric orthopaedics in lockdown

34. Modelling the health and economic impacts of different testing and tracing strategies for COVID-19 in the UK

38. Modelling the Health and Economic Impacts of Population-Wide Testing, Contact Tracing and Isolation (PTTI) Strategies for COVID-19 in the UK

39. 24.118 Paradox & Infinity, Spring 2013

43. Learning Structured Preferences

44. Learning the Preferences of Ignorant, Inconsistent Agents

46. Book Reviews

47. Bayesian computational models for inferring preferences

48. Help or hinder: Bayesian models of social goal inference

Catalog

Books, media, physical & digital resources