Search

Your search keyword '"Amodei, Dario"' showing total 32 results

Search Constraints

Start Over You searched for: Author "Amodei, Dario" Remove constraint Author: "Amodei, Dario" Database OpenAIRE Remove constraint Database: OpenAIRE
32 results on '"Amodei, Dario"'

Search Results

1. The Capacity for Moral Self-Correction in Large Language Models

2. Constitutional AI: Harmlessness from AI Feedback

3. Measuring Progress on Scalable Oversight for Large Language Models

4. In-context Learning and Induction Heads

5. Toy Models of Superposition

6. Language Models (Mostly) Know What They Know

7. Predictability and Surprise in Large Generative Models

8. Scaling Laws and Interpretability of Learning from Repeated Data

9. Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

10. Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned

11. Discovering Language Model Behaviors with Model-Written Evaluations

12. A General Language Assistant as a Laboratory for Alignment

13. Evaluating Large Language Models Trained on Code

14. Learning to summarize from human feedback

15. Scaling Laws for Autoregressive Generative Modeling

16. Language Models are Few-Shot Learners

17. Scaling Laws for Neural Language Models

18. Fine-Tuning Language Models from Human Preferences

19. Reward learning from human preferences and demonstrations in Atari

20. Supervising strong learners by amplifying weak experts

21. Variational Option Discovery Algorithms

22. AI safety via debate

23. The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation

24. The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation

25. An Empirical Model of Large-Batch Training

26. Deep reinforcement learning from human preferences

27. Learning a Natural Language Interface with Neural Programmer

28. Concrete Problems in AI Safety

29. Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

30. Thermodynamics for a network of neurons: Signatures of criticality

31. Physical principles for scalable neural recoding

32. Searching for collective behavior in a network of real neurons

Catalog

Books, media, physical & digital resources