Search

Your search keyword '"Amodei, Dario"' showing total 156 results

Search Constraints

Start Over You searched for: Author "Amodei, Dario" Remove constraint Author: "Amodei, Dario"
156 results on '"Amodei, Dario"'

Search Results

1. The Capacity for Moral Self-Correction in Large Language Models

2. Discovering Language Model Behaviors with Model-Written Evaluations

3. Constitutional AI: Harmlessness from AI Feedback

4. Measuring Progress on Scalable Oversight for Large Language Models

5. In-context Learning and Induction Heads

6. Toy Models of Superposition

7. Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned

8. Language Models (Mostly) Know What They Know

9. Scaling Laws and Interpretability of Learning from Repeated Data

10. Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

11. Predictability and Surprise in Large Generative Models

12. A General Language Assistant as a Laboratory for Alignment

13. Evaluating Large Language Models Trained on Code

14. Scaling Laws for Autoregressive Generative Modeling

15. Learning to summarize from human feedback

16. Language Models are Few-Shot Learners

17. Scaling Laws for Neural Language Models

18. Fine-Tuning Language Models from Human Preferences

19. An Empirical Model of Large-Batch Training

20. Reward learning from human preferences and demonstrations in Atari

21. Supervising strong learners by amplifying weak experts

22. Variational Option Discovery Algorithms

23. AI safety via debate

24. The Malicious Use of Artificial Intelligence: Forecasting, Prevention, and Mitigation

25. Deep reinforcement learning from human preferences

26. Learning a Natural Language Interface with Neural Programmer

27. Concrete Problems in AI Safety

28. Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

29. Thermodynamics for a network of neurons: Signatures of criticality

30. Physical Principles for Scalable Neural Recording

31. Searching for collective behavior in a network of real neurons

32. The simplest maximum entropy model for collective behavior in a neural network

36. Discovering Language Model Behaviors with Model-Written Evaluations

38. Predictability and Surprise in Large Generative Models

39. Mirrors with regular hexagonal segments

40. Trump Can Keep America's AI Advantage.

41. Building high-quality assay libraries for targeted analysis of SWATH MS data

43. Characterizing deformability and surface friction of cancer cells

44. Physical principles for scalable neural recording

45. A cross-platform toolkit for mass spectrometry and proteomics

48. Physical principles for scalable neural recording

50. A cross-platform toolkit for mass spectrometry and proteomics

Catalog

Books, media, physical & digital resources