Search

Your search keyword '"Henighan, Tom"' showing total 37 results

Search Constraints

Start Over You searched for: Author "Henighan, Tom" Remove constraint Author: "Henighan, Tom"
37 results on '"Henighan, Tom"'

Search Results

2. Specific versus General Principles for Constitutional AI

3. The Capacity for Moral Self-Correction in Large Language Models

4. Discovering Language Model Behaviors with Model-Written Evaluations

5. Influence of local symmetry on lattice dynamics coupled to topological surface states

6. Constitutional AI: Harmlessness from AI Feedback

7. Measuring Progress on Scalable Oversight for Large Language Models

8. In-context Learning and Induction Heads

9. Toy Models of Superposition

10. Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned

11. Language Models (Mostly) Know What They Know

12. Scaling Laws and Interpretability of Learning from Repeated Data

13. Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

14. Predictability and Surprise in Large Generative Models

15. A General Language Assistant as a Laboratory for Alignment

16. Scaling Laws for Transfer

17. Scaling Laws for Autoregressive Generative Modeling

18. Language Models are Few-Shot Learners

19. Scaling Laws for Neural Language Models

20. Direct Measurement of Anharmonic Decay Channels of a Coherent Phonon

21. Phonon Spectroscopy with Sub-meV Resolution by Femtosecond X-ray Diffuse Scattering

22. Influence of local symmetry on lattice dynamics coupled to topological surface states

23. Discovering Language Model Behaviors with Model-Written Evaluations

24. Predictability and Surprise in Large Generative Models

25. The Cyclopean city: a fantasy image of decadence

26. Nanny

28. Lensless Imaging of Nano- and Meso-Scale Dynamics with X-rays

30. Opera no longer a 'museum culture'

31. Letters to the Editor

35. Jungle Epitaph.

36. Home at Grasmere.

Catalog

Books, media, physical & digital resources