Search

Your search keyword '"Henderson, Peter"' showing total 2,467 results

Search Constraints

Start Over You searched for: Author "Henderson, Peter" Remove constraint Author: "Henderson, Peter"
2,467 results on '"Henderson, Peter"'

Search Results

1. Preface

2. Cover

3. Prologue. Empowerment

4. Index

5. Notes

6. 4. Courage

7. 1. Vision

8. 6. Hope

9. 5. Passion

10. 3. Resilience

11. Epilogue. Transition

12. 2. Openness

16. On Evaluating the Durability of Safeguards for Open-Weight LLMs

17. The Mirage of Artificial Intelligence Terms of Use Restrictions

18. Index

19. Notes

22. 5. Grit and Greatness

23. 7. Pillars of Success

24. 8. An Honors University

25. Part II

28. 6. At the Crossroads

30. 1. And Then We Did It

31. Cover

36. Preface. It’s about Us

38. An Adversarial Perspective on Machine Unlearning for AI Safety

39. Evaluating Copyright Takedown Methods for Language Models

40. The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources

41. SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors

42. Fantastic Copyrighted Beasts and How (Not) to Generate Them

43. Safety Alignment Should Be Made More Than Just a Few Tokens Deep

44. JIGMARK: A Black-Box Approach for Enhancing Image Watermarks against Diffusion Model Edits

45. AI Risk Management Should Incorporate Both Safety and Security

46. FLawN-T5: An Empirical Examination of Effective Instruction-Tuning Data Mixtures for Legal Reasoning

47. What is in Your Safe Data? Identifying Benign Data that Breaks Safety

48. A Safe Harbor for AI Evaluation and Red Teaming

49. On the Societal Impact of Open Foundation Models

50. Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications

Catalog

Books, media, physical & digital resources