Search

Your search keyword '"Henderson, Peter"' showing total 818 results

Search Constraints

Start Over You searched for: Author "Henderson, Peter" Remove constraint Author: "Henderson, Peter" Search Limiters Full Text Remove constraint Search Limiters: Full Text
818 results on '"Henderson, Peter"'

Search Results

1. On Evaluating the Durability of Safeguards for Open-Weight LLMs

2. The Mirage of Artificial Intelligence Terms of Use Restrictions

3. An Adversarial Perspective on Machine Unlearning for AI Safety

4. Evaluating Copyright Takedown Methods for Language Models

5. The Responsible Foundation Model Development Cheatsheet: A Review of Tools & Resources

6. SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors

7. Fantastic Copyrighted Beasts and How (Not) to Generate Them

8. Safety Alignment Should Be Made More Than Just a Few Tokens Deep

9. JIGMARK: A Black-Box Approach for Enhancing Image Watermarks against Diffusion Model Edits

10. AI Risk Management Should Incorporate Both Safety and Security

11. FLawN-T5: An Empirical Examination of Effective Instruction-Tuning Data Mixtures for Legal Reasoning

12. What is in Your Safe Data? Identifying Benign Data that Breaks Safety

13. A Safe Harbor for AI Evaluation and Red Teaming

14. On the Societal Impact of Open Foundation Models

15. Assessing the Brittleness of Safety Alignment via Pruning and Low-Rank Modifications

16. Promises and pitfalls of artificial intelligence for legal applications

17. Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To!

18. Improving Mathematics in Key Stages 2 and 3. Guidance Report

19. LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models

20. Freedom of Speech and AI Output

21. Where's the Liability in Harmful AI Speech?

22. Visual Adversarial Examples Jailbreak Aligned Large Language Models

23. Cheaply Evaluating Inference Efficiency Metrics for Autoregressive Transformer APIs

24. Foundation Models and Fair Use

25. Self-Destructing Models: Increasing the Costs of Harmful Dual Uses of Foundation Models

26. Holistic Evaluation of Language Models

27. BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

28. Text Characterization Toolkit

29. Entropy Regularization for Population Estimation

30. Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset

31. α1-Adrenergic receptor–PKC–Pyk2–Src signaling boosts L-type Ca2+ channel CaV1.2 activity and long-term potentiation in rodents

32. Data Governance in the Age of Large-Scale Data-Driven Language Technology

33. Integrating Reward Maximization and Population Estimation: Sequential Decision-Making for Internal Revenue Service Audit Selection

34. Special Educational Needs in Mainstream Schools. Guidance Report

35. Improving Mathematics in the Early Years and Key Stage 1. Guidance Report

36. Beyond Ads: Sequential Decision-Making Algorithms in Law and Public Policy

38. On the Opportunities and Risks of Foundation Models

39. When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataset

40. SCIENCE INSTITUTIONS IN A NEW WORLD

41. An Information-Theoretic Perspective on Credit Assignment in Reinforcement Learning

42. With Little Power Comes Great Responsibility

43. Sea ice and methane

44. Ideas for Improving the Field of Machine Learning: Summarizing Discussion from the NeurIPS 2019 Retrospectives Workshop

45. TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?

46. Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims

47. Towards the Systematic Reporting of the Energy and Carbon Footprints of Machine Learning

48. ALGORITHMIC RULEMAKING VS. ALGORITHMIC GUIDANCE.

49. Separating value functions across time-scales

50. α‐Actinin‐1 promotes activity of the L‐type Ca2+ channel Cav1.2

Catalog

Books, media, physical & digital resources