Search

Your search keyword '"Lambert, Nathan"' showing total 172 results

Search Constraints

Start Over You searched for: Author "Lambert, Nathan" Remove constraint Author: "Lambert, Nathan"
172 results on '"Lambert, Nathan"'

Search Results

1. Self-Directed Synthetic Dialogues and Revisions Technical Report

2. WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

3. Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback

4. Towards a Framework for Openness in Foundation Models: Proceedings from the Columbia Convening on Openness in Artificial Intelligence

5. D2PO: Discriminator-Guided DPO with Response Evaluation Models

6. Social Choice Should Guide AI Alignment in Dealing with Diverse Human Feedback

7. RewardBench: Evaluating Reward Models for Language Modeling

8. A Survey on Data Selection for Language Models

9. OLMo: Accelerating the Science of Language Models

10. Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

11. Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

12. The Alignment Ceiling: Objective Mismatch in Reinforcement Learning from Human Feedback

13. Zephyr: Direct Distillation of LM Alignment

14. Entangled Preferences: The History and Risks of Reinforcement Learning and Human Feedback

15. A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning

16. Confidence-Building Measures for Artificial Intelligence: Workshop Proceedings

17. BLISS: Interplanetary Exploration with Swarms of Low-Cost Spacecraft

18. Measuring Data

19. Reward Reports for Reinforcement Learning

20. Investigating Compounding Prediction Errors in Learned Dynamics Models

21. Choices, Risks, and Reward Reports: Charting Public Policy for Reinforcement Learning Systems

22. The Challenges of Exploration for Offline Reinforcement Learning

23. BotNet: A Simulator for Studying the Effects of Accurate Communication Models on Multi-agent and Swarm Control

24. Axes for Sociotechnical Inquiry in AI Research

25. MBRL-Lib: A Modular Library for Model-based Reinforcement Learning

27. On the Importance of Hyperparameter Optimization for Model-based Reinforcement Learning

28. AI Development for the Public Interest: From Abstraction Traps to Sociotechnical Risks

29. Learning Accurate Long-term Dynamics for Model-based Reinforcement Learning

30. Nonholonomic Yaw Control of an Underactuated Flying Robot with Model-based Reinforcement Learning

31. Learning for Microrobot Exploration: Model-based Locomotion, Sparse-robust Navigation, and Low-power Deep Classification

32. Objective Mismatch in Model-based Reinforcement Learning

33. Learning Generalizable Locomotion Skills with Hierarchical Reinforcement Learning

34. Low Level Control of a Quadrotor with Deep Model-Based Reinforcement Learning

35. Femoral Artery Closure Devices vs Manual Compression During Cardiac Catheterization and Percutaneous Coronary Intervention

39. Synergy of Prediction and Control in Model-based Reinforcement Learning

47. The Temporal Stability and Predictive Validity of Pupils' Causal Attributions for Difficult Classroom Behaviour

48. Young People's Views about Their Involvement in Decision-Making

Catalog

Books, media, physical & digital resources