Search

Your search keyword '"Weston, Jason"' showing total 624 results

Search Constraints

Start Over You searched for: Author "Weston, Jason" Remove constraint Author: "Weston, Jason"
624 results on '"Weston, Jason"'

Search Results

1. Byte Latent Transformer: Patches Scale Better Than Tokens

2. Training Large Language Models to Reason in a Continuous Latent Space

3. ALMA: Alignment with Minimal Annotation

4. Adaptive Decoding via Latent Preference Optimization

5. Self-Consistency Preference Optimization

6. Thinking LLMs: General Instruction Following with Thought Generation

7. Backtracking Improves Generation Safety

8. Source2Synth: Synthetic Data Generation and Curation Grounded in Real Data Sources

9. Better Alignment with Instruction Back-and-Forth Translation

10. Self-Taught Evaluators

11. Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

12. Distilling System 2 into System 1

13. Following Length Constraints in Instructions

14. Contextual Position Encoding: Learning to Count What's Important

15. Iterative Reasoning Preference Optimization

16. Reverse Training to Nurse the Reversal Curse

17. Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

18. TOOLVERIFIER: Generalization to New Tools via Self-Verification

19. Self-Rewarding Language Models

20. Some things are more CRINGE than others: Iterative Preference Optimization with the Pairwise Cringe Loss

21. System 2 Attention (is something you might need too)

22. The ART of LLM Refinement: Ask, Refine, and Trust

23. Branch-Solve-Merge Improves Large Language Model Evaluation and Generation

24. Walking Down the Memory Maze: Beyond Context Limit through Interactive Reading

25. Chain-of-Verification Reduces Hallucination in Large Language Models

26. Self-Alignment with Instruction Backtranslation

27. Leveraging Implicit Feedback from Deployment Data in Dialogue

28. System-Level Natural Language Feedback

29. The HCI Aspects of Public Deployment of Research Chatbots: A User Study, Design Recommendations, and Open Challenges

30. Improving Open Language Models by Learning from Organic Interactions

31. Large Language Model Programs

32. Learning to Reason and Memorize with Self-Notes

33. Multi-Party Chat: Conversational Agents in Group Settings with Humans and Models

34. The Stable Entropy Hypothesis and Entropy-Aware Decoding: An Analysis and Algorithm for Robust Natural Language Generation

35. Infusing Commonsense World Models with Graph Knowledge

36. The CRINGE Loss: Learning what language not to model

37. When Life Gives You Lemons, Make Cherryade: Converting Feedback from Bad Responses into Good Labels

38. Learning New Skills after Deployment: Improving open-domain internet-driven dialogue with human feedback

39. BlenderBot 3: a deployed conversational agent that continually learns to responsibly engage

40. Learning from data in the mixed adversarial non-adversarial case: Finding the helpers and ignoring the trolls

41. DIRECTOR: Generator-Classifiers For Supervised Language Modeling

42. Language Models that Seek for Knowledge: Modular Search & Generation for Dialogue and Prompt Completion

43. Human Evaluation of Conversations is an Open Problem: comparing the sensitivity of various methods for evaluating dialogue agents

44. Am I Me or You? State-of-the-Art Dialogue Models Cannot Maintain an Identity

45. Reason first, then respond: Modular Generation for Knowledge-infused Dialogue

46. NormFormer: Improved Transformer Pretraining with Extra Normalization

47. Beyond Goldfish Memory: Long-Term Open-Domain Conversation

48. Internet-Augmented Dialogue Generation

49. Staircase Attention for Recurrent Processing of Sequences

50. Hash Layers For Large Sparse Models

Catalog

Books, media, physical & digital resources