Search

Your search keyword '"Richardson, Kyle"' showing total 355 results

Search Constraints

Start Over You searched for: Author "Richardson, Kyle" Remove constraint Author: "Richardson, Kyle"
355 results on '"Richardson, Kyle"'

Search Results

1. Understanding the Logic of Direct Preference Alignment through Logic

2. SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories

3. SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals

4. TimeArena: Shaping Efficient Multitasking Language Agents in a Time-Aware Simulation

5. OLMo: Accelerating the Science of Language Models

6. Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

7. Paloma: A Benchmark for Evaluating Language Model Fit

8. Catwalk: A Unified Language Model Evaluation Framework for Many Datasets

9. Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction Arena

10. Language Models with Rationality

12. DISCO: Distilling Counterfactuals with Large Language Models

13. Breakpoint Transformers for Modeling and Tracking Intermediate Beliefs

14. Learning to Decompose: Hypothetical Question Decomposition Based on Comparable Texts

15. Decomposed Prompting: A Modular Approach for Solving Complex Tasks

16. Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

17. What Makes Instruction Learning Hard? An Investigation and a New Challenge in a Synthetic Environment

18. Pushing the Limits of Rule Reasoning in Transformers through Natural Language Satisfiability

19. Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts

20. Dyna-bAbI: unlocking bAbI's potential with dynamic synthetic benchmarking

21. Hey AI, Can You Solve Complex Tasks by Talking to Agents?

22. DeepA2: A Modular Framework for Deep Argument Analysis with Pretrained Neural Text2Text Language Models

23. Investigating Transfer Learning in Multilingual Pre-trained Language Models through Chinese Natural Language Inference

24. Thinking Aloud: Dynamic Context Generation Improves Zero-Shot Reasoning Performance of GPT-2

25. Think you have Solved Direct-Answer Question Answering? Try ARC-DA, the Direct-Answer AI2 Reasoning Challenge

26. A Dataset for Tracking Entities in Open Domain Procedural Text

27. Temporal Reasoning on Implicit Events from Distant Supervision

28. OCNLI: Original Chinese Natural Language Inference

29. Critical Thinking for Language Models

30. Text Modular Networks: Learning to Decompose Tasks in the Language of Existing Models

31. Do Dogs have Whiskers? A New Knowledge Base of hasPart Relations

32. Neural Natural Language Inference Models Partially Embed Theories of Lexical Entailment and Negation

33. CLUE: A Chinese Language Understanding Evaluation Benchmark

34. Transformers as Soft Reasoners over Language

35. The admixed brushtail possum genome reveals invasion history in New Zealand and novel imprinted genes

36. What Does My QA Model Know? Devising Controlled Probes using Expert Knowledge

37. MonaLog: a Lightweight System for Natural Language Inference Based on Monotonicity

38. Probing Natural Language Inference Models through Semantic Fragments

39. From 'F' to 'A' on the N.Y. Regents Science Exams: An Overview of the Aristo Project

40. A Language for Function Signature Representations

41. Polyglot Semantic Parsing in APIs

42. The Code2Text Challenge: Text Generation in Source Code Libraries

43. Function Assistant: A Tool for NL Querying of APIs

44. Learning Semantic Correspondences in Technical Documentation

45. Spread of Negative Affect via Social Media: The Affective Consequences of Viewing Others' Fortunate and Unfortunate Social Media Posts

Catalog

Books, media, physical & digital resources