Search

Your search keyword '"Fried, Daniel"' showing total 1,580 results

Search Constraints

Start Over You searched for: Author "Fried, Daniel" Remove constraint Author: "Fried, Daniel"
1,580 results on '"Fried, Daniel"'

Search Results

1. Improving Model Factuality with Fine-grained Critique-based Evaluator

2. Human-aligned Chess with a Bit of Search

3. CRScore: Grounding Automated Evaluation of Code Review Comments in Code Claims and Smells

4. Agent Workflow Memory

5. ECCO: Can We Improve Model-Generated Code Efficiency Without Sacrificing Functional Correctness?

6. Tree Search for Language Model Agents

7. BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

8. CodeRAG-Bench: Can Retrieval Augment Code Generation?

9. Adversarial Attacks on Multimodal Agents

10. Amortizing Pragmatic Program Synthesis with Rankings

11. Evaluating Large Language Model Biases in Persona-Steered Generation

12. Human-Agent Cooperation in Games under Incomplete Information through Natural Language Communication

13. Is the Pope Catholic? Yes, the Pope is Catholic. Generative Evaluation of Non-Literal Intent Resolution in LLMs

14. Dialogue with Robots: Proposals for Broadening Participation and Research in the SLIVAR Community

15. CodeBenchGen: Creating Scalable Execution-based Code Generation Benchmarks

16. What Are Tools Anyway? A Survey from the Language Model Perspective

17. Repetition Improves Language Model Embeddings

18. VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks

19. TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic Tasks

20. Asking More Informative Questions for Grounded Retrieval

21. Generating Pragmatic Examples to Train Neural Program Synthesizers

22. Comparative Knowledge Distillation

23. Data Augmentation for Code Translation with Comparable Corpora and Multiple References

24. Symbolic Planning and Code Generation for Grounded Dialogue

25. API-Assisted Code Generation for Question Answering on Varied Table Structures

26. SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents

27. Amortizing Pragmatic Program Synthesis with Rankings

28. Clinical SWIR and CP-OCT imaging of interproximal lesions

29. Short-Wavelength Infrared Imaging of Infected and Affected Dentin

31. WebArena: A Realistic Web Environment for Building Autonomous Agents

32. Pragmatic Inference with a CLIP Listener for Contrastive Captioning

33. Time‐resolved SWIR imaging for the assessment of the activity of occlusal caries lesions

34. Assessment of the activity of secondary caries lesions with short-wavelength infrared, thermal, and optical coherence tomographic imaging

36. Generating Images with Multimodal Language Models

37. StarCoder: may the source be with you!

40. Exploratory Analysis of Objective Outcome Measures for the Clinical Assessment of Erosive Tooth Wear.

41. Monitoring lesion activity on primary teeth with CP‐OCT and SWIR reflectance imaging

42. Grounding Language Models to Images for Multimodal Inputs and Outputs

43. SantaCoder: don't reach for the stars!

45. Execution-Based Evaluation for Open-Domain Code Generation

46. Coder Reviewer Reranking for Code Generation

47. G^3: Geolocation via Guidebook Grounding

48. AutoReply: Detecting Nonsense in Dialogue Introspectively with Discriminative Replies

49. DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation

50. Pragmatics in Language Grounding: Phenomena, Tasks, and Modeling Approaches

Catalog

Books, media, physical & digital resources