Search

Your search keyword '"Narasimhan, Karthik"' showing total 246 results

Search Constraints

Start Over You searched for: Author "Narasimhan, Karthik" Remove constraint Author: "Narasimhan, Karthik"
246 results on '"Narasimhan, Karthik"'

Search Results

1. LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks

2. An Annotated Dataset of Errors in Premodern Greek and Baselines for Detecting Them

3. SWE-bench Multimodal: Do AI Systems Generalize to Visual Software Domains?

4. EnIGMA: Enhanced Interactive Generative Model Agent for CTF Challenges

5. LLMs are Superior Feedback Providers: Bootstrapping Reasoning for Lie Detection with Self-Generated Feedback

6. ShieldGemma: Generative AI Content Moderation Based on Gemma

7. PersonaGym: Evaluating Persona Agents and LLMs

8. $\tau$-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains

9. SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering

10. Can Language Models Solve Olympiad Programming?

11. RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs

12. Language-Guided World Models: A Model-Based Approach to AI Control

13. GEO: Generative Engine Optimization

14. QualEval: Qualitative Evaluation for Model Improvement

15. Progressively Efficient Learning

16. SWE-bench: Can Language Models Resolve Real-World GitHub Issues?

17. FireAct: Toward Language Agent Fine-tuning

18. Cognitive Architectures for Language Agents

19. Scaling Laws for Imitation Learning in Single-Agent Games

20. COLLIE: Systematic Construction of Constrained Text Generation Tasks

21. InstructEval: Systematic Evaluation of Instruction Selection Methods

22. InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback

23. Anthropomorphization of AI: Opportunities and Risks

24. PruMUX: Augmenting Data Multiplexing with Model Compression

25. Referral Augmentation for Zero-Shot Information Retrieval

26. C-STS: Conditional Semantic Textual Similarity

27. Tree of Thoughts: Deliberate Problem Solving with Large Language Models

28. Toxicity in ChatGPT: Analyzing Persona-assigned Language Models

29. Reflexion: Language Agents with Verbal Reinforcement Learning

30. MUX-PLMs: Data Multiplexing for High-throughput Language Models

31. SemSup-XC: Semantic Supervision for Zero and Few-shot Extreme Classification

32. Building Scalable Video Understanding Benchmarks through Sports

33. Controllable Text Generation with Language Constraints

34. SPARTAN: Sparse Hierarchical Memory for Parameter-Efficient Transformers

35. ALIGN-MLM: Word Embedding Alignment is Crucial for Multilingual Pre-training

36. ReAct: Synergizing Reasoning and Acting in Language Models

37. WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

38. Leveraging Language for Accelerated Learning of Tool Manipulation

39. Using Natural Language and Program Abstractions to Instill Human Inductive Biases in Machines

40. Can Rationalization Improve Robustness?

41. Linking Emergent and Natural Languages via Corpus Transfer

42. CARETS: A Consistency And Robustness Evaluative Test Suite for VQA

43. SemSup: Semantic Supervision for Simple and Scalable Zero-shot Generalization

44. DataMUX: Data Multiplexing for Neural Networks

45. Multi-Query Video Retrieval

46. Multi-Stage Episodic Control for Strategic Exploration in Text Games

47. When is BERT Multilingual? Isolating Crucial Ingredients for Cross-lingual Transfer

48. SILG: The Multi-environment Symbolic Interactive Language Grounding Benchmark

49. Revelio: ML-Generated Debugging Queries for Distributed Systems

50. Self-Attention Networks Can Process Bounded Hierarchical Languages

Catalog

Books, media, physical & digital resources