Search

Your search keyword '"Narasimhan, Karthik"' showing total 43 results

Search Constraints

Start Over You searched for: Author "Narasimhan, Karthik" Remove constraint Author: "Narasimhan, Karthik" Topic computer science - machine learning Remove constraint Topic: computer science - machine learning
43 results on '"Narasimhan, Karthik"'

Search Results

1. LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks

2. ShieldGemma: Generative AI Content Moderation Based on Gemma

3. PersonaGym: Evaluating Persona Agents and LLMs

4. SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering

5. RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs

6. Language-Guided World Models: A Model-Based Approach to AI Control

7. GEO: Generative Engine Optimization

8. QualEval: Qualitative Evaluation for Model Improvement

9. Progressively Efficient Learning

10. FireAct: Toward Language Agent Fine-tuning

11. Cognitive Architectures for Language Agents

12. Scaling Laws for Imitation Learning in Single-Agent Games

13. COLLIE: Systematic Construction of Constrained Text Generation Tasks

14. InterCode: Standardizing and Benchmarking Interactive Coding with Execution Feedback

15. Anthropomorphization of AI: Opportunities and Risks

16. PruMUX: Augmenting Data Multiplexing with Model Compression

17. C-STS: Conditional Semantic Textual Similarity

18. Tree of Thoughts: Deliberate Problem Solving with Large Language Models

19. Toxicity in ChatGPT: Analyzing Persona-assigned Language Models

20. Reflexion: Language Agents with Verbal Reinforcement Learning

21. MUX-PLMs: Data Multiplexing for High-throughput Language Models

22. SPARTAN: Sparse Hierarchical Memory for Parameter-Efficient Transformers

23. ALIGN-MLM: Word Embedding Alignment is Crucial for Multilingual Pre-training

24. ReAct: Synergizing Reasoning and Acting in Language Models

25. WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents

26. Leveraging Language for Accelerated Learning of Tool Manipulation

27. Can Rationalization Improve Robustness?

28. Linking Emergent and Natural Languages via Corpus Transfer

29. SemSup: Semantic Supervision for Simple and Scalable Zero-shot Generalization

30. DataMUX: Data Multiplexing for Neural Networks

31. When is BERT Multilingual? Isolating Crucial Ingredients for Cross-lingual Transfer

32. SILG: The Multi-environment Symbolic Interactive Language Grounding Benchmark

33. Revelio: ML-Generated Debugging Queries for Distributed Systems

34. Grounding Language to Entities and Dynamics for Generalization in Reinforcement Learning

35. Improving Dialog Systems for Negotiation with Personality Modeling

36. Safe Reinforcement Learning with Natural Language Constraints

37. Projection-Based Constrained Policy Optimization

38. Accelerating Safe Reinforcement Learning with Constraint-mismatched Policies

39. A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy Adaptation

40. Calibration, Entropy Rates, and Memory in Language Models

41. Task-Agnostic Dynamics Priors for Deep Reinforcement Learning

42. Grounding Language for Transfer in Deep Reinforcement Learning

43. CSTS: Conditional Semantic Textual Similarity

Catalog

Books, media, physical & digital resources