Search

Your search keyword '"Chandar, Sarath"' showing total 75 results

Search Constraints

Start Over You searched for: Author "Chandar, Sarath" Remove constraint Author: "Chandar, Sarath" Database OAIster Remove constraint Database: OAIster
75 results on '"Chandar, Sarath"'

Search Results

1. Mastering Memory Tasks with World Models

2. Are self-explanations from Large Language Models faithful?

3. BindGPT: A Scalable Framework for 3D Molecular Design via Language Modeling and Reinforcement Learning

4. Predicting the Impact of Model Expansion through the Minima Manifold: A Loss Landscape Perspective

5. Towards Practical Tool Usage for Continually Learning LLMs

6. Interpretability Needs a New Paradigm

7. Sub-goal Distillation: A Method to Improve Small Language Agents

8. Intelligent Switching for Reset-Free RL

9. Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning

10. Replay Buffer with Local Forgetting for Adapting to Local Environment Changes in Deep Model-Based Reinforcement Learning

11. Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning

12. Measuring the Knowledge Acquisition-Utilization Gap in Pretrained Language Models

13. Should We Attend More or Less? Modulating Attention for Fairness

14. On the Costs and Benefits of Adopting Lifelong Learning for Software Analytics -- Empirical Study on Brown Build and Risk Prediction

15. Fairness-Aware Structured Pruning in Transformers

16. Language Model-In-The-Loop: Data Optimal Approach to Learn-To-Recommend Actions in Text Games

17. Self-Influence Guided Data Reweighting for Language Model Pre-training

18. EpiK-Eval: Evaluation for Language Models as Epistemic Models

19. Faithfulness Measurable Masked Language Models

20. Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi

21. Lookbehind-SAM: k steps back, 1 step forward

22. Promoting Exploration in Memory-Augmented Adam using Critical Momenta

23. Thompson sampling for improved exploration in GFlowNets

24. PatchBlender: A Motion Prior for Video Transformers

25. Deep Learning on a Healthy Data Diet: Finding Important Examples for Fairness

26. SAMSON: Sharpness-Aware Minimization Scaled by Outlier Normalization for Improving DNN Generalization and Robustness

27. Detecting Languages Unintelligible to Multilingual Models through Local Structure Probes

28. Local Structure Matters Most in Most Languages

29. Segmentation of Multiple Sclerosis Lesions across Hospitals: Learn Continually or Train from Scratch?

30. Improving Meta-Learning Generalization with Activation-Based Early-Stopping

31. An Introduction to Lifelong Supervised Learning

32. Towards Evaluating Adaptivity of Model-Based Reinforcement Learning Methods

33. Improving Sample Efficiency of Value Based Models Using Attention and Vision Transformers

34. Combining Reinforcement Learning and Constraint Programming for Sequence-Generation Tasks with Hard Constraints

35. An Empirical Investigation of the Role of Pre-training in Lifelong Learning

36. Scaling Laws for the Few-Shot Adaptation of Pre-trained Image Classifiers

37. Post-hoc Interpretability for Neural NLP: A Survey

38. Local Structure Matters Most: Perturbation Study in NLU

39. A Brief Study on the Effects of Training Generative Dialogue Models with a Semantic loss

40. Memory Augmented Optimizers for Deep Learning

41. Do Encoder Representations of Generative Dialogue Models Encode Sufficient Information about the Task ?

42. TAG: Task-based Accumulated Gradients for Lifelong learning

43. A Survey of Data Augmentation Approaches for NLP

44. Continuous Coordination As a Realistic Scenario for Lifelong Learning

45. Slot Contrastive Networks: A Contrastive Approach for Representing Objects

46. The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning

47. PatchUp: A Feature-Space Block-Level Regularization Technique for Convolutional Neural Networks

48. Learning To Navigate The Synthetically Accessible Chemical Space Using Reinforcement Learning

49. IIRC: Incremental Implicitly-Refined Classification

50. Maximum Reward Formulation In Reinforcement Learning

Catalog

Books, media, physical & digital resources