Search

Your search keyword '"Krishnamurthy, Akshay"' showing total 267 results

Search Constraints

Start Over You searched for: Author "Krishnamurthy, Akshay" Remove constraint Author: "Krishnamurthy, Akshay"
267 results on '"Krishnamurthy, Akshay"'

Search Results

1. Reinforcement Learning under Latent Dynamics: Toward Statistical and Algorithmic Modularity

2. Correcting the Mythos of KL-Regularization: Direct Alignment without Overoptimization via Chi-Squared Preference Optimization

3. Computationally Efficient RL under Linear Bellman Completeness for Deterministic Dynamics

4. Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF

5. Rich-Observation Reinforcement Learning with Continuous Latent Dynamics

6. Can large language models explore in-context?

7. Scalable Online Exploration via Coverability

9. Mitigating Covariate Shift in Misspecified Regression with Applications to Reinforcement Learning

10. Butterfly Effects of SGD Noise: Error Amplification in Behavior Cloning and Autoregression

11. Oracle-Efficient Pessimism: Offline Policy Optimization in Contextual Bandits

12. Exposing Attention Glitches with Flip-Flop Language Modeling

13. Streaming Active Learning with Deep Neural Networks

14. Learning Hidden Markov Models Using Conditional Samples

15. Statistical Learning under Heterogeneous Distribution Shift

16. Transformers Learn Shortcuts to Automata

17. Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient

18. Guaranteed Discovery of Control-Endogenous Latent States with Multi-Step Inverse Models

19. On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL

20. Sample-Efficient Reinforcement Learning in the Presence of Exogenous Information

21. A Complete Characterization of Linear Estimators for Offline Policy Evaluation

22. Understanding Contrastive Learning Requires Incorporating Inductive Biases

23. Provable Reinforcement Learning with a Short-Term Memory

24. Efficient and Optimal Algorithms for Contextual Dueling Bandits under Realizability

25. Offline Reinforcement Learning: Fundamental Barriers for Value Function Approximation

26. Universal and data-adaptive algorithms for model selection in linear contextual bandits

27. Anti-Concentrated Confidence Bonuses for Scalable Exploration

28. Provable RL with Exogenous Distractors via Multistep Inverse Dynamics

29. Sparsity in Partially Controllable Linear Systems

30. Efficient First-Order Contextual Bandits: Prediction, Allocation, and Triangular Discrimination

31. Bayesian decision-making under misspecified priors with applications to meta-learning

32. Investigating the Role of Negatives in Contrastive Representation Learning

33. Gone Fishing: Neural Active Learning with Fisher Embeddings

34. Model-free Representation Learning and Exploration in Low-rank MDPs

35. Learning the Linear Quadratic Regulator from Nonlinear Observations

36. Private Reinforcement Learning with PAC and Regret Guarantees

37. Contrastive learning, multi-view redundancy, and linear models

38. Information Theoretic Regret Bounds for Online Nonlinear Control

39. Sample-Efficient Reinforcement Learning of Undercomplete POMDPs

40. Open Problem: Model Selection for Contextual Bandits

41. Provably adaptive reinforcement learning in metric spaces

42. FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs

43. Efficient Contextual Bandits with Continuous Actions

44. Contrastive estimation reveals topic posterior information to linear models

45. Contextual Search in the Presence of Adversarial Corruptions

46. Adaptive Estimator Selection for Off-Policy Evaluation

47. Reward-Free Exploration for Reinforcement Learning

48. Algebraic and Analytic Approaches for Parameter Learning in Mixture Models

49. Scalable Hierarchical Clustering with Tree Grafting

50. Optimism in Reinforcement Learning with Generalized Linear Function Approximation

Catalog

Books, media, physical & digital resources