Search

Your search keyword '"Lazaric, A."' showing total 885 results

Search Constraints

Start Over You searched for: Author "Lazaric, A." Remove constraint Author: "Lazaric, A."
885 results on '"Lazaric, A."'

Search Results

1. System-2 Recommenders: Disentangling Utility and Engagement in Recommendation Systems via Temporal Point-Processes

2. Simple Ingredients for Offline Reinforcement Learning

3. Reinforcement Learning with Options and State Representation

4. Layered State Discovery for Incremental Autonomous Exploration

5. Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping

6. On the Complexity of Representation Learning in Contextual Linear Bandits

7. Improved Adaptive Algorithm for Scalable Active Learning with Weak Labeler

10. Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees

11. Contextual bandits with concave rewards, and an application to fair ranking

12. Reaching Goals is Hard: Settling the Sample Complexity of the Stochastic Shortest Path

13. Linear Convergence of Natural Policy Gradient Methods with Log-Linear Policies

14. Temporal Abstractions-Augmented Temporally Contrastive Learning: An Alternative to the Laplacian in RL

15. Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning

16. Scaling Gaussian Process Optimization by Evaluating a Few Unique Candidates Multiple Times

17. Top $K$ Ranking for Multi-Armed Bandit with Noisy Evaluations

18. Differentially Private Exploration in Reinforcement Learning with Linear Representation

19. Adaptive Multi-Goal Exploration

22. Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection

23. Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching

26. A general sample complexity analysis of vanilla policy gradient

27. Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning

28. A Fully Problem-Dependent Regret Lower Bound for Finite-Horizon MDPs

29. A Reduction-Based Framework for Conservative Bandits and Reinforcement Learning

30. Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret

31. Leveraging Good Representations in Linear Contextual Bandits

32. Reinforcement Learning with Prototypical Representations

34. Improved Sample Complexity for Incremental Autonomous Exploration in MDPs

38. An Asymptotically Optimal Primal-Dual Incremental Algorithm for Contextual Linear Bandits

39. Provably Efficient Reward-Agnostic Navigation with Linear Value Iteration

40. Efficient Optimistic Exploration in Linear-Quadratic Regulators via Lagrangian Relaxation

41. A Provably Efficient Sample Collection Strategy for Reinforcement Learning

42. Improved Analysis of UCRL2 with Empirical Bernstein Inequality

43. Sketched Newton-Raphson

44. A Novel Confidence-Based Algorithm for Structured Bandits

45. Meta-learning with Stochastic Linear Bandits

46. Learning Adaptive Exploration Strategies in Dynamic Environments Through Informed Policy Regularization

47. Active Model Estimation in Markov Decision Processes

48. Learning Near Optimal Policies with Low Inherent Bellman Error

49. Near-linear Time Gaussian Process Optimization with Adaptive Batching and Resparsification

50. Adversarial Attacks on Linear Contextual Bandits

Catalog

Books, media, physical & digital resources