Search

Your search keyword '"Lazaric, A."' showing total 98 results

Search Constraints

Start Over You searched for: Author "Lazaric, A." Remove constraint Author: "Lazaric, A." Language undetermined Remove constraint Language: undetermined
98 results on '"Lazaric, A."'

Search Results

5. Learning Goal-Conditioned Policies Offline with Self-Supervised Reward Shaping

6. Solvent‐mediated forces in protein dielectrophoresis

7. Linear Convergence of Natural Policy Gradient Methods with Log-Linear Policies

8. Reaching Goals is Hard: Settling the Sample Complexity of the Stochastic Shortest Path

9. Contextual bandits with concave rewards, and an application to fair ranking

10. On the Complexity of Representation Learning in Contextual Linear Bandits

11. Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees

12. Scaling Gaussian Process Optimization by Evaluating a Few Unique Candidates Multiple Times

13. Determinants of sustainable consumption in France: the importance of social influence and environmental values

14. Cognition and Routine Dynamics

16. Editorial

17. Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret

18. A Fully Problem-Dependent Regret Lower Bound for Finite-Horizon MDPs

19. Direct then Diffuse: Incremental Unsupervised Skill Discovery for State Covering and Goal Reaching

20. A general sample complexity analysis of vanilla policy gradient

21. Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning

22. Top $K$ Ranking for Multi-Armed Bandit with Noisy Evaluations

24. Meta-learning with Stochastic Linear Bandits

25. A Provably Efficient Sample Collection Strategy for Reinforcement Learning

26. Learning Near Optimal Policies with Low Inherent Bellman Error

27. Sketched Newton-Raphson

28. Learning Adaptive Exploration Strategies in Dynamic Environments Through Informed Policy Regularization

29. Concentration Inequalities for Multinoulli Random Variables

30. Improved Algorithms for Conservative Exploration in Bandits

31. Active Model Estimation in Markov Decision Processes

32. Improved Analysis of UCRL2 with Empirical Bernstein Inequality

33. Chapter 8 Learning a New Ecology of Space and Looking for New Routines: Experimenting Robotics in a Surgical Team

34. A Structured Prediction Approach for Generalization in Cooperative Multi-Agent Reinforcement Learning

35. Frequentist Regret Bounds for Randomized Least-Squares Value Iteration

36. Gaussian Process Optimization with Adaptive Sketching: Scalable and No Regret

37. Determinants of energy tracking application use at the city level: Evidence from France

38. Exploration Bonus for Regret Minimization in Undiscounted Discrete and Continuous Markov Decision Processes

39. Thompson Sampling for Linear-Quadratic Control Problems

40. The new challenges of organizing intellectual property in complex industries: A discussion based on the case of Thales

41. Reinforcement Learning of POMDPs using Spectral Methods

43. Analysis of Kelner and Levin graph sparsification algorithm for a streaming setting

44. LQG for Portfolio Optimization

45. Overcoming inertia: insights from evolutionary economics into improved energy and climate policies

46. La nouvelle architecture de l’industrie de la Défense en France

47. Gatekeepers of Knowledge versus Platforms of Knowledge: From Potential to Realized Absorptive Capacity

48. Capacités d’absorption et d’interaction : une étude de la coopération dans les PME françaises

49. Obituary: Steven Klepper

50. Nouveaux enjeux d'organisation de la propriété intellectuelle dans les industries complexes: une discussion à partir du cas Thales

Catalog

Books, media, physical & digital resources