Search

Your search keyword '"Dann, Christoph"' showing total 46 results

Search Constraints

Start Over You searched for: Author "Dann, Christoph" Remove constraint Author: "Dann, Christoph"
46 results on '"Dann, Christoph"'

Search Results

1. Preserving Expert-Level Privacy in Offline Reinforcement Learning

2. Conditional Language Policy: A General Framework for Steerable Multi-Objective Finetuning

3. Rate-Preserving Reductions for Blackwell Approachability

4. A Minimaximalist Approach to Reinforcement Learning from Human Feedback

5. Data-Driven Online Model Selection With Regret Guarantees

6. A Blackbox Approach to Best of Both Worlds in Bandits and Beyond

7. Best of Both Worlds Policy Optimization

8. Pseudonorm Approachability and Applications to Regret Minimization

9. Learning in POMDPs is Sample-Efficient with Hindsight Observability

10. A Unified Algorithm for Stochastic Path Problems

11. A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning

12. Best of Both Worlds Model Selection

13. Guarantees for Epsilon-Greedy Reinforcement Learning with Function Approximation

14. Same Cause; Different Effects in the Brain

15. A Model Selection Approach for Corruption Robust Reinforcement Learning

16. Beyond Value-Function Gaps: Improved Instance-Dependent Regret Bounds for Episodic Reinforcement Learning

17. Agnostic Reinforcement Learning with Low-Rank MDPs and Rich Observations

18. Neural Active Learning with Performance Guarantees

19. Regret Bound Balancing and Elimination for Model Selection in Bandits and RL

20. Reinforcement Learning with Feedback Graphs

21. Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy

22. Policy Certificates: Towards Accountable Reinforcement Learning

23. On Oracle-Efficient PAC RL with Rich Observations

24. Decoupling Learning Rules from Representations

25. Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning

26. Sample Efficient Policy Search for Optimal Stopping Domains

28. Memory Lens: How Much Memory Does an Agent Use?

29. Thoughts on Massively Scalable Gaussian Processes

30. Sample Complexity of Episodic Fixed-Horizon Reinforcement Learning

31. The Human Kernel

32. Bayesian Time-of-Flight for Realtime Shape, Illumination and Albedo

34. Pottics – The Potts Topic Model for Semantic Image Segmentation

35. A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning

36. Scaling up behavioral science interventions in online education

37. Strategic Exploration in Reinforcement Learning - New Algorithms and Learning Guarantees

41. The Human Kernel

42. Rlpy: A Value-Function-Based Reinforcement Learning Framework for Education and Research

44. RLPy: A Value-Function-Based Reinforcement Learning Framework for Education and Research.

45. Policy Evaluation with Temporal Differences: A Survey and Comparison.

46. Neural Active Learning with Performance Guarantees

Catalog

Books, media, physical & digital resources