Search

Your search keyword '"Michal Valko"' showing total 225 results

Search Constraints

Start Over You searched for: Author "Michal Valko" Remove constraint Author: "Michal Valko"
225 results on '"Michal Valko"'

Search Results

2. Decoding-time Realignment of Language Models.

4. Nash Learning from Human Feedback.

6. Demonstration-Regularized RL.

8. Identification of Microbial and Proteomic Biomarkers in Early Childhood Caries

19. Fast Rates for Maximum Entropy Exploration.

20. Understanding Self-Predictive Learning for Reinforcement Learning.

22. Quantile Credit Assignment.

24. Adaptive Multi-Goal Exploration.

26. Retrieval-Augmented Reinforcement Learning.

30. Nash Learning from Human Feedback.

41. Adaptive Reward-Free Exploration.

Catalog

Books, media, physical & digital resources