Author: "Michal Valko" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Michal Valko"' showing total 225 results

Start Over Author "Michal Valko"

225 results on '"Michal Valko"'

1. A General Theoretical Paradigm to Understand Learning from Human Preferences.

Author: Mohammad Gheshlaghi Azar, Zhaohan Daniel Guo, Bilal Piot, Rémi Munos, Mark Rowland 0001, Michal Valko, and Daniele Calandriello
Published: 2024

2. Decoding-time Realignment of Language Models.

Author: Tianlin Liu, Shangmin Guo, Leonardo Bianco, Daniele Calandriello, Quentin Berthet, Felipe Llinares-López, Jessica Hoffmann, Lucas Dixon, Michal Valko, and Mathieu Blondel
Published: 2024

3. Generalized Preference Optimization: A Unified Approach to Offline Alignment.

Author: Yunhao Tang, Zhaohan Daniel Guo, Zeyu Zheng, Daniele Calandriello, Rémi Munos, Mark Rowland 0001, Pierre Harvey Richemond, Michal Valko, Bernardo ávila Pires, and Bilal Piot
Published: 2024

4. Nash Learning from Human Feedback.

Author: Rémi Munos, Michal Valko, Daniele Calandriello, Mohammad Gheshlaghi Azar, Mark Rowland 0001, Zhaohan Daniel Guo, Yunhao Tang, Matthieu Geist, Thomas Mesnard, Côme Fiegel, Andrea Michi, Marco Selvi, Sertan Girgin, Nikola Momchev, Olivier Bachem, Daniel J. Mankowitz, Doina Precup, and Bilal Piot
Published: 2024

5. Human Alignment of Large Language Models through Online Preference Optimisation.

Author: Daniele Calandriello, Zhaohan Daniel Guo, Rémi Munos, Mark Rowland 0001, Yunhao Tang, Bernardo ávila Pires, Pierre Harvey Richemond, Charline Le Lan, Michal Valko, Tianqi Liu 0002, Rishabh Joshi, Zeyu Zheng, and Bilal Piot
Published: 2024

6. Demonstration-Regularized RL.

Author: Daniil Tiapkin, Denis Belomestny, Daniele Calandriello, Eric Moulines, Alexey Naumov, Pierre Perrault, Michal Valko, and Pierre Ménard
Published: 2024

7. Unlocking the Power of Representations in Long-term Novelty-based Exploration.

Author: Alaa Saade, Steven Kapturowski, Daniele Calandriello, Charles Blundell, Pablo Sprechmann, Leopoldo Sarra, Oliver Groth, Michal Valko, and Bilal Piot
Published: 2024

8. Identification of Microbial and Proteomic Biomarkers in Early Childhood Caries

Author: Thomas C. Hart, Patricia M. Corby, Milos Hauskrecht, Ok Hee Ryu, Richard Pelikan, Michal Valko, Maria B. Oliveira, Gerald T. Hoehn, and Walter A. Bretz
Subjects: Dentistry, RK1-715
Abstract: The purpose of this study was to provide a univariate and multivariate analysis of genomic microbial data and salivary mass-spectrometry proteomic profiles for dental caries outcomes. In order to determine potential useful biomarkers for dental caries, a multivariate classification analysis was employed to build predictive models capable of classifying microbial and salivary sample profiles with generalization performance. We used high-throughput methodologies including multiplexed microbial arrays and SELDI-TOF-MS profiling to characterize the oral flora and salivary proteome in 204 children aged 1–8 years (n=118 caries-free, n=86 caries-active). The population received little dental care and was deemed at high risk for childhood caries. Findings of the study indicate that models incorporating both microbial and proteomic data are superior to models of only microbial or salivary data alone. Comparison of results for the combined and independent data suggests that the combination of proteomic and microbial sources is beneficial for the classification accuracy and that combined data lead to improved predictive models for caries-active and caries-free patients. The best predictive model had a 6% test error, >92% sensitivity, and >95% specificity. These findings suggest that further characterization of the oral microflora and the salivary proteome associated with health and caries may provide clinically useful biomarkers to better predict future caries experience.
Published: 2011
Full Text: View/download PDF

9. Preference Optimization with Multi-Sample Comparisons.

Author: Chaoqi Wang, Zhuokai Zhao, Chen Zhu, Karthik Abinav Sankararaman, Michal Valko, Xuefei Cao, Zhaorun Chen, Madian Khabsa, Yuxin Chen, Hao Ma 0001, and Sinong Wang
Published: 2024
Full Text: View/download PDF

10. A New Bound on the Cumulant Generating Function of Dirichlet Processes.

Author: Pierre Perrault, Denis Belomestny, Pierre Ménard, éric Moulines, Alexey Naumov, Daniil Tiapkin, and Michal Valko
Published: 2024
Full Text: View/download PDF

11. Optimal Design for Reward Modeling in RLHF.

Author: Antoine Scheid, Etienne Boursier, Alain Durmus, Michael I. Jordan, Pierre Ménard, Eric Moulines, and Michal Valko
Published: 2024
Full Text: View/download PDF

12. Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving.

Author: Aniket Didolkar, Anirudh Goyal, Nan Rosemary Ke, Siyuan Guo, Michal Valko, Timothy P. Lillicrap, Danilo J. Rezende, Yoshua Bengio, Michael Mozer, and Sanjeev Arora
Published: 2024
Full Text: View/download PDF

13. Understanding the performance gap between online and offline alignment algorithms.

Author: Yunhao Tang, Zhaohan Daniel Guo, Zeyu Zheng, Daniele Calandriello, Yuan Cao, Eugene Tarassov, Rémi Munos, Bernardo ávila Pires, Michal Valko, Yong Cheng, and Will Dabney
Published: 2024
Full Text: View/download PDF

14. Adapting to game trees in zero-sum imperfect information games.

Author: Côme Fiegel, Pierre Ménard, Tadashi Kozuno, Rémi Munos, Vianney Perchet, and Michal Valko
Published: 2023

15. Curiosity in Hindsight: Intrinsic Exploration in Stochastic Environments.

Author: Daniel Jarrett, Corentin Tallec, Florent Altché, Thomas Mesnard, Rémi Munos, and Michal Valko
Published: 2023

16. Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice.

Author: Toshinori Kitamura, Tadashi Kozuno, Yunhao Tang, Nino Vieillard, Michal Valko, Wenhao Yang, Jincheng Mei, Pierre Ménard, Mohammad Gheshlaghi Azar, Rémi Munos, Olivier Pietquin, Matthieu Geist, Csaba Szepesvári, Wataru Kumagai, and Yutaka Matsuo
Published: 2023

17. DoMo-AC: Doubly Multi-step Off-policy Actor-Critic Algorithm.

Author: Yunhao Tang, Tadashi Kozuno, Mark Rowland 0001, Anna Harutyunyan, Rémi Munos, Bernardo ávila Pires, and Michal Valko
Published: 2023

18. VA-learning as a more efficient alternative to Q-learning.

Author: Yunhao Tang, Rémi Munos, Mark Rowland 0001, and Michal Valko
Published: 2023

19. Fast Rates for Maximum Entropy Exploration.

Author: Daniil Tiapkin, Denis Belomestny, Daniele Calandriello, Eric Moulines, Rémi Munos, Alexey Naumov, Pierre Perrault, Yunhao Tang, Michal Valko, and Pierre Ménard
Published: 2023

20. Understanding Self-Predictive Learning for Reinforcement Learning.

Author: Yunhao Tang, Zhaohan Daniel Guo, Pierre Harvey Richemond, Bernardo ávila Pires, Yash Chandak, Rémi Munos, Mark Rowland 0001, Mohammad Gheshlaghi Azar, Charline Le Lan, Clare Lyle, András György 0001, Shantanu Thakoor, Will Dabney, Bilal Piot, Daniele Calandriello, and Michal Valko
Published: 2023

21. Half-Hop: A graph upsampling approach for slowing down message passing.

Author: Mehdi Azabou, Venkataramana Ganesh, Shantanu Thakoor, Chi-Heng Lin, Lakshmi Sathidevi, Ran Liu, Michal Valko, Petar Velickovic, and Eva L. Dyer
Published: 2023

22. Quantile Credit Assignment.

Author: Thomas Mesnard, Wenqi Chen, Alaa Saade, Yunhao Tang, Mark Rowland 0001, Theophane Weber, Clare Lyle, Audrunas Gruslys, Michal Valko, Will Dabney, Georg Ostrovski, Eric Moulines, and Rémi Munos
Published: 2023

23. Marginalized Operators for Off-policy Reinforcement Learning.

Author: Yunhao Tang, Mark Rowland 0001, Rémi Munos, and Michal Valko
Published: 2022

24. Adaptive Multi-Goal Exploration.

Author: Jean Tarbouriech, Omar Darwiche Domingues, Pierre Ménard, Matteo Pirotta, Michal Valko, and Alessandro Lazaric
Published: 2022

25. From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses.

Author: Daniil Tiapkin, Denis Belomestny, Eric Moulines, Alexey Naumov, Sergey Samsonov, Yunhao Tang, Michal Valko, and Pierre Ménard
Published: 2022

26. Retrieval-Augmented Reinforcement Learning.

Author: Anirudh Goyal, Abram L. Friesen, Andrea Banino, Theophane Weber, Nan Rosemary Ke, Adrià Puigdomènech Badia, Arthur Guez, Mehdi Mirza, Peter Conway Humphreys, Ksenia Konyushkova, Michal Valko, Simon Osindero, Timothy P. Lillicrap, Nicolas Heess, and Charles Blundell
Published: 2022

27. Scaling Gaussian Process Optimization by Evaluating a Few Unique Candidates Multiple Times.

Author: Daniele Calandriello, Luigi Carratino, Alessandro Lazaric, Michal Valko, and Lorenzo Rosasco
Published: 2022

28. Model-free Posterior Sampling via Learning Rate Randomization.

Author: Daniil Tiapkin, Denis Belomestny, Daniele Calandriello, Eric Moulines, Rémi Munos, Alexey Naumov, Pierre Perrault, Michal Valko, and Pierre Ménard
Published: 2023

29. Local and adaptive mirror descents in extensive-form games.

Author: Côme Fiegel, Pierre Ménard, Tadashi Kozuno, Rémi Munos, Vianney Perchet, and Michal Valko
Published: 2023
Full Text: View/download PDF

30. Nash Learning from Human Feedback.

Author: Rémi Munos, Michal Valko, Daniele Calandriello, Mohammad Gheshlaghi Azar, Mark Rowland 0001, Zhaohan Daniel Guo, Yunhao Tang, Matthieu Geist, Thomas Mesnard, Andrea Michi, Marco Selvi, Sertan Girgin, Nikola Momchev, Olivier Bachem, Daniel J. Mankowitz, Doina Precup, and Bilal Piot
Published: 2023
Full Text: View/download PDF

31. Unlocking the Power of Representations in Long-term Novelty-based Exploration.

Author: Alaa Saade, Steven Kapturowski, Daniele Calandriello, Charles Blundell, Pablo Sprechmann, Leopoldo Sarra, Oliver Groth, Michal Valko, and Bilal Piot
Published: 2023
Full Text: View/download PDF

32. A General Theoretical Paradigm to Understand Learning from Human Preferences.

Author: Mohammad Gheshlaghi Azar, Mark Rowland 0001, Bilal Piot, Daniel Guo, Daniele Calandriello, Michal Valko, and Rémi Munos
Published: 2023
Full Text: View/download PDF

33. Demonstration-Regularized RL.

Author: Daniil Tiapkin, Denis Belomestny, Daniele Calandriello, Eric Moulines, Alexey Naumov, Pierre Perrault, Michal Valko, and Pierre Ménard
Published: 2023
Full Text: View/download PDF

34. Broaden Your Views for Self-Supervised Video Learning.

Author: Adrià Recasens, Pauline Luc, Jean-Baptiste Alayrac, Luyu Wang, Florian Strub, Corentin Tallec, Mateusz Malinowski, Viorica Patraucean, Florent Altché, Michal Valko, Jean-Bastien Grill, Aäron van den Oord, and Andrew Zisserman
Published: 2021
Full Text: View/download PDF

35. Learning in two-player zero-sum partially observable Markov games with perfect recall.

Author: Tadashi Kozuno, Pierre Ménard, Rémi Munos, and Michal Valko
Published: 2021

36. A Provably Efficient Sample Collection Strategy for Reinforcement Learning.

Author: Jean Tarbouriech, Matteo Pirotta, Michal Valko, and Alessandro Lazaric
Published: 2021

37. Drop, Swap, and Generate: A Self-Supervised Approach for Generating Neural Activity.

Author: Ran Liu, Mehdi Azabou, Max Dabagia, Chi-Heng Lin, Mohammad Gheshlaghi Azar, Keith B. Hengen, Michal Valko, and Eva L. Dyer
Published: 2021

38. Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret.

Author: Jean Tarbouriech, Runlong Zhou, Simon S. Du, Matteo Pirotta, Michal Valko, and Alessandro Lazaric
Published: 2021

39. Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation.

Author: Yunhao Tang, Tadashi Kozuno, Mark Rowland 0001, Rémi Munos, and Michal Valko
Published: 2021

40. Sample Complexity Bounds for Stochastic Shortest Path with a Generative Model.

Author: Jean Tarbouriech, Matteo Pirotta, Michal Valko, and Alessandro Lazaric
Published: 2021

41. Adaptive Reward-Free Exploration.

Author: Emilie Kaufmann, Pierre Ménard, Omar Darwiche Domingues, Anders Jonsson 0001, Edouard Leurent, and Michal Valko
Published: 2021

42. Episodic Reinforcement Learning in Finite MDPs: Minimax Lower Bounds Revisited.

Author: Omar Darwiche Domingues, Pierre Ménard, Emilie Kaufmann, and Michal Valko
Published: 2021

43. A Kernel-Based Approach to Non-Stationary Reinforcement Learning in Metric Spaces.

Author: Omar Darwiche Domingues, Pierre Ménard, Matteo Pirotta, Emilie Kaufmann, and Michal Valko
Published: 2021

44. Fast active learning for pure exploration in reinforcement learning.

Author: Pierre Ménard, Omar Darwiche Domingues, Anders Jonsson 0001, Emilie Kaufmann, Edouard Leurent, and Michal Valko
Published: 2021

45. Revisiting Peng's Q(λ) for Modern Reinforcement Learning.

Author: Tadashi Kozuno, Yunhao Tang, Mark Rowland 0001, Rémi Munos, Steven Kapturowski, Will Dabney, Michal Valko, and David Abel
Published: 2021