Search

Your search keyword '"Uehara, Masatoshi"' showing total 138 results

Search Constraints

Start Over You searched for: Author "Uehara, Masatoshi" Remove constraint Author: "Uehara, Masatoshi"
138 results on '"Uehara, Masatoshi"'

Search Results

1. Derivative-Free Guidance in Continuous and Discrete Diffusion Models with Soft Value-Based Decoding

2. Understanding Reinforcement Learning-Based Fine-Tuning of Diffusion Models: A Tutorial and Review

3. Adding Conditional Control to Diffusion Models with Reinforcement Learning

4. Bridging Model-Based Optimization and Generative Modeling via Conservative Fine-Tuning of Diffusion Models

5. Regularized DeepIV with Model Selection

6. Feedback Efficient Online Fine-Tuning of Diffusion Models

7. Fine-Tuning of Continuous-Time Diffusion Models as Entropy-Regularized Control

8. Functional Graphical Models: Structure Enables Offline Data-Driven Optimization

9. Source Condition Double Robust Inference on Functionals of Inverse Problems

10. Off-Policy Evaluation of Ranking Policies under Diverse User Behavior

11. Provable Reward-Agnostic Preference-Based Reinforcement Learning

12. Provable Offline Preference-Based Reinforcement Learning

13. Distributional Offline Policy Evaluation with Predictive Error Guarantees

14. Minimax Instrumental Variable Regression and $L_2$ Convergence Guarantees without Identification or Closedness

15. Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage

16. A Review of Off-Policy Evaluation in Reinforcement Learning

17. Inference on Strongly Identified Functionals of Weakly Identified Functions

18. Future-Dependent Value-Based Off-Policy Evaluation in POMDPs

19. PAC Reinforcement Learning for Predictive State Representations

20. Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings

21. Provably Efficient Reinforcement Learning in Partially Observable Dynamical Systems

22. Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach

23. A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes

24. Representation Learning for Online and Offline RL in Low-rank MDPs

25. Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage

26. Mitigating Covariate Shift in Imitation Learning via Offline Data Without Great Coverage

27. Causal Inference Under Unmeasured Confounding With Negative Controls: A Minimax Learning Approach

28. Finite Sample Analysis of Minimax Offline Reinforcement Learning: Completeness, Fast Rates and First-Order Efficiency

29. Fast Rates for the Regret of Offline Reinforcement Learning

30. Optimal Off-Policy Evaluation from Multiple Logging Policies

31. Doubly Robust Off-Policy Value and Gradient Estimation for Deterministic Policies

32. Efficient Evaluation of Natural Stochastic Policies in Offline Reinforcement Learning

33. Off-Policy Evaluation and Learning for External Validity under a Covariate Shift

34. Statistically Efficient Off-Policy Policy Gradients

35. Localized Debiased Machine Learning: Efficient Inference on Quantile Treatment Effects and Beyond

36. Minimax Weight and Q-Function Learning for Off-Policy Evaluation

37. Efficiently Breaking the Curse of Horizon in Off-Policy Evaluation with Double Reinforcement Learning

38. Double Reinforcement Learning for Efficient Off-Policy Evaluation in Markov Decision Processes

39. Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for Reinforcement Learning

40. Information criteria for non-normalized models

41. Imputation estimators for unnormalized models with missing data

42. Unified estimation framework for unnormalized models with statistical efficiency

43. Semiparametric response model with nonignorable nonresponse

44. Analysis of Noise Contrastive Estimation from the Perspective of Asymptotic Variance

45. Generative Adversarial Nets from a Density Ratio Estimation Perspective

46. Efficient evaluation of natural stochastic policies in off-line reinforcement learning.

50. Refined Value-Based Offline RL under Realizability and Partial Coverage

Catalog

Books, media, physical & digital resources