Search

Your search keyword '"Yang, Zhuoran"' showing total 647 results

Search Constraints

Start Over You searched for: Author "Yang, Zhuoran" Remove constraint Author: "Yang, Zhuoran"
647 results on '"Yang, Zhuoran"'

Search Results

1. Unveiling Induction Heads: Provable Training Dynamics and Feature Learning in Transformers

2. Unveiling the Statistical Foundations of Chain-of-Thought Prompting Methods

3. Provable Statistical Rates for Consistency Diffusion Models

4. From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems

5. STRIDE: A Tool-Assisted LLM Agent Framework for Strategic and Interactive Decision-Making

6. Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning

7. Sample-efficient Learning of Infinite-horizon Average-reward MDPs with General Function Approximation

8. A Mean-Field Analysis of Neural Stochastic Gradient Descent-Ascent for Functional Minimax Optimization

9. Unveil Conditional Diffusion Models with Classifier-free Guidance: A Sharp Statistical Theory

10. On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games

11. Training Dynamics of Multi-Head Softmax Attention for In-Context Learning: Emergence, Convergence, and Optimality

12. Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning

13. Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF

14. Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems

15. Empowering Autonomous Driving with Large Language Models: A Safety Perspective

16. Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks

17. Posterior Sampling for Competitive RL: Function Approximation and Partial Observation

18. Learning Regularized Graphon Mean-Field Games with Unknown Graphons

19. Learning Regularized Monotone Graphon Mean-Field Games

20. Sample-Efficient Multi-Agent RL: An Optimization Perspective

22. Actions Speak What You Want: Provably Sample-Efficient Reinforcement Learning of the Quantal Stackelberg Equilibrium from Strategic Feedbacks

23. Contextual Dynamic Pricing with Strategic Buyers

25. A General Framework for Sequential Decision-Making under Adaptivity Constraints

26. Provably Efficient Representation Learning with Tractable Planning in Low-Rank POMDP

27. Provably Efficient Generalized Lagrangian Policy Optimization for Safe Multi-Agent Reinforcement Learning

28. What and How does In-Context Learning Learn? Bayesian Model Averaging, Parameterization, and Generalization

29. Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning

30. Maximize to Explore: One Objective Function Fusing Estimation, Planning, and Exploration

31. Reinforcement Learning with Human Feedback: Learning Dynamic Choices via Pessimism

32. Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning

33. Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization

34. A Unified Framework of Policy Learning for Contextual Bandit with Confounding Bias and Missing Observations

35. Learning to Incentivize Information Acquisition: Proper Scoring Rules Meet Principal-Agent Model

36. Can We Find Nash Equilibria at a Linear Rate in Markov Games?

37. Finding Regularized Competitive Equilibria of Heterogeneous Agent Macroeconomic Models with Reinforcement Learning

38. Offline Policy Optimization in RL with Variance Regularizaton

39. Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information

40. Policy learning 'without' overlap: Pessimism and generalized empirical Bernstein's inequality

41. The Sample Complexity of Online Contract Design

42. GEC: A Unified Framework for Interactive Decision Making in MDP, POMDP, and Beyond

43. A Reinforcement Learning Approach in Multi-Phase Second-Price Auction Design

44. Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments

45. Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL

46. Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes

47. Strategic Decision-Making in the Presence of Information Asymmetry: Provably Efficient RL with Algorithmic Instruments

48. Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning

49. Provably Efficient Fictitious Play Policy Optimization for Zero-Sum Markov Games with Structured Transitions

50. Decentralized Optimistic Hyperpolicy Mirror Descent: Provably No-Regret Learning in Markov Games

Catalog

Books, media, physical & digital resources