Search

Your search keyword '"Gu, Quanquan"' showing total 134 results

Search Constraints

Start Over You searched for: Author "Gu, Quanquan" Remove constraint Author: "Gu, Quanquan" Topic mathematics - optimization and control Remove constraint Topic: mathematics - optimization and control
134 results on '"Gu, Quanquan"'

Search Results

1. Matching the Statistical Query Lower Bound for k-sparse Parity Problems with Stochastic Gradient Descent

2. Feel-Good Thompson Sampling for Contextual Dueling Bandits

3. Reinforcement Learning from Human Feedback with Active Queries

4. Risk Bounds of Accelerated SGD for Overparameterized Linear Regression

5. Implicit Bias of Gradient Descent for Two-layer ReLU and Leaky ReLU Networks on Nearly-orthogonal Data

6. Why Does Sharpness-Aware Minimization Generalize Better Than SGD?

7. Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning

8. Variance-Aware Regret Bounds for Stochastic Contextual Dueling Bandits

9. The Implicit Bias of Batch Normalization in Linear Models and Two-layer Linear Convolutional Neural Networks

10. Horizon-free Reinforcement Learning in Adversarial Linear Mixture MDPs

11. Uniform-PAC Guarantees for Model-Based RL with Bounded Eluder Dimension

12. Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs

13. The Benefits of Mixup for Feature Learning

14. Benign Overfitting for Two-layer ReLU Convolutional Neural Networks

15. Finite-Sample Analysis of Learning High-Dimensional Single ReLU Neuron

16. Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency

17. Nearly Minimax Optimal Reinforcement Learning for Linear Markov Decision Processes

18. Nesterov Meets Optimism: Rate-Optimal Separable Minimax Optimization

19. Learning Two-Player Mixture Markov Games: Kernel Function Approximation and Correlated Equilibrium

20. The Power and Limitation of Pretraining-Finetuning for Linear Regression under Covariate Shift

21. Computationally Efficient Horizon-Free Reinforcement Learning for Linear Mixture MDPs

22. Risk Bounds of Multi-Pass SGD for Least Squares in the Interpolation Regime

23. Optimal Online Generalized Linear Regression with Stochastic Noise and Its Application to Heteroscedastic Bandits

24. Benign Overfitting in Two-layer Convolutional Neural Networks

25. Benign Overfitting in Adversarially Robust Linear Classification

26. Learning Stochastic Shortest Path with Linear Function Approximation

27. Faster Perturbed Stochastic Gradient Methods for Finding Local Minima

28. Locally Differentially Private Reinforcement Learning for Linear Mixture Markov Decision Processes

29. Adaptive Differentially Private Empirical Risk Minimization

30. Reward-Free Model-Based Reinforcement Learning with Linear Function Approximation

31. Last Iterate Risk Bounds of SGD with Decaying Stepsize for Overparameterized Linear Regression

32. Understanding the Generalization of Adam in Learning Neural Networks with Proper Regularization

33. The Benefits of Implicit Regularization from SGD in Least Squares Problems

34. Proxy Convexity: A Unified Framework for the Analysis of Neural Networks Trained by Gradient Descent

35. Self-training Converts Weak Learners to Strong Learners in Mixture Models

36. Variance-Aware Off-Policy Evaluation with Linear Function Approximation

37. Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL

38. Uniform-PAC Bounds for Reinforcement Learning with Linear Function Approximation

39. Provable Robustness of Adversarial Training for Learning Halfspaces with Noise

40. Benign Overfitting of Constant-Stepsize SGD for Linear Regression

41. Near-optimal Policy Optimization Algorithms for Learning Adversarial Linear Mixture MDPs

42. Almost Optimal Algorithms for Two-player Zero-Sum Linear Mixture Markov Games

43. Nearly Minimax Optimal Regret for Learning Infinite-horizon Average-reward MDPs with Linear Function Approximation

44. Provably Efficient Reinforcement Learning with Linear Function Approximation Under Adaptivity Constraints

45. Provable Generalization of SGD-trained Neural Networks of Any Width in the Presence of Adversarial Label Noise

46. Nearly Minimax Optimal Reinforcement Learning for Linear Mixture Markov Decision Processes

47. Logarithmic Regret for Reinforcement Learning with Linear Function Approximation

48. Provable Multi-Objective Reinforcement Learning with Generative Models

49. Nearly Minimax Optimal Reinforcement Learning for Discounted MDPs

50. Agnostic Learning of Halfspaces with Gradient Descent via Soft Margins

Catalog

Books, media, physical & digital resources