Search

Showing total 202 results

Search Constraints

Start Over You searched for: Topic computer science - machine learning Remove constraint Topic: computer science - machine learning Topic mathematics - optimization and control Remove constraint Topic: mathematics - optimization and control Topic mathematics - statistics theory Remove constraint Topic: mathematics - statistics theory Publication Type Reports Remove constraint Publication Type: Reports
202 results

Search Results

1. Non-convex matrix sensing: Breaking the quadratic rank barrier in the sample complexity

2. Metric Entropy-Free Sample Complexity Bounds for Sample Average Approximation in Convex Stochastic Programming

3. Learning the Uncertainty Sets for Control Dynamics via Set Membership: A Non-Asymptotic Analysis

4. Certified Multi-Fidelity Zeroth-Order Optimization

5. On Regularization via Early Stopping for Least Squares Regression

6. Gaussian Approximation and Multiplier Bootstrap for Polyak-Ruppert Averaged Linear Stochastic Approximation with Applications to TD Learning

7. DPO: Differential reinforcement learning with application to optimal configuration search

8. Convergence of coordinate ascent variational inference for log-concave measures via optimal transport

9. High-probability sample complexities for policy evaluation with linear function approximation

10. Reinforcement Learning with Human Feedback: Learning Dynamic Choices via Pessimism

11. Analysing heavy-tail properties of Stochastic Gradient Descent by means of Stochastic Recurrence Equations

12. An Improved Analysis of Langevin Algorithms with Prior Diffusion for Non-Log-Concave Sampling

13. Shifted Interpolation for Differential Privacy

14. Probabilistic Guarantees of Stochastic Recursive Gradient in Non-Convex Finite Sum Problems

15. A New Perspective On Denoising Based On Optimal Transport

16. Solution-Set Geometry and Regularization Path of a Nonconvexly Regularized Convex Sparse Model

17. Sketching for Convex and Nonconvex Regularized Least Squares with Sharp Guarantees

18. Convergence of flow-based generative models via proximal gradient descent in Wasserstein space

19. Robust Matrix Completion with Heavy-tailed Noise

20. RIGID: Robust Linear Regression with Missing Data

21. Some notes concerning a generalized KMM-type optimization method for density ratio estimation

22. RandALO: Out-of-sample risk estimation in no time flat

23. The Star Geometry of Critic-Based Regularizer Learning

24. Fast convergence of the Expectation Maximization algorithm under a logarithmic Sobolev inequality

25. Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning

26. $K$-Nearest-Neighbor Resampling for Off-Policy Evaluation in Stochastic Control

27. The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning

28. Sample-Efficient Reinforcement Learning for Linearly-Parameterized MDPs with a Generative Model

29. Sample-Efficient Reinforcement Learning Is Feasible for Linearly Realizable MDPs with Limited Revisiting

30. Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR

31. Tight Guarantees for Interactive Decision Making with the Decision-Estimation Coefficient

32. The Collusion of Memory and Nonlinearity in Stochastic Approximation With Constant Stepsize

33. 4+3 Phases of Compute-Optimal Neural Scaling Laws

34. Progressive Feedforward Collapse of ResNet Training

35. Quickest Change Detection with Confusing Change

36. High dimensional analysis reveals conservative sharpening and a stochastic edge of stability

37. Linear Convergence of ISTA and FISTA

38. Blessings and Curses of Covariate Shifts: Adversarial Learning Dynamics, Directional Convergence, and Equilibria

39. Model-Free Reinforcement Learning with the Decision-Estimation Coefficient

40. Proximal Subgradient Norm Minimization of ISTA and FISTA

41. Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian

42. Stochastic Mirror Descent for Large-Scale Sparse Recovery

43. Distributionally Robust Batch Contextual Bandits

44. The Heavy-Tail Phenomenon in SGD

45. Breaking the Sample Size Barrier in Model-Based Reinforcement Learning with a Generative Model

46. Bridging Convex and Nonconvex Optimization in Robust PCA: Noise, Outliers, and Missing Data

47. Training Dynamics of Multi-Head Softmax Attention for In-Context Learning: Emergence, Convergence, and Optimality

48. Inferring Dynamic Networks from Marginals with Iterative Proportional Fitting

49. Robustly Learning Single-Index Models via Alignment Sharpness

50. Geometry-induced Implicit Regularization in Deep ReLU Neural Networks