Search

Your search keyword '"Sun, Ruoyu"' showing total 747 results

Search Constraints

Start Over You searched for: Author "Sun, Ruoyu" Remove constraint Author: "Sun, Ruoyu"
747 results on '"Sun, Ruoyu"'

Search Results

1. Adam-mini: Use Fewer Learning Rates To Gain More

2. Bridging the Gap: Rademacher Complexity in Robust and Standard Generalization

3. PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear Programming

4. On the Convergence of Adam under Non-uniform Smoothness: Separability from SGDM and Beyond

5. Why Transformers Need Adam: A Hessian Perspective

6. Combining Transformer based Deep Reinforcement Learning with Black-Litterman Model for Portfolio Optimization

9. ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models

10. LEMON: Lossless model expansion

11. PAC-Bayesian Spectrally-Normalized Bounds for Adversarially Robust Generalization

12. How Graph Neural Networks Learn: Lessons from Training Dynamics

13. AceGPT, Localizing Large Language Models in Arabic

14. Restricted Generative Projection for One-Class Classification and Anomaly Detection

15. Deglacial volcanism and reoxygenation in the aftermath of the Sturtian Snowball Earth.

16. NTK-SAP: Improving neural network pruning by aligning training dynamics

17. Balanced Training for Sparse GANs

18. Invariant Layers for Graphs with Nodes of Different Types

19. A GNN-Guided Predict-and-Search Framework for Mixed-Integer Linear Programming

22. Adversarial Rademacher Complexity of Deep Neural Networks

23. DigGAN: Discriminator gradIent Gap Regularization for GAN Training with Limited Data

24. When Expressivity Meets Trainability: Fewer than $n$ Neurons Can Work

25. Stability Analysis and Generalization Bounds of Adversarial Training

26. Provable Adaptivity of Adam under Non-uniform Smoothness

27. Adam Can Converge Without Any Modification On Update Rules

28. Global Convergence of MAML and Theory-Inspired Neural Architecture Search for Few-Shot Learning

32. Towards Understanding the Impact of Model Size on Differential Private Classification

33. Portfolio analysis with mean-CVaR and mean-CVaR-skewness criteria based on mean-variance mixture models

38. Federated Semi-Supervised Learning with Class Distribution Mismatch

39. Does Momentum Change the Implicit Regularization on Separable Data?

42. Achieving Small Test Error in Mildly Overparameterized Neural Networks

43. On a Faster $R$-Linear Convergence Rate of the Barzilai-Borwein Method

44. Towards a Better Global Loss Landscape of GANs

47. A Single-Loop Smoothed Gradient Descent-Ascent Algorithm for Nonconvex-Concave Min-Max Problems

48. On the Landscape of One-hidden-layer Sparse Networks and Beyond

49. The Global Landscape of Neural Networks: An Overview

50. Global Convergence and Generalization Bound of Gradient-Based Meta-Learning with Deep Neural Nets

Catalog

Books, media, physical & digital resources