Search

Your search keyword '"Sun, Ruoyu"' showing total 351 results

Search Constraints

Start Over You searched for: Author "Sun, Ruoyu" Remove constraint Author: "Sun, Ruoyu" Search Limiters Full Text Remove constraint Search Limiters: Full Text
351 results on '"Sun, Ruoyu"'

Search Results

1. Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity

2. MoFO: Momentum-Filtered Optimizer for Mitigating Forgetting in LLM Fine-Tuning

3. Adam-mini: Use Fewer Learning Rates To Gain More

4. Bridging the Gap: Rademacher Complexity in Robust and Standard Generalization

5. PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear Programming

6. On the Convergence of Adam under Non-uniform Smoothness: Separability from SGDM and Beyond

7. Why Transformers Need Adam: A Hessian Perspective

8. Combining Transformer based Deep Reinforcement Learning with Black-Litterman Model for Portfolio Optimization

10. ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models

11. LEMON: Lossless model expansion

12. PAC-Bayesian Spectrally-Normalized Bounds for Adversarially Robust Generalization

13. How Graph Neural Networks Learn: Lessons from Training Dynamics

14. AceGPT, Localizing Large Language Models in Arabic

15. Restricted Generative Projection for One-Class Classification and Anomaly Detection

16. Deglacial volcanism and reoxygenation in the aftermath of the Sturtian Snowball Earth.

17. NTK-SAP: Improving neural network pruning by aligning training dynamics

18. Balanced Training for Sparse GANs

19. Invariant Layers for Graphs with Nodes of Different Types

20. A GNN-Guided Predict-and-Search Framework for Mixed-Integer Linear Programming

21. Adversarial Rademacher Complexity of Deep Neural Networks

22. DigGAN: Discriminator gradIent Gap Regularization for GAN Training with Limited Data

23. When Expressivity Meets Trainability: Fewer than $n$ Neurons Can Work

24. Stability Analysis and Generalization Bounds of Adversarial Training

25. Provable Adaptivity of Adam under Non-uniform Smoothness

26. Adam Can Converge Without Any Modification On Update Rules

27. Global Convergence of MAML and Theory-Inspired Neural Architecture Search for Few-Shot Learning

28. Towards Understanding the Impact of Model Size on Differential Private Classification

29. Portfolio analysis with mean-CVaR and mean-CVaR-skewness criteria based on mean-variance mixture models

31. Federated Semi-Supervised Learning with Class Distribution Mismatch

32. Does Momentum Change the Implicit Regularization on Separable Data?

35. Achieving Small Test Error in Mildly Overparameterized Neural Networks

36. On a Faster $R$-Linear Convergence Rate of the Barzilai-Borwein Method

37. Towards a Better Global Loss Landscape of GANs

38. A Single-Loop Smoothed Gradient Descent-Ascent Algorithm for Nonconvex-Concave Min-Max Problems

39. On the Landscape of One-hidden-layer Sparse Networks and Beyond

40. The Global Landscape of Neural Networks: An Overview

41. Global Convergence and Generalization Bound of Gradient-Based Meta-Learning with Deep Neural Nets

42. Distilling Object Detectors with Task Adaptive Regularization

43. DEED: A General Quantization Scheme for Communication Efficiency in Bits

44. Revisiting Landscape Analysis in Deep Neural Networks: Eliminating Decreasing Paths to Infinity

45. Optimization for deep learning: theory and algorithms

46. Sub-Optimal Local Minima Exist for Neural Networks with Almost All Non-Linear Activations

48. Understanding Limitation of Two Symmetrized Orders by Worst-case Complexity

49. Off-road Autonomous Vehicles Traversability Analysis and Trajectory Planning Based on Deep Inverse Reinforcement Learning

50. Max-Sliced Wasserstein Distance and its use for GANs

Catalog

Books, media, physical & digital resources