Search

Your search keyword '"Huang, Longbo"' showing total 390 results

Search Constraints

Start Over You searched for: Author "Huang, Longbo" Remove constraint Author: "Huang, Longbo"
390 results on '"Huang, Longbo"'

Search Results

1. Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration

2. uniINF: Best-of-Both-Worlds Algorithm for Parameter-Free Heavy-Tailed MABs

3. Beyond Squared Error: Exploring Loss Design for Enhanced Training of Generative Flow Networks

4. Value-Based Deep Multi-Agent Reinforcement Learning with Dynamic Sparse Training

5. Adversarial Network Optimization under Bandit Feedback: Maximizing Utility in Non-Stationary Multi-Hop Networks

6. Mixed Sparsity Training: Achieving 4$\times$ FLOP Reduction for Transformer Pretraining

7. RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning

8. Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation

9. Provably Efficient Partially Observable Risk-Sensitive Reinforcement Learning with Hindsight Observation

10. LCM-LoRA: A Universal Stable-Diffusion Acceleration Module

11. A Quadratic Synchronization Rule for Distributed Deep Learning

12. One is More: Diverse Perspectives within a Single Network for Efficient DRL

13. Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

14. Provably Efficient Iterated CVaR Reinforcement Learning with Function Approximation and Human Feedback

15. Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning

16. Queue Scheduling with Adversarial Bandit Learning

17. Why (and When) does Local SGD Generalize Better than SGD?

18. RePreM: Representation Pre-training with Masked Model for Reinforcement Learning

19. Stochastic Generative Flow Networks

20. Provably Safe Reinforcement Learning with Step-wise Violation Constraints

21. Multi-task Representation Learning for Pure Exploration in Linear Bandits

22. Banker Online Mirror Descent: A Universal Approach for Delayed Online Bandit Learning

23. Dueling Bandits: From Two-dueling to Multi-dueling

24. Generative Augmented Flow Networks

25. Effective Multi-User Delay-Constrained Scheduling with Deep Recurrent Reinforcement Learning

26. Nearly Minimax Optimal Reinforcement Learning with Linear Function Approximation

27. Provable Generalization of Overparameterized Meta-learning Trained with SGD

28. Provably Efficient Risk-Sensitive Reinforcement Learning: Iterated CVaR and Worst Path

29. RLx2: Training a Sparse Deep Reinforcement Learning Model from Scratch

30. Network Topology Optimization via Deep Reinforcement Learning

31. Modality Competition: What Makes Joint Training of Multi-modal Network Fail in Deep Learning? (Provably)

32. Adaptive Best-of-Both-Worlds Algorithm for Heavy-Tailed Multi-Armed Bandits

37. Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification

38. Simultaneously Achieving Sublinear Regret and Constraint Violations for Online Convex Optimization with Time-varying Constraints

39. Collaborative Pure Exploration in Kernel Bandit

40. Scale-Free Adversarial Multi-Armed Bandit with Arbitrary Feedback Delays

41. Banker Online Mirror Descent

42. What Makes Multi-modal Learning Better than Single (Provably)

43. Fast Federated Learning in the Presence of Arbitrary Device Unavailability

44. The best of both worlds: stochastic and adversarial episodic MDPs with unknown transition

45. Regularized Softmax Deep Multi-Agent $Q$-Learning

47. Continuous Mean-Covariance Bandits

48. A One-Size-Fits-All Solution to Conservative Bandit Problems

49. Adaptive Algorithms for Multi-armed Bandit with Composite and Anonymous Feedback

50. Restless-UCB, an Efficient and Low-complexity Algorithm for Online Restless Bandits

Catalog

Books, media, physical & digital resources