Search

Your search keyword '"Wang, Weixun"' showing total 213 results

Search Constraints

Start Over You searched for: "Wang, Weixun" Remove constraint "Wang, Weixun"
213 results on '"Wang, Weixun"'

Search Results

1. OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

2. The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization

3. MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library

4. Off-Beat Multi-Agent Reinforcement Learning

5. A2C is a special case of PPO

6. Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents

7. Breaking the Curse of Dimensionality in Multiagent State Space: A Unified Agent Permutation Framework

8. Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization

10. Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment

11. Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping

12. Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising

13. Efficient Deep Reinforcement Learning via Adaptive Policy Transfer

14. An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning

15. KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge

16. Multi-Agent Game Abstraction via Graph Attention Neural Network

17. Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems

18. From Few to More: Large-scale Dynamic Multiagent Curriculum Learning

19. Action Semantics Network: Considering the Effects of Actions in Multiagent Systems

21. Learning Adaptive Display Exposure for Real-Time Advertising

22. Towards Cooperation in Sequential Prisoner's Dilemmas: a Deep Multiagent Reinforcement Learning Approach

24. A Novel Approach for Handling Misbehaving Nodes in Behavior-Aware Mobile Networking

25. Investigating Performance of the SLIM-Based High Resolution Ion Mobility Platform for Separation of Isomeric Phosphatidylcholine Species

27. Cooperative Multiagent Transfer Learning With Coalition Pattern Decomposition

36. Introduction

37. Conclusions

38. MARLlib: A Scalable Multi-agent Reinforcement Learning Library

41. Discovery of Insulin Receptor Partial Agonists MK-5160 and MK-1092 as Novel Basal Insulins with Potential to Improve Therapeutic Index

42. Development of ProTx-II Analogues as Highly Selective Peptide Blockers of Nav1.7 for the Treatment of Pain

43. Guiding Chemically Synthesized Peptide Drug Lead Optimization by Derisking Mast Cell Degranulation-Related Toxicities of a NaV1.7 Peptide Inhibitor

44. A Series of Novel, Highly Potent, and Orally Bioavailable Next-Generation Tricyclic Peptide PCSK9 Inhibitors

48. Introduction

Catalog

Books, media, physical & digital resources