Search

Your search keyword '"Wang, Weixun"' showing total 109 results

Search Constraints

Start Over You searched for: Author "Wang, Weixun" Remove constraint Author: "Wang, Weixun" Publication Year Range Last 10 years Remove constraint Publication Year Range: Last 10 years
109 results on '"Wang, Weixun"'

Search Results

1. 2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervision

2. OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework

3. The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization

4. MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library

5. Off-Beat Multi-Agent Reinforcement Learning

6. A2C is a special case of PPO

7. Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents

8. Breaking the Curse of Dimensionality in Multiagent State Space: A Unified Agent Permutation Framework

9. Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization

11. Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment

12. Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping

13. Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising

14. Efficient Deep Reinforcement Learning via Adaptive Policy Transfer

15. An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning

16. KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge

17. Multi-Agent Game Abstraction via Graph Attention Neural Network

18. Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems

19. From Few to More: Large-scale Dynamic Multiagent Curriculum Learning

20. Action Semantics Network: Considering the Effects of Actions in Multiagent Systems

21. Learning Adaptive Display Exposure for Real-Time Advertising

22. Towards Cooperation in Sequential Prisoner's Dilemmas: a Deep Multiagent Reinforcement Learning Approach

25. Investigating Performance of the SLIM-Based High Resolution Ion Mobility Platform for Separation of Isomeric Phosphatidylcholine Species

27. Cooperative Multiagent Transfer Learning With Coalition Pattern Decomposition

28. MARLlib: A Scalable Multi-agent Reinforcement Learning Library

31. Discovery of Insulin Receptor Partial Agonists MK-5160 and MK-1092 as Novel Basal Insulins with Potential to Improve Therapeutic Index

32. Development of ProTx-II Analogues as Highly Selective Peptide Blockers of Nav1.7 for the Treatment of Pain

33. Guiding Chemically Synthesized Peptide Drug Lead Optimization by Derisking Mast Cell Degranulation-Related Toxicities of a NaV1.7 Peptide Inhibitor

34. A Series of Novel, Highly Potent, and Orally Bioavailable Next-Generation Tricyclic Peptide PCSK9 Inhibitors

36. Efficient Deep Reinforcement Learning via Adaptive Policy Transfer

41. Guiding Chemically Synthesized Peptide Drug Lead Optimization by Derisking Mast Cell Degranulation-Related Toxicities of a NaV1.7 Peptide Inhibitor.

42. Learning Adaptive Display Exposure for Real-Time Advertising

46. Erratum. Dipeptidyl Peptidase 4 Inhibition Stimulates Distal Tubular Natriuresis and Increases in Circulating SDF-1α 1-67 in Patients With Type 2 Diabetes. Diabetes Care 2017;40:1073–1081

47. Dipeptidyl Peptidase 4 Inhibition Stimulates Distal Tubular Natriuresis and Increases in Circulating SDF-1α1-67 in Patients With Type 2 Diabetes

48. Dipeptidyl Peptidase 4 Inhibition Stimulates Distal Tubular Natriuresis and Increases in Circulating SDF-1α1-67 in Patients With Type 2 Diabetes.

49. Leveraging High-Resolution Ion Mobility-Mass Spectrometry for Cyclic Peptide Soft Spot Identification.

50. Development of ProTx-II Analogues as Highly Selective Peptide Blockers of Na v 1.7 for the Treatment of Pain.

Catalog

Books, media, physical & digital resources