Search

Your search keyword '"Pan, Gang"' showing total 23 results

Search Constraints

Start Over You searched for: Author "Pan, Gang" Remove constraint Author: "Pan, Gang" Topic computer science - machine learning Remove constraint Topic: computer science - machine learning
23 results on '"Pan, Gang"'

Search Results

1. Toward Large-scale Spiking Neural Networks: A Comprehensive Survey and Future Directions

2. Resisting Stochastic Risks in Diffusion Planners with the Trajectory Aggregation Tree

3. Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline

4. Generalizable Sleep Staging via Multi-Level Domain Alignment

5. FP3O: Enabling Proximal Policy Optimization in Multi-Agent Cooperation with Parameter-Sharing Versatility

6. Mitigating Communication Costs in Neural Networks: The Role of Dendritic Nonlinearity

7. Constrained Update Projection Approach to Safe Policy Optimization

8. TinyLight: Adaptive Traffic Signal Control on Devices with Extremely Limited Resources

9. Dynamic Ensemble Bayesian Filter for Robust Control of a Human Brain-machine Interface

10. CUP: A Conservative Update Policy Algorithm for Safe Reinforcement Learning

11. Thompson Sampling for Unimodal Bandits

12. On Convergence of Gradient Expected Sarsa($\lambda$)

13. Sample Complexity of Policy Gradient Finding Second-Order Stationary Points

14. Dynamic Ensemble Modeling Approach to Nonstationary Neural Decoding in Brain-Computer Interfaces

15. Gradient Q$(\sigma, \lambda)$: A Unified Algorithm with Function Approximation for Reinforcement Learning

16. FiDi-RL: Incorporating Deep Reinforcement Learning with Finite-Difference Policy Search for Efficient Learning of Continuous Control

17. Expected Sarsa($\lambda$) with Control Variate for Variance Reduction

18. Policy Optimization with Stochastic Mirror Descent

19. Brain Network Construction and Classification Toolbox (BrainNetClass)

20. TBQ($\sigma$): Improving Efficiency of Trace Utilization for Off-Policy Reinforcement Learning

21. Field-aware Neural Factorization Machine for Click-Through Rate Prediction

22. Qualitative Measurements of Policy Discrepancy for Return-Based Deep Q-Network

23. Spiking Deep Residual Network

Catalog

Books, media, physical & digital resources