Search

Your search keyword '"Bai, Chenjia"' showing total 110 results

Search Constraints

Start Over You searched for: Author "Bai, Chenjia" Remove constraint Author: "Bai, Chenjia"
110 results on '"Bai, Chenjia"'

Search Results

1. ODRL: A Benchmark for Off-Dynamics Reinforcement Learning

2. Preference Aligned Diffusion Planner for Quadrupedal Locomotion Control

3. Task-agnostic Pre-training and Task-guided Fine-tuning for Versatile Diffusion Planner

4. Forward KL Regularized Preference Optimization for Aligning Diffusion Policies

5. SelfBC: Self Behavior Cloning for Offline Reinforcement Learning

6. Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models

7. SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation

8. Constrained Ensemble Exploration for Unsupervised Skill Discovery

9. Cross-Domain Policy Adaptation by Capturing Representation Mismatch

10. Towards Efficient LLM Grounding for Embodied Multi-Agent Collaboration

11. Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning

12. Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning

13. Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning

14. Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning

15. Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning

16. Regularized Conditional Diffusion Model for Multi-Task Preference Alignment

17. Learning an Actionable Discrete Diffusion Policy via Large-Scale Actionless Video Pre-Training

18. OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments

19. Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness

20. Robust Quadrupedal Locomotion via Risk-Averse Policy Learning

21. Bridging the Sim-to-Real Gap from the Information Bottleneck Perspective

22. Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning

23. Cross-Domain Policy Adaptation via Value-Guided Data Filtering

24. On the Value of Myopic Behavior in Policy Reuse

25. Behavior Contrastive Learning for Unsupervised Skill Discovery

26. Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning

27. RORL: Robust Offline Reinforcement Learning via Conservative Smoothing

29. Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning

30. False Correlation Reduction for Offline Reinforcement Learning

31. Dynamic Bottleneck for Robust Self-Supervised Exploration

32. Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain

33. Principled Exploration via Optimistic Bootstrapping and Backward Induction

34. Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning

36. Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning

43. Privileged Knowledge Distillation for Sim-to-Real Policy Generalization

44. False Correlation Reduction for Offline Reinforcement Learning

45. Monotonic Quantile Network for Worst-Case Offline Reinforcement Learning

46. Exploration in Deep Reinforcement Learning: From Single-Agent to Multiagent Domain

50. Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning

Catalog

Books, media, physical & digital resources