Hello! I am Yunpeng Qing, a third-year Ph.D. student in the College of Computer Science & the State Key Laboratory of CAD&CG at Zhejiang University, advised by Professor Changqing Zou. Previously, I worked closely with Shunyu Liu in the Visual Intelligence and Pattern Analysis (VIPA) Lab. I received my B.E. degree from the College of Computer Science and Technology, Zhejiang University in June 2023. I am currently an intern student at ACE ROBOTICS.

My research interests lie in Reinforcement Learning, especially Reinforcement Learning-enhanced Embodied AI. My research goal is to create robust and reliable general frameworks for agents to effectively use sequential decision-making information for behavior improvement. Towards this goal, my prior work has focused on methods that facilitate the application of reinforcement learning algorithms on a variety of static datasets.

Please feel free to contact me via yunpeng.qing.cs@gmail.com if you have any questions, are interested in collaborating, or just want to chat.

[Google Scholar]    [Email]    [GitHub]    [CV]

🔥 News

  • 2026.05: Two papers are accepted by ICML 2026.
  • 2025.01: One paper is accepted by IEEE TITS.
  • 2024.12: One paper is accepted by AAMAS 2025.
  • 2024.11: One paper is accepted by KDD 2025 ADS Track.
  • 2024.09: One paper is accepted by NeurIPS 2024.
  • 2024.05: One paper is accepted by KDD 2024.

📝 Selected Publications

ICML 2026
BiTrajDiff preview

BiTrajDiff: Bidirectional Trajectory Generation with Diffusion Models for Offline Reinforcement Learning

Yunpeng Qing, Yixiao Chi, Shuo Chen, Shunyu Liu, Kexuan Zhou, Sixu Lin, Litao Liu, Changqing Zou

[Paper] [Code]

NeurIPS 2024
A2PO preview

A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective

Yunpeng Qing, Shunyu Liu, Jingyuan Cong, Kaixuan Chen, Yihe Zhou, Mingli Song

[Paper] [Code]

IEEE TITS 2025
CSIRL preview

Curricular Subgoals for Inverse Reinforcement Learning

Shunyu Liu, Yunpeng Qing, Shuqi Xu, Hongyan Wu, Jiangtao Zhang, Jingyuan Cong, Tianhao Chen, Yunfu Liu, Mingli Song

[Paper] [Code]

ICML 2026
CADP preview

DyGRO-VLA: Cross-Task Scaling of Vision-Language-Action Models via Dynamic Grouped Residual Optimization

Sixu Lin, Yunpeng Qing, Litao Liu, Ming Zhou, Ruixing Jin, Xiaoyi Fan, Guiliang Liu

[Paper] [Project]

AAMAS 2024
CADP preview

Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?

Yihe Zhou, Shunyu Liu, Yunpeng Qing, Kaixuan Chen, Tongya Zheng, Yanhao Huang, Jie Song, Mingli Song

[Paper] [Code]

arXiv 2023
Explainable RL survey preview

A survey on explainable reinforcement learning: Concepts, algorithms, challenges

Yunpeng Qing, Shunyu Liu, Jie Song, Huiqiong Wang, Mingli Song

[Paper] [Project]

🎖 Honors and Awards

  • 2023.06: Excellent Graduation Thesis.
  • 2021.09: Third-class Scholarship.
  • 2020.09: Third-class Scholarship.

📖 Education

  • 2023.09 - Present: Ph.D. student, College of Computer Science and Technology, Zhejiang University.
  • 2019.09 - 2023.06: B.E., College of Computer Science and Technology, Zhejiang University.

💻 Internship

  • Present: Intern student, ACE ROBOTICS.

🤝 Service

  • Conference Reviewer: NeurIPS, ICML (Silver Reviewer), ICLR, CVPR, ECCV, KDD, WWW.