Hello! I am Yunpeng Qing, a third-year Ph.D. student in the College of Computer Science & the State Key Laboratory of CAD&CG at Zhejiang University, advised by Professor Changqing Zou. Previously, I worked closely with Shunyu Liu in the Visual Intelligence and Pattern Analysis (VIPA) Lab. I received my B.E. degree from the College of Computer Science and Technology, Zhejiang University in June 2023. I am currently an intern student at ACE ROBOTICS.

My research interests lie in Reinforcement Learning, especially Reinforcement Learning-enhanced Embodied AI. My research goal is to create robust and reliable general frameworks for agents to effectively use sequential decision-making information for behavior improvement. Towards this goal, my prior work has focused on methods that facilitate the application of reinforcement learning algorithms on a variety of static datasets.

Please feel free to contact me via yunpeng.qing.cs@gmail.com if you have any questions, are interested in collaborating, or just want to chat.

🔥 News

2026.05: Two papers are accepted by ICML 2026.
2025.01: One paper is accepted by IEEE TITS.
2024.12: One paper is accepted by AAMAS 2025.
2024.11: One paper is accepted by KDD 2025 ADS Track.
2024.09: One paper is accepted by NeurIPS 2024.
2024.05: One paper is accepted by KDD 2024.

📝 Selected Publications

ICML 2026

BiTrajDiff: Bidirectional Trajectory Generation with Diffusion Models for Offline Reinforcement Learning

Yunpeng Qing, Yixiao Chi, Shuo Chen, Shunyu Liu, Kexuan Zhou, Sixu Lin, Litao Liu, Changqing Zou

[Paper] [Code]

NeurIPS 2024

A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective

Yunpeng Qing, Shunyu Liu, Jingyuan Cong, Kaixuan Chen, Yihe Zhou, Mingli Song

[Paper] [Code]

IEEE TITS 2025

Curricular Subgoals for Inverse Reinforcement Learning

Shunyu Liu, Yunpeng Qing, Shuqi Xu, Hongyan Wu, Jiangtao Zhang, Jingyuan Cong, Tianhao Chen, Yunfu Liu, Mingli Song

[Paper] [Code]

ICML 2026

DyGRO-VLA: Cross-Task Scaling of Vision-Language-Action Models via Dynamic Grouped Residual Optimization

Sixu Lin, Yunpeng Qing, Litao Liu, Ming Zhou, Ruixing Jin, Xiaoyi Fan, Guiliang Liu

[Paper] [Project]

AAMAS 2024

Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL?

Yihe Zhou, Shunyu Liu, Yunpeng Qing, Kaixuan Chen, Tongya Zheng, Yanhao Huang, Jie Song, Mingli Song

[Paper] [Code]

arXiv 2023

A survey on explainable reinforcement learning: Concepts, algorithms, challenges

Yunpeng Qing, Shunyu Liu, Jie Song, Huiqiong Wang, Mingli Song

[Paper] [Project]

🎖 Honors and Awards

2023.06: Excellent Graduation Thesis.
2021.09: Third-class Scholarship.
2020.09: Third-class Scholarship.

📖 Education

2023.09 - Present: Ph.D. student, College of Computer Science and Technology, Zhejiang University.
2019.09 - 2023.06: B.E., College of Computer Science and Technology, Zhejiang University.

💻 Internship

Present: Intern student, ACE ROBOTICS.

🤝 Service

Conference Reviewer: NeurIPS, ICML (Silver Reviewer), ICLR, CVPR, ECCV, KDD, WWW.