About me
I am currently a Technical Director at Tencent, Shenzhen, China.
I graduated with a bachelor’s degree from Sun Yat-sen University in 2006 and a master’s degree from the Institute of Computing Technology, Chinese Academy of Sciences in 2009.
I has accumulated rich practical experience in two key areas: AI Infrastructure optimization and multi-agent reinforcement learning.
My core achievement is leading the development of Tencent Kai Wu Platform, China’s first self-developed open platform for multi-agent reinforcement learning. This platform has significantly improved the R&D efficiency of decision making AI. Meanwhile, I has promoted Reinforcement Learning Systems ecosystem, organizing the Tencent AI Arena Global Open Challenge for many consecutive years. The challenge has attracted hundreds of universities worldwide and cultivated tens of thousands of students. I has also released China’s first industry standard “Reinforcement Learning Systems”.
My research interests include LLM agents, deep reinforcement learning, high performance computing and their commercial applications.
Publications
please visit my google scholar
- Second Author, T/CCF 0006-2025 Reinforcement Learning system part 1: General requirements, China Computer Federation (CCF) Group Standard
- First Author, T/CCF 0007-2025 Reinforcement Learning system part 2: Technical requirements for reinforcement learning environment, China Computer Federation (CCF) Group Standard
Talks
Multi agent Reinforcement Learning for Gaming Industry
2024 Conference on Multi-Agent Applications In China, chengdu china, 2024/11/16The Continuous Path of Tencent Kai Wu Platform
China National Computer Congress 2022, Online, 2022/12/8Challenges and Opportunities of Tencent Game AI Platform
China National Computer Congress 2021, shenzhen china, 2021/10/30
Awards
- 2024 Edge-Device Large Language Model Competition 1st Place, NeurIPS 2024 Competition Track, supervisor of The Team Tinytron, leaderboard, code, technical report
- 2024 Tencent Sustainable Social Value Award - Tencent Kai Wu Platform Serving National AI Talent Cultivation, Tencent, Project Leader
- 2022 AutoML Decathlon 2nd Place, NeurIPS 2022 Competition Track, Team TEG-AutoML, technical report
- 2021 Tencent Business Breakthrough Award - Tencent Kai Wu Platform Team, Tencent, Project Leader
- 2019 Tencent Business Breakthrough Award - Tencent Jue wu AI Project Team, Tencent, Training-Infrastructure Leader
- 2016 Tencent Technological Breakthrough Award - Wechat search Project, Tencent, Senior developer
- 2014 Tencent Outstanding R&D Award - Tencent Video search Project, Tencent, Senior developer
- 2010 Tencent Outstanding R&D Award - Tencent Web search Project, Tencent, Senior developer
Honors
- Expert member of the AI-Enabled Talent Cultivation Exploration Group at Sun Yat-sen University, 2025
- Visiting Researcher at the Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, 2021-2023
Education
M.S. in Computer Science, 2006 - 2009
Institute of Computing Technology, Chinese Academy of Sciences, ChinaB.S. in Computer Science, 2002 - 2006
Sun Yat-sen University, China
