Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reflection of Episodes: Learning to Play Game from Expert and Self Experiences

About

StarCraft II is a complex and dynamic real-time strategy (RTS) game environment, which is very suitable for artificial intelligence and reinforcement learning research. To address the problem of Large Language Model(LLM) learning in complex environments through self-reflection, we propose a Reflection of Episodes(ROE) framework based on expert experience and self-experience. This framework first obtains key information in the game through a keyframe selection method, then makes decisions based on expert experience and self-experience. After a game is completed, it reflects on the previous experience to obtain new self-experience. Finally, in the experiment, our method beat the robot under the Very Hard difficulty in TextStarCraft II. We analyze the data of the LLM in the process of the game in detail, verified its effectiveness.

Xiaojie Xu, Zongyuan Li, Chang Lu, Runnan Qi, Yanan Ni, Lumin Jiang, Xiangbei Liu, Xuebo Zhang, Yongchun Fang, Kuihua Huang, Xian Guo, Zhanghua Wu, Zhenya Li• 2025

Related benchmarks

TaskDatasetResultRank
Game PlayingTextStarCraft II Harder Level-5
Winning Rate10
2
Game PlayingTextStarCraft II VeryHard Level-6
Winning Rate2
2
Game PlayingTextStarCraft II Hard Level-4
Winning Rate10
2
Game PlayingTextStarCraft II Elite Level-7
Winning Rate0.00e+0
1
Showing 4 of 4 rows

Other info

Follow for update