Share your thoughts, 1 month free Claude Pro on usSee more

Planning on Commonly solved planning tasks F5-3 vs lm

0.33Runtime Ratio

DeepSeek

Updated 4mo ago

Evaluation Results

Method	Links
DeepSeek 2025.08		0.33	98	1.43	34
Qwen 2025.08		0.36	98	1.65	32
Llama 2025.08		0.37	98	1.77	39
GPT 2025.08		0.43	97	2.19	28