Share your thoughts, 1 month free Claude Pro on usSee more

Reasoning & General on HLE Full

0.502Score (%)

Kimi K2.5

Updated 3mo ago

Evaluation Results

Method	Links
Kimi K2.5 2026.02		0.502
Gemini 3 Pro 2026.02		0.458
GPT-5.2 (xhigh) 2026.02		0.455
Claude Opus 4.5 2026.02		0.432
DeepSeek-V3.2 2026.02		0.408
Gemini 3 Pro 2026.02		0.375
GPT-5.2 (xhigh) 2026.02		0.345
Claude Opus 4.5 2026.02		0.308
Kimi K2.5 2026.02		0.301
DeepSeek-V3.2 2026.02		0.251