Share your thoughts, 1 month free Claude Pro on usSee more

Advanced Reasoning on ruMATH-500

97.2Accuracy

DeepSeek-R1

Updated 4mo ago

Evaluation Results

Method	Links
DeepSeek-R1 2025.12		97.2
o4-mini (medium) 2025.12		95.8
T-pro 2.0 2025.12		94
Qwen3-32B 2025.12		93.8
DeepSeek-R1-Distill-Qwen-32B 2025.12		89.8
DeepSeek-V3 2025.12		88.2
Gemma 3 27B 2025.12		86
GPT-4o 2025.12		76.6
GigaChat 2 Max 2025.12		70.2
YandexGPT5-Pro 2025.12		68.2
RuadaptQwen3-32B-Instruct 2025.12		45