Share your thoughts, 1 month free Claude Pro on usSee more

Mathematical Reasoning on Minerva (Avg@2)

57.7Avg@2

DARL

Updated 4mo ago

Evaluation Results

Method	Links
DARL 2026.01		57.7
RLPR 2026.01		56.5
RLVR 2026.01		54.9
TTRL 2026.01		52.8
Oat-Zero 2026.01		52.1
General Reasoner 2026.01		51.7
SimpleRL-Zoo 2026.01		51
Qwen2.5-7B-Inst 2026.01		49.4
SimpleRL-Zoo 2026.01		49.2
VeriFree 2026.01		49
PRIME 2026.01		45.5
RLPR 2026.01		39
DARL 2026.01		37.9
Qwen2.5-7B 2026.01		37.6
RLVR 2026.01		35.2
Llama3.1-8B-Inst 2026.01		32.7