Share your thoughts, 1 month free Claude Pro on usSee more

Mathematical Computation on MathQA

52.34Exact Match (EM)

Prompt-R1

Updated 4mo ago

Evaluation Results

Method	Links
Prompt-R1 2025.11		52.34	61.59
CoT Reasoning 2025.11		49.22	57.03
GRPO 2025.11		46.88	54.43
Baseline 2025.11		46.09	54.04
TextGrad 2025.11		44.53	61.46
OPRO 2025.11		43.75	60.08
GEPA 2025.11		40.63	61.59
Baseline 2025.11		28.91	32.29
CoT Reasoning 2025.11		27.34	30.6
SFT 2025.11		17.97	22.66