Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Formula Reasoning on Formula
Loading...
73.5
Task Score
GEPA-C
72.252
72.576
72.9
73.224
May 20, 2026
Task Score
Brier Score
Updated 12d ago
Evaluation Results
Method
Method
Links
Task Score
Brier Score
GEPA-C
Optimizer LLM=GPT-5
2026.05
73.5
0.262
GEPA-C
Optimizer LLM=GPT-5-mini
2026.05
73.5
0.262
GEPA-C
Optimizer LLM=Gemini-3...
2026.05
73.5
0.262
GEPA-C
Optimizer LLM=Gemini-3...
2026.05
73.5
0.262
RPT
Optimizer LLM=GPT-5
2026.05
72.3
0.272
RPT
Optimizer LLM=GPT-5-mini
2026.05
72.3
0.272
RPT
Optimizer LLM=Gemini-3...
2026.05
72.3
0.272
RPT
Optimizer LLM=Gemini-3...
2026.05
72.3
0.272
Feedback
Search any
task
Search any
task