| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| FrontierScience-Olympiad | ESC sweep | Token Efficiency Ratio (B_method/BMV)5.59 | 27 | 23d ago | |
| SciBench-107 | Claude-Opus-4.5 | Atkins Score62.5 | 24 | 3mo ago | |
| SciBench | Entropy-Tree | Pass@2077.46 | 17 | 3mo ago | |
| SciBench | Min-p Sampling | Accuracy4 | 12 | 21d ago | |
| SCIREAS Suite | GPQA55.8 | 9 | 5d ago | ||
| SciBench | Meta-reasoner | Diff Accuracy65.47 | 6 | 26d ago | |
| FrontierScience Olympiad N=20 (test) | - | - | 0 | 3mo ago |