| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| SLR-Bench Hard tier 250 tasks 1 | DF + CC + QO | Accuracy58.4 | 13 | 26d ago | |
| SLR-Bench Medium tier 250 tasks 1 | DF + CC | Accuracy88.8 | 13 | 26d ago | |
| SLR-Bench Easy tier 1 (250 tasks) | Accuracy100 | 13 | 26d ago | ||
| SLR-Bench Basic tier 250 tasks 1 | gpt-5† | Accuracy100 | 13 | 26d ago | |
| SLR-Bench Overall 1,000 tasks (full) | DF + CC + QO | Accuracy (%)86.7 | 13 | 26d ago |