| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Logical Reasoning | SLR-BENCH Extended Leaderboard | LRL Score15.5 | 54 | |
| Logical Reasoning | SLR-BENCH (test) | LRL11.3 | 27 | |
| Logical Reasoning | SLR-BENCH | Overall LRL Score15.5 | 14 | |
| inductive Prolog rule synthesis | SLR-Bench Hard tier 250 tasks 1 | Accuracy58.4 | 13 | |
| inductive Prolog rule synthesis | SLR-Bench Medium tier 250 tasks 1 | Accuracy88.8 | 13 | |
| inductive Prolog rule synthesis | SLR-Bench Easy tier 1 (250 tasks) | Accuracy100 | 13 | |
| inductive Prolog rule synthesis | SLR-Bench Basic tier 250 tasks 1 | Accuracy100 | 13 | |
| inductive Prolog rule synthesis | SLR-Bench Overall 1,000 tasks (full) | Accuracy (%)86.7 | 13 |