| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Classification | LD UCI Repository (test) | Accuracy68.1 | 6 | |
| MCQ Classification | LD 3 v1 (Eva) | Accuracy100 | 6 | |
| MCQ Classification | LD 3 v1 (infer) | Accuracy100 | 6 | |
| Logical Reasoning | LD (val) | Accuracy78.33 | 5 | |
| Language Modeling | LD-S | Perplexity8.66 | 4 |