| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Science Question Answering | ARC-C non-EU languages (test) | Accuracy91.5 | 16 | |
| Science Question Answering | ARC-C German (test) | Accuracy91 | 15 | |
| Knowledge recall | ARC-c (test) | Accuracy (ARC-c test)83.73 | 13 | |
| Scientific Reasoning | ARC-C 0-shot (test) | Pass@1 Acc83.87 | 8 | |
| Explanation self-consistency | ARC-C (test) | Accuracy77.57 | 4 | |
| Knowledge Injection | ARC-C | Learned Accuracy- | 0 |