| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multiple-choice Question Answering | Text-only Adaptive Benchmark (ALL) | Pass@1 Acc81 | 5 | |
| Multiple-choice Question Answering | Text-only Adaptive Benchmark L5 | Pass@1 Accuracy65 | 5 | |
| Multiple-choice Question Answering | Text-only Adaptive Benchmark L4 | Pass@1 Accuracy81 | 5 | |
| Multiple-choice Question Answering | Text-only Adaptive Benchmark L3 | Pass@1 Accuracy88 | 5 | |
| Multiple-choice Question Answering | Text-only Adaptive Benchmark L1 | Pass@1 Accuracy93 | 5 |