| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Membership Inference Attack | OLMo near-IID Dolma 3 (test) | AUC0.723 | 13 | |
| Training Data Attribution | Olmo-7B | Tail-patch (%)98.6 | 5 | |
| General Language Evaluation | OLMo-2 Held-out Evals | AGIEval Score24.4 | 2 | |
| Question Answering | OLMo Benchmarks 2 (dev) | NQ Score16.1 | 2 | |
| Language Modeling | OLMo (val) | Base CE2.24 | 1 |