Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Olmo

Benchmarks

Task NameDataset NameSOTA ResultTrend
Membership Inference AttackOLMo near-IID Dolma 3 (test)
AUC0.723
13
Training Data AttributionOlmo-7B
Tail-patch (%)98.6
5
General Language EvaluationOLMo-2 Held-out Evals
AGIEval Score24.4
2
Question AnsweringOLMo Benchmarks 2 (dev)
NQ Score16.1
2
Language ModelingOLMo (val)
Base CE2.24
1
Showing 5 of 5 rows