Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

General Model Performance Evaluation on Aggregate Model Evaluation 16 benchmarks

45.8Average Score

OLMo-2-0425-1B

42.88843.64444.445.156Sep 27, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.09
45.8
2025.09
43