Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

General Language Model Evaluation on OlmoBaseEval HeldOut

33.7LBPP Score

Nemo. 3 Nano

-0.7248.21317.1526.087Apr 3, 2026
Updated 12d ago

Evaluation Results

MethodLinks
2026.04
33.778.253.532.8
2026.04
31.168.650.740.8
2026.04
30.275.55034.3
2026.04
26.869.944.426.1
2026.04
26.276.550.147.6
2026.04
17.76437.223.6
2026.04
16.865.241.723.4
2026.04
5.853.327.617.3
2026.04
5.742.824.515.5
2026.04
0.624.611.47.5