Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DCLM

Benchmarks

Task NameDataset NameSOTA ResultTrend
Zero-shot EvaluationDCLM CORE V2
CORE_V2 Score48
17
Language Model EvaluationDCLM Core
DCLM Core Score49.3
12
Language ModelingDCLM
Loss1.863
11
Natural Language UnderstandingDCLM
DCLM Core Score48.9
9
Language ModelingDCLM benchmark
Macro Avg Value14.58
9
Language UnderstandingDCLM evaluation suite (test)
HellaSwag Accuracy62.9
7
Zero-shot Language EvaluationDCLM Pro
WinoGrande57.93
2
Showing 7 of 7 rows