Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

General-R

Benchmarks

Task NameDataset NameSOTA ResultTrend
General ReasoningGeneral-R MMLU-stem, ARC-challenge (test)
Accuracy61.8
24
Showing 1 of 1 rows