Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Instruction-Following, Mathematics, and Commonsense Reasoning

Benchmarks

Task NameDataset NameSOTA ResultTrend
General LLM EvaluationInstruction-Following, Mathematics, and Commonsense Reasoning Combined
Average Score57
18
Showing 1 of 1 rows