We-Math

Benchmarks

Task Name	Dataset Name	SOTA Result
Mathematical Reasoning	We-Math mini (test)	Accuracy86.4	31
Math Reasoning	We-Math	Avg Pass@885.6	26
Multi-step mathematical reasoning	We-Math (test)	S1 Score72.8	20
Mathematical & Geometric Reasoning	We-Math	Accuracy@877.7	16

Showing 4 of 4 rows