Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reasoning Benchmark Suite

Benchmarks

Task NameDataset NameSOTA ResultTrend
ReasoningReasoning Benchmark Suite Aggregate
Average Score59.44
26
Showing 1 of 1 rows