Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

StratQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
General ReasoningStratQA
Accuracy87.8
91
ReasonStratQA
Accuracy (%)92.2
2
Showing 2 of 2 rows