Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SINGLEOP

Benchmarks

Task NameDataset NameSOTA ResultTrend
Arithmetic ReasoningSingleOP
Accuracy97.3
9
Math ReasoningSingleOp (test)
Accuracy97.86
8
Mathematical ReasoningSINGLEOP
Solve Rate94.6
4
Online Out-of-Distribution DetectionSingleOp Near-shift OOD
Accuracy95.75
3
Showing 4 of 4 rows