Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SingleEq

Benchmarks

Task NameDataset NameSOTA ResultTrend
Arithmetic ReasoningSingleEq
Accuracy98.8
43
Math reasoningSingleeq
EM0.8307
10
Mathematical ReasoningSingleEQ (test)
Accuracy99.01
4
Mathematical ReasoningSINGLEEQ
Solve Rate96.1
4
Arithmetic ReasoningSingleEq (test)
Accuracy0.795
4
Online Out-of-Distribution DetectionSingleEq Near-shift OOD
Accuracy93.15
3
Showing 6 of 6 rows