Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

AddSub

Benchmarks

Task NameDataset NameSOTA ResultTrend
Arithmetic ReasoningAddSub
Accuracy98.2
76
Mathematical ReasoningADDSUB
Solve Rate93.1
22
Arithmetic ReasoningAddSub (test)
Accuracy96.71
8
Online Out-of-Distribution DetectionAddSub Near-shift OOD
Accuracy79.16
3
Showing 4 of 4 rows