Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AddSub

Benchmarks

Task NameDataset NameSOTA ResultTrend
Arithmetic ReasoningAddSub
Accuracy99
149
Mathematical ReasoningADDSUB
Solve Rate93.1
25
Arithmetic ReasoningAddSub (test)
Accuracy96.71
8
Mathematical ReasoningAddSub
Accuracy85.6
7
Online Out-of-Distribution DetectionAddSub Near-shift OOD
Accuracy79.16
3
Showing 5 of 5 rows