Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Common Numeracy Benchmarks

Benchmarks

Task NameDataset NameSOTA ResultTrend
AdditionCommon Numeracy Benchmarks
RMSE [0, 10^2]0.64
5
DecodingCommon Numeracy Benchmarks
RMSE ([0, 10^2])0.3
5
List MaximumCommon Numeracy Benchmarks
Accuracy (Range $10^2$)98
5
Showing 3 of 3 rows