Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Long Multiplication

Benchmarks

Task NameDataset NameSOTA ResultTrend
Arithmetic ReasoningLong Multiplication 2,3,4,5-digit (OOD)
Accuracy37.1
10
Showing 1 of 1 rows