Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

NUPA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Numerical GenerationNUPA
Exact Match83.7
28
Numerical ReasoningNUPA (aggregated)
Exact Match72.4
4
Showing 2 of 2 rows