Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AQUA-RAT

Benchmarks

Task NameDataset NameSOTA ResultTrend
Mathematical ReasoningAQUA-RAT
Accuracy91.73
120
Mathematical ReasoningAQuA-RAT (test)
Accuracy83
40
Mathematical CalculationAQuA-RAT
Accuracy (AQuA-RAT)83.1
20
Algebraic ReasoningAQuA-RAT
Accuracy66.8
16
Algebraic Question AnsweringAQUA-RAT Synthetic NIID 1.0 (test)
Accuracy28
7
Algebraic Question AnsweringAQUA-RAT Synthetic IID 1.0 (test)
Accuracy29.9
7
Mathematical ReasoningAQUA-RAT STREET
Answer Accuracy78
3
Mathematical ReasoningAQUA-RAT standard (test)
ACC (%)40.16
3
Algebraic ReasoningAQuA-RAT
PTR Advance Rate100
2
Showing 9 of 9 rows