Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Numerical Reasoning on DROP (dev)
Loading...
85.2
EM
POET-SQL_T5
74.7792
77.4846
80.19
82.8954
Jan 27, 2022
Oct 3, 2022
Jun 10, 2023
Feb 15, 2024
Oct 21, 2024
Jun 28, 2025
Mar 5, 2026
EM
F1
Updated 1mo ago
Evaluation Results
Method
Method
Links
EM
F1
POET-SQL_T5
Model size=11B, Base m...
2022.01
85.2
87.6
AeNER
2026.03
83.72
86.56
CONE
2026.03
83.71
86.7
T5-11B
Model size=11B
2022.01
83.5
85.9
NumNet
2026.03
83.34
86
NC-BERT
2026.03
75.18
77.68
Feedback
Search any
task
Search any
task