Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Question Answering on SDQA
Loading...
37.79
Accuracy
APin
25.83
28.935
32.04
35.145
Apr 8, 2026
Accuracy
Updated 9d ago
Evaluation Results
Method
Method
Links
Accuracy
APin
Setting=Aggressive (τi...
2026.04
37.79
APdeep
Setting=Aggressive (τi...
2026.04
37.79
DAP
Setting=Aggressive (τi...
2026.04
37.61
APdeep
Setting=Conservative (...
2026.04
37.61
DAP
Setting=Conservative (...
2026.04
36.89
Vanilla
lin=-, ldeep=-, FRR=10...
2026.04
36.71
APin
Setting=Conservative (...
2026.04
36.71
DAP
lin=✓, ldeep=✓, FRR=14...
2026.04
29.66
DAP
lin=✓, ldeep=✓, FRR=33...
2026.04
27.85
APin
lin=✓, ldeep=-, FRR=93...
2026.04
27.67
APdeep
lin=-, ldeep=✓, FRR=14...
2026.04
27.49
Vanilla
lin=-, ldeep=-, FRR=10...
2026.04
27.31
APdeep
lin=-, ldeep=✓, FRR=33...
2026.04
26.94
APin
lin=✓, ldeep=-, FRR=78...
2026.04
26.29
Feedback
Search any
task
Search any
task