Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Confidence Calibration in Retrieval-Augmented Generation on NQ k=5 OOD (test)

0.248ECE

NAACL

0.239760.295380.3510.40662Jan 16, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
0.2480.75
2026.01
0.2650.674
2026.01
0.2730.653
2026.01
0.2760.628
2026.01
0.2890.667
2026.01
0.290.605
2026.01
0.3040.641
2026.01
0.3130.696
2026.01
0.3220.706
2026.01
0.3250.694
2026.01
0.3290.604
2026.01
0.3340.693
2026.01
0.3350.667
2026.01
0.3350.64
2026.01
0.3510.59
2026.01
0.3520.67
2026.01
0.3710.645
2026.01
0.3730.621
2026.01
0.3760.625
2026.01
0.3760.477
2026.01
0.3980.597
2026.01
0.4020.686
2026.01
0.4350.608
2026.01
0.4540.627