Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Question Answering on HotpotQA (test) (Comprehensive Metrics)

0.692EM

HGN

0.087760.244630.40150.55837Jan 27, 2022Aug 10, 2022Feb 22, 2023Sep 5, 2023Mar 19, 2024Sep 30, 2024Apr 14, 2025
Updated 2d ago

Evaluation Results

MethodLinks
2022.01
0.6920.822-------
2022.01
0.6870.816-------
2022.01
0.680.812-------
2022.01
0.6770.808-------
2022.01
0.6740.812-------
2022.01
0.6650.797-------
2022.01
0.6480.792-------
2022.01
0.5910.734-------
2022.01
0.5570.693-------
2025.04
0.1630.25480.26050.30522.5242.5531.263194.39512.273
2025.04
0.1240.22360.22610.30233.0243.9851.765200.39813.562
2025.04
0.1110.19510.19310.30032.9044.3862.094302.89419.146