Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Fact-driven Question Answering on HotpotQA (F1, EM)

60.78F1 Score

w/t BoT

49.568852.479455.3958.3006May 20, 2026
Updated 12d ago

Evaluation Results

MethodLinks
2026.05
60.7847.27
2026.05
59.9746.8
2026.05
57.6244.31
2026.05
57.3344.18
2026.05
57.0343.75
2026.05
5037.06