Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Question Answering on NarrativeQA Helmet benchmark

49.5F1 Score

MiA-Emb-8B + MiA-Gen-14B

34.21238.18142.1546.119Dec 19, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
49.529.8
2025.12
48.728.9
2025.12
46.5-
2025.12
43.1-
2025.12
42.8-
2025.12
39.120.4
2025.12
38.921.9
2025.12
36.718.2
2025.12
34.817.7