Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-hop Question Answering on Average (MuSiQue, 2WikiMQA, HotpotQA)

43.3EM

Graph + LLM

10.5419.04527.5536.055Oct 22, 2024Jan 26, 2025May 2, 2025Aug 7, 2025Nov 11, 2025Feb 15, 2026May 23, 2026
Updated 8d ago

Evaluation Results

MethodLinks
2024.10
43.356.9----
2024.10
41.354.8----
2024.10
40.354.6----
2024.10
39.453.6----
2024.10
3949.1----
2024.10
38.451.3----
2026.05
37.1--44.251.315.8
2024.10
36.748.5----
2024.10
36.350.5----
2024.10
35.449----
2026.05
35.4--39.141.526.7
2024.10
34.745.9----
2026.05
34.6--37.840.525.5
2026.05
33.6--38.247.814.9
2026.05
33.4--36.63924.6
2026.05
31--3741.414.6
2024.10
30.842.5----
2024.10
30.341.5----
2026.05
29.8--38.634.616.2
2026.05
29.8--38.634.616.2
2026.05
28.2--34.836.313.4
2024.10
28.140.2----
2024.10
23.836.5----
2026.05
19.7--29.923.55.8
2024.10
19.333.3----
2026.05
18.1--21.725.96.6
2026.05
11.8--13.314.97.2
2026.03
-12.9723.88---
2026.03
-14.5123.35---
2026.03
-53.5156.71---
2026.03
-24.1530.16---
2026.03
-44.746.97---
2026.03
-50.7352.94---
2026.03
-25.5627.17---
2026.03
-47.7449.92---
2026.03
-60.564.4---
2026.03
-43.2948.65---
2026.03
-63.0266.63---