Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-hop reasoning on 2WikiMultihopQA

48.44Exact Match (EM)

Prompt-R1

17.562425.578733.59541.6113Nov 2, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.11
48.4454.41
2025.11
43.7549.13
2025.11
41.4142.62
2025.11
41.4146.27
2025.11
34.3835.05
2025.11
33.5936.57
2025.11
28.1329.32
2025.11
2535.96
2025.11
21.8824.17
2025.11
18.7527.5