Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-hop Question Answering on Musique 2-hop (dev)
Loading...
15.2
Exact Match Accuracy
Self-ask + Search
0.952
4.651
8.35
12.049
Oct 7, 2022
Exact Match Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Exact Match Accuracy
Self-ask + Search
Model=Davinci-002
2022.10
15.2
Self-ask
Model=Davinci-002
2022.10
13.8
Chain of Thought
Model=Davinci-002
2022.10
12.6
Search + postproc.
Model=Davinci-002
2022.10
6.5
Direct prompting
Model=Davinci-002
2022.10
5.6
Search
Model=Davinci-002
2022.10
1.5
Feedback
Search any
task
Search any
task