Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Question Answering on EXPERTQA (test)
Loading...
19.27
Claim Recall
ChatGPT
9.962
12.3785
14.795
17.2115
Feb 6, 2024
Claim Recall
Citation Recall
Citation Precision
MAUVE
Updated 4d ago
Evaluation Results
Method
Method
Links
Claim Recall
Citation Recall
Citation Precision
MAUVE
ChatGPT
Learning Mode=ICL
2024.02
19.27
47.79
47.3
48.68
M_dist + f.g.RL
Refinement=Fine-graine...
2024.02
15.53
49.73
51.11
45.92
M_dist + f.g.RS
Refinement=Fine-graine...
2024.02
15.48
59.46
57.58
44.67
M_dist
Type=Distilled baseline
2024.02
15.28
49.03
46.22
40.63
M_dist + f.g.(RS+RL)
Refinement=Combined RS...
2024.02
15.23
58.94
59.8
42.13
LLAMA-2-7B
Learning Mode=ICL
2024.02
10.32
10.09
7.79
34.27
Feedback
Search any
task
Search any
task