Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Open-domain question answering on NQ (val)
Loading...
30.6
EM
Llama2-7b CBQA w. Mixed Training + Recitation
15.728
19.589
23.45
27.311
Mar 12, 2024
EM
F1
Rec-BLEU
Updated 4d ago
Evaluation Results
Method
Method
Links
EM
F1
Rec-BLEU
Llama2-7b CBQA w. Mixed Training + Recitation
Model=Llama2-7b, Setup...
2024.03
30.6
39.8
17.4
Llama2-7b + Recitation
Model=Llama2-7b, Setup...
2024.03
30.3
40.1
15.7
Llama2-7b CBQA w. Mixed Training
Model=Llama2-7b, Setup...
2024.03
27.6
40.2
-
Llama2-7b Closed-book QA
Model=Llama2-7b, Setup...
2024.03
26.4
39
-
Qwen1.5-4b CBQA w. Mixed Training
Model=Qwen1.5-4b, Setu...
2024.03
21.2
29.4
-
Qwen1.5-4b Closed-book QA
Model=Qwen1.5-4b, Setu...
2024.03
19.7
27.8
-
Qwen1.5-4b CBQA w. Mixed Training + Recitation
Model=Qwen1.5-4b, Setu...
2024.03
17.4
24.1
13.6
Qwen1.5-4b + Recitation
Model=Qwen1.5-4b, Setu...
2024.03
16.3
24.1
11.4
Feedback
Search any
task
Search any
task