Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Open-domain QA on HotpotQA
Loading...
47
QA-F1
UnifiedQA-3b
0.616
12.658
24.7
36.742
Jul 19, 2023
QA-F1
Updated 4d ago
Evaluation Results
Method
Method
Links
QA-F1
UnifiedQA-3b
parameters=3B, Protoco...
2023.07
47
UnifiedQA-large
parameters=770M, Proto...
2023.07
42.7
UnifiedQA-base
parameters=220M, Proto...
2023.07
40.3
UnifiedQA-3b
parameters=3B, Protoco...
2023.07
17
UnifiedQA-large
parameters=770M, Proto...
2023.07
14.2
UnifiedQA-base
parameters=220M, Proto...
2023.07
13.1
OPT-30b
parameters=30B, Protoc...
2023.07
11.4
T5-base
parameters=220M, Proto...
2023.07
9.1
T5-3b
parameters=3B, Protoco...
2023.07
6.8
T5-large
parameters=770M, Proto...
2023.07
6.6
T5-base
parameters=220M, Proto...
2023.07
6
GPT-J
parameters=6B, Protoco...
2023.07
5.9
T5-large
parameters=770M, Proto...
2023.07
5.1
GPT-J
parameters=6B, Protoco...
2023.07
5.1
T5-3b
parameters=3B, Protoco...
2023.07
4.9
OPT-30b
parameters=30B, Protoc...
2023.07
2.4
Feedback
Search any
task
Search any
task