Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Question Answering on ACE-whQA
Loading...
79.02
EM
GPT-3.5
24.1808
38.4179
52.655
66.8921
Jul 6, 2023
EM
F1
AUC
Updated 4d ago
Evaluation Results
Method
Method
Links
EM
F1
AUC
GPT-3.5
Verifier=true
2023.07
79.02
80.91
0.84
GPT-3.5 + ALIGN Verifier
Verifier=ALIGN, Unansw...
2023.07
79.02
80.91
0.84
FLAN T5
Verifier=true
2023.07
75.75
77.6
0.9
FLAN T5 + ALIGN Verifier
Verifier=ALIGN, Unansw...
2023.07
75.75
77.6
0.9
GPT-3.5
Verifier=false
2023.07
67.98
71.98
0.77
GPT-3.5
mode=Zero-shot
2023.07
67.98
71.98
0.77
Electra
Verifier=false
2023.07
52.32
55.59
0.87
Flan T5
Verifier=false
2023.07
26.29
29.24
0.51
FLAN T5
mode=Zero-shot
2023.07
26.29
29.24
0.51
Feedback
Search any
task
Search any
task