Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Long-form Question Answering on KILT ELI5 (dev test)
Loading...
26.3
RL Score
KID
20.6944
22.1497
23.605
25.0603
May 24, 2023
RL Score
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
RL Score
F1 Score
KID
Model Size=406M
2023.05
26.3
-
c-REALM
Model Size=596M
2023.05
23.2
22.9
EMAT
Model Size=446M
2023.05
20.91
19.03
Feedback
Search any
task
Search any
task