Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Long-form Question Answering refinement on ELI5 (test)
Loading...
0.0381
Error Rate
EIR
0.030452
0.082076
0.1337
0.185324
Jul 16, 2024
Error Rate
Error Score
Precision
Recall
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Error Rate
Error Score
Precision
Recall
F1 Score
EIR
Feedback=Fine-grained...
2024.07
0.0381
0.13
0.88
0.96
0.92
Generic
Feedback=Coarse-graine...
2024.07
0.0606
0.22
0.84
0.91
0.87
Zero-shot
Model=LLaMA2-13B-chat,...
2024.07
0.0961
0.27
0.74
0.89
0.81
Improve
Feedback=Coarse-graine...
2024.07
0.1005
0.36
0.75
0.86
0.8
Baseline
Type=Original dataset...
2024.07
0.2293
0.82
-
-
-
Feedback
Search any
task
Search any
task