Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Long-form Question Answering refinement on ASQA (test)
Loading...
16.63
Error Rate (%)
EIR
15.8944
20.8597
25.825
30.7903
Jul 16, 2024
Error Rate (%)
Error Score
Precision
Recall
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Error Rate (%)
Error Score
Precision
Recall
F1 Score
EIR
Feedback=Fine-grained...
2024.07
16.63
0.51
0.73
0.82
0.77
Generic
Feedback=Coarse-graine...
2024.07
18.67
0.61
0.72
0.75
0.74
Improve
Feedback=Coarse-graine...
2024.07
20.85
0.68
0.7
0.71
0.7
Baseline
Type=Original dataset...
2024.07
34.81
1.2
-
-
-
Zero-shot
Model=LLaMA2-13B-chat,...
2024.07
35.02
1.08
0.5
0.62
0.55
Feedback
Search any
task
Search any
task