Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Long-form Question Answering on HQ2A
Loading...
100
Comprehensiveness
Error-Informed Refinement (EIR)
-4
23
50
77
Jul 16, 2024
Comprehensiveness
Overall Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Comprehensiveness
Overall Score
Error-Informed Refinement (EIR)
Preference Category=Re...
2024.07
100
92.16
Baseline
Preference Category=Ba...
2024.07
0
7.84
Tie
Preference Category=Ti...
2024.07
0
0
Feedback
Search any
task
Search any
task