Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Long-form Question Answering on ALCE LFQA
Loading...
38.6
ROUGE-L
ATTR. FIRST_CoT
24.144
27.897
31.65
35.403
Mar 25, 2024
ROUGE-L
BERTScore
AutoAIS
Output Length
No Attention (%)
Updated 4d ago
Evaluation Results
Method
Method
Links
ROUGE-L
BERTScore
AutoAIS
Output Length
No Attention (%)
ATTR. FIRST_CoT
Evaluation Protocol=IC...
2024.03
38.6
90.7
89.3
48.2
0
ATTR. FIRST
Evaluation Protocol=ICL
2024.03
35.8
90.5
78.7
65.2
0
ALCE
Evaluation Protocol=ICL
2024.03
35.2
89.9
49.8
2,153.3
26.9
GEMINI
Evaluation Protocol=ICL
2024.03
33.1
89.6
-
-
-
PRIMERA
Evaluation Protocol=FT
2024.03
32.2
88.8
-
-
-
ATTR. FIRST_joint
Evaluation Protocol=FT...
2024.03
27
88.1
44.9
33.1
0
ATTR. FIRST
Evaluation Protocol=FT
2024.03
24.7
88
52.7
20.9
0
Feedback
Search any
task
Search any
task