Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Factuality Evaluation on Long-form summarization factuality dataset (test)

66.2Balanced Accuracy

FENICE

50.80854.80458.862.796Mar 4, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.03
66.2
2024.03
65.7
2024.03
61.7
2024.03
61.3
2024.03
51.4