Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Short-form QA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Short-form Question AnsweringShort-form QA Aggregate (Avg.) (test)
EM35.93
5
Faithfulness EvaluationShort-Form QA
Faithfulness Correlation0.82
2
Showing 2 of 2 rows