Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HQ2A

Benchmarks

Task NameDataset NameSOTA ResultTrend
Long-form Question AnsweringHQ2A
Comprehensiveness100
3
Sentence-level Error DetectionHQ2A 1.0 (test)
Exact Accuracy25.49
1
Showing 2 of 2 rows