Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

BIOS

Benchmarks

Task NameDataset NameSOTA ResultTrend
Text ClassificationBIOS
Task Accuracy84.6
32
FactualityBIOS
Factuality56
28
Long-form generation factuality and uncertainty estimationBios (test)
FA71.4
14
Factual Precision EvaluationBios
FACTSCORE83
10
Attribute-conditional generationBIOS
Control Accuracy99.2
5
Showing 5 of 5 rows