Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

TRUE

Benchmarks

Task NameDataset NameSOTA ResultTrend
Factual Consistency EvaluationTRUE benchmark
PAWS (AUC-ROC)98.4
37
Factual Consistency EvaluationTRUE 1.0 (test)
Frank AUC91.5
20
Showing 2 of 2 rows