Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

LLM-AGGREFACT

Benchmarks

Task NameDataset NameSOTA ResultTrend
Faithfulness Hallucination DetectionLLM-AggreFact Refined
Agg-CNN86.8
14
Factuality EvaluationLLM-AggreFact (test)
CNN Score69.9
13
Fact-CheckingLLM-AGGREFACT (test)
Cost ($)0.2
10
Showing 3 of 3 rows