Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Fact-Checking on LLM-AGGREFACT (test)
Loading...
0.2
Cost ($)
AlignScore
-0.0876
1.8537
3.795
5.7363
Apr 16, 2024
Cost ($)
Updated 4d ago
Evaluation Results
Method
Method
Links
Cost ($)
AlignScore
Backbone Model=ROBERTa...
2024.04
0.2
MiniCheck-RBTA
Backbone Model=AlignSc...
2024.04
0.2
MiniCheck-DBTA
Backbone Model=DeBERTa...
2024.04
0.2
FT5-ANLI-L
Backbone Model=Flan-T5...
2024.04
0.24
MiniCheck-FT5
Backbone Model=Flan-T5...
2024.04
0.24
DAE
Backbone Model=ELECTRA...
2024.04
0.26
SummaC-ZS
Backbone Model=ALBERT-...
2024.04
0.85
SummaC-CV
Backbone Model=ALBERT-...
2024.04
0.85
QAFactEval
Backbone Model=multipl...
2024.04
1.87
T5-NLI-Mixed
Backbone Model=T5-XXL,...
2024.04
7.39
Feedback
Search any
task
Search any
task