Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Fact Verification on Creak
Loading...
0.956
Accuracy
TOG
0.87904
0.89902
0.919
0.93898
Apr 11, 2024
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
TOG
Knowledge Usage=With K...
2024.04
0.956
TOG-R
Knowledge Usage=With K...
2024.04
0.954
ODA
Knowledge Usage=With K...
2024.04
0.9519
Direct answering (GPT4)
Knowledge Usage=Withou...
2024.04
0.9452
Self-Consistency (GPT3.5)
Knowledge Usage=Withou...
2024.04
0.908
COT (GPT3.5)
Knowledge Usage=Withou...
2024.04
0.901
Direct answering (GPT3.5)
Knowledge Usage=Withou...
2024.04
0.9
RACo
Knowledge Usage=With K...
2024.04
0.882
Feedback
Search any
task
Search any
task