Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Claim Verification on FEVEROUS
Loading...
0.9567
Accuracy
InfoRE + CoT
0.620988
0.708144
0.7953
0.882456
Apr 22, 2024
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
InfoRE + CoT
Backbone=GPT-4, Zero-s...
2024.04
0.9567
InfoRE
Backbone=GPT-4, Zero-s...
2024.04
0.9562
CoT
Backbone=GPT-4, Zero-s...
2024.04
0.9267
Standard
Backbone=GPT-4, Zero-s...
2024.04
0.9233
InfoRE + CoT
Backbone=GPT-3.5, Zero...
2024.04
0.9153
InfoRE
Backbone=GPT-3.5, Zero...
2024.04
0.9131
CoT
Backbone=GPT-3.5, Zero...
2024.04
0.8867
Standard
Backbone=GPT-3.5, Zero...
2024.04
0.8767
InfoRE + CoT
Backbone=LLAMA2-70B, Z...
2024.04
0.6812
InfoRE
Backbone=LLAMA2-70B, Z...
2024.04
0.6784
CoT
Backbone=LLAMA2-70B, Z...
2024.04
0.6453
Standard
Backbone=LLAMA2-70B, Z...
2024.04
0.6339
Feedback
Search any
task
Search any
task