Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Claim Verification on HOVER 2-hop
Loading...
76.69
Accuracy
InfoRE + CoT
48.3188
55.6844
63.05
70.4156
Apr 22, 2024
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
InfoRE + CoT
Backbone=GPT-4, Zero-s...
2024.04
76.69
InfoRE
Backbone=GPT-4, Zero-s...
2024.04
75.87
CoT
Backbone=GPT-4, Zero-s...
2024.04
73.82
Standard
Backbone=GPT-4, Zero-s...
2024.04
72.4
InfoRE + CoT
Backbone=GPT-3.5, Zero...
2024.04
69.02
InfoRE
Backbone=GPT-3.5, Zero...
2024.04
68.21
CoT
Backbone=GPT-3.5, Zero...
2024.04
66.7
Standard
Backbone=GPT-3.5, Zero...
2024.04
64.74
InfoRE + CoT
Backbone=LLAMA2-70B, Z...
2024.04
53.2
InfoRE
Backbone=LLAMA2-70B, Z...
2024.04
52.83
CoT
Backbone=LLAMA2-70B, Z...
2024.04
50.02
Standard
Backbone=LLAMA2-70B, Z...
2024.04
49.41
Feedback
Search any
task
Search any
task