Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Fact Verification on FACTKG 1.0 (test)
Loading...
72.7
Accuracy
LLME (KG-GPT)
53.876
58.763
63.65
68.537
Jun 19, 2024
Accuracy
Std Dev
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Std Dev
LLME (KG-GPT)
few-shot=12-shot
2024.06
72.7
6.66
KELP
few-shot=12-shot
2024.06
69.2
0.38
KELP
few-shot=8-shot
2024.06
68.6
0.38
KELP
few-shot=4-shot
2024.06
68.5
0.38
LLME (KG-GPT)
few-shot=8-shot
2024.06
67.7
6.66
GPT (gpt-3.5-turbo-0613)
few-shot=12-shot
2024.06
64
5.26
LLME (KG-GPT)
few-shot=4-shot
2024.06
59.5
6.66
GPT (gpt-3.5-turbo-0613)
few-shot=8-shot
2024.06
55.2
5.26
GPT (gpt-3.5-turbo-0613)
few-shot=4-shot
2024.06
54.6
5.26
Feedback
Search any
task
Search any
task