Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Factual Claim Detection on PoliClaim Inconsistent samples S_inc (test)
Loading...
63.6
Agreement
Experts
31.88
40.115
48.35
56.585
Feb 16, 2024
Agreement
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Agreement
Accuracy
Experts
Subset Reference=GPT-3...
2024.02
63.6
91.99
Experts
Subset Reference=GPT-4...
2024.02
62.9
90.79
AFaCTA
Backbone=GPT-4
2024.02
41.8
74.64
AFaCTA
Backbone=GPT-3.5
2024.02
33.1
65.8
Feedback
Search any
task
Search any
task