Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Factual Claim Detection on PoliClaim Perfectly consistent samples S_con (test)
Loading...
83.3
Agreement
AFaCTA
73.94
76.37
78.8
81.23
Feb 16, 2024
Agreement
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Agreement
Accuracy
AFaCTA
Backbone=GPT-4
2024.02
83.3
98.49
AFaCTA
Backbone=GPT-3.5
2024.02
75.4
90.4
Experts
Subset Reference=GPT-3...
2024.02
74.6
93.79
Experts
Subset Reference=GPT-4...
2024.02
74.3
94.85
Feedback
Search any
task
Search any
task