Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Automated Fact Checking on Factors
Loading...
70.8
Macro F1
CAAFC
53.328
57.864
62.4
66.936
May 12, 2026
Macro F1
Accuracy
Updated 21d ago
Evaluation Results
Method
Method
Links
Macro F1
Accuracy
CAAFC
Backbone Model=LLAMA3....
2026.05
70.8
93.1
CAAFC
Backbone Model=GEMMA3-27B
2026.05
69.3
91.9
CAAFC
Backbone Model=GPT-OSS...
2026.05
54
77.4
Feedback
Search any
task
Search any
task