Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Automated Fact Checking on Coverbench
Loading...
85.8
Macro F1
CAAFC
82.368
83.259
84.15
85.041
May 12, 2026
Macro F1
Accuracy
Updated 21d ago
Evaluation Results
Method
Method
Links
Macro F1
Accuracy
CAAFC
Backbone Model=GPT-OSS...
2026.05
85.8
85.9
CAAFC
Backbone Model=LLAMA3....
2026.05
85.4
85.5
CAAFC
Backbone Model=GEMMA3-27B
2026.05
82.5
82.9
Feedback
Search any
task
Search any
task