Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Knowledge Conflict Resolution on CounterFact 500
Loading...
96.6
Accuracy
Conflict-Aware
92.648
93.674
94.7
95.726
Apr 26, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Conflict-Aware
Backbone=Mistral-7B-In...
2026.04
96.6
SLB
Backbone=Mistral-7B-In...
2026.04
95.4
Baseline
Backbone=Mistral-7B-In...
2026.04
92.8
Feedback
Search any
task
Search any
task