Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Knowledge Conflict Resolution on KID-Bench Category C v2
Loading...
78.1
Accuracy (C-Light)
Conflict-Aware
56.988
62.469
67.95
73.431
Apr 26, 2026
Accuracy (C-Light)
Accuracy (C-Medium)
Accuracy (C-Deep)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy (C-Light)
Accuracy (C-Medium)
Accuracy (C-Deep)
Conflict-Aware
Backbone=Gemma-2B, Dec...
2026.04
78.1
83.6
71
SLB
Backbone=Gemma-2B, Dec...
2026.04
75
73.8
60.9
D2L Baseline
Backbone=Gemma-2B, Dec...
2026.04
57.8
55.7
46.4
Feedback
Search any
task
Search any
task