Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Knowledge Conflict Resolution on KID-Bench v2
Loading...
97.6
Performance (Difficulty A)
RAG
95.416
95.983
96.55
97.117
Apr 26, 2026
Performance (Difficulty A)
Performance (Difficulty B)
Performance (Difficulty C-Light)
Performance (Difficulty C-Medium)
Performance (Difficulty C-Deep)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Performance (Difficulty A)
Performance (Difficulty B)
Performance (Difficulty C-Light)
Performance (Difficulty C-Medium)
Performance (Difficulty C-Deep)
RAG
Knowledge internalizat...
2026.04
97.6
74
71.9
65.6
63.8
Conflict-Aware
Knowledge internalizat...
2026.04
97.1
64
78.1
83.6
71
D2L Baseline
Knowledge internalizat...
2026.04
96.7
68
57.8
55.7
46.4
SLB
Knowledge internalizat...
2026.04
95.5
68
75
73.8
60.9
Feedback
Search any
task
Search any
task