Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Closed-book recall on KGFACT SMALL (train)
Loading...
100
Recall (%)
MIXSD
-4
23
50
77
May 16, 2026
Recall (%)
Updated 15d ago
Evaluation Results
Method
Method
Links
Recall (%)
MIXSD
Model backbone=Qwen3-1...
2026.05
100
SFT
Model backbone=Qwen3-4...
2026.05
100
OPSD
Model backbone=Qwen3-4...
2026.05
100
SFT
Model backbone=Qwen3-8...
2026.05
100
MIXSD
Model backbone=Qwen3-8...
2026.05
100
SFT
Model backbone=Qwen3-1...
2026.05
99
OPSD
Model backbone=Qwen3-1...
2026.05
99
MIXSD
Model backbone=Qwen3-4...
2026.05
99
OPSD
Model backbone=Qwen3-8...
2026.05
99
MIXSD
Model backbone=Qwen3-8...
2026.05
99
MIXSD
Model backbone=Qwen3-1...
2026.05
97
MIXSD
Model backbone=Qwen3-8...
2026.05
97
MIXSD
Model backbone=Qwen3-1...
2026.05
96
MIXSD
Model backbone=Qwen3-4...
2026.05
94
MIXSD
Model backbone=Qwen3-4...
2026.05
89
MIXSD
Model backbone=Qwen3-4...
2026.05
85
MIXSD
Model backbone=Qwen3-8...
2026.05
73
MIXSD
Model backbone=Qwen3-1...
2026.05
71
Base
Model backbone=Qwen3-1...
2026.05
0
Base
Model backbone=Qwen3-4...
2026.05
0
Base
Model backbone=Qwen3-8...
2026.05
0
Feedback
Search any
task
Search any
task