Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Knowledge Injection on EVOKE
Loading...
5.33
CEM
Vanilla
4.2276
11.6688
19.11
26.5512
Oct 22, 2025
CEM
F1 Score
Updated 26d ago
Evaluation Results
Method
Method
Links
CEM
F1 Score
Vanilla
Backbone=LLaVA-v1.5 (13B)
2025.10
5.33
10.07
Vanilla
Backbone=Qwen2.5-VL (7B)
2025.10
9.34
15.33
Replay
Backbone=Qwen2.5-VL (7B)
2025.10
11.73
18.51
Replay
Backbone=LLaVA-v1.5 (13B)
2025.10
12.05
20.21
LoRA
Backbone=Qwen2.5-VL (7B)
2025.10
14.56
14.01
LoRA
Backbone=LLaVA-v1.5 (13B)
2025.10
16.26
22.83
KORE
Backbone=Qwen2.5-VL (7B)
2025.10
22.91
31.36
KORE
Backbone=LLaVA-v1.5 (13B)
2025.10
32.89
44.47
Feedback
Search any
task
Search any
task