Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Knowledge Editing on E-VQA
Loading...
98.12
Reliability
DSCA
81.8856
86.1003
90.315
94.5297
Apr 9, 2026
Reliability
Textual Generality
Visual Generality
Textual Locality
Multimodal Locality
Average Score
Updated 8d ago
Evaluation Results
Method
Method
Links
Reliability
Textual Generality
Visual Generality
Textual Locality
Multimodal Locality
Average Score
DSCA
Base Model=LLaVA-1.5-7B
2026.04
98.12
97.3
97.25
100
99.83
98.5
DualEdit
Base Model=LLaVA-1.5-7B
2026.04
96.94
96.43
96.2
100
99.61
97.84
VisEdit
Base Model=LLaVA-1.5-7B
2026.04
95.78
94.21
94.37
100
91.11
95.09
LTE
Base Model=LLaVA-1.5-7B
2026.04
94.16
93.54
93.06
83.76
81.65
89.23
MEND
Base Model=LLaVA-1.5-7B
2026.04
92.3
92.16
92.1
90.3
81.13
89.6
SERAC
Base Model=LLaVA-1.5-7B
2026.04
82.51
81.6
80.05
100
57.48
80.33
Feedback
Search any
task
Search any
task