Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

E-VQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Visual Question AnsweringE-VQA (test)
Accuracy84.03
85
Lifelong Knowledge EditingE-VQA Lifelong Sequential
Rel. Score96.67
72
Knowledge-Based Visual Question AnsweringE-VQA Single-Hop
Accuracy59.9
52
Knowledge-Intensive Visual Question AnsweringE-VQA (test)
Accuracy (All)41.4
34
Online Multimodal Model EditingE-VQA
Reliability100
32
Knowledge EditingE-VQA MMEdit 1.0 (test)
Reliability100
24
Visual Question AnsweringE-VQA All
Accuracy39.5
23
Visual Question AnsweringE-VQA
Accuracy (All)36.3
19
Knowledge-Based Visual Question AnsweringE-VQA
Final Fidelity Rate12.5
18
Knowledge-based VQAE-VQA
Single-Hop Accuracy70.27
16
Knowledge-Based Visual Question AnsweringE-VQA All
Accuracy45.8
15
Visual Question AnsweringE-VQA
Accuracy53.7
15
Conflict DiscriminationE-VQA (test)
MCC93.4
12
Model EditingE-VQA 5
Reliability Score98.12
11
RetrievalE-VQA
Recall@144.9
11
Lifelong EditingE-VQA Lifelong Editing 5
Relational Score96.85
10
Knowledge EditingE-VQA
Reliability100
10
Re-rankingE-VQA
CondR@190
10
Re-rankingE-VQA
Recall@144.7
10
Visual Question AnsweringE-VQA M2KR setup (test)
BEM70.3
8
RetrievalE-VQA standard (test)
R@144.1
8
Entity RetrievalE-VQA (test)
Recall@10.428
7
Open-domain visual recognitionE-VQA (Overall)
Top-1 Accuracy47.4
6
Knowledge-based Visual RetrievalE-VQA 1.0 (test)
MRR@50.4492
6
Knowledge EditingE-VQA 1,000 sequential edits
Reliability96.85
5
Showing 25 of 33 rows