Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

E-VQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Visual Question AnsweringE-VQA (test)
Accuracy84.03
85
Lifelong Knowledge EditingE-VQA Lifelong Sequential
Rel. Score96.67
72
Knowledge-Intensive Visual Question AnsweringE-VQA (test)
Accuracy (All)41.4
34
Knowledge-Based Visual Question AnsweringE-VQA Single-Hop
Accuracy59.9
27
Knowledge EditingE-VQA MMEdit 1.0 (test)
Reliability100
24
Knowledge-based VQAE-VQA
Single-Hop Accuracy70.27
16
Knowledge-Based Visual Question AnsweringE-VQA All
Accuracy45.8
15
Visual Question AnsweringE-VQA
Accuracy53.7
15
Conflict DiscriminationE-VQA (test)
MCC93.4
12
Model EditingE-VQA 5
Reliability Score98.12
11
RetrievalE-VQA
Recall@144.9
11
Lifelong EditingE-VQA Lifelong Editing 5
Relational Score96.85
10
Re-rankingE-VQA
CondR@190
10
Re-rankingE-VQA
Recall@144.7
10
RetrievalE-VQA standard (test)
R@144.1
8
Entity RetrievalE-VQA (test)
Recall@10.428
7
Knowledge EditingE-VQA
Reliability98.12
6
Open-domain visual recognitionE-VQA (Overall)
Top-1 Accuracy47.4
6
Knowledge-based Visual RetrievalE-VQA 1.0 (test)
MRR@50.4492
6
Knowledge EditingE-VQA 1,000 sequential edits
Reliability96.85
5
Knowledge RetrievalE-VQA (test)
R@2058.7
5
RetrievalE-VQA (test)
PR@50.781
5
Multimodal Multi-hop Visual Question AnsweringE-VQA Two hop
Accuracy23.3
4
RetrievalE-VQA M2KR
R@143.1
3
Open-domain visual recognitionE-VQA (Seen)
Top-1 Accuracy39.9
3
Showing 25 of 28 rows