Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FVQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Vision-Language Model EditingFVQA 1.0 (test)
Accuracy100
48
Fact-based Visual Question AnsweringFVQA
Accuracy74.2
46
Visual Question AnsweringFVQA (test)
Accuracy73.95
36
Visual Question AnsweringFVQA
Accuracy82.82
34
Multimodal Deep SearchFVQA
Accuracy76.67
16
Fact-based Question AnsweringFVQA (test)
Accuracy70.1
16
Fact-based Visual Question AnsweringFVQA (test)
Top-1 WUPS@0.982.47
13
Fact-based Visual Question AnsweringFVQA 1.0 (test)
WUPS@0.0 (Top-1)87.3
13
Visual Question AnsweringFVQA 2.0+
LLM-J Score (Qwen2.5-7B)59.5
8
Showing 9 of 9 rows