Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Text-rich image question-answering on TRINS-VQA Human
Loading...
58.8
Accuracy
LaRA
21.152
30.926
40.7
50.474
Jun 10, 2024
Accuracy
B@1
B@2
B@3
B@4
METEOR
ROUGE
CIDEr
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
B@1
B@2
B@3
B@4
METEOR
ROUGE
CIDEr
LaRA
Resolution=336^2
2024.06
58.8
31
23.6
18.6
15.1
26.5
38
135.6
LLaVAR (finetuned)
Resolution=336^2
2024.06
50.1
26.1
18.7
14.1
11
23.1
33.4
112.6
LLaVAR
Resolution=336^2
2024.06
44.1
27.5
19.5
14.4
10.9
21.6
33
94.7
LLaVA
Resolution=336^2
2024.06
22.6
12.3
7.5
4.9
3.4
15.3
20.2
26.2
Feedback
Search any
task
Search any
task