Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Visual Question Answering on MMBench (dev)

60.1Overall Score

LLaVA-RLHF_13B×336

4.4618.90533.3547.795Sep 25, 2023
Updated 8d ago

Evaluation Results

MethodLinks
2023.09
60.129.267.256.560.953.871.5---
2023.09
59.246.755.743.564.34972.5---
2023.09
58.825.856.758.357.257.975.8---
2023.09
57.525.865.754.857.95168.5---
2023.09
54.62967.846.5564861.9---
2023.09
52.128.363.237.453.235.966.8---
2023.09
51.432.556.753.946.838.665.4---
2023.09
51.424.263.239.150.24066.1---
2023.09
48.220.854.23347.836.667.1---
2023.09
47.523.359.731.341.438.665.8---
2023.09
4419.154.234.847.824.856.4---
2023.09
41.211.735.329.647.538.656.4---
2023.09
38.716.748.330.445.532.440.6---
2023.09
3614.246.322.63721.449---
2023.09
24.37.531.34.330.3935.6---
2023.09
6.64.215.40.98.11.45---
2025.01
-------72.4--
2025.01
-------68.8--
2025.01
-------72.5--
2025.01
-------68.3--
2025.01
-------79.5--
2025.01
-------79.5--
2025.01
-------83.3--
2025.01
-------78.1--
2026.05
--------66.158.9
2026.05
--------62.254.8
2026.05
--------55.252
2026.05
--------53.247.4
2026.05
--------5245.8
2026.05
--------52.248.5
2026.05
--------65.757.6
2026.05
--------61.453.8
2026.05
--------48.845.3
2026.05
--------63.154.5
2026.05
--------61.252.7
2026.05
--------63.856.7
2026.05
--------63.155.8
2026.05
--------65.156.8