Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Vision-Language Question Answering on Pooled MLLM Suite (GQA, LLaVA-Wild, MMMU Pro, MME-Finance) (test)

5.4Expected Calibration Error (ECE)

InternalInspector

3.8614.25524.6535.045May 11, 2026
Updated 21d ago

Evaluation Results

MethodLinks
2026.05
5.41773.382.689.779.6
2026.05
5.618.271.478.187.778.9
2026.05
7.51872.281.588.477.2
2026.05
8.917.472.978.990.380.1
2026.05
10.619.472.882.886.474.2
2026.05
10.819.968.978.287.177.3
2026.05
10.820.270.580.386.473.8
2026.05
12.823.56377.377.368.5
2026.05
13.822.365.477.881.471.9
2026.05
14.723.867.88071.658.2
2026.05
16.32267.478.87672.9
2026.05
16.52367.378.88272.5
2026.05
16.623.369.680.876.265.4
2026.05
16.725.966.68060.343.4
2026.05
19.82667.380.466.252.9
2026.05
21.524.768.480.876.870.8
2026.05
24.427.569.781.27759.5
2026.05
26.230.945.734.867.954.7
2026.05
28.745.653.954.566.244.9
2026.05
28.93065.278.181.967.3
2026.05
41.241.557.2707859.5
2026.05
43.944.254.66776.254.6