Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Continual Multimodal Instruction Tuning on CoIN/VQA Composite Benchmark

68.71Accuracy

Upper bound

16.730830.225443.7257.2146Dec 2, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
68.71-------
57.32-------
2025.12
55.2573.2740.7491.8836.5644.1544.9-
2025.12
53.02------14.48
2025.12
43.18------21.96
2025.12
37.65------27.37
2025.12
37.0567.8733.369.5433.641.9136.6418.25
2025.12
36.9667.3232.829.1233.2943.236.2219.12
2025.12
35.54-------
2025.12
34.1465.8922.366.7130.2242.5537.1324.08
2025.12
33.47------25.99
2025.12
33.01------30.77
2025.12
32.86------30.95
32.62------27.66
2025.12
31.92------6.82
2025.12
29.8238.6733.1815.0934.9939.3418.6224.64
2025.12
29.13------32.01
2025.12
29.0956.0625.827.132.6237.3215.6622.97
2025.12
28.43------35.33
2025.12
25.4752.4524.123.8231.4427.1613.829.57
2025.12
22.48-------
2025.12
18.7351.5223.4916.5314.226.640-
2025.12
-72.4339.3489.4137.9344.3818.62-
2025.12
-74.940.0478.2237.3543.2515.66-
2025.12
-60.0131.4824.5529.9140.9113.82-
2025.12
-71.7939.6294.7537.6644.3337.13-
2025.12
-71.3538.8290.0837.3743.7336.22-
2025.12
-71.493985.8836.7344.4536.64-