Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Continual Instruction Tuning on MLLM-CTBench

49.52Math QA Accuracy

Ours (PASs-MoE)

-1.980811.389624.7638.1304Jan 19, 2026
Updated 3mo ago

Evaluation Results

MethodLinks
2026.01
49.52-5.943.2210.4344.7-6.566.13-6.0529.99-5.3821.95-1.6383.73-48.46-2.15
2026.01
44.83-7.8820.87-5.3637.4-1.2558.77-13.5123.74-0.342.35-0.0774.46-37.49-4.06
2026.01
42.98-11.4535.892.6540.84-11.6156.1-15.4729.24-6.0418.89-4.5779.57-43.36-6.64
2026.01
40.64-12.5625.44-7.9734.37-14.0960.15-11.8331.79-2.3423.68-3.2178.25-42.05-8.29
2026.01
37.89-19.755.52-32.3539.91-11.9764.42-5.7415.7-20.47.19-16.6784.07-36.39-15.27
2026.01
35.68-15.066.78-24.1833.58-17.6939.8-27.1822.67-10.8914.57-3.4173.56-32.38-15.34
2026.01
26.85-25.1211.84-22.3231.58-16.7749.29-9.088.22-20.7514.67-3.0729.97-24.63-16.16
2026.01
24.58-29.859.87-22.6130.79-20.9838.48-34.0122.82-12.8518.64-5.9282.44-32.52-19.46
2026.01
19.48-34.957.87-25.6528.48-23.6338.48-35.1121.58-13.3717.78-6.878.74-30.34-19.36
2026.01
18.98-34.716.45-27.0126.8-25.3136.51-34.6623.61-9.9417.38-8.181.68-30.2-19.96
2026.01
16.78-36.186.83-26.4225.68-26.6637.4-36.1923.63-11.2517.49-4.6683.52-30.19-20.2
2026.01
0-54.436.28-24.8127.75-22.1838.44-35.1524.7-12.5416.96-5.5383.52-28.24-22.09