Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multimodal Instruction Following on CoIN

77.84SciQA Score

ϕ-DPO

56.613662.124367.63573.1457Feb 26, 2026
Updated 18d ago

Evaluation Results

MethodLinks
2026.02
77.8495.6154.5560.7459.1764.3269.9968.6968.8674.94
2026.02
73.269.2850.7659.1856.9261.3367.1264.7662.8264.7
2026.02
72.5662.8448.4358.9757.6659.1463.2163.3160.7762.6
2026.02
70.2123.3144.2143.7656.2558.4662.3264.1152.8353.96
2026.02
69.799.9345.558.4757.7560.7766.564.93--
2026.02
62.0237.2143.3233.2252.0553.1257.9265.7550.5855.24
2026.02
60.7130.5841.4936.0152.847.0753.4365.1248.453.22
2026.02
59.7531.8842.2634.9651.0651.8455.364.5548.9553.3
2026.02
57.4328.941.8830.0551.3950.7653.2864.7847.3152.86