Share your thoughts, 1 month free Claude Pro on usSee more

Information Analysis and Processing on MARS Level 2 (Evaluation Set)

2.2Average Ranking (AI GPT-5)

Ours

Updated 3mo ago

Evaluation Results

Method	Links
Ours 2025.11		2.2	1.5
Qwen3-235B-A 2025.11		2.6	4
Llama-Vision 2025.11		4.8	5.5
Llama-3.2-11B 2025.11		5.8	6.7
Llama-3.2-90B 2025.11		6.1	6.1
GPT-oss 2025.11		6.7	5.3
Qwen-VL 2025.11		6.78	7.9
Llama-4-Scout 2025.11		6.9	5.9
Qwen2-VL 2025.11		7.3	7.9
Qwen2.5-VL 2025.11		8.2	8.8
LLaVA-1.5 2025.11		8.9	6.4