Share your thoughts, 1 month free Claude Pro on usSee more

Information Analysis and Processing on MARS Overall (Evaluation Set)

1.93Average Ranking (AI GPT-5)

Ours

Updated 3mo ago

Evaluation Results

Method	Links
Ours 2025.11		1.93	1.33
Qwen3-235B-A 2025.11		2.77	3.77
Llama-Vision 2025.11		5.23	6.3
Llama-3.2-11B 2025.11		5.77	6.2
Llama-4-Scout 2025.11		5.77	6.4
Qwen-VL 2025.11		6.18	7.6
Llama-3.2-90B 2025.11		6.77	5.9
GPT-oss 2025.11		6.8	6.37
LLaVA-1.5 2025.11		8.03	6.33
Qwen2-VL 2025.11		8.12	7.2
Qwen2.5-VL 2025.11		8.43	8.33