Share your thoughts, 1 month free Claude Pro on usSee more

Scientific Reasoning on Lens

56.2Accuracy

Gemini2.5-Pro

Updated 2mo ago

Evaluation Results

Method	Links
Gemini2.5-Pro 2025.05		56.2
Qwen2.5-VL 2025.05		53.65
Qwen2.5-VL 2025.05		51.66
GPT-4o 2025.05		51.14
QVQ-Max 2025.05		50.8
InternVL3 2025.05		49.39
InternVL3 2025.05		47.18
Qwen2.5-VL 2025.05		46.28
InternVL3 2025.05		44.69
Deepseek-VL2 2025.05		44.58
Gemma3 2025.05		43.33
InternVL3 2025.05		40.56
Qwen2.5-VL 2025.05		40.33
Gemma3 2025.05		39.53
Deepseek-VL2-tiny 2025.05		38.97
Kimi-VL-thinking 2025.05		29.4