Share your thoughts, 1 month free Claude Pro on usSee more

Open-Ended Question Answering (with Context) on Earth Observation

86.65Judge Score

GPT-4.1

Updated 3mo ago

Evaluation Results

Method	Links
GPT-4.1 2026.03		86.65	49.22	2.83
Mistral Medium 3.1 2026.03		86.44	50.99	4.17
Qwen3 2026.03		86.1	50.09	2.17
GPT OSS 2026.03		84.8	50.7	4.83
GPT-5 nano 2026.03		84.4	48.6	5.33
MiniMax m2.5 2026.03		81.57	51.2	5.17
EVE-Instruct 2026.03		78.28	-	3.5