Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Open-Ended Question Answering on EO and Earth Sciences Open-Ended QA with Context
Loading...
81.81
Judge Score
Qwen3
71.3268
74.0484
76.77
79.4916
Mar 20, 2026
Judge Score
EVE WR Score
Updated 3d ago
Evaluation Results
Method
Method
Links
Judge Score
EVE WR Score
Qwen3
Size (B)=30-A3, Evalua...
2026.03
81.81
52.12
Gemma3
Size (B)=27, Evaluatio...
2026.03
78.31
51.58
EVE-Instruct
Size (B)=24, Evaluatio...
2026.03
78.28
-
Mistral Small 3.2
Size (B)=24, Evaluatio...
2026.03
71.93
57.27
Llama4 Scout
Size (B)=109-A17, Eval...
2026.03
71.73
58.31
Feedback
Search any
task
Search any
task