Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Open-Ended Question Answering on EO and Earth Sciences Open-Ended QA
Loading...
96.4
Judge Score
EVE-Instruct
87.0088
89.4469
91.885
94.3231
Mar 20, 2026
Judge Score
EVE WR
Updated 3d ago
Evaluation Results
Method
Method
Links
Judge Score
EVE WR
EVE-Instruct
Size (B)=24, Evaluatio...
2026.03
96.4
-
Qwen3
Size (B)=30-A3, Evalua...
2026.03
94.92
50.7
Gemma3
Size (B)=27, Evaluatio...
2026.03
94.41
50.92
Mistral Small 3.2
Size (B)=24, Evaluatio...
2026.03
91.78
51.69
Llama4 Scout
Size (B)=109-A17, Eval...
2026.03
87.37
53.95
Feedback
Search any
task
Search any
task