Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Earth Observation

Benchmarks

Task NameDataset NameSOTA ResultTrend
Open-Ended Question Answering (with Context)Earth Observation
Judge Score86.65
7
Open-Ended Question AnsweringEarth Observation
Judge Score97.05
7
Hallucination DetectionEarth Observation
F1 Score90.94
7
Multiple Choice Question Answering (Single)Earth Observation
Accuracy96.35
7
Multiple Choice Question Answering (Multiple)Earth Observation
IoU87.56
7
Showing 5 of 5 rows