Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Geo-Bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Generative Engine OptimizationGEO-bench 1,000 queries (test)
Word Score11.07
12
Generative Engine OptimizationGEO-Bench Subjective Average (test)
VAR0.0116
12
Generative Engine OptimizationGEO-Bench Objective Overall (test)
VAR0.0189
12
Multispectral ClassificationGEO-Bench m-bigearthnet, m-so2sat, m-eurosat (test)
F1 Score (GB-ben)0.6215
10
Semantic SegmentationGEO-Bench SA-c
Macro mIoU28.98
10
Semantic SegmentationGeo-Bench
mIoU (nz-cattle, macro)82.98
10
Showing 6 of 6 rows