Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Remote Sensing Reasoning on XLRS-Bench

60.4PASS@1

Baseline + pre-warming + ES Text QA (w/ CoT)

20.495230.855141.21551.5749Feb 15, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
60.496.25
2026.02
55-
2026.02
52.3991.85
2026.02
51.6-
2026.02
51.5-
2026.02
51.11-
2026.02
50.02-
2026.02
50.0182.58
2026.02
49.8-
2026.02
47.53-
2026.02
47.485.75
2026.02
45.4-
45.2-
2026.02
43.6-
40.5-
2026.02
40.2-
2026.02
39.1-
2026.02
22.03-