Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Visual Question Answering on XLRS-Bench L-3 Capability (test)

41.7OC

Gemini 2.0 Flash

15.722.4529.235.95Apr 15, 2026
Updated 3d ago

Evaluation Results

MethodLinks
41.7453873.534.627.661.732738243305139.5
2026.04
40391071.544.530.86525.277823621.75040.3
2026.04
38.337107733.435.56521.6738334504338.5
2026.04
33.340317740.640.566.736.268722738.34543.8
2026.04
33.3381572.536.336.366.735.674832836.74341.1
27.622.717.468.430.529.963.627.664.878.434.527.832.635.1
2026.04
26.74056728.832.866.730697827353636
2026.04
26.740117335.934.661.731.870813546.74840.1
2026.04
26.738496941.631.66535677866505243.3
2026.04
2538269.535.935.36525.276832443.33638.1
2026.04
253215669.511.311.724.6737335202523
2026.04
23.349337442.537.466.730768140454243.5
23.3251959.540.9316523.67171296.73036.2
21.74276831.827.86.72672814136.74734.8
2026.04
21.7335055.543.533.86544.862715446.75144
2026.04
18.34213631.3316524.862714348.35033.8
2026.04
16.72922321.116.83524.2334310242121.2
2026.04
16.73022621.416.83525.642534628.32123.6