Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Predicate Grounding on Real Images Blocksworld
Loading...
100
F1 Score
Symbolizer
23.456
43.328
63.2
83.072
Apr 20, 2026
F1 Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
F1 Score
Symbolizer
Model=Gem.3.1-Pro
2026.04
100
Symbolizer
Model=Gem.3.1-FL
2026.04
100
SYMBOLIZER
Backbone=Gemini 3.1 Pro
2026.04
100
SYMBOLIZER
Backbone=Gemini 3.1 Fl...
2026.04
100
ViLaIn
Backbone=Gem. 3.1-FL,...
2026.04
100
SYMBOLIZER
Backbone=Gem.3.1-Pro
2026.04
100
SYMBOLIZER
Backbone=Gem.3.1-FL
2026.04
100
SYMBOLIZER
Backbone=Mistral Small
2026.04
95.8
Symbolizer
Model=Mistral Small
2026.04
88.3
ViLaIn
Backbone=Gem. 3.1-FL,...
2026.04
80.2
SYMBOLIZER
Backbone=Mistral Small
2026.04
78.7
ViLaIn
Backbone=Gemini 3.1 Fl...
2026.04
56
ViLaIn
Backbone=Gemini 3.1 Fl...
2026.04
47.4
ViLaIn
Model=Gem. 3.1-FL, Ret...
2026.04
32.3
ViLaIn
Model=Gem. 3.1-FL, Ret...
2026.04
26.4
Feedback
Search any
task
Search any
task