Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Predicate Grounding on PyBullet Blocks
Loading...
100
F1 Score
Symbolizer
30.424
48.487
66.55
84.613
Apr 20, 2026
F1 Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
F1 Score
Symbolizer
Model=Gem.3.1-Pro
2026.04
100
SYMBOLIZER
Backbone=Gemini 3.1 Pro
2026.04
100
ViLaIn
Backbone=Gem. 3.1-FL,...
2026.04
100
SYMBOLIZER
Backbone=Gem.3.1-Pro
2026.04
100
SYMBOLIZER
Backbone=Gem.3.1-FL
2026.04
100
Symbolizer
Model=Gem.3.1-FL
2026.04
99.2
SYMBOLIZER
Backbone=Gemini 3.1 Fl...
2026.04
98.7
SYMBOLIZER
Backbone=Mistral Small
2026.04
97.4
SYMBOLIZER
Backbone=Mistral Small
2026.04
96.4
Symbolizer
Model=Mistral Small
2026.04
88.1
ViLaIn
Backbone=Gem. 3.1-FL,...
2026.04
86.4
ViLaIn
Backbone=Gemini 3.1 Fl...
2026.04
74.4
ViLaIn
Backbone=Gemini 3.1 Fl...
2026.04
69
ViLaIn
Model=Gem. 3.1-FL, Ret...
2026.04
36.8
ViLaIn
Model=Gem. 3.1-FL, Ret...
2026.04
33.1
Feedback
Search any
task
Search any
task