Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Predicate Grounding on PyBullet Hanoi
Loading...
100
F1 Score
SYMBOLIZER
-4
23
50
77
Apr 20, 2026
F1 Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
F1 Score
SYMBOLIZER
Backbone=Gemini 3.1 Pro
2026.04
100
SYMBOLIZER
Backbone=Gemini 3.1 Fl...
2026.04
99.8
SYMBOLIZER
Backbone=Mistral Small
2026.04
99.4
Symbolizer
Model=Gem.3.1-Pro
2026.04
95
Symbolizer
Model=Mistral Small
2026.04
84.1
Symbolizer
Model=Gem.3.1-FL
2026.04
76.3
ViLaIn
Backbone=Gemini 3.1 Fl...
2026.04
64.3
ViLaIn
Model=Gem. 3.1-FL, Ret...
2026.04
29.6
ViLaIn
Model=Gem. 3.1-FL, Ret...
2026.04
13
ViLaIn
Backbone=Gemini 3.1 Fl...
2026.04
0
Feedback
Search any
task
Search any
task