Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mug Cleanup on Simulation Extrapolated Views (8 views)
Loading...
11.85
Success Rate
VILA
-0.474
2.7255
5.925
9.1245
Jan 6, 2026
Success Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Success Rate
VILA
Setting=Fine-tuned, Vi...
2026.01
11.85
Vanilla
Setting=Fine-tuned, Vi...
2026.01
2.5
CLASS
Setting=Fine-tuned, Vi...
2026.01
0.6
ReViWo
Setting=Fine-tuned, Vi...
2026.01
0
KYC
Setting=Fine-tuned, Vi...
2026.01
0
Feedback
Search any
task
Search any
task