Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Clean Table on Real-World Unseen
Loading...
60
Success Rate
VGM+AnyPos
5.92
19.96
34
48.04
Jul 17, 2025
Aug 26, 2025
Oct 6, 2025
Nov 16, 2025
Dec 26, 2025
Feb 5, 2026
Mar 18, 2026
Success Rate
Updated 27d ago
Evaluation Results
Method
Method
Links
Success Rate
VGM+AnyPos
Pipeline Architecture=...
2025.07
60
π0.5-ADV
Base Model=π0.5, Strat...
2026.03
48
VPP
Pipeline Architecture=VPP
2025.07
40
Qwen2.5-VL-ADV
Base Model=Qwen2.5-VL,...
2026.03
24
InternVL3.5-ADV
Base Model=InternVL3.5...
2026.03
16
π0.5
Base Model=π0.5, Strat...
2026.03
16
Qwen2.5-VL-Diffusion
Base Model=Qwen2.5-VL,...
2026.03
8
InternVL3.5-Diffusion
Base Model=InternVL3.5...
2026.03
8
Feedback
Search any
task
Search any
task