Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Clean Table on Real-World Unseen
Loading...
48
Success Rate
π0.5-ADV
6.4
17.2
28
38.8
Mar 18, 2026
Success Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Success Rate
π0.5-ADV
Base Model=π0.5, Strat...
2026.03
48
Qwen2.5-VL-ADV
Base Model=Qwen2.5-VL,...
2026.03
24
InternVL3.5-ADV
Base Model=InternVL3.5...
2026.03
16
π0.5
Base Model=π0.5, Strat...
2026.03
16
Qwen2.5-VL-Diffusion
Base Model=Qwen2.5-VL,...
2026.03
8
InternVL3.5-Diffusion
Base Model=InternVL3.5...
2026.03
8
Feedback
Search any
task
Search any
task