Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Push T on Push T Real-world
Loading...
97.1
Success Rate
Hi-WM
51.132
63.066
75
86.934
Apr 23, 2026
Success Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Success Rate
Hi-WM
Policy=Pi0
2026.04
97.1
Hi-WM
Policy=DP
2026.04
85.3
WM-CL
Policy=Pi0
2026.04
79.4
Base
Policy=Pi0
2026.04
76.5
WM-CL
Policy=DP
2026.04
64.7
Base
Policy=DP
2026.04
52.9
Feedback
Search any
task
Search any
task