Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Tool Usage on Real-world tool usage
Loading...
90
Success Rate
Compose by Focus
4.2
26.475
48.75
71.025
Sep 19, 2025
Oct 26, 2025
Dec 3, 2025
Jan 10, 2026
Feb 16, 2026
Mar 26, 2026
May 3, 2026
Success Rate
Updated 26d ago
Evaluation Results
Method
Method
Links
Success Rate
Compose by Focus
representation=Scene G...
2025.09
90
DexSim2Real
zero-shot=true, real_d...
2026.05
67.8
DP3
2025.09
60
DrEureka
zero-shot=true, real_d...
2026.05
53.4
Act3D
zero-shot=false, real_...
2026.05
52.8
RVT
zero-shot=false, real_...
2026.05
51.3
PerAct
zero-shot=false, real_...
2026.05
48.7
DeXtreme
zero-shot=true, real_d...
2026.05
45.3
Diffusion Policy
2025.09
40
ADR
zero-shot=true, real_d...
2026.05
38.2
RAPS
zero-shot=true, real_d...
2026.05
35.1
Vanilla DR
zero-shot=true, real_d...
2026.05
28.4
π0
2025.09
7.5
Feedback
Search any
task
Search any
task