Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Simulation Task Generalization on lock-in benchmark T6 [S]
Loading...
13
Success Rate
DeLock
-0.52
2.99
6.5
10.01
Apr 25, 2026
Success Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Success Rate
DeLock
2026.04
13
Spatial Forcing
2026.04
0
Feedback
Search any
task
Search any
task