Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Simulation Task Generalization on lock-in benchmark T1
Loading...
16
Success Count
DeLock
1.44
5.22
9
12.78
Apr 25, 2026
Success Count
Updated 1mo ago
Evaluation Results
Method
Method
Links
Success Count
DeLock
2026.04
16
Spatial Forcing
2026.04
2
Feedback
Search any
task
Search any
task