Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Simulation Task Generalization on lock-in benchmark T5 [S]
Loading...
11
Success Count
DeLock
-0.44
2.53
5.5
8.47
Apr 25, 2026
Success Count
Updated 1mo ago
Evaluation Results
Method
Method
Links
Success Count
DeLock
2026.04
11
Spatial Forcing
2026.04
0
Feedback
Search any
task
Search any
task