| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| AGENT new concepts from new initial states (test) | FTL-IGM | Accuracy73 | 9 | 3mo ago | |
| HM3D and MP3D Open-vocabulary objects v0.2 (val) | PLMD† | Success Rate (SR)35.4 | 4 | 26d ago | |
| MiniGrid RedBall (unseen seeds) | LLM4Teach | Success Rate95.8 | 4 | 3mo ago | |
| MiniGrid RedBlueDoor (unseen seeds) | LLM4Teach | Success Rate95.6 | 4 | 3mo ago | |
| MiniGrid LavaCrossing (unseen seeds) | LLM4Teach | Success Rate93.1 | 4 | 3mo ago | |
| MiniGrid DoorKey (unseen seeds) | LLM4Teach | Success Rate97 | 4 | 3mo ago | |
| MoCap human experiment (New Initial State) | FTL-IGM | Percentage Depicts Concept80 | 3 | 3mo ago | |
| MoCap human experiment New Concept | FTL-IGM | Success Rate73.3 | 3 | 3mo ago | |
| MoCap human experiment (train) | FTL-IGM | Success Time Percentage80 | 3 | 3mo ago |