| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| NaVQA Long memory horizon | STaR | SR77 | 3 | 4d ago | |
| NaVQA Medium memory horizon | STaR | SR84 | 3 | 4d ago | |
| NaVQA Short memory horizon | STaR | SR89 | 3 | 4d ago | |
| Pixel Diffusion Models Human Evaluation set | Human Count72 | 2 | 4d ago | ||
| Latent Diffusion Models reasoning evaluation set | Human Score73 | 2 | 4d ago |