| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| StableToolBench (STB) | Llama3.1-8B | EM Accuracy49.25 | 16 | 3mo ago | |
| WebShop (WS) | Qwen2.5-7B | EM Accuracy79.05 | 16 | 3mo ago | |
| TextWorld (TW) | Qwen2.5-7B | EM Accuracy70.6 | 16 | 3mo ago | |
| SciWorld (SW) | Llama3.1-8B | EM Accuracy98.64 | 16 | 3mo ago | |
| ALFWorld (AW) | Qwen2.5-7B | EM Accuracy99.87 | 16 | 3mo ago | |
| LiDAR synthetic (NSP) | KE | RMSE22.4 | 10 | 7d ago | |
| Doppler radar tracking (Free) (test) | OKE | RMSE102.76 | 7 | 7d ago | |
| Doppler radar tracking Const_a (test) | OKE | RMSE94.65 | 7 | 7d ago | |
| Doppler radar tracking Const_v (test) | KE | RMSE85.4 | 7 | 7d ago | |
| Doppler radar tracking Close (test) | KE | RMSE22.7 | 7 | 7d ago | |
| Doppler radar tracking Toy (test) | OKE | RMSE86.93 | 7 | 7d ago | |
| MOT20 | KE | RMSE0.4599 | 7 | 7d ago | |
| LiDAR synthetic N=2000 trajectories (test) | KE | RMSE22.4 | 7 | 7d ago |