| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Off-policy prediction | Two-state environment | Steady-state AUC Error3.67 | 9 | |
| Linear off-policy prediction | New two-state environment | Max RMSE3.89 | 8 | |
| Linear off-policy prediction | Two-state environment | Max RMSE1.697 | 8 |