| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Off-policy prediction | Boyan chain | Tail-average RMSE0.166 | 16 | |
| Off-policy prediction | Boyan Chain environment | Steady-state AUC Error0.1669 | 9 | |
| Policy Evaluation | 14-State Boyan Chain on-policy | Sum of sqrt MSE25.06 | 7 | |
| Policy Evaluation | 14-State Boyan Chain on-policy | MSE0.1 | 7 |