Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Boyan Chain

Benchmarks

Task NameDataset NameSOTA ResultTrend
Off-policy predictionBoyan chain
Tail-average RMSE0.166
16
Off-policy predictionBoyan Chain environment
Steady-state AUC Error0.1669
9
Policy Evaluation14-State Boyan Chain on-policy
Sum of sqrt MSE25.06
7
Policy Evaluation14-State Boyan Chain on-policy
MSE0.1
7
Showing 4 of 4 rows