Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Two-state

Benchmarks

Task NameDataset NameSOTA ResultTrend
Off-policy predictionTwo-state
RMSVE0
9
Off-policy predictionNew two-state
Tail-Average RMSE4.131
7
Off-policy predictionTwo-state
Tail-average RMSE0.82
7
Off-policy predictionTwo-state
Steady-state AUC error1.05
6
Showing 4 of 4 rows