Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Baird

Benchmarks

Task NameDataset NameSOTA ResultTrend
Linear off-policy predictionBaird environment
Max RMSE2.21
8
Off-policy predictionBaird
Final RMSPBE0.0082
6
Off-policy predictionBaird
Tail-average RMSE1.41
5
Showing 3 of 3 rows