Share your thoughts, 1 month free Claude Pro on usSee more

Linear off-policy prediction on Baird environment

2.21Max RMSE

TD

Updated 2mo ago

Evaluation Results

Method	Links
TD 2026.05		2.21	10
ETD 2026.05		2.41	50
GTD2 2026.05		5.318	0
TETD 2026.05		5.44	50
TDRC 2026.05		11.35	0
TDC 2026.05		11.5	0
CETD 2026.05		17.54	0
RETD 2026.05		21.16	0