Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Linear off-policy prediction on Two-state environment

1.697Max RMSE

GTD2

1.480882.939694.39855.85731May 2, 2026
Updated 27d ago

Evaluation Results

MethodLinks
2026.05
1.6970
2026.05
1.720
2026.05
1.7540
2026.05
1.8090
2026.05
1.9160
2026.05
2.0430
2026.05
2.6590
2026.05
7.110