Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Linear off-policy prediction on New two-state environment
Loading...
3.89
Max RMSE
ETD
1.4488
17.9269
34.405
50.8831
May 2, 2026
Max RMSE
Divergence Count
Updated 27d ago
Evaluation Results
Method
Method
Links
Max RMSE
Divergence Count
ETD
alpha=0.01, total runs=10
2026.05
3.89
1
GTD2
alpha=0.01, total runs=10
2026.05
8.735
0
TETD
alpha=0.01, total runs=10
2026.05
8.747
0
RETD
alpha=0.01, total runs=10
2026.05
8.758
0
TDRC
alpha=0.01, total runs=10
2026.05
8.79
0
TDC
alpha=0.01, total runs=10
2026.05
8.794
0
TD
alpha=0.01, total runs=10
2026.05
8.802
0
CETD
alpha=0.01, total runs=10
2026.05
64.92
0
Feedback
Search any
task
Search any
task