Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Linear off-policy prediction on Two-state environment
Loading...
1.697
Max RMSE
GTD2
1.48088
2.93969
4.3985
5.85731
May 2, 2026
Max RMSE
Divergence Count
Updated 27d ago
Evaluation Results
Method
Method
Links
Max RMSE
Divergence Count
GTD2
alpha=0.01, total runs=10
2026.05
1.697
0
ETD
alpha=0.01, total runs=10
2026.05
1.72
0
RETD
alpha=0.01, total runs=10
2026.05
1.754
0
TETD
alpha=0.01, total runs=10
2026.05
1.809
0
TDRC
alpha=0.01, total runs=10
2026.05
1.916
0
CETD
alpha=0.01, total runs=10
2026.05
2.043
0
TDC
alpha=0.01, total runs=10
2026.05
2.659
0
TD
alpha=0.01, total runs=10
2026.05
7.1
10
Feedback
Search any
task
Search any
task