Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Off-policy Evaluation on DOPE averaged (three tasks)
Loading...
0.37
Normalized Value Gap
GALILEO
0.3564
0.4482
0.54
0.6318
Jun 10, 2022
Normalized Value Gap
Rank Correlation
Regret@1
Updated 4d ago
Evaluation Results
Method
Method
Links
Normalized Value Gap
Rank Correlation
Regret@1
GALILEO
2022.06
0.37
0.44
0.09
Best DICE
2022.06
0.48
0.15
0.42
FQE (L2)
2022.06
0.54
-0.19
0.34
Doubly Rubost
2022.06
0.57
-0.14
0.33
IS
2022.06
0.67
-0.4
0.36
VPM
2022.06
0.71
0.29
0.17
Feedback
Search any
task
Search any
task