Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Policy Learning on CASdatasets Counterfactual (test)
Loading...
-9.38
V(pi)
Optimal
-10.4408
-10.1654
-9.89
-9.6146
Jan 27, 2026
V(pi)
Delta1(pi)
Delta2(pi)
Updated 1mo ago
Evaluation Results
Method
Method
Links
V(pi)
Delta1(pi)
Delta2(pi)
Optimal
fairness_criterion=Cou...
2026.01
-9.38
0.38
1.1
DFL
fairness_criterion=Cou...
2026.01
-10.18
0.02
0.12
VB2
fairness_criterion=Cou...
2026.01
-10.23
0.04
0.11
VB1
fairness_criterion=Cou...
2026.01
-10.4
0
0.49
ADVB
fairness_criterion=Cou...
2026.01
-10.4
0
0.49
Feedback
Search any
task
Search any
task