Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Policy learning from action-inclusive feedback on OpenML (K ≥ 3, N ≥ 70,000)
Loading...
58.41
Policy Accuracy
CB Policy
10.05
22.605
35.16
47.715
Jun 16, 2022
Policy Accuracy
Constant Action
Performance vs CB
Updated 1mo ago
Evaluation Results
Method
Method
Links
Policy Accuracy
Constant Action
Performance vs CB
CB Policy
2022.06
58.41
-
-
AI-IGL
Feedback=Action-Inclusive
2022.06
50.11
-
0.79
IGL (full CI)
Feedback=full CI
2022.06
11.91
-
0.22
Constant Action
2022.06
-
22.57
-
Feedback
Search any
task
Search any
task