Share your thoughts, 1 month free Claude Pro on usSee more

Policy learning from action-inclusive feedback on OpenML (K ≥ 3, N ≥ 70,000)

58.41Policy Accuracy

CB Policy

Updated 5mo ago

Evaluation Results

Method	Links
CB Policy 2022.06		58.41	-	-
AI-IGL 2022.06		50.11	-	0.79
IGL (full CI) 2022.06		11.91	-	0.22
Constant Action 2022.06		-	22.57	-