Share your thoughts, 1 month free Claude Pro on usSee more

Policy learning from action-inclusive feedback on OpenML K ≥ 3

57.98Policy Accuracy

CB Policy

Updated 5mo ago

Evaluation Results

Method	Links
CB Policy 2022.06		57.98	-	-
AI-IGL 2022.06		35.74	-	0.59
IGL (full CI) 2022.06		15.65	-	0.3
Constant Action 2022.06		-	25.28	-