Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Off-Policy Evaluation on MIMIC-III (standard random split)
Loading...
67.9
FQE
OPL-MT-MNAR
52.196
56.273
60.35
64.427
Apr 23, 2026
FQE
FQE 95% CI
Updated 1mo ago
Evaluation Results
Method
Method
Links
FQE
FQE 95% CI
OPL-MT-MNAR
Type=Model-free
2026.04
67.9
0.673
Clinician
Type=Behavior
2026.04
52.8
0.52
Feedback
Search any
task
Search any
task