Conformal Risk Control
About
We extend conformal prediction to control the expected value of any monotone loss function. The algorithm generalizes split conformal prediction together with its coverage guarantee. Like conformal prediction, the conformal risk control procedure is tight up to an $\mathcal{O}(1/n)$ factor. We also introduce extensions of the idea to distribution shift, quantile risk control, multiple and adversarial risk control, and expectations of U-statistics. Worked examples from computer vision and natural language processing demonstrate the usage of our algorithm to bound the false negative rate, graph distance, and token-level F1-score.
Anastasios N. Angelopoulos, Stephen Bates, Adam Fisch, Lihua Lei, Tal Schuster• 2022
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Conformal Prediction | CIFAR-100 | Avg Prediction Set Size2.7219 | 32 | |
| Reinforcement Learning from Verifiable Rewards | HEAD-QA | AR16.5 | 30 | |
| Medical Image Segmentation | MSD Pancreas (test) | DSC45.19 | 30 | |
| Medical Image Segmentation | CAMUS (test) | DSC81.07 | 22 | |
| Medical Image Segmentation | ACDC-LV (test) | Coverage99.8 | 10 | |
| Medical Image Segmentation | ACDC-RV (test) | Coverage99.8 | 10 | |
| Distribution Shift Robustness | Sixteen Adversarial Cells MedQA + GSM8K (eval) | Violations4 | 10 | |
| Expert-Iteration RLVR | MedQA, HEAD-QA, ARC-C, and CaseHOLD | Pathwise Clean Score4 | 10 | |
| Mathematical Reasoning | GSM8K | AR (%)28.9 | 10 | |
| Natural Language Inference | medNLI | AR (%)28.9 | 10 |
Showing 10 of 20 rows