Stop Suppressing the Tail: Causal Inference for Extreme Events

About

Estimating how an outcome responds to a continuous treatment (the Average Dose-Response Function, or ADRF) is a core causal-inference primitive. However, when outcomes possess heavy tails, standard robust double machine learning (DML) deliberately suppresses these extremes to stabilize the bulk average. In high-stakes settings, such as financial returns or climate losses, this omitted 1-in-1000 extreme event is the actual target quantity. Furthermore, current methods that read the tail from a model's residuals suffer from circular dependence, causing tail shape inferences to shift drastically based solely on whether the core estimator is switched between Huber and Welsch. The research proposes an ADRF estimator that emits a structured tail-shape output alongside the standard point estimate. Its tail diagnostic (PDHTE+JK) evaluates the per-treatment tail shape from the outcome centered by a pilot median, successfully breaking the circular dependence and rendering the diagnostic invariant to the choice of core method. The output encompasses four treatment-conditional quantities: tail shape $\hat{\xi}(t)$, deep-tail return levels $\hat{Q}_{\alpha}(t)$, conditional shortfalls $\hat{S}_{\alpha}(t)$, the recovered mean ADRF, and an explicit refusal mechanism that declines extrapolation when extreme-value modeling is unsupported by the data. Compared to kernel-weighted quantile regression (QR), the proposed estimator reduces deep-tail ($\alpha=0.001$) return-level MAE by 11% and conditional-shortfall MAE by 25.5% across a heavy-tailed panel. It also achieves a 20-29% MAE reduction in sample-scarce regimes ($n\le2000$). On freMTPL2 motor-insurance claims, it successfully triggered an explicit extrapolation refusal on the log-claim scale, which neither QR nor loss-only DML can produce.

Eichi Uehara• 2026

Related benchmarks

Task	Dataset	Result
Tail regime classification	Synthetic Tail DGPs Fréchet and Gumbel-domain n=3000	Classification Accuracy6	6
Return-level estimation	10 DGPs alpha = 0.001 (deep tail)	MAE15	3
Return-level estimation	10 DGPs alpha = 0.01	MAE2.71	3
ADRF Estimation	10 DGP Verification Panel 320 cells	Mean Relative MAE-11.6	2
Conditional Shortfall Estimation	DGP sinusoidal_pareto n=100,000 oracle, n=3000 train (holdout)	MAE8.86	2
Conditional Shortfall Estimation	DGP sinusoidal_asymmetric n=100,000 oracle, n=3000 train (holdout)	MAE1.25	2
Conditional Shortfall Estimation	DGP sinusoidal_heavytail n=100,000 oracle, n=3000 train (holdout)	MAE3.83	2
Conditional Shortfall Estimation	DGP sinusoidal_two_paretos n=100,000 oracle, n=3000 train (holdout)	Mean Absolute Error (MAE)5.13	2
Conditional Shortfall Estimation	DGP regime_switch n=100,000 oracle, n=3000 train (holdout)	MAE6.86	2
Conditional Shortfall Estimation	DGP pareto_plus_gaussian n=100,000 oracle, n=3000 train (holdout)	MAE6.07	2

Showing 10 of 14 rows

Other info

Follow for update

@wizwand_team Discord