Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Density-Guided Robust Counterfactual Explanations on Tabular Data under Model Multiplicity

About

Counterfactual explanations (CEs) are essential for actionable recourse, yet their reliability is often compromised in low-density regions, where classifiers exhibit high variance. Unlike existing methods that rely on expensive ensemble intersections to define stability, we propose \textit{DensityFlow}, a generative framework that constructs robust CEs by adhering to the high-confidence data manifold. Specifically, we model the counterfactual generation as continuous-time dynamics parameterized by Neural ODE, guided by a differentiable density score to actively avoid uncertain, low-density areas. This density score is learned via Noise Contrastive Estimation, effectively leveraging a $(K{+}1)$-way discriminator to estimate density ratios. For black-box settings, we introduce a local proxy distillation mechanism that aligns a lightweight surrogate with the target model strictly within the trajectory of CE generation, enabling efficient gradient-based optimization with minimal queries. Experiments demonstrate that \textit{DensityFlow} achieves superior validity under model multiplicity while significantly reducing query costs compared to ensemble-based baselines. Our implementation is available at https://github.com/G-AILab/DensityFlow.

Jun Tan, Qing Guo, Zicheng Xu, Jinglin Li, Qi Fang, Ning Gui• 2026

Related benchmarks

TaskDatasetResultRank
Counterfactual ExplanationsCOMPAS
Validity72.9
21
Counterfactual Explanationsmoons
Validity99.7
19
Counterfactual ExplanationsHELOC
Validity75.7
19
Counterfactual ExplanationAdult
Cost1.597
5
Counterfactual Explanationsblood
Cost1.527
5
Counterfactual Explanationscircles
Cost0.683
5
Counterfactual ExplanationsSpirals
Cost0.487
5
Counterfactual Explanationschessboard
Cost1.088
5
Counterfactual Explanation PlausibilitySpirals
LOF Score1.07
5
Showing 9 of 9 rows

Other info

Follow for update