Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CounterFlowNet: From Minimal Changes to Meaningful Counterfactual Explanations

About

Counterfactual explanations (CFs) provide human-interpretable insights into model's predictions by identifying minimal changes to input features that would alter the model's output. However, existing methods struggle to generate multiple high-quality explanations that (1) affect only a small portion of the features, (2) can be applied to tabular data with heterogeneous features, and (3) are consistent with the user-defined constraints. We propose CounterFlowNet, a generative approach that formulates CF generation as sequential feature modification using conditional Generative Flow Networks (GFlowNet). CounterFlowNet is trained to sample CFs proportionally to a user-specified reward function that can encode key CF desiderata: validity, sparsity, proximity and plausibility, encouraging high-quality explanations. The sequential formulation yields highly sparse edits, while a unified action space seamlessly supports continuous and categorical features. Moreover, actionability constraints, such as immutability and monotonicity of features, can be enforced at inference time via action masking, without retraining. Experiments on eight datasets under two evaluation protocols demonstrate that CounterFlowNet achieves superior trade-offs between validity, sparsity, plausibility, and diversity with full satisfaction of the given constraints.

Oleksii Furman, Patryk Marsza{\l}ek, Jan Mas{\l}owski, Piotr Gai\'nski, Maciej Zi\k{e}ba, Marek \'Smieja• 2026

Related benchmarks

TaskDatasetResultRank
Counterfactual ExplanationBank Protocol B, B=64
Validity1
12
Counterfactual Explanation GenerationGerman Credit Protocol A B=4
Sparsity69.09
6
Counterfactual Explanation GenerationStudent Performance Protocol A B=4
Sparsity71.18
6
Counterfactual Explanation GenerationAdult Income Protocol A B=4
Sparsity62.36
5
Counterfactual Explanation GenerationGraduate Admission Protocol A B=4
Sparsity55.41
5
Counterfactual ExplanationAdult Income Protocol B, B=64
Validity100
4
Counterfactual ExplanationDefault Protocol B, B=64
Validity96
4
Showing 7 of 7 rows

Other info

Follow for update