A p-value for Process Tracing and other N=1 Studies
About
We introduce a method for calculating \(p\)-values to test causal hypotheses in qualitative research \emph{a la} process tracing. As in an experiment, our \(p\)-value tells us how often one would make the same or more compelling observations favoring one theory while entertaining a rival theory. We adapt Fisher's (1935) randomization-based urn model to the reality of qualitative researchers, who cannot randomize history, but can make observations about historical processes. Our test includes a method of sensitivity analysis which allows researchers to account for the possibility of observation bias, as well as a framework for representing the varying strenght of individual pieces of evidence, altoguether informing the robustness of qualitative causal inefernce. We provide simulations and replications of previously published work to illustrate how to execute our test using any type of qualitative data about events that took place within one case. This approach adds to the pluralistic turn in the use of probability theory in theory-testing process tracing by offering a simple model with provable conservatism, while relying on few assumptions the consequences of which can be directly assessed.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Anti-customization | VGG-Face2 (test) | -- | 16 | |
| Image Purification | VGGFace2 ASPL (test) | IMS-0.05 | 12 | |
| Image Purification | VGGFace2 EASPL (test) | IMS-0.08 | 12 | |
| Image Purification | VGGFace2 PhotoGuard (test) | IMS-0.12 | 12 | |
| Image Purification | VGGFace2 MetaCloak (test) | IMS-0.16 | 12 | |
| Image Purification | VGGFace2 FSMG (test) | IMS-0.15 | 12 | |
| Image Purification | VGGFace2 AdvDM (test) | IMS-0.19 | 12 | |
| Image Purification | VGGFace2 Glaze (test) | IMS-0.22 | 12 | |
| Adversarial Purification | Diffusion-based Purification (val) | LPIPS0.384 | 7 |