A p-value for Process Tracing and other N=1 Studies

About

We introduce a method for calculating \(p\)-values to test causal hypotheses in qualitative research \emph{a la} process tracing. As in an experiment, our \(p\)-value tells us how often one would make the same or more compelling observations favoring one theory while entertaining a rival theory. We adapt Fisher's (1935) randomization-based urn model to the reality of qualitative researchers, who cannot randomize history, but can make observations about historical processes. Our test includes a method of sensitivity analysis which allows researchers to account for the possibility of observation bias, as well as a framework for representing the varying strenght of individual pieces of evidence, altoguether informing the robustness of qualitative causal inefernce. We provide simulations and replications of previously published work to illustrate how to execute our test using any type of qualitative data about events that took place within one case. This approach adds to the pluralistic turn in the use of probability theory in theory-testing process tracing by offering a simple model with provable conservatism, while relying on few assumptions the consequences of which can be directly assessed.

Matias Lopez, Jake Bowers• 2023

Related benchmarks

Task	Dataset	Result
Anti-customization	VGG-Face2 (test)	--	16
Image Purification	VGGFace2 ASPL (test)	IMS-0.05	12
Image Purification	VGGFace2 EASPL (test)	IMS-0.08	12
Image Purification	VGGFace2 PhotoGuard (test)	IMS-0.12	12
Image Purification	VGGFace2 MetaCloak (test)	IMS-0.16	12
Image Purification	VGGFace2 FSMG (test)	IMS-0.15	12
Image Purification	VGGFace2 AdvDM (test)	IMS-0.19	12
Image Purification	VGGFace2 Glaze (test)	IMS-0.22	12
Adversarial Purification	Diffusion-based Purification (val)	LPIPS0.384	7

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord