Learning to Pay Attention: Unsupervised Modeling of Attentive and Inattentive Respondents in Survey Data
About
The integrity of behavioral and social-science surveys depends on detecting inattentive respondents who provide random or low-effort answers. Traditional safeguards, such as attention checks, are often costly, reactive, and inconsistent. We propose a unified, label-free framework for inattentiveness detection that scores response coherence using complementary unsupervised views: geometric reconstruction (Autoencoders) and probabilistic dependency modeling (Chow-Liu trees). While we introduce a "Percentile Loss" objective to improve Autoencoder robustness against anomalies, our primary contribution is identifying the structural conditions that enable unsupervised quality control. Across nine heterogeneous real-world datasets, we find that detection effectiveness is driven less by model complexity than by survey structure: instruments with coherent, overlapping item batteries exhibit strong covariance patterns that allow even linear models to reliably separate attentive from inattentive respondents. This reveals a critical ``Psychometric-ML Alignment'': the same design principles that maximize measurement reliability (e.g., internal consistency) also maximize algorithmic detectability. The framework provides survey platforms with a scalable, domain-agnostic diagnostic tool that links data quality directly to instrument design, enabling auditing without additional respondent burden.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Randomness Detection | Attention 1 | Recall@h18 | 12 | |
| Randomness Detection | Attention 2 | R@h33 | 12 | |
| Randomness Detection | Intersection | R@h18 | 12 | |
| Randomness Detection | Uhalt 2020 | R@h50 | 4 | |
| Randomness Detection | O’Grady 2019 | R@h20 | 4 | |
| Randomness Detection | Buchanan and Scofield 2018 | R@h27 | 4 | |
| Randomness Detection | Moss Union | R@h0.21 | 4 | |
| Randomness Detection | Mastroianni and Dana | R@h50 | 4 | |
| Randomness Detection | Union | R@h48 | 4 | |
| Randomness Detection | Robinson-Cimpian 2014 | R@h10 | 4 |