LogitDynamics: Reliable ViT Error Detection from Layerwise Logit Trajectories

About

Reliable confidence estimation is critical when deploying vision models. We study error prediction: determining whether an image classifier's output is correct using only signals from a single forward pass. Motivated by internal-signal hallucination detection in large language models, we investigate whether similar depth-wise signals exist in Vision Transformers (ViTs). We propose a simple method that models how class evidence evolves across layers. By attaching lightweight linear heads to intermediate layers, we extract features from the last L layers that capture both the logits of the predicted class and its top-K competitors, as well as statistics describing instability of top-ranked classes across depth. A linear probe trained on these features predicts the error indicator. Across datasets, our method improves or matches AUCPR over baselines and shows stronger cross-dataset generalization while requiring minimal additional computation.

Ido Beigelman, Moti Freiman• 2026

Related benchmarks

Task	Dataset	Result
Trustworthiness Prediction	ImageNet-1k (val)	AUPR64.58	25
Error Prediction	CIFAR-100 (test)	AUCPR44.3	9
Error Prediction	Places365 (val)	AUCPR72.32	9

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord