UAT-LITE: Inference-Time Uncertainty-Aware Attention for Pretrained Transformers

About

Neural NLP models are often miscalibrated and overconfident, assigning high confidence to incorrect predictions and failing to express uncertainty during internal evidence aggregation. This undermines selective prediction and high-stakes deployment. Post-hoc calibration methods adjust output probabilities but leave internal computation unchanged, while ensemble and Bayesian approaches improve uncertainty at substantial training or storage cost. We propose UAT-LITE, an inference-time framework that makes self-attention uncertainty-aware via Monte Carlo dropout in pretrained transformer classifiers. Unlike output-level calibration (e.g., TS), UAT-LITE injects epistemic uncertainty directly into attention, enabling uncertainty-aware routing during contextualization and token-level diagnostic signals beyond global logit rescaling. Token-level epistemic uncertainty is estimated from stochastic forward passes and used to modulate self-attention during contextualization, without modifying pretrained weights or training objectives. We additionally introduce a layer-wise variance decomposition to diagnose how predictive uncertainty accumulates across transformer depth. Across SQuAD 2.0 answerability, MNLI, and SST-2, UAT-LITE achieves an average relative ECE reduction of approximately 20% compared with a fine-tuned BERT-base baseline while preserving accuracy, and yields more informative uncertainty behavior for selective prediction under distribution shift.

Elias Hossain, Shubhashis Roy Dipta, Subash Neupane, Rajib Rana, Ravid Shwartz-Ziv, Ivan Garibay, Niloofar Yousefi• 2026

Related benchmarks

Task	Dataset	Result
Question Answering	MedQA	Accuracy24.9	96
Natural Language Inference	MNLI (val)	--	26
Question Answering	SQuAD v2.0 (val)	--	21
Question Answering	PubMedQA	Accuracy64	15
Distribution Shift Robustness	MNLI matched → mismatched	ID ECE0.0219	2
Selective Prediction	MNLI	Coverage @ 0.986.92	2
Selective Prediction	SQuAD 2.0	Coverage@0.969.04	2

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord