Sketched Lanczos uncertainty score: a low-memory summary of the Fisher information
About
Current uncertainty quantification is memory and compute expensive, which hinders practical uptake. To counter, we develop Sketched Lanczos Uncertainty (SLU): an architecture-agnostic uncertainty score that can be applied to pre-trained neural networks with minimal overhead. Importantly, the memory use of SLU only grows logarithmically with the number of model parameters. We combine Lanczos' algorithm with dimensionality reduction techniques to compute a sketch of the leading eigenvectors of a matrix. Applying this novel algorithm to the Fisher information matrix yields a cheap and reliable uncertainty score. Empirically, SLU yields well-calibrated uncertainties, reliably detects out-of-distribution examples, and consistently outperforms existing methods in the low-memory regime.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Out-of-Distribution Detection | CIFAR-10 vs SVHN (test) | AUROC0.76 | 101 | |
| Out-of-Distribution Detection | CIFAR-10 vs CIFAR-100 (test) | AUROC54 | 93 | |
| Out-of-Distribution Detection | FashionMNIST (ID) vs MNIST (OoD) | AUROC0.94 | 61 | |
| Out-of-Distribution Detection | MNIST vs FASHIONMNIST (test) | AUROC0.95 | 27 | |
| Out-of-Distribution Detection | CELEBA vs Hold-out (avg) (test) | AUROC72 | 16 | |
| Out-of-Distribution Detection | CELEBA vs FOOD-101 (test) | AUROC0.95 | 16 | |
| Out-of-Distribution Detection | CIFAR-10 Corruption avg (test) | AUROC57 | 16 | |
| Out-of-Distribution Detection | MNIST vs KMNIST (test) | AUROC0.46 | 16 | |
| Out-of-Distribution Detection | MNIST vs Rotation (test) | AUROC0.61 | 16 | |
| Out-of-Distribution Detection | FASHIONMNIST vs Rotation (test) | AUROC0.75 | 8 |