Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Test-Time Training with Self-Supervision for Generalization under Distribution Shifts

About

In this paper, we propose Test-Time Training, a general approach for improving the performance of predictive models when training and test data come from different distributions. We turn a single unlabeled test sample into a self-supervised learning problem, on which we update the model parameters before making a prediction. This also extends naturally to data in an online stream. Our simple approach leads to improvements on diverse image classification benchmarks aimed at evaluating robustness to distribution shifts.

Yu Sun, Xiaolong Wang, Zhuang Liu, John Miller, Alexei A. Efros, Moritz Hardt• 2019

Related benchmarks

TaskDatasetResultRank
Image ClassificationCIFAR-10C Severity Level 5 (test)
Average Error Rate (Severity 5)19.1
127
Image ClassificationImageNet-C level 5
Avg Top-1 Acc (ImageNet-C L5)83.6
110
Robot ManipulationLIBERO Object
Success Rate97.2
70
Image ClassificationCIFAR-100C Level 5 (test)
Gaussian Acc83.8
51
Robotic ManipulationLIBERO Long
Success Rate89.1
44
Diabetic Retinopathy ClassificationDEEPDR (test)
Accuracy0.524
30
Robot ManipulationLIBERO Average
Success Rate94.7
26
Image ClassificationCIFAR-10C level 5 (test)
Mean Error12.1
26
Diabetic Retinopathy (DR) gradingAPTOS
Accuracy51.7
25
Diabetic Retinopathy (DR) gradingMessidor
ACC64.8
22
Showing 10 of 41 rows

Other info

Code

Follow for update