Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ClusT3: Information Invariant Test-Time Training

About

Deep Learning models have shown remarkable performance in a broad range of vision tasks. However, they are often vulnerable against domain shifts at test-time. Test-time training (TTT) methods have been developed in an attempt to mitigate these vulnerabilities, where a secondary task is solved at training time simultaneously with the main task, to be later used as an self-supervised proxy task at test-time. In this work, we propose a novel unsupervised TTT technique based on the maximization of Mutual Information between multi-scale feature maps and a discrete latent representation, which can be integrated to the standard training as an auxiliary clustering task. Experimental results demonstrate competitive classification performance on different popular test-time adaptation benchmarks.

Gustavo A. Vargas Hakim, David Osowiechi, Mehrdad Noori, Milad Cheraghalikhani, Ismail Ben Ayed, Christian Desrosiers• 2023

Related benchmarks

TaskDatasetResultRank
Reading ComprehensionC3
Accuracy49.32
73
Aspect-level Sentiment AnalysisCOTE BD
F1 Score90.14
34
Semantic SimilarityLCQMC
Accuracy75.07
17
Natural Language InferenceOCNLI
Accuracy55.19
17
Sentiment AnalysisChnSent
Accuracy90.43
17
Aspect-based Sentiment AnalysisCOTE-MFW syntactically perturbed
F1 Score85.42
17
Reading ComprehensionDRCD
F1 Score83.82
17
Poetry MatchingCCPM
F1 Score72.81
17
Relation ExtractionFinRE
F1 Score72.6
17
Question AnsweringCMRC syntactically perturbed 2018
F1 Score74.77
17
Showing 10 of 23 rows

Other info

Follow for update