Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Beyond Entropy: Region Confidence Proxy for Wild Test-Time Adaptation

About

Wild Test-Time Adaptation (WTTA) is proposed to adapt a source model to unseen domains under extreme data scarcity and multiple shifts. Previous approaches mainly focused on sample selection strategies, while overlooking the fundamental problem on underlying optimization. Initially, we critically analyze the widely-adopted entropy minimization framework in WTTA and uncover its significant limitations in noisy optimization dynamics that substantially hinder adaptation efficiency. Through our analysis, we identify region confidence as a superior alternative to traditional entropy, however, its direct optimization remains computationally prohibitive for real-time applications. In this paper, we introduce a novel region-integrated method ReCAP that bypasses the lengthy process. Specifically, we propose a probabilistic region modeling scheme that flexibly captures semantic changes in embedding space. Subsequently, we develop a finite-to-infinite asymptotic approximation that transforms the intractable region confidence into a tractable and upper-bounded proxy. These innovations significantly unlock the overlooked potential dynamics in local region in a concise solution. Our extensive experiments demonstrate the consistent superiority of ReCAP over existing methods across various datasets and wild scenarios.

Zixuan Hu, Yichun Hu, Xiaotong Li, Shixiang Tang, Ling-Yu Duan• 2025

Related benchmarks

TaskDatasetResultRank
Vision-Language NavigationR2R-CE (val-unseen)
Success Rate (SR)60
677
Vision-and-Language NavigationR2R (val unseen)
Success Rate (SR)75
448
Vision-and-Language NavigationREVERIE (val unseen)
SPL36.22
225
Vision-Language NavigationR2R (val seen)
Success Rate (SR)81
150
Vision-and-Language NavigationREVERIE Unseen (test)
Success Rate (SR)53.07
110
Vision-and-Language NavigationR2R-CE (val-seen)
SR71
79
Vision-and-Language NavigationREVERIE seen (val)
SR74.72
64
Showing 7 of 7 rows

Other info

Follow for update