GeneralizeFormer: Layer-Adaptive Model Generation across Test-Time Distribution Shifts

Ambekar, Sameer; Xiao, Zehao; Zhen, Xiantong; Snoek, Cees G. M.

Computer Science > Machine Learning

arXiv:2502.12195 (cs)

[Submitted on 15 Feb 2025]

Title:GeneralizeFormer: Layer-Adaptive Model Generation across Test-Time Distribution Shifts

Authors:Sameer Ambekar, Zehao Xiao, Xiantong Zhen, Cees G. M. Snoek

View PDF HTML (experimental)

Abstract:We consider the problem of test-time domain generalization, where a model is trained on several source domains and adjusted on target domains never seen during training. Different from the common methods that fine-tune the model or adjust the classifier parameters online, we propose to generate multiple layer parameters on the fly during inference by a lightweight meta-learned transformer, which we call \textit{GeneralizeFormer}. The layer-wise parameters are generated per target batch without fine-tuning or online adjustment. By doing so, our method is more effective in dynamic scenarios with multiple target distributions and also avoids forgetting valuable source distribution characteristics. Moreover, by considering layer-wise gradients, the proposed method adapts itself to various distribution shifts. To reduce the computational and time cost, we fix the convolutional parameters while only generating parameters of the Batch Normalization layers and the linear classifier. Experiments on six widely used domain generalization datasets demonstrate the benefits and abilities of the proposed method to efficiently handle various distribution shifts, generalize in dynamic scenarios, and avoid forgetting.

Comments:	WACV 2025
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2502.12195 [cs.LG]
	(or arXiv:2502.12195v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2502.12195

Submission history

From: Sameer Ambekar [view email]
[v1] Sat, 15 Feb 2025 10:10:49 UTC (6,176 KB)

Computer Science > Machine Learning

Title:GeneralizeFormer: Layer-Adaptive Model Generation across Test-Time Distribution Shifts

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:GeneralizeFormer: Layer-Adaptive Model Generation across Test-Time Distribution Shifts

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators