Self-supervised Transformation Learning for Equivariant Representations

Yu, Jaemyung; Choi, Jaehyun; Lee, Dong-Jae; Hong, HyeongGwon; Kim, Junmo

Computer Science > Computer Vision and Pattern Recognition

arXiv:2501.08712 (cs)

[Submitted on 15 Jan 2025]

Title:Self-supervised Transformation Learning for Equivariant Representations

Authors:Jaemyung Yu, Jaehyun Choi, Dong-Jae Lee, HyeongGwon Hong, Junmo Kim

View PDF HTML (experimental)

Abstract:Unsupervised representation learning has significantly advanced various machine learning tasks. In the computer vision domain, state-of-the-art approaches utilize transformations like random crop and color jitter to achieve invariant representations, embedding semantically the same inputs despite transformations. However, this can degrade performance in tasks requiring precise features, such as localization or flower classification. To address this, recent research incorporates equivariant representation learning, which captures transformation-sensitive information. However, current methods depend on transformation labels and thus struggle with interdependency and complex transformations. We propose Self-supervised Transformation Learning (STL), replacing transformation labels with transformation representations derived from image pairs. The proposed method ensures transformation representation is image-invariant and learns corresponding equivariant transformations, enhancing performance without increased batch complexity. We demonstrate the approach's effectiveness across diverse classification and detection tasks, outperforming existing methods in 7 out of 11 benchmarks and excelling in detection. By integrating complex transformations like AugMix, unusable by prior equivariant methods, this approach enhances performance across tasks, underscoring its adaptability and resilience. Additionally, its compatibility with various base models highlights its flexibility and broad applicability. The code is available at this https URL.

Comments:	38th Conference on Neural Information Processing Systems (NeurIPS 2024)
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2501.08712 [cs.CV]
	(or arXiv:2501.08712v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2501.08712

Submission history

From: Jaemyung Yu [view email]
[v1] Wed, 15 Jan 2025 10:54:21 UTC (1,930 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Self-supervised Transformation Learning for Equivariant Representations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Self-supervised Transformation Learning for Equivariant Representations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators