Denoising Cosine Similarity: A Theory-Driven Approach for Efficient Representation Learning

Nakagawa, Takumi; Sanada, Yutaro; Waida, Hiroki; Zhang, Yuhui; Wada, Yuichiro; Takanashi, Kōsaku; Yamada, Tomonori; Kanamori, Takafumi

Statistics > Machine Learning

arXiv:2304.09552 (stat)

[Submitted on 19 Apr 2023]

Title:Denoising Cosine Similarity: A Theory-Driven Approach for Efficient Representation Learning

Authors:Takumi Nakagawa, Yutaro Sanada, Hiroki Waida, Yuhui Zhang, Yuichiro Wada, Kōsaku Takanashi, Tomonori Yamada, Takafumi Kanamori

View PDF

Abstract:Representation learning has been increasing its impact on the research and practice of machine learning, since it enables to learn representations that can apply to various downstream tasks efficiently. However, recent works pay little attention to the fact that real-world datasets used during the stage of representation learning are commonly contaminated by noise, which can degrade the quality of learned representations. This paper tackles the problem to learn robust representations against noise in a raw dataset. To this end, inspired by recent works on denoising and the success of the cosine-similarity-based objective functions in representation learning, we propose the denoising Cosine-Similarity (dCS) loss. The dCS loss is a modified cosine-similarity loss and incorporates a denoising property, which is supported by both our theoretical and empirical findings. To make the dCS loss implementable, we also construct the estimators of the dCS loss with statistical guarantees. Finally, we empirically show the efficiency of the dCS loss over the baseline objective functions in vision and speech domains.

Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:2304.09552 [stat.ML]
	(or arXiv:2304.09552v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2304.09552

Submission history

From: Hiroki Waida [view email]
[v1] Wed, 19 Apr 2023 10:33:39 UTC (3,311 KB)

Statistics > Machine Learning

Title:Denoising Cosine Similarity: A Theory-Driven Approach for Efficient Representation Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Denoising Cosine Similarity: A Theory-Driven Approach for Efficient Representation Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators