StyTr^2: Unbiased Image Style Transfer with Transformers

Deng, Yingying; Tang, Fan; Pan, Xingjia; Dong, Weiming; Ma, Chongyang; Xu, Changsheng

Computer Science > Computer Vision and Pattern Recognition

arXiv:2105.14576v2 (cs)

[Submitted on 30 May 2021 (v1), revised 8 Jun 2021 (this version, v2), latest version 1 Apr 2022 (v3)]

Title:StyTr^2: Unbiased Image Style Transfer with Transformers

Authors:Yingying Deng, Fan Tang, Xingjia Pan, Weiming Dong, Chongyang Ma, Changsheng Xu

View PDF

Abstract:The goal of image style transfer is to render an image with artistic features guided by a style reference while maintaining the original content. Due to the locality and spatial invariance in CNNs, it is difficult to extract and maintain the global information of input images. Therefore, traditional neural style transfer methods are usually biased and content leak can be observed by running several times of the style transfer process with the same reference style image. To address this critical issue, we take long-range dependencies of input images into account for unbiased style transfer by proposing a transformer-based approach, namely StyTr^2. In contrast with visual transformers for other vision tasks, our StyTr^2 contains two different transformer encoders to generate domain-specific sequences for content and style, respectively. Following the encoders, a multi-layer transformer decoder is adopted to stylize the content sequence according to the style sequence. In addition, we analyze the deficiency of existing positional encoding methods and propose the content-aware positional encoding (CAPE) which is scale-invariant and more suitable for image style transfer task. Qualitative and quantitative experiments demonstrate the effectiveness of the proposed StyTr^2 compared to state-of-the-art CNN-based and flow-based approaches.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Image and Video Processing (eess.IV)
Cite as:	arXiv:2105.14576 [cs.CV]
	(or arXiv:2105.14576v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2105.14576

Submission history

From: Yingying Deng [view email]
[v1] Sun, 30 May 2021 15:57:09 UTC (8,311 KB)
[v2] Tue, 8 Jun 2021 02:40:39 UTC (17,524 KB)
[v3] Fri, 1 Apr 2022 09:05:25 UTC (32,001 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:StyTr^2: Unbiased Image Style Transfer with Transformers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:StyTr^2: Unbiased Image Style Transfer with Transformers

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators