MixSA: Training-free Reference-based Sketch Extraction via Mixture-of-Self-Attention

Yang, Rui; Wu, Xiaojun; He, Shengfeng

doi:10.1109/TVCG.2024.3502395

Computer Science > Computer Vision and Pattern Recognition

arXiv:2501.00816 (cs)

[Submitted on 1 Jan 2025]

Title:MixSA: Training-free Reference-based Sketch Extraction via Mixture-of-Self-Attention

Authors:Rui Yang, Xiaojun Wu, Shengfeng He

View PDF HTML (experimental)

Abstract:Current sketch extraction methods either require extensive training or fail to capture a wide range of artistic styles, limiting their practical applicability and versatility. We introduce Mixture-of-Self-Attention (MixSA), a training-free sketch extraction method that leverages strong diffusion priors for enhanced sketch perception. At its core, MixSA employs a mixture-of-self-attention technique, which manipulates self-attention layers by substituting the keys and values with those from reference sketches. This allows for the seamless integration of brushstroke elements into initial outline images, offering precise control over texture density and enabling interpolation between styles to create novel, unseen styles. By aligning brushstroke styles with the texture and contours of colored images, particularly in late decoder layers handling local textures, MixSA addresses the common issue of color averaging by adjusting initial outlines. Evaluated with various perceptual metrics, MixSA demonstrates superior performance in sketch quality, flexibility, and applicability. This approach not only overcomes the limitations of existing methods but also empowers users to generate diverse, high-fidelity sketches that more accurately reflect a wide range of artistic expressions.

Comments:	25 pages, 25 figures; Accepted by IEEE IEEE Transactions on Visualization and Computer Graphics, 2024 (TVCG)
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2501.00816 [cs.CV]
	(or arXiv:2501.00816v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2501.00816
Related DOI:	https://doi.org/10.1109/TVCG.2024.3502395

Submission history

From: Rui Yang [view email]
[v1] Wed, 1 Jan 2025 12:03:37 UTC (41,090 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:MixSA: Training-free Reference-based Sketch Extraction via Mixture-of-Self-Attention

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:MixSA: Training-free Reference-based Sketch Extraction via Mixture-of-Self-Attention

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators