Nested Attention: Semantic-aware Attention Values for Concept Personalization

Patashnik, Or; Gal, Rinon; Ostashev, Daniil; Tulyakov, Sergey; Aberman, Kfir; Cohen-Or, Daniel

Computer Science > Computer Vision and Pattern Recognition

arXiv:2501.01407 (cs)

[Submitted on 2 Jan 2025]

Title:Nested Attention: Semantic-aware Attention Values for Concept Personalization

Authors:Or Patashnik, Rinon Gal, Daniil Ostashev, Sergey Tulyakov, Kfir Aberman, Daniel Cohen-Or

View PDF HTML (experimental)

Abstract:Personalizing text-to-image models to generate images of specific subjects across diverse scenes and styles is a rapidly advancing field. Current approaches often face challenges in maintaining a balance between identity preservation and alignment with the input text prompt. Some methods rely on a single textual token to represent a subject, which limits expressiveness, while others employ richer representations but disrupt the model's prior, diminishing prompt alignment. In this work, we introduce Nested Attention, a novel mechanism that injects a rich and expressive image representation into the model's existing cross-attention layers. Our key idea is to generate query-dependent subject values, derived from nested attention layers that learn to select relevant subject features for each region in the generated image. We integrate these nested layers into an encoder-based personalization method, and show that they enable high identity preservation while adhering to input text prompts. Our approach is general and can be trained on various domains. Additionally, its prior preservation allows us to combine multiple personalized subjects from different domains in a single image.

Comments:	Project page at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
Cite as:	arXiv:2501.01407 [cs.CV]
	(or arXiv:2501.01407v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2501.01407

Submission history

From: Or Patashnik [view email]
[v1] Thu, 2 Jan 2025 18:52:11 UTC (48,669 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Nested Attention: Semantic-aware Attention Values for Concept Personalization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Nested Attention: Semantic-aware Attention Values for Concept Personalization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators