PromptArtisan: Multi-instruction Image Editing in Single Pass with Complete Attention Control

Swami, Kunal; Chittersu, Raghu; Adlinge, Pranav; Irny, Rajeev; Doodekula, Shashavali; Shukla, Alok

Computer Science > Computer Vision and Pattern Recognition

arXiv:2502.10258 (cs)

[Submitted on 14 Feb 2025]

Title:PromptArtisan: Multi-instruction Image Editing in Single Pass with Complete Attention Control

Authors:Kunal Swami, Raghu Chittersu, Pranav Adlinge, Rajeev Irny, Shashavali Doodekula, Alok Shukla

View PDF HTML (experimental)

Abstract:We present PromptArtisan, a groundbreaking approach to multi-instruction image editing that achieves remarkable results in a single pass, eliminating the need for time-consuming iterative refinement. Our method empowers users to provide multiple editing instructions, each associated with a specific mask within the image. This flexibility allows for complex edits involving mask intersections or overlaps, enabling the realization of intricate and nuanced image transformations. PromptArtisan leverages a pre-trained InstructPix2Pix model in conjunction with a novel Complete Attention Control Mechanism (CACM). This mechanism ensures precise adherence to user instructions, granting fine-grained control over the editing process. Furthermore, our approach is zero-shot, requiring no additional training, and boasts improved processing complexity compared to traditional iterative methods. By seamlessly integrating multi-instruction capabilities, single-pass efficiency, and complete attention control, PromptArtisan unlocks new possibilities for creative and efficient image editing workflows, catering to both novice and expert users alike.

Comments:	Accepted in ICASSP 2025
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Human-Computer Interaction (cs.HC)
Cite as:	arXiv:2502.10258 [cs.CV]
	(or arXiv:2502.10258v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2502.10258

Submission history

From: Kunal Swami [view email]
[v1] Fri, 14 Feb 2025 16:11:57 UTC (16,892 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:PromptArtisan: Multi-instruction Image Editing in Single Pass with Complete Attention Control

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:PromptArtisan: Multi-instruction Image Editing in Single Pass with Complete Attention Control

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators