Face0: Instantaneously Conditioning a Text-to-Image Model on a Face

Valevski, Dani; Wasserman, Danny; Matias, Yossi; Leviathan, Yaniv

Computer Science > Computer Vision and Pattern Recognition

arXiv:2306.06638 (cs)

[Submitted on 11 Jun 2023]

Title:Face0: Instantaneously Conditioning a Text-to-Image Model on a Face

Authors:Dani Valevski, Danny Wasserman, Yossi Matias, Yaniv Leviathan

View PDF

Abstract:We present Face0, a novel way to instantaneously condition a text-to-image generation model on a face, in sample time, without any optimization procedures such as fine-tuning or inversions. We augment a dataset of annotated images with embeddings of the included faces and train an image generation model, on the augmented dataset. Once trained, our system is practically identical at inference time to the underlying base model, and is therefore able to generate images, given a user-supplied face image and a prompt, in just a couple of seconds. Our method achieves pleasing results, is remarkably simple, extremely fast, and equips the underlying model with new capabilities, like controlling the generated images both via text or via direct manipulation of the input face embeddings. In addition, when using a fixed random vector instead of a face embedding from a user supplied image, our method essentially solves the problem of consistent character generation across images. Finally, while requiring further research, we hope that our method, which decouples the model's textual biases from its biases on faces, might be a step towards some mitigation of biases in future text-to-image models.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Graphics (cs.GR); Machine Learning (cs.LG)
Cite as:	arXiv:2306.06638 [cs.CV]
	(or arXiv:2306.06638v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2306.06638

Submission history

From: Dani Valevski [view email]
[v1] Sun, 11 Jun 2023 09:52:03 UTC (19,359 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Face0: Instantaneously Conditioning a Text-to-Image Model on a Face

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Face0: Instantaneously Conditioning a Text-to-Image Model on a Face

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators