Omni-ID: Holistic Identity Representation Designed for Generative Tasks

Qian, Guocheng; Wang, Kuan-Chieh; Patashnik, Or; Heravi, Negin; Ostashev, Daniil; Tulyakov, Sergey; Cohen-Or, Daniel; Aberman, Kfir

Computer Science > Computer Vision and Pattern Recognition

arXiv:2412.09694 (cs)

[Submitted on 12 Dec 2024]

Title:Omni-ID: Holistic Identity Representation Designed for Generative Tasks

Authors:Guocheng Qian, Kuan-Chieh Wang, Or Patashnik, Negin Heravi, Daniil Ostashev, Sergey Tulyakov, Daniel Cohen-Or, Kfir Aberman

View PDF HTML (experimental)

Abstract:We introduce Omni-ID, a novel facial representation designed specifically for generative tasks. Omni-ID encodes holistic information about an individual's appearance across diverse expressions and poses within a fixed-size representation. It consolidates information from a varied number of unstructured input images into a structured representation, where each entry represents certain global or local identity features. Our approach uses a few-to-many identity reconstruction training paradigm, where a limited set of input images is used to reconstruct multiple target images of the same individual in various poses and expressions. A multi-decoder framework is further employed to leverage the complementary strengths of diverse decoders during training. Unlike conventional representations, such as CLIP and ArcFace, which are typically learned through discriminative or contrastive objectives, Omni-ID is optimized with a generative objective, resulting in a more comprehensive and nuanced identity capture for generative tasks. Trained on our MFHQ dataset -- a multi-view facial image collection, Omni-ID demonstrates substantial improvements over conventional representations across various generative tasks.

Comments:	Webpage: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2412.09694 [cs.CV]
	(or arXiv:2412.09694v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2412.09694

Submission history

From: Guocheng Qian [view email]
[v1] Thu, 12 Dec 2024 19:21:20 UTC (39,714 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Omni-ID: Holistic Identity Representation Designed for Generative Tasks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Omni-ID: Holistic Identity Representation Designed for Generative Tasks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators