Dynamic Concepts Personalization from Single Videos

Abdal, Rameen; Patashnik, Or; Skorokhodov, Ivan; Menapace, Willi; Siarohin, Aliaksandr; Tulyakov, Sergey; Cohen-Or, Daniel; Aberman, Kfir

Computer Science > Graphics

arXiv:2502.14844 (cs)

[Submitted on 20 Feb 2025]

Title:Dynamic Concepts Personalization from Single Videos

Authors:Rameen Abdal, Or Patashnik, Ivan Skorokhodov, Willi Menapace, Aliaksandr Siarohin, Sergey Tulyakov, Daniel Cohen-Or, Kfir Aberman

View PDF HTML (experimental)

Abstract:Personalizing generative text-to-image models has seen remarkable progress, but extending this personalization to text-to-video models presents unique challenges. Unlike static concepts, personalizing text-to-video models has the potential to capture dynamic concepts, i.e., entities defined not only by their appearance but also by their motion. In this paper, we introduce Set-and-Sequence, a novel framework for personalizing Diffusion Transformers (DiTs)-based generative video models with dynamic concepts. Our approach imposes a spatio-temporal weight space within an architecture that does not explicitly separate spatial and temporal features. This is achieved in two key stages. First, we fine-tune Low-Rank Adaptation (LoRA) layers using an unordered set of frames from the video to learn an identity LoRA basis that represents the appearance, free from temporal interference. In the second stage, with the identity LoRAs frozen, we augment their coefficients with Motion Residuals and fine-tune them on the full video sequence, capturing motion dynamics. Our Set-and-Sequence framework results in a spatio-temporal weight space that effectively embeds dynamic concepts into the video model's output domain, enabling unprecedented editability and compositionality while setting a new benchmark for personalizing dynamic concepts.

Comments:	Webpage: this https URL
Subjects:	Graphics (cs.GR); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2502.14844 [cs.GR]
	(or arXiv:2502.14844v1 [cs.GR] for this version)
	https://doi.org/10.48550/arXiv.2502.14844

Submission history

From: Rameen Abdal [view email]
[v1] Thu, 20 Feb 2025 18:53:39 UTC (24,859 KB)

Computer Science > Graphics

Title:Dynamic Concepts Personalization from Single Videos

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Graphics

Title:Dynamic Concepts Personalization from Single Videos

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators