Zero-shot Inversion Process for Image Attribute Editing with Diffusion Models

Feng, Zhanbo; Ling, Zenan; Gong, Ci; Zhou, Feng; Li, Jie; Qiu, Robert C.

Computer Science > Computer Vision and Pattern Recognition

arXiv:2308.15854 (cs)

[Submitted on 30 Aug 2023 (v1), last revised 11 Oct 2023 (this version, v2)]

Title:Zero-shot Inversion Process for Image Attribute Editing with Diffusion Models

Authors:Zhanbo Feng, Zenan Ling, Ci Gong, Feng Zhou, Jie Li, Robert C. Qiu

View PDF

Abstract:Denoising diffusion models have shown outstanding performance in image editing. Existing works tend to use either image-guided methods, which provide a visual reference but lack control over semantic coherence, or text-guided methods, which ensure faithfulness to text guidance but lack visual quality. To address the problem, we propose the Zero-shot Inversion Process (ZIP), a framework that injects a fusion of generated visual reference and text guidance into the semantic latent space of a \textit{frozen} pre-trained diffusion model. Only using a tiny neural network, the proposed ZIP produces diverse content and attributes under the intuitive control of the text prompt. Moreover, ZIP shows remarkable robustness for both in-domain and out-of-domain attribute manipulation on real images. We perform detailed experiments on various benchmark datasets. Compared to state-of-the-art methods, ZIP produces images of equivalent quality while providing a realistic editing effect.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2308.15854 [cs.CV]
	(or arXiv:2308.15854v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2308.15854

Submission history

From: Zhanbo Feng [view email]
[v1] Wed, 30 Aug 2023 08:40:15 UTC (57,781 KB)
[v2] Wed, 11 Oct 2023 02:34:23 UTC (36,461 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Zero-shot Inversion Process for Image Attribute Editing with Diffusion Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Zero-shot Inversion Process for Image Attribute Editing with Diffusion Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators