Diffusing Colors: Image Colorization with Text Guided Diffusion

Zabari, Nir; Azulay, Aharon; Gorkor, Alexey; Halperin, Tavi; Fried, Ohad

Computer Science > Computer Vision and Pattern Recognition

arXiv:2312.04145 (cs)

[Submitted on 7 Dec 2023]

Title:Diffusing Colors: Image Colorization with Text Guided Diffusion

Authors:Nir Zabari, Aharon Azulay, Alexey Gorkor, Tavi Halperin, Ohad Fried

View PDF HTML (experimental)

Abstract:The colorization of grayscale images is a complex and subjective task with significant challenges. Despite recent progress in employing large-scale datasets with deep neural networks, difficulties with controllability and visual quality persist. To tackle these issues, we present a novel image colorization framework that utilizes image diffusion techniques with granular text prompts. This integration not only produces colorization outputs that are semantically appropriate but also greatly improves the level of control users have over the colorization process. Our method provides a balance between automation and control, outperforming existing techniques in terms of visual quality and semantic coherence. We leverage a pretrained generative Diffusion Model, and show that we can finetune it for the colorization task without losing its generative power or attention to text prompts. Moreover, we present a novel CLIP-based ranking model that evaluates color vividness, enabling automatic selection of the most suitable level of vividness based on the specific scene semantics. Our approach holds potential particularly for color enhancement and historical image colorization.

Comments:	SIGGRAPH Asia 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Machine Learning (cs.LG)
Cite as:	arXiv:2312.04145 [cs.CV]
	(or arXiv:2312.04145v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2312.04145

Submission history

From: Nir Zabari [view email]
[v1] Thu, 7 Dec 2023 08:59:20 UTC (32,701 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Diffusing Colors: Image Colorization with Text Guided Diffusion

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Diffusing Colors: Image Colorization with Text Guided Diffusion

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators