Progressive Knowledge Distillation Of Stable Diffusion XL Using Layer Level Loss

Gupta, Yatharth; Jaddipal, Vishnu V.; Prabhala, Harish; Paul, Sayak; Von Platen, Patrick

Computer Science > Computer Vision and Pattern Recognition

arXiv:2401.02677 (cs)

[Submitted on 5 Jan 2024]

Title:Progressive Knowledge Distillation Of Stable Diffusion XL Using Layer Level Loss

Authors:Yatharth Gupta, Vishnu V. Jaddipal, Harish Prabhala, Sayak Paul, Patrick Von Platen

View PDF HTML (experimental)

Abstract:Stable Diffusion XL (SDXL) has become the best open source text-to-image model (T2I) for its versatility and top-notch image quality. Efficiently addressing the computational demands of SDXL models is crucial for wider reach and applicability. In this work, we introduce two scaled-down variants, Segmind Stable Diffusion (SSD-1B) and Segmind-Vega, with 1.3B and 0.74B parameter UNets, respectively, achieved through progressive removal using layer-level losses focusing on reducing the model size while preserving generative quality. We release these models weights at this https URL. Our methodology involves the elimination of residual networks and transformer blocks from the U-Net structure of SDXL, resulting in significant reductions in parameters, and latency. Our compact models effectively emulate the original SDXL by capitalizing on transferred knowledge, achieving competitive results against larger multi-billion parameter SDXL. Our work underscores the efficacy of knowledge distillation coupled with layer-level losses in reducing model size while preserving the high-quality generative capabilities of SDXL, thus facilitating more accessible deployment in resource-constrained environments.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2401.02677 [cs.CV]
	(or arXiv:2401.02677v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2401.02677

Submission history

From: Yatharth Gupta [view email]
[v1] Fri, 5 Jan 2024 07:21:46 UTC (29,468 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Progressive Knowledge Distillation Of Stable Diffusion XL Using Layer Level Loss

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Progressive Knowledge Distillation Of Stable Diffusion XL Using Layer Level Loss

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators