Latent Directions: A Simple Pathway to Bias Mitigation in Generative AI

Olmos, Carolina Lopez; Neophytou, Alexandros; Sengupta, Sunando; Papadopoulos, Dim P.

Computer Science > Computer Vision and Pattern Recognition

arXiv:2406.06352 (cs)

[Submitted on 10 Jun 2024]

Title:Latent Directions: A Simple Pathway to Bias Mitigation in Generative AI

Authors:Carolina Lopez Olmos, Alexandros Neophytou, Sunando Sengupta, Dim P. Papadopoulos

View PDF HTML (experimental)

Abstract:Mitigating biases in generative AI and, particularly in text-to-image models, is of high importance given their growing implications in society. The biased datasets used for training pose challenges in ensuring the responsible development of these models, and mitigation through hard prompting or embedding alteration, are the most common present solutions. Our work introduces a novel approach to achieve diverse and inclusive synthetic images by learning a direction in the latent space and solely modifying the initial Gaussian noise provided for the diffusion process. Maintaining a neutral prompt and untouched embeddings, this approach successfully adapts to diverse debiasing scenarios, such as geographical biases. Moreover, our work proves it is possible to linearly combine these learned latent directions to introduce new mitigations, and if desired, integrate it with text embedding adjustments. Furthermore, text-to-image models lack transparency for assessing bias in outputs, unless visually inspected. Thus, we provide a tool to empower developers to select their desired concepts to mitigate. The project page with code is available online.

Comments:	Accepted at CVPR workshop 2024, proceedings of ReGenAI: First Workshop on Responsible Generative AI
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2406.06352 [cs.CV]
	(or arXiv:2406.06352v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2406.06352

Submission history

From: Carolina Lopez Olmos [view email]
[v1] Mon, 10 Jun 2024 15:13:51 UTC (3,645 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Latent Directions: A Simple Pathway to Bias Mitigation in Generative AI

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Latent Directions: A Simple Pathway to Bias Mitigation in Generative AI

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators