Skip to main content

Showing 1–7 of 7 results for author: Orgad, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.05846  [pdf, other

    cs.CV cs.CL

    Diffusion Lens: Interpreting Text Encoders in Text-to-Image Pipelines

    Authors: Michael Toker, Hadas Orgad, Mor Ventura, Dana Arad, Yonatan Belinkov

    Abstract: Text-to-image diffusion models (T2I) use a latent representation of a text prompt to guide the image generation process. However, the process by which the encoder produces the text representation is unknown. We propose the Diffusion Lens, a method for analyzing the text encoder of T2I models by generating images from its intermediate representations. Using the Diffusion Lens, we perform an extensi… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: Project webpage: tokeron.github.io/DiffusionLensWeb

    ACM Class: I.2.7; I.4.0

  2. arXiv:2308.14761  [pdf, other

    cs.CV cs.LG

    Unified Concept Editing in Diffusion Models

    Authors: Rohit Gandikota, Hadas Orgad, Yonatan Belinkov, Joanna MaterzyƄska, David Bau

    Abstract: Text-to-image models suffer from various safety issues that may limit their suitability for deployment. Previous methods have separately addressed individual issues of bias, copyright, and offensive content in text-to-image models. However, in the real world, all of these issues appear simultaneously in the same model. We present a method that tackles all issues with a single approach. Our method,… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

  3. arXiv:2306.00738  [pdf, other

    cs.CL cs.CV

    ReFACT: Updating Text-to-Image Models by Editing the Text Encoder

    Authors: Dana Arad, Hadas Orgad, Yonatan Belinkov

    Abstract: Our world is marked by unprecedented technological, global, and socio-political transformations, posing a significant challenge to text-to-image generative models. These models encode factual associations within their parameters that can quickly become outdated, diminishing their utility for end-users. To that end, we introduce ReFACT, a novel approach for editing factual associations in text-to-i… ▽ More

    Submitted 7 May, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted to NAACL 2024 (Main Conference)

    MSC Class: 68T50 ACM Class: I.2.7

  4. arXiv:2303.08084  [pdf, other

    cs.CV

    Editing Implicit Assumptions in Text-to-Image Diffusion Models

    Authors: Hadas Orgad, Bahjat Kawar, Yonatan Belinkov

    Abstract: Text-to-image diffusion models often make implicit assumptions about the world when generating images. While some assumptions are useful (e.g., the sky is blue), they can also be outdated, incorrect, or reflective of social biases present in the training data. Thus, there is a need to control these assumptions without requiring explicit user input or costly re-training. In this work, we aim to edi… ▽ More

    Submitted 25 August, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

    Comments: Project page: https://time-diffusion.github.io/

  5. arXiv:2212.10563  [pdf, other

    cs.CL

    BLIND: Bias Removal With No Demographics

    Authors: Hadas Orgad, Yonatan Belinkov

    Abstract: Models trained on real-world data tend to imitate and amplify social biases. Common methods to mitigate biases require prior information on the types of biases that should be mitigated (e.g., gender or racial bias) and the social groups associated with each data sample. In this work, we introduce BLIND, a method for bias removal with no prior knowledge of the demographics in the dataset. While tra… ▽ More

    Submitted 11 June, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Accepted to ACL 2023 main conference

    MSC Class: 68T50 ACM Class: I.2.7

  6. arXiv:2210.11471  [pdf, other

    cs.CL

    Choose Your Lenses: Flaws in Gender Bias Evaluation

    Authors: Hadas Orgad, Yonatan Belinkov

    Abstract: Considerable efforts to measure and mitigate gender bias in recent years have led to the introduction of an abundance of tasks, datasets, and metrics used in this vein. In this position paper, we assess the current paradigm of gender bias evaluation and identify several flaws in it. First, we highlight the importance of extrinsic bias metrics that measure how a model's performance on some task is… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: Accepted to the 4th Workshop on Gender Bias in Natural Language Processing

    MSC Class: 68T50 ACM Class: I.2.7

  7. arXiv:2204.06827  [pdf, other

    cs.CL

    How Gender Debiasing Affects Internal Model Representations, and Why It Matters

    Authors: Hadas Orgad, Seraphina Goldfarb-Tarrant, Yonatan Belinkov

    Abstract: Common studies of gender bias in NLP focus either on extrinsic bias measured by model performance on a downstream task or on intrinsic bias found in models' internal representations. However, the relationship between extrinsic and intrinsic bias is relatively unknown. In this work, we illuminate this relationship by measuring both quantities together: we debias a model during downstream fine-tunin… ▽ More

    Submitted 16 May, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted to NAACL 2022

    MSC Class: 68T50 ACM Class: I.2.7