Skip to main content

Showing 1–13 of 13 results for author: Moayeri, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.07844  [pdf, other

    cs.CV

    Understanding and Mitigating Compositional Issues in Text-to-Image Generative Models

    Authors: Arman Zarei, Keivan Rezaei, Samyadeep Basu, Mehrdad Saberi, Mazda Moayeri, Priyatham Kattakinda, Soheil Feizi

    Abstract: Recent text-to-image diffusion-based generative models have the stunning ability to generate highly detailed and photo-realistic images and achieve state-of-the-art low FID scores on challenging image generation benchmarks. However, one of the primary failure modes of these text-to-image generative models is in composing attributes, objects, and their associated relationships accurately into an im… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  2. arXiv:2405.17247  [pdf, other

    cs.LG

    An Introduction to Vision-Language Modeling

    Authors: Florian Bordes, Richard Yuanzhe Pang, Anurag Ajay, Alexander C. Li, Adrien Bardes, Suzanne Petryk, Oscar Mañas, Zhiqiu Lin, Anas Mahmoud, Bargav Jayaraman, Mark Ibrahim, Melissa Hall, Yunyang Xiong, Jonathan Lebensold, Candace Ross, Srihari Jayakumar, Chuan Guo, Diane Bouchacourt, Haider Al-Tahan, Karthik Padthe, Vasu Sharma, Hu Xu, Xiaoqing Ellen Tan, Megan Richards, Samuel Lavoie , et al. (16 additional authors not shown)

    Abstract: Following the recent popularity of Large Language Models (LLMs), several attempts have been made to extend them to the visual domain. From having a visual assistant that could guide us through unfamiliar environments to generative models that produce images using only a high-level text description, the vision-language model (VLM) applications will significantly impact our relationship with technol… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  3. arXiv:2404.16717  [pdf, other

    cs.CV cs.AI cs.HC

    Embracing Diversity: Interpretable Zero-shot classification beyond one vector per class

    Authors: Mazda Moayeri, Michael Rabbat, Mark Ibrahim, Diane Bouchacourt

    Abstract: Vision-language models enable open-world classification of objects without the need for any retraining. While this zero-shot paradigm marks a significant advance, even today's best models exhibit skewed performance when objects are dissimilar from their typical depiction. Real world objects such as pears appear in a variety of forms -- from diced to whole, on a table or in a bowl -- yet standard V… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: Accepted to FAccT 2024

  4. arXiv:2404.08030  [pdf, other

    cs.CV cs.AI

    Rethinking Artistic Copyright Infringements in the Era of Text-to-Image Generative Models

    Authors: Mazda Moayeri, Samyadeep Basu, Sriram Balasubramanian, Priyatham Kattakinda, Atoosa Chengini, Robert Brauneis, Soheil Feizi

    Abstract: Recent text-to-image generative models such as Stable Diffusion are extremely adept at mimicking and generating copyrighted content, raising concerns amongst artists that their unique styles may be improperly copied. Understanding how generative models copy "artistic style" is more complex than duplicating a single image, as style is comprised by a set of elements (or signature) that frequently co… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  5. arXiv:2310.00164  [pdf, other

    cs.CV

    PRIME: Prioritizing Interpretability in Failure Mode Extraction

    Authors: Keivan Rezaei, Mehrdad Saberi, Mazda Moayeri, Soheil Feizi

    Abstract: In this work, we study the challenge of providing human-understandable descriptions for failure modes in trained image classification models. Existing works address this problem by first identifying clusters (or directions) of incorrectly classified samples in a latent space and then aiming to provide human-understandable text descriptions for them. We observe that in some cases, describing text d… ▽ More

    Submitted 14 March, 2024; v1 submitted 29 September, 2023; originally announced October 2023.

    Comments: Accepted to ICLR 2024

  6. arXiv:2306.07304  [pdf, other

    cs.LG cs.AI

    A Holistic Approach to Unifying Automatic Concept Extraction and Concept Importance Estimation

    Authors: Thomas Fel, Victor Boutin, Mazda Moayeri, Rémi Cadène, Louis Bethune, Léo andéol, Mathieu Chalvidal, Thomas Serre

    Abstract: In recent years, concept-based approaches have emerged as some of the most promising explainability methods to help us interpret the decisions of Artificial Neural Networks (ANNs). These methods seek to discover intelligible visual 'concepts' buried within the complex patterns of ANN activations in two key steps: (1) concept extraction followed by (2) importance estimation. While these two steps a… ▽ More

    Submitted 29 October, 2023; v1 submitted 11 June, 2023; originally announced June 2023.

    Journal ref: Conference on Neural Information Processing Systems (NeurIPS), 2023

  7. arXiv:2305.06386  [pdf, other

    cs.CV cs.AI cs.HC cs.LG

    Text-To-Concept (and Back) via Cross-Model Alignment

    Authors: Mazda Moayeri, Keivan Rezaei, Maziar Sanjabi, Soheil Feizi

    Abstract: We observe that the map** between an image's representation in one model to its representation in another can be learned surprisingly well with just a linear layer, even across diverse models. Building on this observation, we propose $\textit{text-to-concept}$, where features from a fixed pretrained model are aligned linearly to the CLIP space, so that text embeddings from CLIP's text encoder be… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: Accepted to ICML 2023 and CVPR4XAI workshop 2023

  8. arXiv:2212.02648  [pdf, other

    cs.CV cs.AI cs.HC cs.LG

    Spuriosity Rankings: Sorting Data to Measure and Mitigate Biases

    Authors: Mazda Moayeri, Wenxiao Wang, Sahil Singla, Soheil Feizi

    Abstract: We present a simple but effective method to measure and mitigate model biases caused by reliance on spurious cues. Instead of requiring costly changes to one's data or model training, our method better utilizes the data one already has by sorting them. Specifically, we rank images within their classes based on spuriosity (the degree to which common spurious cues are present), proxied via deep neur… ▽ More

    Submitted 30 October, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: Accepted to NeurIPS '23 (Spotlight). Camera ready version

  9. arXiv:2211.09859  [pdf, other

    cs.CV

    Data-Centric Debugging: mitigating model failures via targeted data collection

    Authors: Sahil Singla, Atoosa Malemir Chegini, Mazda Moayeri, Soheil Feiz

    Abstract: Deep neural networks can be unreliable in the real world when the training set does not adequately cover all the settings where they are deployed. Focusing on image classification, we consider the setting where we have an error distribution $\mathcal{E}$ representing a deployment scenario where the model fails. We have access to a small set of samples $\mathcal{E}_{sample}$ from $\mathcal{E}$ and… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  10. arXiv:2209.07592  [pdf, other

    cs.LG cs.CV

    Explicit Tradeoffs between Adversarial and Natural Distributional Robustness

    Authors: Mazda Moayeri, Kiarash Banihashem, Soheil Feizi

    Abstract: Several existing works study either adversarial or natural distributional robustness of deep neural networks separately. In practice, however, models need to enjoy both types of robustness to ensure reliability. In this work, we bridge this gap and show that in fact, explicit tradeoffs exist between adversarial and natural distributional robustness. We first consider a simple linear regression set… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: Accepted to NeurIPS 2022

  11. arXiv:2203.15566  [pdf, other

    cs.CV cs.AI

    Core Risk Minimization using Salient ImageNet

    Authors: Sahil Singla, Mazda Moayeri, Soheil Feizi

    Abstract: Deep neural networks can be unreliable in the real world especially when they heavily use spurious features for their predictions. Recently, Singla & Feizi (2022) introduced the Salient Imagenet dataset by annotating and localizing core and spurious features of ~52k samples from 232 classes of Imagenet. While this dataset is useful for evaluating the reliance of pretrained models on spurious featu… ▽ More

    Submitted 27 March, 2022; originally announced March 2022.

  12. arXiv:2201.10766  [pdf, other

    cs.CV

    A Comprehensive Study of Image Classification Model Sensitivity to Foregrounds, Backgrounds, and Visual Attributes

    Authors: Mazda Moayeri, Phillip Pope, Yogesh Balaji, Soheil Feizi

    Abstract: While datasets with single-label supervision have propelled rapid advances in image classification, additional annotations are necessary in order to quantitatively assess how models make predictions. To this end, for a subset of ImageNet samples, we collect segmentation masks for the entire object and $18$ informative attributes. We call this dataset RIVAL10 (RIch Visual Attributes with Localizati… ▽ More

    Submitted 26 January, 2022; originally announced January 2022.

  13. arXiv:2108.13797  [pdf, other

    cs.CR cs.LG

    Sample Efficient Detection and Classification of Adversarial Attacks via Self-Supervised Embeddings

    Authors: Mazda Moayeri, Soheil Feizi

    Abstract: Adversarial robustness of deep models is pivotal in ensuring safe deployment in real world settings, but most modern defenses have narrow scope and expensive costs. In this paper, we propose a self-supervised method to detect adversarial attacks and classify them to their respective threat models, based on a linear model operating on the embeddings from a pre-trained self-supervised encoder. We us… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

    Comments: Accepted to ICCV 2021