Skip to main content

Showing 1–6 of 6 results for author: Paes, L M

.
  1. arXiv:2407.08571  [pdf, other

    cs.AI cs.IR cs.IT cs.LG stat.ML

    Multi-Group Proportional Representation

    Authors: Alex Oesterling, Claudio Mayrink Verdun, Carol Xuan Long, Alex Glynn, Lucas Monteiro Paes, Sajani Vithana, Martina Cardone, Flavio P. Calmon

    Abstract: Image search and retrieval tasks can perpetuate harmful stereotypes, erase cultural identities, and amplify social disparities. Current approaches to mitigate these representational harms balance the number of retrieved items across population groups defined by a small number of (often binary) attributes. However, most existing methods overlook intersectional groups determined by combinations of g… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 35 pages, 24 figures. Under review

  2. arXiv:2405.19562  [pdf, other

    cs.CY cs.CL cs.LG

    Selective Explanations

    Authors: Lucas Monteiro Paes, Dennis Wei, Flavio P. Calmon

    Abstract: Feature attribution methods explain black-box machine learning (ML) models by assigning importance scores to input features. These methods can be computationally expensive for large ML models. To address this challenge, there has been increasing efforts to develop amortized explainers, where a machine learning model is trained to predict feature attribution scores with only one inference. Despite… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  3. arXiv:2403.14459  [pdf, other

    cs.CL cs.AI

    Multi-Level Explanations for Generative Language Models

    Authors: Lucas Monteiro Paes, Dennis Wei, Hyo ** Do, Hendrik Strobelt, Ronny Luss, Amit Dhurandhar, Manish Nagireddy, Karthikeyan Natesan Ramamurthy, Prasanna Sattigeri, Werner Geyer, Soumya Ghosh

    Abstract: Perturbation-based explanation methods such as LIME and SHAP are commonly applied to text classification. This work focuses on their extension to generative language models. To address the challenges of text as output and long text inputs, we propose a general framework called MExGen that can be instantiated with different attribution algorithms. To handle text output, we introduce the notion of s… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  4. arXiv:2402.16979  [pdf, other

    cs.CY cs.LG cs.SI

    Algorithmic Arbitrariness in Content Moderation

    Authors: Juan Felipe Gomez, Caio Vieira Machado, Lucas Monteiro Paes, Flavio P. Calmon

    Abstract: Machine learning (ML) is widely used to moderate online content. Despite its scalability relative to human moderation, the use of ML introduces unique challenges to content moderation. One such challenge is predictive multiplicity: multiple competing models for content classification may perform equally well on average, yet assign conflicting predictions to the same content. This multiplicity can… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  5. arXiv:2312.03867  [pdf, other

    cs.LG cs.CY cs.IT stat.ML

    Multi-Group Fairness Evaluation via Conditional Value-at-Risk Testing

    Authors: Lucas Monteiro Paes, Ananda Theertha Suresh, Alex Beutel, Flavio P. Calmon, Ahmad Beirami

    Abstract: Machine learning (ML) models used in prediction and classification tasks may display performance disparities across population groups determined by sensitive attributes (e.g., race, sex, age). We consider the problem of evaluating the performance of a fixed ML model across population groups defined by multiple sensitive attributes (e.g., race and sex and age). Here, the sample complexity for estim… ▽ More

    Submitted 25 May, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: Accepted for publication in the IEEE Journal on Selected Areas in Information Theory (JSAIT)

  6. arXiv:2306.05500  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    Word-Level Explanations for Analyzing Bias in Text-to-Image Models

    Authors: Alexander Lin, Lucas Monteiro Paes, Sree Harsha Tanneru, Suraj Srinivas, Himabindu Lakkaraju

    Abstract: Text-to-image models take a sentence (i.e., prompt) and generate images associated with this input prompt. These models have created award wining-art, videos, and even synthetic datasets. However, text-to-image (T2I) models can generate images that underrepresent minorities based on race and sex. This paper investigates which word in the input prompt is responsible for bias in generated images. We… ▽ More

    Submitted 3 June, 2023; originally announced June 2023.

    Comments: 5 main pages, 3 pages in appendix, and 3 figures