Skip to main content

Showing 1–4 of 4 results for author: Berrios, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.15108  [pdf, other

    cs.CV cs.AI

    Leveraging Diffusion Perturbations for Measuring Fairness in Computer Vision

    Authors: Nicholas Lui, Bryan Chia, William Berrios, Candace Ross, Douwe Kiela

    Abstract: Computer vision models have been known to encode harmful biases, leading to the potentially unfair treatment of historically marginalized groups, such as people of color. However, there remains a lack of datasets balanced along demographic traits that can be used to evaluate the downstream fairness of these models. In this work, we demonstrate that diffusion models can be leveraged to create such… ▽ More

    Submitted 11 February, 2024; v1 submitted 25 November, 2023; originally announced November 2023.

    Comments: The Appendix can be found at https://bit.ly/dp-appendix; Added link to code and fixed formatting (Feb 10 2024)

  2. arXiv:2308.08003  [pdf, other

    cs.HC cs.LG

    BI-LAVA: Biocuration with Hierarchical Image Labeling through Active Learning and Visual Analysis

    Authors: Juan Trelles, Andrew Wentzel, William Berrios, G. Elisabeta Marai

    Abstract: In the biomedical domain, taxonomies organize the acquisition modalities of scientific images in hierarchical structures. Such taxonomies leverage large sets of correct image labels and provide essential information about the importance of a scientific publication, which could then be used in biocuration tasks. However, the hierarchical nature of the labels, the overhead of processing images, the… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: 15 pages, 6 figures

  3. arXiv:2306.16410  [pdf, other

    cs.CL cs.CV

    Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language

    Authors: William Berrios, Gautam Mittal, Tristan Thrush, Douwe Kiela, Amanpreet Singh

    Abstract: We propose LENS, a modular approach for tackling computer vision problems by leveraging the power of large language models (LLMs). Our system uses a language model to reason over outputs from a set of independent and highly descriptive vision modules that provide exhaustive information about an image. We evaluate the approach on pure computer vision settings such as zero- and few-shot object recog… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  4. arXiv:2203.06649  [pdf, other

    q-bio.NC cs.AI cs.CV cs.LG cs.NE

    Joint rotational invariance and adversarial training of a dual-stream Transformer yields state of the art Brain-Score for Area V4

    Authors: William Berrios, Arturo Deza

    Abstract: Modern high-scoring models of vision in the brain score competition do not stem from Vision Transformers. However, in this paper, we provide evidence against the unexpected trend of Vision Transformers (ViT) being not perceptually aligned with human visual representations by showing how a dual-stream Transformer, a CrossViT$~\textit{a la}$ Chen et al. (2021), under a joint rotationally-invariant a… ▽ More

    Submitted 17 October, 2022; v1 submitted 8 March, 2022; originally announced March 2022.

    Comments: Under review