Skip to main content

Showing 1–13 of 13 results for author: Gustafson, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.00035  [pdf, other

    cs.CV cs.AI

    FACET: Fairness in Computer Vision Evaluation Benchmark

    Authors: Laura Gustafson, Chloe Rolland, Nikhila Ravi, Quentin Duval, Aaron Adcock, Cheng-Yang Fu, Melissa Hall, Candace Ross

    Abstract: Computer vision models have known performance disparities across attributes such as gender and skin tone. This means during tasks such as classification and detection, model performance differs for certain classes based on the demographics of the people in the image. These disparities have been shown to exist, but until now there has not been a unified approach to measure these differences for com… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

  2. arXiv:2304.05391  [pdf, other

    cs.CV

    Pinpointing Why Object Recognition Performance Degrades Across Income Levels and Geographies

    Authors: Laura Gustafson, Megan Richards, Melissa Hall, Caner Hazirbas, Diane Bouchacourt, Mark Ibrahim

    Abstract: Despite impressive advances in object-recognition, deep learning systems' performance degrades significantly across geographies and lower income levels raising pressing concerns of inequity. Addressing such performance gaps remains a challenge, as little is understood about why performance degrades across incomes or geographies. We take a step in this direction by annotating images from Dollar Str… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  3. arXiv:2304.02643  [pdf, other

    cs.CV cs.AI cs.LG

    Segment Anything

    Authors: Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alexander C. Berg, Wan-Yen Lo, Piotr Dollár, Ross Girshick

    Abstract: We introduce the Segment Anything (SA) project: a new task, model, and dataset for image segmentation. Using our efficient model in a data collection loop, we built the largest segmentation dataset to date (by far), with over 1 billion masks on 11M licensed and privacy respecting images. The model is designed and trained to be promptable, so it can transfer zero-shot to new image distributions and… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: Project web-page: https://segment-anything.com

  4. arXiv:2302.08572  [pdf, other

    cs.CV cs.HC cs.SI

    Towards Reliable Assessments of Demographic Disparities in Multi-Label Image Classifiers

    Authors: Melissa Hall, Bobbie Chern, Laura Gustafson, Denisse Ventura, Harshad Kulkarni, Candace Ross, Nicolas Usunier

    Abstract: Disaggregated performance metrics across demographic groups are a hallmark of fairness assessments in computer vision. These metrics successfully incentivized performance improvements on person-centric tasks such as face analysis and are used to understand risks of modern models. However, there is a lack of discussion on the vulnerabilities of these measurements for more complex computer vision ta… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

  5. arXiv:2301.11100  [pdf, other

    cs.CV cs.CY cs.HC

    Vision-Language Models Performing Zero-Shot Tasks Exhibit Gender-based Disparities

    Authors: Melissa Hall, Laura Gustafson, Aaron Adcock, Ishan Misra, Candace Ross

    Abstract: We explore the extent to which zero-shot vision-language models exhibit gender bias for different vision tasks. Vision models traditionally required task-specific labels for representing concepts, as well as finetuning; zero-shot models like CLIP instead perform tasks with an open-vocabulary, meaning they do not need a fixed set of labels, by using text embeddings to represent concepts. With these… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

  6. arXiv:2201.11706  [pdf, other

    cs.LG cs.CV

    A Systematic Study of Bias Amplification

    Authors: Melissa Hall, Laurens van der Maaten, Laura Gustafson, Maxwell Jones, Aaron Adcock

    Abstract: Recent research suggests that predictions made by machine-learning models can amplify biases present in the training data. When a model amplifies bias, it makes certain predictions at a higher rate for some groups than expected based on training-data statistics. Mitigating such bias amplification requires a deep understanding of the mechanics in modern machine learning that give rise to that ampli… ▽ More

    Submitted 19 October, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

  7. arXiv:2201.08371  [pdf, other

    cs.CV

    Revisiting Weakly Supervised Pre-Training of Visual Perception Models

    Authors: Mannat Singh, Laura Gustafson, Aaron Adcock, Vinicius de Freitas Reis, Bugra Gedik, Raj Prateek Kosaraju, Dhruv Mahajan, Ross Girshick, Piotr Dollár, Laurens van der Maaten

    Abstract: Model pre-training is a cornerstone of modern visual recognition systems. Although fully supervised pre-training on datasets like ImageNet is still the de-facto standard, recent studies suggest that large-scale weakly supervised pre-training can outperform fully supervised approaches. This paper revisits weakly-supervised pre-training of models using hashtag supervision with modern versions of res… ▽ More

    Submitted 2 April, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

    Comments: CVPR 2022

  8. arXiv:2006.10734  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Forward Prediction for Physical Reasoning

    Authors: Rohit Girdhar, Laura Gustafson, Aaron Adcock, Laurens van der Maaten

    Abstract: Physical reasoning requires forward prediction: the ability to forecast what will happen next given some initial world state. We study the performance of state-of-the-art forward-prediction models in the complex physical-reasoning tasks of the PHYRE benchmark. We do so by incorporating models that operate on object or pixel-based representations of the world into simple physical-reasoning agents.… ▽ More

    Submitted 29 March, 2021; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: Webpage/code/models: https://facebookresearch.github.io/phyre-fwd/

  9. arXiv:1908.05656  [pdf, other

    cs.LG cs.AI stat.ML

    PHYRE: A New Benchmark for Physical Reasoning

    Authors: Anton Bakhtin, Laurens van der Maaten, Justin Johnson, Laura Gustafson, Ross Girshick

    Abstract: Understanding and reasoning about physics is an important ability of intelligent agents. We develop the PHYRE benchmark for physical reasoning that contains a set of simple classical mechanics puzzles in a 2D physical environment. The benchmark is designed to encourage the development of learning algorithms that are sample-efficient and generalize well across puzzles. We test several modern learni… ▽ More

    Submitted 15 August, 2019; originally announced August 2019.

  10. arXiv:1906.12266  [pdf, other

    cs.LG cs.AI stat.ML

    Growing Action Spaces

    Authors: Gregory Farquhar, Laura Gustafson, Zeming Lin, Shimon Whiteson, Nicolas Usunier, Gabriel Synnaeve

    Abstract: In complex tasks, such as those with large combinatorial action spaces, random exploration may be too inefficient to achieve meaningful learning progress. In this work, we use a curriculum of progressively growing action spaces to accelerate learning. We assume the environment is out of our control, but that the agent may set an internal curriculum by initially restricting its action space. Our ap… ▽ More

    Submitted 28 June, 2019; originally announced June 2019.

  11. Performance-Efficiency Trade-off of Low-Precision Numerical Formats in Deep Neural Networks

    Authors: Zachariah Carmichael, Hamed F. Langroudi, Char Khazanov, Jeffrey Lillie, John L. Gustafson, Dhireesha Kudithipudi

    Abstract: Deep neural networks (DNNs) have been demonstrated as effective prognostic models across various domains, e.g. natural language processing, computer vision, and genomics. However, modern-day DNNs demand high compute and memory storage for executing any reasonably complex task. To optimize the inference time and alleviate the power consumption of these networks, DNN accelerators with low-precision… ▽ More

    Submitted 25 March, 2019; originally announced March 2019.

    Comments: 9 pages, Proceedings of the ACM Conference for Next Generation Arithmetic (CoNGA) 2019

  12. arXiv:1812.01762  [pdf, other

    cs.DC cs.LG cs.NE

    Deep Positron: A Deep Neural Network Using the Posit Number System

    Authors: Zachariah Carmichael, Hamed F. Langroudi, Char Khazanov, Jeffrey Lillie, John L. Gustafson, Dhireesha Kudithipudi

    Abstract: The recent surge of interest in Deep Neural Networks (DNNs) has led to increasingly complex networks that tax computational and memory resources. Many DNNs presently use 16-bit or 32-bit floating point operations. Significant performance and power gains can be obtained when DNN accelerators support low-precision numerical formats. Despite considerable research, there is still a knowledge gap on ho… ▽ More

    Submitted 18 January, 2019; v1 submitted 4 December, 2018; originally announced December 2018.

    Comments: 6 pages, Design, Automation and Test in Europe 2019

  13. arXiv:1709.09108  [pdf

    cs.AI cs.MS cs.PF

    Tensors Come of Age: Why the AI Revolution will help HPC

    Authors: John L. Gustafson, Lenore M. Mullin

    Abstract: This article discusses how the automation of tensor algorithms, based on A Mathematics of Arrays and Psi Calculus, and a new way to represent numbers, Unum Arithmetic, enables mechanically provable, scalable, portable, and more numerically accurate software.

    Submitted 26 September, 2017; originally announced September 2017.

    Comments: To be published in this years 30th anniversary edition of HPCwire