Skip to main content

Showing 1–11 of 11 results for author: Adcock, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.00035  [pdf, other

    cs.CV cs.AI

    FACET: Fairness in Computer Vision Evaluation Benchmark

    Authors: Laura Gustafson, Chloe Rolland, Nikhila Ravi, Quentin Duval, Aaron Adcock, Cheng-Yang Fu, Melissa Hall, Candace Ross

    Abstract: Computer vision models have known performance disparities across attributes such as gender and skin tone. This means during tasks such as classification and detection, model performance differs for certain classes based on the demographics of the people in the image. These disparities have been shown to exist, but until now there has not been a unified approach to measure these differences for com… ▽ More

    Submitted 31 August, 2023; originally announced September 2023.

  2. arXiv:2303.13496  [pdf, other

    cs.CV cs.AI cs.LG

    The effectiveness of MAE pre-pretraining for billion-scale pretraining

    Authors: Mannat Singh, Quentin Duval, Kalyan Vasudev Alwala, Haoqi Fan, Vaibhav Aggarwal, Aaron Adcock, Armand Joulin, Piotr Dollár, Christoph Feichtenhofer, Ross Girshick, Rohit Girdhar, Ishan Misra

    Abstract: This paper revisits the standard pretrain-then-finetune paradigm used in computer vision for visual recognition tasks. Typically, state-of-the-art foundation models are pretrained using large scale (weakly) supervised datasets with billions of images. We introduce an additional pre-pretraining stage that is simple and uses the self-supervised MAE technique to initialize the model. While MAE has on… ▽ More

    Submitted 24 January, 2024; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: ICCV 2023. Models available at https://github.com/facebookresearch/maws/

  3. arXiv:2301.11100  [pdf, other

    cs.CV cs.CY cs.HC

    Vision-Language Models Performing Zero-Shot Tasks Exhibit Gender-based Disparities

    Authors: Melissa Hall, Laura Gustafson, Aaron Adcock, Ishan Misra, Candace Ross

    Abstract: We explore the extent to which zero-shot vision-language models exhibit gender bias for different vision tasks. Vision models traditionally required task-specific labels for representing concepts, as well as finetuning; zero-shot models like CLIP instead perform tasks with an open-vocabulary, meaning they do not need a fixed set of labels, by using text embeddings to represent concepts. With these… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

  4. arXiv:2301.02560  [pdf, other

    cs.CV

    GeoDE: a Geographically Diverse Evaluation Dataset for Object Recognition

    Authors: Vikram V. Ramaswamy, Sing Yu Lin, Dora Zhao, Aaron B. Adcock, Laurens van der Maaten, Deepti Ghadiyaram, Olga Russakovsky

    Abstract: Current dataset collection methods typically scrape large amounts of data from the web. While this technique is extremely scalable, data collected in this way tends to reinforce stereotypical biases, can contain personally identifiable information, and typically originates from Europe and North America. In this work, we rethink the dataset collection paradigm and introduce GeoDE, a geographically… ▽ More

    Submitted 7 April, 2023; v1 submitted 5 January, 2023; originally announced January 2023.

  5. arXiv:2201.11706  [pdf, other

    cs.LG cs.CV

    A Systematic Study of Bias Amplification

    Authors: Melissa Hall, Laurens van der Maaten, Laura Gustafson, Maxwell Jones, Aaron Adcock

    Abstract: Recent research suggests that predictions made by machine-learning models can amplify biases present in the training data. When a model amplifies bias, it makes certain predictions at a higher rate for some groups than expected based on training-data statistics. Mitigating such bias amplification requires a deep understanding of the mechanics in modern machine learning that give rise to that ampli… ▽ More

    Submitted 19 October, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

  6. arXiv:2201.08371  [pdf, other

    cs.CV

    Revisiting Weakly Supervised Pre-Training of Visual Perception Models

    Authors: Mannat Singh, Laura Gustafson, Aaron Adcock, Vinicius de Freitas Reis, Bugra Gedik, Raj Prateek Kosaraju, Dhruv Mahajan, Ross Girshick, Piotr Dollár, Laurens van der Maaten

    Abstract: Model pre-training is a cornerstone of modern visual recognition systems. Although fully supervised pre-training on datasets like ImageNet is still the de-facto standard, recent studies suggest that large-scale weakly supervised pre-training can outperform fully supervised approaches. This paper revisits weakly-supervised pre-training of models using hashtag supervision with modern versions of res… ▽ More

    Submitted 2 April, 2022; v1 submitted 20 January, 2022; originally announced January 2022.

    Comments: CVPR 2022

  7. arXiv:2111.09887  [pdf, other

    cs.CV cs.LG

    PyTorchVideo: A Deep Learning Library for Video Understanding

    Authors: Haoqi Fan, Tullie Murrell, Heng Wang, Kalyan Vasudev Alwala, Yanghao Li, Yilei Li, Bo Xiong, Nikhila Ravi, Meng Li, Haichuan Yang, Jitendra Malik, Ross Girshick, Matt Feiszli, Aaron Adcock, Wan-Yen Lo, Christoph Feichtenhofer

    Abstract: We introduce PyTorchVideo, an open-source deep-learning library that provides a rich set of modular, efficient, and reproducible components for a variety of video understanding tasks, including classification, detection, self-supervised learning, and low-level processing. The library covers a full stack of video understanding tools including multimodal data loading, transformations, and models tha… ▽ More

    Submitted 18 November, 2021; originally announced November 2021.

    Comments: Technical report

  8. arXiv:2006.10734  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Forward Prediction for Physical Reasoning

    Authors: Rohit Girdhar, Laura Gustafson, Aaron Adcock, Laurens van der Maaten

    Abstract: Physical reasoning requires forward prediction: the ability to forecast what will happen next given some initial world state. We study the performance of state-of-the-art forward-prediction models in the complex physical-reasoning tasks of the PHYRE benchmark. We do so by incorporating models that operate on object or pixel-based representations of the world into simple physical-reasoning agents.… ▽ More

    Submitted 29 March, 2021; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: Webpage/code/models: https://facebookresearch.github.io/phyre-fwd/

  9. arXiv:1411.1546  [pdf, other

    cs.DS cs.SI physics.soc-ph stat.AP

    Tree decompositions and social graphs

    Authors: Aaron B. Adcock, Blair D. Sullivan, Michael W. Mahoney

    Abstract: Recent work has established that large informatics graphs such as social and information networks have non-trivial tree-like structure when viewed at moderate size scales. Here, we present results from the first detailed empirical evaluation of the use of tree decomposition (TD) heuristics for structure identification and extraction in social graphs. Although TDs have historically been used in str… ▽ More

    Submitted 3 May, 2016; v1 submitted 6 November, 2014; originally announced November 2014.

    Comments: v2 has 44 pages, 21 figures, 7 tables, 107 references. To appear in Internet Mathematics

  10. arXiv:1410.5845  [pdf, other

    cs.CC

    Zig-Zag Numberlink is NP-Complete

    Authors: Aaron Adcock, Erik D. Demaine, Martin L. Demaine, Michael P. O'Brien, Felix Reidl, Fernando Sánchez Villaamil, Blair D. Sullivan

    Abstract: When can $t$ terminal pairs in an $m \times n$ grid be connected by $t$ vertex-disjoint paths that cover all vertices of the grid? We prove that this problem is NP-complete. Our hardness result can be compared to two previous NP-hardness proofs: Lynch's 1975 proof without the ``cover all vertices'' constraint, and Kotsuma and Takenaga's 2010 proof when the paths are restricted to have the fewest p… ▽ More

    Submitted 21 October, 2014; originally announced October 2014.

  11. arXiv:1210.0866  [pdf, other

    cs.CV cs.CG math.AT

    Classification of Hepatic Lesions using the Matching Metric

    Authors: Aaron Adcock, Daniel Rubin, Gunnar Carlsson

    Abstract: In this paper we present a methodology of classifying hepatic (liver) lesions using multidimensional persistent homology, the matching metric (also called the bottleneck distance), and a support vector machine. We present our classification results on a dataset of 132 lesions that have been outlined and annotated by radiologists. We find that topological features are useful in the classification o… ▽ More

    Submitted 2 October, 2012; originally announced October 2012.