Skip to main content

Showing 1–21 of 21 results for author: Black, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.14782  [pdf, other

    cs.CL

    Lessons from the Trenches on Reproducible Evaluation of Language Models

    Authors: Stella Biderman, Hailey Schoelkopf, Lintang Sutawika, Leo Gao, Jonathan Tow, Baber Abbasi, Alham Fikri Aji, Pawan Sasanka Ammanamanchi, Sidney Black, Jordan Clive, Anthony DiPofi, Julen Etxaniz, Benjamin Fattori, Jessica Zosa Forde, Charles Foster, Jeffrey Hsu, Mimansa Jaiswal, Wilson Y. Lee, Haonan Li, Charles Lovering, Niklas Muennighoff, Ellie Pavlick, Jason Phang, Aviya Skowron, Samson Tan , et al. (5 additional authors not shown)

    Abstract: Effective evaluation of language models remains an open challenge in NLP. Researchers and engineers face methodological issues such as the sensitivity of models to evaluation setup, difficulty of proper comparisons across methods, and the lack of reproducibility and transparency. In this paper we draw on three years of experience in evaluating large language models to provide guidance and lessons… ▽ More

    Submitted 29 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

  2. arXiv:2402.17891  [pdf, other

    cs.CV

    Weakly Supervised Co-training with Swap** Assignments for Semantic Segmentation

    Authors: Xinyu Yang, Hossein Rahmani, Sue Black, Bryan M. Williams

    Abstract: Class activation maps (CAMs) are commonly employed in weakly supervised semantic segmentation (WSSS) to produce pseudo-labels. Due to incomplete or excessive class activation, existing studies often resort to offline CAM refinement, introducing additional stages or proposing offline modules. This can cause optimization difficulties for single-stage methods and limit generalizability. In this study… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  3. arXiv:2402.06694  [pdf

    cs.LG cs.AI

    Scaling Intelligent Agents in Combat Simulations for Wargaming

    Authors: Scotty Black, Christian Darken

    Abstract: Remaining competitive in future conflicts with technologically-advanced competitors requires us to accelerate our research and development in artificial intelligence (AI) for wargaming. More importantly, leveraging machine learning for intelligent combat behavior development will be key to one day achieving superhuman performance in this domain--elevating the quality and accelerating the speed of… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:2402.06075

    Journal ref: I/ITSEC Conference Proceedings 2023

  4. Scaling Artificial Intelligence for Digital Wargaming in Support of Decision-Making

    Authors: Scotty Black, Christian Darken

    Abstract: In this unprecedented era of technology-driven transformation, it becomes more critical than ever that we aggressively invest in develo** robust artificial intelligence (AI) for wargaming in support of decision-making. By advancing AI-enabled systems and pairing these with human judgment, we will be able to enhance all-domain awareness, improve the speed and quality of our decision cycles, offer… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Report number: STO-MP-MSG-207-23

    Journal ref: NATO STO-MP-MSG-207 2023

  5. arXiv:2312.13770  [pdf, other

    cs.CV

    3D Points Splatting for Real-Time Dynamic Hand Reconstruction

    Authors: Zheheng Jiang, Hossein Rahmani, Sue Black, Bryan M. Williams

    Abstract: We present 3D Points Splatting Hand Reconstruction (3D-PSHR), a real-time and photo-realistic hand reconstruction approach. We propose a self-adaptive canonical points upsampling strategy to achieve high-resolution hand geometry representation. This is followed by a self-adaptive deformation that deforms the hand from the canonical space to the target pose, adapting to the dynamic changing of cano… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  6. arXiv:2307.08456  [pdf, other

    eess.IV cs.CV

    Domain Adaptation using Silver Standard Masks for Lateral Ventricle Segmentation in FLAIR MRI

    Authors: Owen Crystal, Pejman J. Maralani, Sandra Black, Alan R. Moody, April Khademi

    Abstract: Lateral ventricular volume (LVV) is an important biomarker for clinical investigation. We present the first transfer learning-based LVV segmentation method for fluid-attenuated inversion recovery (FLAIR) MRI. To mitigate covariate shifts between source and target domains, this work proposes an domain adaptation method that optimizes performance on three target datasets. Silver standard (SS) masks… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: 16 pages, 3 figures

  7. arXiv:2307.05745  [pdf, other

    cs.CR cs.DC

    CloudSec: An Extensible Automated Reasoning Framework for Cloud Security Policies

    Authors: Joe Stubbs, Smruti Padhy, Richard Cardone, Steven Black

    Abstract: Users increasingly create, manage and share digital resources, including sensitive data, via cloud platforms and APIs. Platforms encode the rules governing access to these resources, referred to as \textit{security policies}, using different systems and semantics. As the number of resources and rules grows, the challenge of reasoning about them collectively increases. Formal methods tools, such as… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  8. arXiv:2304.14299  [pdf, other

    cs.CV

    A Probabilistic Attention Model with Occlusion-aware Texture Regression for 3D Hand Reconstruction from a Single RGB Image

    Authors: Zheheng Jiang, Hossein Rahmani, Sue Black, Bryan M. Williams

    Abstract: Recently, deep learning based approaches have shown promising results in 3D hand reconstruction from a single RGB image. These approaches can be roughly divided into model-based approaches, which are heavily dependent on the model's parameter space, and model-free approaches, which require large numbers of 3D ground truths to reduce depth ambiguity and struggle in weakly-supervised scenarios. To o… ▽ More

    Submitted 27 April, 2023; originally announced April 2023.

  9. arXiv:2304.08557  [pdf, other

    cs.CR cs.DC

    A Decentralized Authorization and Security Framework for Distributed Research Workflows

    Authors: Richard Cardone, Smruti Padhy, Steven Black, Sean Cleveland, Joe Stubbs

    Abstract: Research challenges such as climate change and the search for habitable planets increasingly use academic and commercial computing resources distributed across different institutions and physical sites. Furthermore, such analyses often require a level of automation that precludes direct human interaction, and securing these workflows involves adherence to security policies across institutions. In… ▽ More

    Submitted 13 May, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

    Comments: 10 pages. Short version of this paper to be published on COMPSAC 2023 proceedings

    ACM Class: H.4.0

  10. arXiv:2211.12312  [pdf, other

    cs.LG cs.AI

    Interpreting Neural Networks through the Polytope Lens

    Authors: Sid Black, Lee Sharkey, Leo Grinsztajn, Eric Winsor, Dan Braun, Jacob Merizian, Kip Parker, Carlos Ramón Guevara, Beren Millidge, Gabriel Alfour, Connor Leahy

    Abstract: Mechanistic interpretability aims to explain what a neural network has learned at a nuts-and-bolts level. What are the fundamental primitives of neural network representations? Previous mechanistic descriptions have used individual neurons or their linear combinations to understand the representations a network has learned. But there are clues that neurons and their linear combinations are not the… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

    Comments: 22/11/22 initial upload

  11. arXiv:2204.06745  [pdf, other

    cs.CL

    GPT-NeoX-20B: An Open-Source Autoregressive Language Model

    Authors: Sid Black, Stella Biderman, Eric Hallahan, Quentin Anthony, Leo Gao, Laurence Golding, Horace He, Connor Leahy, Kyle McDonell, Jason Phang, Michael Pieler, USVSN Sai Prashanth, Shivanshu Purohit, Laria Reynolds, Jonathan Tow, Ben Wang, Samuel Weinbach

    Abstract: We introduce GPT-NeoX-20B, a 20 billion parameter autoregressive language model trained on the Pile, whose weights will be made freely and openly available to the public through a permissive license. It is, to the best of our knowledge, the largest dense autoregressive model that has publicly available weights at the time of submission. In this work, we describe \model{}'s architecture and trainin… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Comments: To appear in the Proceedings of the ACL Workshop on Challenges & Perspectives in Creating Large Language Models

  12. arXiv:2203.06823  [pdf, other

    eess.IV cs.CV

    SKM-TEA: A Dataset for Accelerated MRI Reconstruction with Dense Image Labels for Quantitative Clinical Evaluation

    Authors: Arjun D Desai, Andrew M Schmidt, Elka B Rubin, Christopher M Sandino, Marianne S Black, Valentina Mazzoli, Kathryn J Stevens, Robert Boutin, Christopher Ré, Garry E Gold, Brian A Hargreaves, Akshay S Chaudhari

    Abstract: Magnetic resonance imaging (MRI) is a cornerstone of modern medical imaging. However, long image acquisition times, the need for qualitative expert analysis, and the lack of (and difficulty extracting) quantitative indicators that are sensitive to tissue health have curtailed widespread clinical and research studies. While recent machine learning methods for MRI reconstruction and analysis have sh… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

    Comments: Accepted to NeurIPS Datasets & Benchmarks (2021)

  13. arXiv:2203.06060  [pdf, other

    eess.IV cs.CV

    ROOD-MRI: Benchmarking the robustness of deep learning segmentation models to out-of-distribution and corrupted data in MRI

    Authors: Lyndon Boone, Mahdi Biparva, Parisa Mojiri Forooshani, Joel Ramirez, Mario Masellis, Robert Bartha, Sean Symons, Stephen Strother, Sandra E. Black, Chris Heyn, Anne L. Martel, Richard H. Swartz, Maged Goubran

    Abstract: Deep artificial neural networks (DNNs) have moved to the forefront of medical image analysis due to their success in classification, segmentation, and detection challenges. A principal challenge in large-scale deployment of DNNs in neuroimage analysis is the potential for shifts in signal-to-noise ratio, contrast, resolution, and presence of artifacts from site to site due to variances in scanners… ▽ More

    Submitted 11 March, 2022; originally announced March 2022.

    Comments: 30 pages, 13 figures. For associated GitHub repository, see https://github.com/AICONSlab/roodmri

  14. arXiv:2112.05253  [pdf, other

    cs.CV cs.CL

    MAGMA -- Multimodal Augmentation of Generative Models through Adapter-based Finetuning

    Authors: Constantin Eichenberg, Sidney Black, Samuel Weinbach, Letitia Parcalabescu, Anette Frank

    Abstract: Large-scale pretraining is fast becoming the norm in Vision-Language (VL) modeling. However, prevailing VL approaches are limited by the requirement for labeled data and the use of complex multi-step pretraining objectives. We present MAGMA - a simple method for augmenting generative language models with additional modalities using adapter-based finetuning. Building on Frozen, we train a series of… ▽ More

    Submitted 24 October, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: 13 pages, 6 figures, 2 tables. Minor improvements. Accepted at EMNLP 2022

    ACM Class: I.2.7; I.4.8; I.5.1

  15. arXiv:2108.02234  [pdf, other

    cs.CV

    Multi-Branch with Attention Network for Hand-Based Person Recognition

    Authors: Nathanael L. Baisa, Bryan Williams, Hossein Rahmani, Plamen Angelov, Sue Black

    Abstract: In this paper, we propose a novel hand-based person recognition method for the purpose of criminal investigations since the hand image is often the only available information in cases of serious crime such as sexual abuse. Our proposed method, Multi-Branch with Attention Network (MBA-Net), incorporates both channel and spatial attention modules in branches in addition to a global (without attentio… ▽ More

    Submitted 30 June, 2022; v1 submitted 4 August, 2021; originally announced August 2021.

    Comments: arXiv admin note: text overlap with arXiv:2101.05260

  16. arXiv:2106.05746  [pdf, other

    cs.CV

    The 2021 Hotel-ID to Combat Human Trafficking Competition Dataset

    Authors: Rashmi Kamath, Gregory Rolwes, Samuel Black, Abby Stylianou

    Abstract: Hotel recognition is an important task for human trafficking investigations since victims are often photographed in hotel rooms. Identifying these hotels is vital to trafficking investigations since they can help track down current and future victims who might be taken to the same places. Hotel recognition is a challenging fine grained visual classification task as there can be little similarity b… ▽ More

    Submitted 14 June, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: CVPR 2021 Workshop on Fine-Grained Visual Categorization (FGVC)

  17. arXiv:2101.05260  [pdf, other

    cs.CV

    Hand-Based Person Identification using Global and Part-Aware Deep Feature Representation Learning

    Authors: Nathanael L. Baisa, Bryan Williams, Hossein Rahmani, Plamen Angelov, Sue Black

    Abstract: In cases of serious crime, including sexual abuse, often the only available information with demonstrated potential for identification is images of the hands. Since this evidence is captured in uncontrolled situations, it is difficult to analyse. As global approaches to feature comparison are limited in this case, it is important to extend to consider local information. In this work, we propose ha… ▽ More

    Submitted 26 March, 2022; v1 submitted 13 January, 2021; originally announced January 2021.

  18. arXiv:2101.00027  [pdf, other

    cs.CL

    The Pile: An 800GB Dataset of Diverse Text for Language Modeling

    Authors: Leo Gao, Stella Biderman, Sid Black, Laurence Golding, Travis Hoppe, Charles Foster, Jason Phang, Horace He, Anish Thite, Noa Nabeshima, Shawn Presser, Connor Leahy

    Abstract: Recent work has demonstrated that increased training dataset diversity improves general cross-domain knowledge and downstream generalization capability for large-scale language models. With this in mind, we present \textit{the Pile}: an 825 GiB English text corpus targeted at training large-scale language models. The Pile is constructed from 22 diverse high-quality subsets -- both existing and new… ▽ More

    Submitted 31 December, 2020; originally announced January 2021.

  19. arXiv:2012.12406  [pdf

    cs.CV q-bio.QM q-bio.TO

    Open source software for automatic subregional assessment of knee cartilage degradation using quantitative T2 relaxometry and deep learning

    Authors: Kevin A. Thomas, Dominik Krzemiński, Łukasz Kidziński, Rohan Paul, Elka B. Rubin, Eni Halilaj, Marianne S. Black, Akshay Chaudhari, Garry E. Gold, Scott L. Delp

    Abstract: Objective: We evaluate a fully-automated femoral cartilage segmentation model for measuring T2 relaxation values and longitudinal changes using multi-echo spin echo (MESE) MRI. We have open sourced this model and corresponding segmentations. Methods: We trained a neural network to segment femoral cartilage from MESE MRIs. Cartilage was divided into 12 subregions along medial-lateral, superficial-d… ▽ More

    Submitted 22 December, 2020; originally announced December 2020.

  20. arXiv:1911.03283  [pdf, other

    cs.CL

    Composing and Embedding the Words-as-Classifiers Model of Grounded Semantics

    Authors: Daniele Moro, Stacy Black, Casey Kennington

    Abstract: The words-as-classifiers model of grounded lexical semantics learns a semantic fitness score between physical entities and the words that are used to denote those entities. In this paper, we explore how such a model can incrementally perform composition and how the model can be unified with a distributional representation. For the latter, we leverage the classifier coefficients as an embedding. Fo… ▽ More

    Submitted 8 November, 2019; originally announced November 2019.

    Comments: 10 pages

  21. Perivascular Spaces Segmentation in Brain MRI Using Optimal 3D Filtering

    Authors: Lucia Ballerini, Ruggiero Lovreglio, Maria del C. Valdes-Hernandez, Joel Ramirez, Bradley J. MacIntosh, Sandra E. Black, Joanna M. Wardlaw

    Abstract: Perivascular Spaces (PVS) are a recently recognised feature of Small Vessel Disease (SVD), also indicating neuroinflammation, and are an important part of the brain's circulation and glymphatic drainage system. Quantitative analysis of PVS on Magnetic Resonance Images (MRI) is important for understanding their relationship with neurological diseases. In this work, we propose a segmentation techniq… ▽ More

    Submitted 25 April, 2017; originally announced April 2017.