Search | arXiv e-print repository

Lessons from the Trenches on Reproducible Evaluation of Language Models

Authors: Stella Biderman, Hailey Schoelkopf, Lintang Sutawika, Leo Gao, Jonathan Tow, Baber Abbasi, Alham Fikri Aji, Pawan Sasanka Ammanamanchi, Sidney Black, Jordan Clive, Anthony DiPofi, Julen Etxaniz, Benjamin Fattori, Jessica Zosa Forde, Charles Foster, Jeffrey Hsu, Mimansa Jaiswal, Wilson Y. Lee, Haonan Li, Charles Lovering, Niklas Muennighoff, Ellie Pavlick, Jason Phang, Aviya Skowron, Samson Tan , et al. (5 additional authors not shown)

Abstract: Effective evaluation of language models remains an open challenge in NLP. Researchers and engineers face methodological issues such as the sensitivity of models to evaluation setup, difficulty of proper comparisons across methods, and the lack of reproducibility and transparency. In this paper we draw on three years of experience in evaluating large language models to provide guidance and lessons… ▽ More Effective evaluation of language models remains an open challenge in NLP. Researchers and engineers face methodological issues such as the sensitivity of models to evaluation setup, difficulty of proper comparisons across methods, and the lack of reproducibility and transparency. In this paper we draw on three years of experience in evaluating large language models to provide guidance and lessons for researchers. First, we provide an overview of common challenges faced in language model evaluation. Second, we delineate best practices for addressing or lessening the impact of these challenges on research. Third, we present the Language Model Evaluation Harness (lm-eval): an open source library for independent, reproducible, and extensible evaluation of language models that seeks to address these issues. We describe the features of the library as well as case studies in which the library has been used to alleviate these methodological concerns. △ Less

Submitted 29 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

arXiv:2402.17891 [pdf, other]

Weakly Supervised Co-training with Swap** Assignments for Semantic Segmentation

Authors: Xinyu Yang, Hossein Rahmani, Sue Black, Bryan M. Williams

Abstract: Class activation maps (CAMs) are commonly employed in weakly supervised semantic segmentation (WSSS) to produce pseudo-labels. Due to incomplete or excessive class activation, existing studies often resort to offline CAM refinement, introducing additional stages or proposing offline modules. This can cause optimization difficulties for single-stage methods and limit generalizability. In this study… ▽ More Class activation maps (CAMs) are commonly employed in weakly supervised semantic segmentation (WSSS) to produce pseudo-labels. Due to incomplete or excessive class activation, existing studies often resort to offline CAM refinement, introducing additional stages or proposing offline modules. This can cause optimization difficulties for single-stage methods and limit generalizability. In this study, we aim to reduce the observed CAM inconsistency and error to mitigate reliance on refinement processes. We propose an end-to-end WSSS model incorporating guided CAMs, wherein our segmentation model is trained while concurrently optimizing CAMs online. Our method, Co-training with Swap** Assignments (CoSA), leverages a dual-stream framework, where one sub-network learns from the swapped assignments generated by the other. We introduce three techniques: i) soft perplexity-based regularization to penalize uncertain regions; ii) a threshold-searching approach to dynamically revise the confidence threshold; and iii) contrastive separation to address the coexistence problem. CoSA demonstrates exceptional performance, achieving mIoU of 76.2\% and 51.0\% on VOC and COCO validation datasets, respectively, surpassing existing baselines by a substantial margin. Notably, CoSA is the first single-stage approach to outperform all existing multi-stage methods including those with additional supervision. Code is avilable at \url{https://github.com/youshyee/CoSA}. △ Less

Submitted 27 February, 2024; originally announced February 2024.

arXiv:2402.06694 [pdf]

Scaling Intelligent Agents in Combat Simulations for Wargaming

Authors: Scotty Black, Christian Darken

Abstract: Remaining competitive in future conflicts with technologically-advanced competitors requires us to accelerate our research and development in artificial intelligence (AI) for wargaming. More importantly, leveraging machine learning for intelligent combat behavior development will be key to one day achieving superhuman performance in this domain--elevating the quality and accelerating the speed of… ▽ More Remaining competitive in future conflicts with technologically-advanced competitors requires us to accelerate our research and development in artificial intelligence (AI) for wargaming. More importantly, leveraging machine learning for intelligent combat behavior development will be key to one day achieving superhuman performance in this domain--elevating the quality and accelerating the speed of our decisions in future wars. Although deep reinforcement learning (RL) continues to show promising results in intelligent agent behavior development in games, it has yet to perform at or above the human level in the long-horizon, complex tasks typically found in combat modeling and simulation. Capitalizing on the proven potential of RL and recent successes of hierarchical reinforcement learning (HRL), our research is investigating and extending the use of HRL to create intelligent agents capable of performing effectively in these large and complex simulation environments. Our ultimate goal is to develop an agent capable of superhuman performance that could then serve as an AI advisor to military planners and decision-makers. This papers covers our ongoing approach and the first three of our five research areas aimed at managing the exponential growth of computations that have thus far limited the use of AI in combat simulations: (1) develo** an HRL training framework and agent architecture for combat units; (2) develo** a multi-model framework for agent decision-making; (3) develo** dimension-invariant observation abstractions of the state space to manage the exponential growth of computations; (4) develo** an intrinsic rewards engine to enable long-term planning; and (5) implementing this framework into a higher-fidelity combat simulation. △ Less

Submitted 8 February, 2024; originally announced February 2024.

Comments: arXiv admin note: text overlap with arXiv:2402.06075

Journal ref: I/ITSEC Conference Proceedings 2023

arXiv:2402.06075 [pdf]

doi 10.14339/STO-MP-MSG-207-23-PDF

Scaling Artificial Intelligence for Digital Wargaming in Support of Decision-Making

Authors: Scotty Black, Christian Darken

Abstract: In this unprecedented era of technology-driven transformation, it becomes more critical than ever that we aggressively invest in develo** robust artificial intelligence (AI) for wargaming in support of decision-making. By advancing AI-enabled systems and pairing these with human judgment, we will be able to enhance all-domain awareness, improve the speed and quality of our decision cycles, offer… ▽ More In this unprecedented era of technology-driven transformation, it becomes more critical than ever that we aggressively invest in develo** robust artificial intelligence (AI) for wargaming in support of decision-making. By advancing AI-enabled systems and pairing these with human judgment, we will be able to enhance all-domain awareness, improve the speed and quality of our decision cycles, offer recommendations for novel courses of action, and more rapidly counter our adversary's actions. It therefore becomes imperative that we accelerate the development of AI to help us better address the complexity of modern challenges and dilemmas that currently requires human intelligence and, if possible, attempt to surpass human intelligence--not to replace humans, but to augment and better inform human decision-making at machine speed. Although deep reinforcement learning continues to show promising results in intelligent agent behavior development for the long-horizon, complex tasks typically found in combat modeling and simulation, further research is needed to enable the scaling of AI to deal with these intricate and expansive state-spaces characteristic of wargaming for either concept development, education, or analysis. To help address this challenge, in our research, we are develo** and implementing a hierarchical reinforcement learning framework that includes a multi-model approach and dimension-invariant observation abstractions. △ Less

Submitted 8 February, 2024; originally announced February 2024.

Report number: STO-MP-MSG-207-23

Journal ref: NATO STO-MP-MSG-207 2023

arXiv:2312.13770 [pdf, other]

3D Points Splatting for Real-Time Dynamic Hand Reconstruction

Authors: Zheheng Jiang, Hossein Rahmani, Sue Black, Bryan M. Williams

Abstract: We present 3D Points Splatting Hand Reconstruction (3D-PSHR), a real-time and photo-realistic hand reconstruction approach. We propose a self-adaptive canonical points upsampling strategy to achieve high-resolution hand geometry representation. This is followed by a self-adaptive deformation that deforms the hand from the canonical space to the target pose, adapting to the dynamic changing of cano… ▽ More We present 3D Points Splatting Hand Reconstruction (3D-PSHR), a real-time and photo-realistic hand reconstruction approach. We propose a self-adaptive canonical points upsampling strategy to achieve high-resolution hand geometry representation. This is followed by a self-adaptive deformation that deforms the hand from the canonical space to the target pose, adapting to the dynamic changing of canonical points which, in contrast to the common practice of subdividing the MANO model, offers greater flexibility and results in improved geometry fitting. To model texture, we disentangle the appearance color into the intrinsic albedo and pose-aware shading, which are learned through a Context-Attention module. Moreover, our approach allows the geometric and the appearance models to be trained simultaneously in an end-to-end manner. We demonstrate that our method is capable of producing animatable, photorealistic and relightable hand reconstructions using multiple datasets, including monocular videos captured with handheld smartphones and large-scale multi-view videos featuring various hand poses. We also demonstrate that our approach achieves real-time rendering speeds while simultaneously maintaining superior performance compared to existing state-of-the-art methods. △ Less

Submitted 21 December, 2023; originally announced December 2023.

arXiv:2310.20520 [pdf, other]

Efficiency of gratings for silica fiber-coupled internal Smith-Purcell radiation and Cherenkov diffraction radiation -- a quantitative numerical study

Authors: Andrzej Szczepkowicz, Dmytro Konakhovych, Damian Sniezek, Dylan S. Black, R. Joel England, Yen-Chieh Huang, Levi Schachter

Abstract: We propose a setup for measuring visible and near-visible internal Smith-Purcell radiation and Cherenkov Diffraction Radiation, based on silica and silicon, and perform quantitative numerical analysis of its radiation efficiency. We calculate the total radiated energy per electron and the spectral distribution of different radiation orders, taking into account material dispersion and absorption. F… ▽ More We propose a setup for measuring visible and near-visible internal Smith-Purcell radiation and Cherenkov Diffraction Radiation, based on silica and silicon, and perform quantitative numerical analysis of its radiation efficiency. We calculate the total radiated energy per electron and the spectral distribution of different radiation orders, taking into account material dispersion and absorption. For an optimized silica grating of 200 micrometer length, the total radiated energy reaches 2~eV per electron for 2 MeV electrons. Above the Cherenkov threshold, in most cases the energy of Cherenkov diffraction radiation is several times higher than the energy of internal Smith-Purcell radiation, but for some geometries and frequency ranges first or second order radiation may dominate over Cherenkov radiation (zeroth order). Radiation up to the 4th order is detected in the simulation. The spectrum of Cherenkov radiation is highly resonant, with local minima for frequencies where maxima of the other radiation orders occur. The spectrum of Cherenkov radiation takes a frequency comb-like shape for a uniform layer of Si on SiO2 substrate, due to a Fabry-Perot effect occurring for the evanescent field of the moving electron. The proposed setup could become a prototype of a non-invasive particle beam monitor for both conventional and laser particle accelerators. △ Less

Submitted 31 October, 2023; originally announced October 2023.

arXiv:2310.02434 [pdf, other]

Subrelativistic Alternating Phase Focusing Dielectric Laser Accelerators

Authors: Payton Broaddus, Thilo Egenolf, Dylan S. Black, Melanie Murillo, Clarisse Woodahl, Yu Miao, Uwe Niedermayer, Robert L. Byer, Kenneth J. Leedle, Olav Solgaard

Abstract: We demonstrate a silicon-based electron accelerator that uses laser optical near fields to both accelerate and confine electrons over extended distances. Two dielectric laser accelerator (DLA) designs were tested, each consisting of two arrays of silicon pillars pumped symmetrically by pulse front tilted laser beams, designed for average acceleration gradients 35 and 50 MeV/m respectively. The DLA… ▽ More We demonstrate a silicon-based electron accelerator that uses laser optical near fields to both accelerate and confine electrons over extended distances. Two dielectric laser accelerator (DLA) designs were tested, each consisting of two arrays of silicon pillars pumped symmetrically by pulse front tilted laser beams, designed for average acceleration gradients 35 and 50 MeV/m respectively. The DLAs are designed to act as alternating phase focusing (APF) lattices, where electrons, depending on the electron-laser interaction phase, will alternate between opposing longitudinal and transverse focusing and defocusing forces. By incorporating fractional period drift sections that alter the synchronous phase between $\pm 60^\circ$ off crest, electrons captured in the designed acceleration bucket experience half the peak gradient as average gradient while also experiencing strong confinement forces that enable long interaction lengths. We demonstrate APF accelerators with interaction lengths up to 708 $μ$m and energy gains up to 23.7 $\pm$ 1.07 keV FWHM, a 25$\%$ increase from starting energy, demonstrating the ability to achieve substantial energy gains with subrelativistic DLA. △ Less

Submitted 12 March, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

Comments: 16 pages

arXiv:2307.08456 [pdf, other]

Domain Adaptation using Silver Standard Masks for Lateral Ventricle Segmentation in FLAIR MRI

Authors: Owen Crystal, Pejman J. Maralani, Sandra Black, Alan R. Moody, April Khademi

Abstract: Lateral ventricular volume (LVV) is an important biomarker for clinical investigation. We present the first transfer learning-based LVV segmentation method for fluid-attenuated inversion recovery (FLAIR) MRI. To mitigate covariate shifts between source and target domains, this work proposes an domain adaptation method that optimizes performance on three target datasets. Silver standard (SS) masks… ▽ More Lateral ventricular volume (LVV) is an important biomarker for clinical investigation. We present the first transfer learning-based LVV segmentation method for fluid-attenuated inversion recovery (FLAIR) MRI. To mitigate covariate shifts between source and target domains, this work proposes an domain adaptation method that optimizes performance on three target datasets. Silver standard (SS) masks were generated from the target domain using a novel conventional image processing ventricular segmentation algorithm and used to supplement the gold standard (GS) data from the source domain, Canadian Atherosclerosis Imaging Network (CAIN). Four models were tested on held-out test sets from four datasets: 1) SS+GS: trained on target SS masks and fine-tuned on source GS masks, 2) GS+SS: trained on source GS masks and fine-tuned on target SS masks, 3) trained on source GS (GS CAIN Only) and 4) trained on target SS masks (SS Only). The SS+GS model had the best and most consistent performance (mean DSC = 0.89, CoV = 0.05) and showed significantly (p < 0.05) higher DSC compared to the GS-only model on three target domains. Results suggest pre-training with noisy labels from the target domain allows the model to adapt to the dataset-specific characteristics and provides robust parameter initialization while fine-tuning with GS masks allows the model to learn detailed features. This method has wide application to other medical imaging problems where labeled data is scarce, and can be used as a per-dataset calibration method to accelerate wide-scale adoption. △ Less

Submitted 17 July, 2023; originally announced July 2023.

Comments: 16 pages, 3 figures

arXiv:2307.05745 [pdf, other]

CloudSec: An Extensible Automated Reasoning Framework for Cloud Security Policies

Authors: Joe Stubbs, Smruti Padhy, Richard Cardone, Steven Black

Abstract: Users increasingly create, manage and share digital resources, including sensitive data, via cloud platforms and APIs. Platforms encode the rules governing access to these resources, referred to as \textit{security policies}, using different systems and semantics. As the number of resources and rules grows, the challenge of reasoning about them collectively increases. Formal methods tools, such as… ▽ More Users increasingly create, manage and share digital resources, including sensitive data, via cloud platforms and APIs. Platforms encode the rules governing access to these resources, referred to as \textit{security policies}, using different systems and semantics. As the number of resources and rules grows, the challenge of reasoning about them collectively increases. Formal methods tools, such as Satisfiability Modulo Theories (SMT) libraries, can be used to automate the analysis of security policies, but several challenges, including the highly specialized, technical nature of the libraries as well as their variable performance, prevent their broad adoption in cloud systems. In this paper, we present CloudSec, an extensible framework for reasoning about cloud security policies using SMT. CloudSec provides a high-level API that can be used to encode different types of cloud security policies without knowledge of SMT. Further, it is trivial for applications written with CloudSec to utilize and switch between different SMT libraries such as Z3 and CVC5. We demonstrate the use of CloudSec to analyze security policies in Tapis, a cloud-based API for distributed computational research used by tens of thousands of researchers. △ Less

Submitted 7 July, 2023; originally announced July 2023.

arXiv:2304.14299 [pdf, other]

A Probabilistic Attention Model with Occlusion-aware Texture Regression for 3D Hand Reconstruction from a Single RGB Image

Authors: Zheheng Jiang, Hossein Rahmani, Sue Black, Bryan M. Williams

Abstract: Recently, deep learning based approaches have shown promising results in 3D hand reconstruction from a single RGB image. These approaches can be roughly divided into model-based approaches, which are heavily dependent on the model's parameter space, and model-free approaches, which require large numbers of 3D ground truths to reduce depth ambiguity and struggle in weakly-supervised scenarios. To o… ▽ More Recently, deep learning based approaches have shown promising results in 3D hand reconstruction from a single RGB image. These approaches can be roughly divided into model-based approaches, which are heavily dependent on the model's parameter space, and model-free approaches, which require large numbers of 3D ground truths to reduce depth ambiguity and struggle in weakly-supervised scenarios. To overcome these issues, we propose a novel probabilistic model to achieve the robustness of model-based approaches and reduced dependence on the model's parameter space of model-free approaches. The proposed probabilistic model incorporates a model-based network as a prior-net to estimate the prior probability distribution of joints and vertices. An Attention-based Mesh Vertices Uncertainty Regression (AMVUR) model is proposed to capture dependencies among vertices and the correlation between joints and mesh vertices to improve their feature representation. We further propose a learning based occlusion-aware Hand Texture Regression model to achieve high-fidelity texture reconstruction. We demonstrate the flexibility of the proposed probabilistic model to be trained in both supervised and weakly-supervised scenarios. The experimental results demonstrate our probabilistic model's state-of-the-art accuracy in 3D hand and texture reconstruction from a single image in both training schemes, including in the presence of severe occlusions. △ Less

Submitted 27 April, 2023; originally announced April 2023.

arXiv:2304.08557 [pdf, other]

A Decentralized Authorization and Security Framework for Distributed Research Workflows

Authors: Richard Cardone, Smruti Padhy, Steven Black, Sean Cleveland, Joe Stubbs

Abstract: Research challenges such as climate change and the search for habitable planets increasingly use academic and commercial computing resources distributed across different institutions and physical sites. Furthermore, such analyses often require a level of automation that precludes direct human interaction, and securing these workflows involves adherence to security policies across institutions. In… ▽ More Research challenges such as climate change and the search for habitable planets increasingly use academic and commercial computing resources distributed across different institutions and physical sites. Furthermore, such analyses often require a level of automation that precludes direct human interaction, and securing these workflows involves adherence to security policies across institutions. In this paper, we present a decentralized authorization and security framework that enables researchers to utilize resources across different sites while allowing service providers to maintain autonomy over their secrets and authorization policies. We describe this framework as part of the Tapis platform, a web-based, hosted API used by researchers from multiple institutions, and we measure the performance of various authorization and security queries, including cross-site queries. We conclude with two use case studies -- a project at the University of Hawaii to study climate change and the NASA NEID telescope project that searches the galaxy for exoplanets. △ Less

Submitted 13 May, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

Comments: 10 pages. Short version of this paper to be published on COMPSAC 2023 proceedings

ACM Class: H.4.0

arXiv:2211.12312 [pdf, other]

Interpreting Neural Networks through the Polytope Lens

Authors: Sid Black, Lee Sharkey, Leo Grinsztajn, Eric Winsor, Dan Braun, Jacob Merizian, Kip Parker, Carlos Ramón Guevara, Beren Millidge, Gabriel Alfour, Connor Leahy

Abstract: Mechanistic interpretability aims to explain what a neural network has learned at a nuts-and-bolts level. What are the fundamental primitives of neural network representations? Previous mechanistic descriptions have used individual neurons or their linear combinations to understand the representations a network has learned. But there are clues that neurons and their linear combinations are not the… ▽ More Mechanistic interpretability aims to explain what a neural network has learned at a nuts-and-bolts level. What are the fundamental primitives of neural network representations? Previous mechanistic descriptions have used individual neurons or their linear combinations to understand the representations a network has learned. But there are clues that neurons and their linear combinations are not the correct fundamental units of description: directions cannot describe how neural networks use nonlinearities to structure their representations. Moreover, many instances of individual neurons and their combinations are polysemantic (i.e. they have multiple unrelated meanings). Polysemanticity makes interpreting the network in terms of neurons or directions challenging since we can no longer assign a specific feature to a neural unit. In order to find a basic unit of description that does not suffer from these problems, we zoom in beyond just directions to study the way that piecewise linear activation functions (such as ReLU) partition the activation space into numerous discrete polytopes. We call this perspective the polytope lens. The polytope lens makes concrete predictions about the behavior of neural networks, which we evaluate through experiments on both convolutional image classifiers and language models. Specifically, we show that polytopes can be used to identify monosemantic regions of activation space (while directions are not in general monosemantic) and that the density of polytope boundaries reflect semantic boundaries. We also outline a vision for what mechanistic interpretability might look like through the polytope lens. △ Less

Submitted 22 November, 2022; originally announced November 2022.

Comments: 22/11/22 initial upload

arXiv:2204.06745 [pdf, other]

GPT-NeoX-20B: An Open-Source Autoregressive Language Model

Authors: Sid Black, Stella Biderman, Eric Hallahan, Quentin Anthony, Leo Gao, Laurence Golding, Horace He, Connor Leahy, Kyle McDonell, Jason Phang, Michael Pieler, USVSN Sai Prashanth, Shivanshu Purohit, Laria Reynolds, Jonathan Tow, Ben Wang, Samuel Weinbach

Abstract: We introduce GPT-NeoX-20B, a 20 billion parameter autoregressive language model trained on the Pile, whose weights will be made freely and openly available to the public through a permissive license. It is, to the best of our knowledge, the largest dense autoregressive model that has publicly available weights at the time of submission. In this work, we describe \model{}'s architecture and trainin… ▽ More We introduce GPT-NeoX-20B, a 20 billion parameter autoregressive language model trained on the Pile, whose weights will be made freely and openly available to the public through a permissive license. It is, to the best of our knowledge, the largest dense autoregressive model that has publicly available weights at the time of submission. In this work, we describe \model{}'s architecture and training and evaluate its performance on a range of language-understanding, mathematics, and knowledge-based tasks. We find that GPT-NeoX-20B is a particularly powerful few-shot reasoner and gains far more in performance when evaluated five-shot than similarly sized GPT-3 and FairSeq models. We open-source the training and evaluation code, as well as the model weights, at https://github.com/EleutherAI/gpt-neox. △ Less

Submitted 14 April, 2022; originally announced April 2022.

Comments: To appear in the Proceedings of the ACL Workshop on Challenges & Perspectives in Creating Large Language Models

arXiv:2203.06823 [pdf, other]

SKM-TEA: A Dataset for Accelerated MRI Reconstruction with Dense Image Labels for Quantitative Clinical Evaluation

Authors: Arjun D Desai, Andrew M Schmidt, Elka B Rubin, Christopher M Sandino, Marianne S Black, Valentina Mazzoli, Kathryn J Stevens, Robert Boutin, Christopher Ré, Garry E Gold, Brian A Hargreaves, Akshay S Chaudhari

Abstract: Magnetic resonance imaging (MRI) is a cornerstone of modern medical imaging. However, long image acquisition times, the need for qualitative expert analysis, and the lack of (and difficulty extracting) quantitative indicators that are sensitive to tissue health have curtailed widespread clinical and research studies. While recent machine learning methods for MRI reconstruction and analysis have sh… ▽ More Magnetic resonance imaging (MRI) is a cornerstone of modern medical imaging. However, long image acquisition times, the need for qualitative expert analysis, and the lack of (and difficulty extracting) quantitative indicators that are sensitive to tissue health have curtailed widespread clinical and research studies. While recent machine learning methods for MRI reconstruction and analysis have shown promise for reducing this burden, these techniques are primarily validated with imperfect image quality metrics, which are discordant with clinically-relevant measures that ultimately hamper clinical deployment and clinician trust. To mitigate this challenge, we present the Stanford Knee MRI with Multi-Task Evaluation (SKM-TEA) dataset, a collection of quantitative knee MRI (qMRI) scans that enables end-to-end, clinically-relevant evaluation of MRI reconstruction and analysis tools. This 1.6TB dataset consists of raw-data measurements of ~25,000 slices (155 patients) of anonymized patient MRI scans, the corresponding scanner-generated DICOM images, manual segmentations of four tissues, and bounding box annotations for sixteen clinically relevant pathologies. We provide a framework for using qMRI parameter maps, along with image reconstructions and dense image labels, for measuring the quality of qMRI biomarker estimates extracted from MRI reconstruction, segmentation, and detection techniques. Finally, we use this framework to benchmark state-of-the-art baselines on this dataset. We hope our SKM-TEA dataset and code can enable a broad spectrum of research for modular image reconstruction and image analysis in a clinically informed manner. Dataset access, code, and benchmarks are available at https://github.com/StanfordMIMI/skm-tea. △ Less

Submitted 13 March, 2022; originally announced March 2022.

Comments: Accepted to NeurIPS Datasets & Benchmarks (2021)

arXiv:2203.06060 [pdf, other]

ROOD-MRI: Benchmarking the robustness of deep learning segmentation models to out-of-distribution and corrupted data in MRI

Authors: Lyndon Boone, Mahdi Biparva, Parisa Mojiri Forooshani, Joel Ramirez, Mario Masellis, Robert Bartha, Sean Symons, Stephen Strother, Sandra E. Black, Chris Heyn, Anne L. Martel, Richard H. Swartz, Maged Goubran

Abstract: Deep artificial neural networks (DNNs) have moved to the forefront of medical image analysis due to their success in classification, segmentation, and detection challenges. A principal challenge in large-scale deployment of DNNs in neuroimage analysis is the potential for shifts in signal-to-noise ratio, contrast, resolution, and presence of artifacts from site to site due to variances in scanners… ▽ More Deep artificial neural networks (DNNs) have moved to the forefront of medical image analysis due to their success in classification, segmentation, and detection challenges. A principal challenge in large-scale deployment of DNNs in neuroimage analysis is the potential for shifts in signal-to-noise ratio, contrast, resolution, and presence of artifacts from site to site due to variances in scanners and acquisition protocols. DNNs are famously susceptible to these distribution shifts in computer vision. Currently, there are no benchmarking platforms or frameworks to assess the robustness of new and existing models to specific distribution shifts in MRI, and accessible multi-site benchmarking datasets are still scarce or task-specific. To address these limitations, we propose ROOD-MRI: a platform for benchmarking the Robustness of DNNs to Out-Of-Distribution (OOD) data, corruptions, and artifacts in MRI. The platform provides modules for generating benchmarking datasets using transforms that model distribution shifts in MRI, implementations of newly derived benchmarking metrics for image segmentation, and examples for using the methodology with new models and tasks. We apply our methodology to hippocampus, ventricle, and white matter hyperintensity segmentation in several large studies, providing the hippocampus dataset as a publicly available benchmark. By evaluating modern DNNs on these datasets, we demonstrate that they are highly susceptible to distribution shifts and corruptions in MRI. We show that while data augmentation strategies can substantially improve robustness to OOD data for anatomical segmentation tasks, modern DNNs using augmentation still lack robustness in more challenging lesion-based segmentation tasks. We finally benchmark U-Nets and transformer-based models, finding consistent differences in robustness to particular classes of transforms across architectures. △ Less

Submitted 11 March, 2022; originally announced March 2022.

Comments: 30 pages, 13 figures. For associated GitHub repository, see https://github.com/AICONSlab/roodmri

arXiv:2112.05253 [pdf, other]

MAGMA -- Multimodal Augmentation of Generative Models through Adapter-based Finetuning

Authors: Constantin Eichenberg, Sidney Black, Samuel Weinbach, Letitia Parcalabescu, Anette Frank

Abstract: Large-scale pretraining is fast becoming the norm in Vision-Language (VL) modeling. However, prevailing VL approaches are limited by the requirement for labeled data and the use of complex multi-step pretraining objectives. We present MAGMA - a simple method for augmenting generative language models with additional modalities using adapter-based finetuning. Building on Frozen, we train a series of… ▽ More Large-scale pretraining is fast becoming the norm in Vision-Language (VL) modeling. However, prevailing VL approaches are limited by the requirement for labeled data and the use of complex multi-step pretraining objectives. We present MAGMA - a simple method for augmenting generative language models with additional modalities using adapter-based finetuning. Building on Frozen, we train a series of VL models that autoregressively generate text from arbitrary combinations of visual and textual input. The pretraining is entirely end-to-end using a single language modeling objective, simplifying optimization compared to previous approaches. Importantly, the language model weights remain unchanged during training, allowing for transfer of encyclopedic knowledge and in-context learning abilities from language pretraining. MAGMA outperforms Frozen on open-ended generative tasks, achieving state of the art results on the OKVQA benchmark and competitive results on a range of other popular VL benchmarks, while pretraining on 0.2% of the number of samples used to train SimVLM. △ Less

Submitted 24 October, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

Comments: 13 pages, 6 figures, 2 tables. Minor improvements. Accepted at EMNLP 2022

ACM Class: I.2.7; I.4.8; I.5.1

arXiv:2111.10709 [pdf, other]

doi 10.1103/PhysRevLett.127.164802

Electron Pulse Compression with Optical Beat Note

Authors: Zhexin Zhao, Kenneth J. Leedle, Dylan S. Black, Olav Solgaard, Robert L. Byer, Shanhui Fan

Abstract: Compressing electron pulses is important in many applications of electron beam systems. In this study, we propose to use optical beat notes to compress electron pulses. The beat frequency is chosen to match the initial electron pulse duration, which enables the compression of electron pulses with a wide range of durations. This functionality extends the optical control of electron beams, which is… ▽ More Compressing electron pulses is important in many applications of electron beam systems. In this study, we propose to use optical beat notes to compress electron pulses. The beat frequency is chosen to match the initial electron pulse duration, which enables the compression of electron pulses with a wide range of durations. This functionality extends the optical control of electron beams, which is important in compact electron beam systems such as dielectric laser accelerators. We also find that the dominant frequency of the electron charge density changes continuously along its drift trajectory, which may open up new opportunities in coherent interaction between free electrons and quantum or classical systems. △ Less

Submitted 20 November, 2021; originally announced November 2021.

Journal ref: Phys. Rev. Lett. 127, 164802 (2021)

arXiv:2108.02234 [pdf, other]

Multi-Branch with Attention Network for Hand-Based Person Recognition

Authors: Nathanael L. Baisa, Bryan Williams, Hossein Rahmani, Plamen Angelov, Sue Black

Abstract: In this paper, we propose a novel hand-based person recognition method for the purpose of criminal investigations since the hand image is often the only available information in cases of serious crime such as sexual abuse. Our proposed method, Multi-Branch with Attention Network (MBA-Net), incorporates both channel and spatial attention modules in branches in addition to a global (without attentio… ▽ More In this paper, we propose a novel hand-based person recognition method for the purpose of criminal investigations since the hand image is often the only available information in cases of serious crime such as sexual abuse. Our proposed method, Multi-Branch with Attention Network (MBA-Net), incorporates both channel and spatial attention modules in branches in addition to a global (without attention) branch to capture global structural information for discriminative feature learning. The attention modules focus on the relevant features of the hand image while suppressing the irrelevant backgrounds. In order to overcome the weakness of the attention mechanisms, equivariant to pixel shuffling, we integrate relative positional encodings into the spatial attention module to capture the spatial positions of pixels. Extensive evaluations on two large multi-ethnic and publicly available hand datasets demonstrate that our proposed method achieves state-of-the-art performance, surpassing the existing hand-based identification methods. △ Less

Submitted 30 June, 2022; v1 submitted 4 August, 2021; originally announced August 2021.

Comments: arXiv admin note: text overlap with arXiv:2101.05260

arXiv:2106.05746 [pdf, other]

The 2021 Hotel-ID to Combat Human Trafficking Competition Dataset

Authors: Rashmi Kamath, Gregory Rolwes, Samuel Black, Abby Stylianou

Abstract: Hotel recognition is an important task for human trafficking investigations since victims are often photographed in hotel rooms. Identifying these hotels is vital to trafficking investigations since they can help track down current and future victims who might be taken to the same places. Hotel recognition is a challenging fine grained visual classification task as there can be little similarity b… ▽ More Hotel recognition is an important task for human trafficking investigations since victims are often photographed in hotel rooms. Identifying these hotels is vital to trafficking investigations since they can help track down current and future victims who might be taken to the same places. Hotel recognition is a challenging fine grained visual classification task as there can be little similarity between different rooms within the same hotel, and high similarity between rooms from different hotels (especially if they are from the same chain). Hotel recognition to combat human trafficking poses additional challenges as investigative images are often low quality, contain uncommon camera angles and are highly occluded. Here, we present the 2021 Hotel-ID dataset to help raise awareness for this problem and generate novel approaches. The dataset consists of hotel room images that have been crowd-sourced and uploaded through the TraffickCam mobile application. The quality of these images is similar to investigative images and hence models trained on these images have good chances of accurately narrowing down on the correct hotel. △ Less

Submitted 14 June, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

Comments: CVPR 2021 Workshop on Fine-Grained Visual Categorization (FGVC)

arXiv:2105.07682 [pdf, other]

Internal Smith-Purcell radiation and its interplay with Cherenkov diffraction radiation in silicon -- a combined time and frequency domain numerical study

Authors: Dmytro Konakhovych, Damian Sniezek, Oskar Warmusz, Dylan S. Black, Zhexin Zhao, R. Joel England, Andrzej Szczepkowicz

Abstract: We consider radiation generated by an electron travelling parallel to a planar rectangular silicon grating: Smith-Purcell radiation to the vacuum side, internal Smith-Purcell radiation into the dielectric, and Cherenkov radiation into the dielectric. Internal Smith-Purcell radiation dominates over the other two radiation mechanisms in the range where conventional Smith-Purcell radiation is forbidd… ▽ More We consider radiation generated by an electron travelling parallel to a planar rectangular silicon grating: Smith-Purcell radiation to the vacuum side, internal Smith-Purcell radiation into the dielectric, and Cherenkov radiation into the dielectric. Internal Smith-Purcell radiation dominates over the other two radiation mechanisms in the range where conventional Smith-Purcell radiation is forbidden. This observation may lead to improved design of contactless particle beam monitors. △ Less

Submitted 17 May, 2021; originally announced May 2021.

Comments: 5 pages, 5 figures

arXiv:2101.05260 [pdf, other]

Hand-Based Person Identification using Global and Part-Aware Deep Feature Representation Learning

Authors: Nathanael L. Baisa, Bryan Williams, Hossein Rahmani, Plamen Angelov, Sue Black

Abstract: In cases of serious crime, including sexual abuse, often the only available information with demonstrated potential for identification is images of the hands. Since this evidence is captured in uncontrolled situations, it is difficult to analyse. As global approaches to feature comparison are limited in this case, it is important to extend to consider local information. In this work, we propose ha… ▽ More In cases of serious crime, including sexual abuse, often the only available information with demonstrated potential for identification is images of the hands. Since this evidence is captured in uncontrolled situations, it is difficult to analyse. As global approaches to feature comparison are limited in this case, it is important to extend to consider local information. In this work, we propose hand-based person identification by learning both global and local deep feature representations. Our proposed method, Global and Part-Aware Network (GPA-Net), creates global and local branches on the conv-layer for learning robust discriminative global and part-level features. For learning the local (part-level) features, we perform uniform partitioning on the conv-layer in both horizontal and vertical directions. We retrieve the parts by conducting a soft partition without explicitly partitioning the images or requiring external cues such as pose estimation. We make extensive evaluations on two large multi-ethnic and publicly available hand datasets, demonstrating that our proposed method significantly outperforms competing approaches. △ Less

Submitted 26 March, 2022; v1 submitted 13 January, 2021; originally announced January 2021.

arXiv:2101.00027 [pdf, other]

The Pile: An 800GB Dataset of Diverse Text for Language Modeling

Authors: Leo Gao, Stella Biderman, Sid Black, Laurence Golding, Travis Hoppe, Charles Foster, Jason Phang, Horace He, Anish Thite, Noa Nabeshima, Shawn Presser, Connor Leahy

Abstract: Recent work has demonstrated that increased training dataset diversity improves general cross-domain knowledge and downstream generalization capability for large-scale language models. With this in mind, we present \textit{the Pile}: an 825 GiB English text corpus targeted at training large-scale language models. The Pile is constructed from 22 diverse high-quality subsets -- both existing and new… ▽ More Recent work has demonstrated that increased training dataset diversity improves general cross-domain knowledge and downstream generalization capability for large-scale language models. With this in mind, we present \textit{the Pile}: an 825 GiB English text corpus targeted at training large-scale language models. The Pile is constructed from 22 diverse high-quality subsets -- both existing and newly constructed -- many of which derive from academic or professional sources. Our evaluation of the untuned performance of GPT-2 and GPT-3 on the Pile shows that these models struggle on many of its components, such as academic writing. Conversely, models trained on the Pile improve significantly over both Raw CC and CC-100 on all components of the Pile, while improving performance on downstream evaluations. Through an in-depth exploratory analysis, we document potentially concerning aspects of the data for prospective users. We make publicly available the code used in its construction. △ Less

Submitted 31 December, 2020; originally announced January 2021.

arXiv:2012.12406 [pdf]

Open source software for automatic subregional assessment of knee cartilage degradation using quantitative T2 relaxometry and deep learning

Authors: Kevin A. Thomas, Dominik Krzemiński, Łukasz Kidziński, Rohan Paul, Elka B. Rubin, Eni Halilaj, Marianne S. Black, Akshay Chaudhari, Garry E. Gold, Scott L. Delp

Abstract: Objective: We evaluate a fully-automated femoral cartilage segmentation model for measuring T2 relaxation values and longitudinal changes using multi-echo spin echo (MESE) MRI. We have open sourced this model and corresponding segmentations. Methods: We trained a neural network to segment femoral cartilage from MESE MRIs. Cartilage was divided into 12 subregions along medial-lateral, superficial-d… ▽ More Objective: We evaluate a fully-automated femoral cartilage segmentation model for measuring T2 relaxation values and longitudinal changes using multi-echo spin echo (MESE) MRI. We have open sourced this model and corresponding segmentations. Methods: We trained a neural network to segment femoral cartilage from MESE MRIs. Cartilage was divided into 12 subregions along medial-lateral, superficial-deep, and anterior-central-posterior boundaries. Subregional T2 values and four-year changes were calculated using a musculoskeletal radiologist's segmentations (Reader 1) and the model's segmentations. These were compared using 28 held out images. A subset of 14 images were also evaluated by a second expert (Reader 2) for comparison. Results: Model segmentations agreed with Reader 1 segmentations with a Dice score of 0.85 +/- 0.03. The model's estimated T2 values for individual subregions agreed with those of Reader 1 with an average Spearman correlation of 0.89 and average mean absolute error (MAE) of 1.34 ms. The model's estimated four-year change in T2 for individual regions agreed with Reader 1 with an average correlation of 0.80 and average MAE of 1.72 ms. The model agreed with Reader 1 at least as closely as Reader 2 agreed with Reader 1 in terms of Dice score (0.85 vs 0.75) and subregional T2 values. Conclusions: We present a fast, fully-automated model for segmentation of MESE MRIs. Assessments of cartilage health using its segmentations agree with those of an expert as closely as experts agree with one another. This has the potential to accelerate osteoarthritis research. △ Less

Submitted 22 December, 2020; originally announced December 2020.

arXiv:2008.02147 [pdf, other]

doi 10.1103/PhysRevApplied.15.L021002

Low Energy Spread Attosecond Bunching and Coherent Electron Acceleration in Dielectric Nanostructures

Authors: Uwe Niedermayer, Dylan S. Black, Kenneth J. Leedle, Yu Miao, Robert L. Byer, Olav Solgaard

Abstract: We demonstrate a compact technique to compress electron pulses to attosecond length, while kee** the energy spread reasonably small. The technique is based on Dielectric Laser Acceleration (DLA) in nanophotonic silicon structures. Unlike previous ballistic optical microbunching demonstrations, we use a modulator-demodulator scheme to compress phase space in the time and energy coordinates. With… ▽ More We demonstrate a compact technique to compress electron pulses to attosecond length, while kee** the energy spread reasonably small. The technique is based on Dielectric Laser Acceleration (DLA) in nanophotonic silicon structures. Unlike previous ballistic optical microbunching demonstrations, we use a modulator-demodulator scheme to compress phase space in the time and energy coordinates. With a second stage, we show that these pulses can be coherently accelerated, producing a net energy gain of $1.5\pm0.1$ keV, which is significantly larger than the remaining energy spread of $0.88 \,_{-0.2}^{+0.0}$ keV FWHM. We show that by linearly swee** the phase between the two stages, the energy spectrum can be coherently moved in a periodic manner, while kee** the energy spread roughly constant. After leaving the buncher, the electron pulse is also transversely focused, and can be matched into a following accelerator lattice. Thus, this setup is the prototype injector into a scalable DLA based on Alternating Phase Focusing (APF). △ Less

Submitted 5 August, 2020; originally announced August 2020.

Journal ref: Phys. Rev. Applied 15, 021002 (2021)

arXiv:2001.09583 [pdf, other]

Design of a multi-channel photonic crystal dielectric laser accelerator

Authors: Zhexin Zhao, Dylan S. Black, R. Joel England, Tyler W. Hughes, Yu Miao, Olav Solgaard, Robert L. Byer, Shanhui Fan

Abstract: To be useful for most scientific and medical applications, compact particle accelerators will require much higher average current than enabled by current architectures. For this purpose, we propose a photonic crystal architecture for a dielectric laser accelerator, referred to as a multi-input multi-output silicon accelerator (MIMOSA), that enables simultaneous acceleration of multiple electron be… ▽ More To be useful for most scientific and medical applications, compact particle accelerators will require much higher average current than enabled by current architectures. For this purpose, we propose a photonic crystal architecture for a dielectric laser accelerator, referred to as a multi-input multi-output silicon accelerator (MIMOSA), that enables simultaneous acceleration of multiple electron beams, increasing the total electron throughput by at least one order of magnitude. To achieve this, we show that the photonic crystal must support a mode at the $Γ$ point in reciprocal space, with a normalized frequency equal to the normalized speed of the phase matched electron. We show that the figure of merit of the MIMOSA can be inferred from the eigenmodes of the corresponding infinitely periodic structure, which provides a powerful approach to design such devices. Additionally, we extend the MIMOSA architecture to electron deflectors and other electron manipulation functionalities. These additional functionalities, combined with the increased electron throughput of these devices, permit all-optical on-chip manipulation of electron beams in a fully integrated architecture compatible with current fabrication technologies, which opens the way to unconventional electron beam sha**, imaging, and radiation generation. △ Less

Submitted 26 January, 2020; originally announced January 2020.

arXiv:1911.03283 [pdf, other]

Composing and Embedding the Words-as-Classifiers Model of Grounded Semantics

Authors: Daniele Moro, Stacy Black, Casey Kennington

Abstract: The words-as-classifiers model of grounded lexical semantics learns a semantic fitness score between physical entities and the words that are used to denote those entities. In this paper, we explore how such a model can incrementally perform composition and how the model can be unified with a distributional representation. For the latter, we leverage the classifier coefficients as an embedding. Fo… ▽ More The words-as-classifiers model of grounded lexical semantics learns a semantic fitness score between physical entities and the words that are used to denote those entities. In this paper, we explore how such a model can incrementally perform composition and how the model can be unified with a distributional representation. For the latter, we leverage the classifier coefficients as an embedding. For composition, we leverage the underlying mechanics of three different classifier types (i.e., logistic regression, decision trees, and multi-layer perceptrons) to arrive at a several systematic approaches to composition unique to each classifier including both denotational and connotational methods of composition. We compare these approaches to each other and to prior work in a visual reference resolution task using the refCOCO dataset. Our results demonstrate the need to expand upon existing composition strategies and bring together grounded and distributional representations. △ Less

Submitted 8 November, 2019; originally announced November 2019.

Comments: 10 pages

arXiv:1905.12822 [pdf, other]

doi 10.1126/science.aay5734

On-chip integrated laser-driven particle accelerator

Authors: Neil V. Sapra, Ki Youl Yang, Dries Vercruysse, Kenneth J. Leedle, Dylan S. Black, R. Joel England, Logan Su, Yu Miao, Olav Solgaard, Robert L. Byer, Jelena Vučković

Abstract: Particle accelerators represent an indispensable tool in science and industry. However, the size and cost of conventional radio-frequency accelerators limit the utility and reach of this technology. Dielectric laser accelerators (DLAs) provide a compact and cost-effective solution to this problem by driving accelerator nanostructures with visible or near-infrared (NIR) pulsed lasers, resulting in… ▽ More Particle accelerators represent an indispensable tool in science and industry. However, the size and cost of conventional radio-frequency accelerators limit the utility and reach of this technology. Dielectric laser accelerators (DLAs) provide a compact and cost-effective solution to this problem by driving accelerator nanostructures with visible or near-infrared (NIR) pulsed lasers, resulting in a 10$^4$ reduction of scale. Current implementations of DLAs rely on free-space lasers directly incident on the accelerating structures, limiting the scalability and integrability of this technology. Here we present the first experimental demonstration of a waveguide-integrated DLA, designed using a photonic inverse design approach. These on-chip devices accelerate sub-relativistic electrons of initial energy 83.4 keV by 1.21 keV over 30 um, providing peak acceleration gradients of 40.3 MeV/m. This progress represents a significant step towards a completely integrated MeV-scale dielectric laser accelerator. △ Less

Submitted 29 May, 2019; originally announced May 2019.

arXiv:1902.00170 [pdf, other]

doi 10.1103/PhysRevLett.122.104801

Laser-Driven Electron Lensing in Silicon Microstructures

Authors: Dylan S. Black, Kenneth J. Leedle, Yu Miao, Uwe Niedermayer, Robert L. Byer, Olav Solgaard

Abstract: We demonstrate a laser-driven, tunable electron lens fabricated in monolithic silicon. The lens consists of an array of silicon pillars pumped symmetrically by two 300 fs, 1.95 $μ$m wavelength, nJ-class laser pulses from an optical parametric amplifier. The optical near-field of the pillar structure focuses electrons in the plane perpendicular to the pillar axes. With 100 $\pm$ 10 MV/m incident la… ▽ More We demonstrate a laser-driven, tunable electron lens fabricated in monolithic silicon. The lens consists of an array of silicon pillars pumped symmetrically by two 300 fs, 1.95 $μ$m wavelength, nJ-class laser pulses from an optical parametric amplifier. The optical near-field of the pillar structure focuses electrons in the plane perpendicular to the pillar axes. With 100 $\pm$ 10 MV/m incident laser fields, the lens focal length is measured to be 50 $\pm$ 4 $μ$m, which corresponds to an equivalent quadrupole focusing gradient $B'$ of 1.4 $\pm$ 0.1 MT/m. By varying the incident laser field strength, the lens can be tuned from a 21 $\pm$ 2 $μ$m focal length ($B'>3.3$ MT/m) to focal lengths on the cm-scale. △ Less

Submitted 31 January, 2019; originally announced February 2019.

Journal ref: Physical Review Letters, 122(10), 2019

arXiv:1810.06788 [pdf, other]

doi 10.1093/mnras/sty3177

Narrow Transient Absorptions in Late-Time Optical Spectra of Type Ia Supernovae: Evidence for Large Clumps of Iron-Rich Ejecta?

Authors: Christine S. Black, Robert A. Fesen, Jerod T. Parrent

Abstract: An examination of late-time, optical spectra of type Ia supernovae revealed surprisingly narrow absorption features which only become visible a few months after maximum light. These features, most clearly seen in the late-time spectra of the bright, recent type Ia supernovae ASASSN-14lp and SN 2017bzc, appear as narrow absorptions at ~4840 A, ~5000 A, and as a sharp inflection at ~4760 A on the re… ▽ More An examination of late-time, optical spectra of type Ia supernovae revealed surprisingly narrow absorption features which only become visible a few months after maximum light. These features, most clearly seen in the late-time spectra of the bright, recent type Ia supernovae ASASSN-14lp and SN 2017bzc, appear as narrow absorptions at ~4840 A, ~5000 A, and as a sharp inflection at ~4760 A on the red side of the prominent late-time 4700 A feature. A survey of on-line archival data revealed similar features present in the spectra of ten other normal and 91T-like SNe Ia, including SN 2011fe. Unlike blue spectral features which exhibit progressive red-ward shifts, these narrow absorptions remain at the same wavelength from epoch to epoch for an individual SN, but can appear at slightly different wavelengths for each object. These features are also transient, appearing and then fading in one to three months. After ruling out instrumental, data reduction, and atmospheric affects, we discuss possible explanations including progenitor mass-loss material, interaction with material from previous novae events, and absorption by large discrete clumps of high-velocity Fe-rich ejecta. △ Less

Submitted 21 November, 2018; v1 submitted 15 October, 2018; originally announced October 2018.

Comments: Accepted by MNRAS, 11 pages, 9 figures, 2 tables

arXiv:1711.02174 [pdf, other]

doi 10.1093/mnras/sty072

A Distance Estimate to the Cygnus Loop Based on the Distances to Two Stars Located Within the Remnant

Authors: Robert A. Fesen, Jack M. M. Neustadt, Christine S. Black, Dan Milisavljevic

Abstract: Underlying nearly every quantitative discussion of the Cygnus Loop supernova remnant is uncertainty about its distance. Here we present optical images and spectra of nebulosities around two stars whose mass-loss material appears to have interacted with the remnant's expanding shock front and thus can be used to estimate the Cygnus Loop's distance. Narrow passband images reveal a small emission-lin… ▽ More Underlying nearly every quantitative discussion of the Cygnus Loop supernova remnant is uncertainty about its distance. Here we present optical images and spectra of nebulosities around two stars whose mass-loss material appears to have interacted with the remnant's expanding shock front and thus can be used to estimate the Cygnus Loop's distance. Narrow passband images reveal a small emission-line nebula surrounding an M4 red giant near the remnant's eastern nebula NGC 6992. Optical spectra of the nebula show it to be shock-heated with significantly higher electron densities than seen in the remnant's filaments. This along with a bow-shaped morphology suggests it is likely red giant mass-loss material shocked and accelerated by passage of the Cygnus Loop's blast wave. We also identify a B7 V star located along the remnant's northwestern limb which also appears to have interacted with the remnant's shock wave. It lies within a small arc of nebulosity in an unusually complex region of highly curved and distorted filaments along the remnant's northern shock front suggestive of a localized disturbance of the shock front due to the B star's stellar winds. Based on the assumption that these two stars lie inside the remnant, combined with an estimated distance to a molecular cloud situated along the remnant's western limb, we propose a distance to the Cygnus Loop of 1.0 +/- 0.2 kpc. Although larger than several recent estimates of 500 - 800 pc, a distance ~1 kpc helps resolve difficulties with the remnant's postshock cosmic ray and gas pressure ratio and estimated supernova explosion energy. △ Less

Submitted 25 November, 2017; v1 submitted 6 November, 2017; originally announced November 2017.

Comments: 16 pages, 14 figures

arXiv:1709.04441 [pdf, other]

doi 10.1103/PhysRevApplied.9.054017

On-Chip Laser Power Delivery System for Dielectric Laser Accelerators

Authors: Tyler W. Hughes, Si Tan, Zhexin Zhao, Neil V. Sapra, Yun Jo Lee, Kenneth J. Leedle, Huiyang Deng, Yu Miao, Dylan S. Black, Minghao Qi, Olav Solgaard, James S. Harris, Jelena Vuckovic, Robert L. Byer, Shanhui Fan

Abstract: We propose an on-chip optical power delivery system for dielectric laser accelerators based on a fractal 'tree-branch' dielectric waveguide network. This system replaces experimentally demanding free-space manipulations of the driving laser beam with chip-integrated techniques based on precise nano-fabrication, enabling access to orders of magnitude increases in the interaction length and total en… ▽ More We propose an on-chip optical power delivery system for dielectric laser accelerators based on a fractal 'tree-branch' dielectric waveguide network. This system replaces experimentally demanding free-space manipulations of the driving laser beam with chip-integrated techniques based on precise nano-fabrication, enabling access to orders of magnitude increases in the interaction length and total energy gain for these miniature accelerators. Based on computational modeling, in the relativistic regime, our laser delivery system is estimated to provide 21 keV of energy gain over an acceleration length of 192 um with a single laser input, corresponding to a 108 MV/m acceleration gradient. The system may achieve 1 MeV of energy gain over a distance less than 1 cm by sequentially illuminating 49 identical structures. These findings are verified by detailed numerical simulation and modeling of the subcomponents and we provide a discussion of the main constraints, challenges, and relevant parameters in regards to on-chip laser coupling for dielectric laser accelerators. △ Less

Submitted 13 September, 2017; originally announced September 2017.

Comments: 14 pages

Journal ref: Phys. Rev. Applied 9, 054017 (2018)

arXiv:1709.02257 [pdf, other]

doi 10.3847/1538-4357/aa8999

The Transition of a Type IIL Supernova into a Supernova Remnant: Late-time Observations of SN 2013by

Authors: C. S. Black, D. Milisavljevic, R. Margutti, R. A. Fesen, D. Patnaude, S. Parker

Abstract: We present early-time Swift and Chandra X-ray data along with late-time optical and near-infrared observations of SN 2013by, a Type IIL supernova (SN) that occurred in the nearby spiral galaxy ESO 138$-$G10 (D $\sim 14.8$ Mpc). Optical and NIR photometry and spectroscopy follow the late-time evolution of the supernova from days +89 to +457 post-maximum brightness. The optical spectra and X-ray lig… ▽ More We present early-time Swift and Chandra X-ray data along with late-time optical and near-infrared observations of SN 2013by, a Type IIL supernova (SN) that occurred in the nearby spiral galaxy ESO 138$-$G10 (D $\sim 14.8$ Mpc). Optical and NIR photometry and spectroscopy follow the late-time evolution of the supernova from days +89 to +457 post-maximum brightness. The optical spectra and X-ray light curves are consistent with the picture of a SN having prolonged interaction with circumstellar material (CSM) that accelerates the transition from supernova to supernova remnant (SNR). Specifically, we find SN 2013by's H$α$ profile exhibits significant broadening ($\sim$ 10,000 km s$^{-1}$) on day +457, the likely consequence of high-velocity, H-rich material being excited by a reverse shock. A relatively flat X-ray light curve is observed that cannot be modeled using inverse-Compton scattering processes alone but requires an additional energy source most likely originating from the SN-CSM interaction. In addition, we see the first overtone of CO emission near 2.3 $μ$m on day +152, signaling the formation of molecules and dust in the SN ejecta and is the first time CO has been detected in a Type IIL supernova. We compare SN 2013by to Type IIP supernovae whose spectra show the rarely observed SN-to-SNR transition in varying degrees and conclude that Type IIL SNe may enter the remnant phase at earlier epochs than their Type IIP counterparts. △ Less

Submitted 11 September, 2017; v1 submitted 5 September, 2017; originally announced September 2017.

Comments: Accepted for publication in ApJ. 9 Pages, 10 figures, 3 tables

arXiv:1704.07699 [pdf, other]

doi 10.1038/s41598-018-19781-5

Perivascular Spaces Segmentation in Brain MRI Using Optimal 3D Filtering

Authors: Lucia Ballerini, Ruggiero Lovreglio, Maria del C. Valdes-Hernandez, Joel Ramirez, Bradley J. MacIntosh, Sandra E. Black, Joanna M. Wardlaw

Abstract: Perivascular Spaces (PVS) are a recently recognised feature of Small Vessel Disease (SVD), also indicating neuroinflammation, and are an important part of the brain's circulation and glymphatic drainage system. Quantitative analysis of PVS on Magnetic Resonance Images (MRI) is important for understanding their relationship with neurological diseases. In this work, we propose a segmentation techniq… ▽ More Perivascular Spaces (PVS) are a recently recognised feature of Small Vessel Disease (SVD), also indicating neuroinflammation, and are an important part of the brain's circulation and glymphatic drainage system. Quantitative analysis of PVS on Magnetic Resonance Images (MRI) is important for understanding their relationship with neurological diseases. In this work, we propose a segmentation technique based on the 3D Frangi filtering for extraction of PVS from MRI. Based on prior knowledge from neuroradiological ratings of PVS, we used ordered logit models to optimise Frangi filter parameters in response to the variability in the scanner's parameters and study protocols. We optimized and validated our proposed models on two independent cohorts, a dementia sample (N=20) and patients who previously had mild to moderate stroke (N=48). Results demonstrate the robustness and generalisability of our segmentation method. Segmentation-based PVS burden estimates correlated with neuroradiological assessments (Spearman's $ρ$ = 0.74, p $<$ 0.001), suggesting the great potential of our proposed method △ Less

Submitted 25 April, 2017; originally announced April 2017.

arXiv:1610.07831 [pdf, ps, other]

doi 10.1002/asna.201613274

Another look at the size of the low-surface brightness galaxy VCC 1661 in the Virgo Cluster

Authors: Andreas Koch, Christine S. Black, R. Michael Rich, Francis A. Longstaff, Michelle L. M. Collins, Joachim Janz

Abstract: We present new wide-field images of the low-surface brightness Virgo Cluster dwarf galaxy VCC 1661. The extant literature lists a broad range of radii for this object, covering a factor of more than four, depending on the filters used and the details of the analyses. While some studies find a radius typical of other Virgo dwarfs and note the normality of this object, any larger spatial extent, tak… ▽ More We present new wide-field images of the low-surface brightness Virgo Cluster dwarf galaxy VCC 1661. The extant literature lists a broad range of radii for this object, covering a factor of more than four, depending on the filters used and the details of the analyses. While some studies find a radius typical of other Virgo dwarfs and note the normality of this object, any larger spatial extent, taken at face value, would render this galaxy the largest dwarf in the Virgo Cluster samples. Confirmation of a large extent of dwarf galaxies has often led to the discovery of tidal tails and would then, also in VCC 1661, indicate a severe state of tidal disruption. Given the importance of galactic sizes for assessing tidal interactions of the satellites with their hosts, we thus combine our surface brightness profile with data from the literature to investigate further the nature of this galaxy. However, our new characteristic radius for VCC 1661 of $r_e=24.1$"$\pm7.7$" and the previously noted smooth appearance of its isophotes are fully consistent with the remainder of the ACSVCS dwarf galaxy population without any need to invoke tidal perturbations. △ Less

Submitted 25 October, 2016; originally announced October 2016.

Comments: 8 pages, 6 figures, accepted for publication in Astronomische Nachrichten

arXiv:1605.01739 [pdf, other]

doi 10.3847/0004-637X/826/1/12

The Intrinsic Eddington Ratio Distribution of Active Galactic Nuclei in Star-forming Galaxies from the Sloan Digital Sky Survey

Authors: M. L. Jones, R. C. Hickox, C. S. Black, K. N. Hainline, M. A. DiPompeo, A. D. Goulding

Abstract: An important question in extragalactic astronomy concerns the distribution of black hole accretion rates of active galactic nuclei (AGN). Based on observations at X-ray wavelengths, the observed Eddington ratio distribution appears as a power law, while optical studies have often yielded a lognormal distribution. There is increasing evidence that these observed discrepancies may be due to contamin… ▽ More An important question in extragalactic astronomy concerns the distribution of black hole accretion rates of active galactic nuclei (AGN). Based on observations at X-ray wavelengths, the observed Eddington ratio distribution appears as a power law, while optical studies have often yielded a lognormal distribution. There is increasing evidence that these observed discrepancies may be due to contamination by star formation and other selection effects. Using a sample of galaxies from the Sloan Digital Sky Survey Data Release 7, we test if an intrinsic Eddington ratio distribution that takes the form of a Schechter function is consistent with previous work that suggests that young galaxies in optical surveys have an observed lognormal Eddington ratio distribution. We simulate the optical emission line properties of a population of galaxies and AGN using a broad instantaneous luminosity distribution described by a Schechter function near the Eddington limit. This simulated AGN population is then compared to observed galaxies via the positions on an emission line excitation diagram and Eddington ratio distributions. We present an improved method for extracting the AGN distribution using BPT diagnostics that allows us to probe over one order of magnitude lower in Eddington ratio counteracting the effects of dilution by star formation. We conclude that for optically selected AGN in young galaxies, the intrinsic Eddington ratio distribution is consistent with a possibly universal, broad power law with an exponential cutoff, as this distribution is observed in old optically selected galaxies and in X-rays. △ Less

Submitted 5 May, 2016; originally announced May 2016.

Comments: 12 pages, 12 figures. Accepted for publication in ApJ

arXiv:1604.01044 [pdf, other]

doi 10.1093/mnras/stw1680

Progressive Red Shifts in the Late-Time Spectra of Type Ia Supernovae

Authors: C. S. Black, R. A. Fesen, J. T. Parrent

Abstract: We examine the evolution of late-time, optical nebular features of Type Ia supernovae (SNe Ia) using a sample consisting of 160 spectra of 27 normal SNe Ia taken from the literature as well as unpublished spectra of SN 2008Q and ASASSN-14lp. Particular attention was given to nebular features between 4000-6000 A in terms of temporal changes in width and central wavelength. Analysis of the prominent… ▽ More We examine the evolution of late-time, optical nebular features of Type Ia supernovae (SNe Ia) using a sample consisting of 160 spectra of 27 normal SNe Ia taken from the literature as well as unpublished spectra of SN 2008Q and ASASSN-14lp. Particular attention was given to nebular features between 4000-6000 A in terms of temporal changes in width and central wavelength. Analysis of the prominent late-time 4700 A feature shows a progressive central wavelength shift from ~4600 A to longer wavelengths out to at least day +300 for our entire sample. We find no evidence for the feature's red-ward shift slowing or halting at an [Fe III] blend centroid of 4701 A as has been proposed. The width of the feature also steadily increases with a FWHM ~170 A at day +100 growing to 200 A or more by day +350. Two weaker adjacent features around 4850 and 5000 A exhibit very similar red shifts to that of the 4700 A feature but show no change in width until very late times. We discuss possible causes for the observed red-ward shifting of these late-time optical features including contribution from [Co II] emission at early nebular epochs and the emergence of additional features at later times. We conclude that the ubiquitous red shift of these common late-time, nebular SN Ia spectral features is not mainly due to a decrease in a blueshift of forbidden Fe lines but the result, in part, of decreasing velocity and/or optical depth of permitted Fe lines. △ Less

Submitted 8 August, 2016; v1 submitted 4 April, 2016; originally announced April 2016.

Comments: 12 pages, 9 figures, accepted for publication in MNRAS

arXiv:1412.3122 [pdf, other]

doi 10.1093/mnras/stu2641

A 3D Kinematic Study of the Northern Ejecta "Jet" of the Crab Nebula

Authors: Christine S. Black, Robert A. Fesen

Abstract: We present [O III] 4959,5007 emission line spectra (FWHM = 40 km/s) of the Crab Nebula's northern ejecta `jet'. These data, along with a recent [O III] image of the Crab, are used to build 3-dimensional models of the jet and adjacent remnant nebulosity to better understand the jet's properties and possible formation. We find that the jet's radial velocities range from -190 to +480 km/s with transv… ▽ More We present [O III] 4959,5007 emission line spectra (FWHM = 40 km/s) of the Crab Nebula's northern ejecta `jet'. These data, along with a recent [O III] image of the Crab, are used to build 3-dimensional models of the jet and adjacent remnant nebulosity to better understand the jet's properties and possible formation. We find that the jet's radial velocities range from -190 to +480 km/s with transverse velocities from 1600 to 2650 km/s from base to tip. The jet appears virtually hollow in [O III] emission with the exception of some material at the jet's base where the it connects with the remnant. Our 3D reconstructions indicate that the jet is elliptical in shape and slightly funnel-like rather than a straight cylindrical tube as previously thought. At the base of the jet we find evidence for a significant opening or "channel" in the Crab's main nebula shell. Our analysis of the jet's expansion properties and location supports the theory that the jet may simply represent the highest velocity component of the Crab's N-S bipolar expansion. △ Less

Submitted 11 December, 2014; v1 submitted 9 December, 2014; originally announced December 2014.

Comments: 11 pages, 8 figures, 2 tables; Accepted for publication in MNRAS; minor changes

arXiv:1310.5788 [pdf, other]

Forbidden minors for graphs with no first obstruction to parametric Feynman integration

Authors: Samson Black, Iain Crump, Matt DeVos, Karen Yeats

Abstract: We give a characterization of 3-connected graphs which are planar and forbid cube, octahedron, and $H$ minors, where $H$ is the graph which is one $Δ-Y$ away from each of the cube and the octahedron. Next we say a graph is Feynman 5-split if no choice of edge ordering gives an obstruction to parametric Feynman integration at the fifth step. The 3-connected Feynman 5-split graphs turn out to be pre… ▽ More We give a characterization of 3-connected graphs which are planar and forbid cube, octahedron, and $H$ minors, where $H$ is the graph which is one $Δ-Y$ away from each of the cube and the octahedron. Next we say a graph is Feynman 5-split if no choice of edge ordering gives an obstruction to parametric Feynman integration at the fifth step. The 3-connected Feynman 5-split graphs turn out to be precisely those characterized above. Finally we derive the full list of forbidden minors for Feynman 5-split graphs of any connectivity. △ Less

Submitted 2 October, 2014; v1 submitted 21 October, 2013; originally announced October 2013.

Comments: A few changes suggested by the referees. 55 pages. Code and output is included with the source

MSC Class: Primary 05C75; Secondary 81T18

arXiv:1207.2762 [pdf, ps, other]

doi 10.1088/2041-8205/755/1/L13

Threshing in Action - The tidal disruption of a dwarf galaxy by the Hydra I Cluster

Authors: Andreas Koch, Andreas Burkert, R. Michael Rich, Michelle L. M. Collins, Christine S. Black, Michael Hilker, Andrew J. Benson

Abstract: We report on the discovery of strong tidal features around a dwarf spheroidal galaxy in the Hydra I galaxy cluster, indicating its ongoing tidal disruption. This very low surface brightness object, HCC-087, was originally classified as an early-type dwarf in the Hydra Cluster Catalogue (HCC), but our re-analysis of the ESO-VLT/FORS images of the HCC unearthed a clear indication of an S-shaped morp… ▽ More We report on the discovery of strong tidal features around a dwarf spheroidal galaxy in the Hydra I galaxy cluster, indicating its ongoing tidal disruption. This very low surface brightness object, HCC-087, was originally classified as an early-type dwarf in the Hydra Cluster Catalogue (HCC), but our re-analysis of the ESO-VLT/FORS images of the HCC unearthed a clear indication of an S-shaped morphology and a large spatial extent. Its shape, luminosity (M_V=-11.6 mag), and physical size (at a half-light radius of 3.1 kpc and a full length of ~5.9 kpc) are comparable to the recently discovered NGC 4449B and the Sagittarius dwarf spheroidal, all of which are undergoing clear tidal disruption. Aided by N-body simulations we argue that HCC-087 is currently at its first apocenter, at 150 kpc, around the cluster center and that it is being tidally disrupted by the galaxy cluster's potential itself. An interaction with the near-by (50 kpc) S0 cluster galaxy HCC-005, at M* ~ 3 x 10^10 M_sun is rather unlikely, as this constellation requires a significant amount of dynamical friction and thus low relative velocities. The S-shaped morphology and large spatial extent of the satellite would, however, also appear if HCC-087 would orbit the cluster center. These features appear to be characteristic properties of satellites that are seen in the process of being tidally disrupted, independent of the environment of the destruction. An important finding of our simulations is an orientation of the tidal tails perpendicular to the orbit. △ Less

Submitted 11 July, 2012; originally announced July 2012.

Comments: 5 pages, 4 figures, accepted for publication in the Astrophysical Journal Letters. Some figure sizes reduced

arXiv:1002.4860 [pdf, ps, other]

doi 10.1142/S0218216511009741

A state-sum formula for the Alexander polynomial

Authors: Samson Black

Abstract: We develop a diagrammatic formalism for calculating the Alexander polynomial of the closure of a braid as a state-sum. Our main tools are the Markov trace formulas for the HOMFLY-PT polynomial and Young's semi-normal representations of the Iwahori-Hecke algebras of type A. We develop a diagrammatic formalism for calculating the Alexander polynomial of the closure of a braid as a state-sum. Our main tools are the Markov trace formulas for the HOMFLY-PT polynomial and Young's semi-normal representations of the Iwahori-Hecke algebras of type A. △ Less

Submitted 3 June, 2010; v1 submitted 25 February, 2010; originally announced February 2010.

Comments: 8 pages

MSC Class: 57M27; 57M25

Journal ref: SAMSON BLACK, J. Knot Theory Ramifications, 21, 1250008 (2012)

arXiv:0911.3358 [pdf, ps, other]

doi 10.1063/1.3427224

A multi-purpose modular system for high-resolution microscopy at high hydrostatic pressure

Authors: Hugh Vass, S. Lucas Black, Eva M. Herzig, F. Bruce Ward, Paul S. Clegg, Rosalind J. Allen

Abstract: We have developed a modular system for high-resolution microscopy at high hydrostatic pressure. The system consists of a pressurised cell of volume ~100 microlitres, a temperature controlled holder, a ram and a piston. We have made each of these components in several versions which can be interchanged to allow a wide range of applications. Here, we report two pressure cells with pressure ranges… ▽ More We have developed a modular system for high-resolution microscopy at high hydrostatic pressure. The system consists of a pressurised cell of volume ~100 microlitres, a temperature controlled holder, a ram and a piston. We have made each of these components in several versions which can be interchanged to allow a wide range of applications. Here, we report two pressure cells with pressure ranges 0.1-700MPa and 0.1-100MPa, which can be combined with hollow or solid rams and pistons. Our system is designed to work with fluorescent samples (using a confocal or epifluorescence microscope), but also allows for transmitted light microscopy via the hollow ram and piston. The system allows precise control of pressure and temperature [-20-70C], as well as rapid pressure quenching. We demonstrate its performance and versatility with two applications: time-resolved imaging of colloidal phase transitions caused by pressure changes between 0.1MPa and 101MPa, and imaging the growth of Escherichia coli bacteria at 50MPa. We also show that the isotropic-nematic phase transition of pentyl-cyanobiphenyl (5CB) liquid crystal provides a simple, convenient and accurate method for calibrating pressure in the range 0.1-200MPa. △ Less

Submitted 17 November, 2009; originally announced November 2009.

Showing 1–41 of 41 results for author: Black, S