Search | arXiv e-print repository

arXiv:2407.03428 [pdf, other]

NEBULA: Neural Empirical Bayes Under Latent Representations for Efficient and Controllable Design of Molecular Libraries

Authors: Ewa M. Nowara, Pedro O. Pinheiro, Sai Pooja Mahajan, Omar Mahmood, Andrew Martin Watkins, Saeed Saremi, Michael Maser

Abstract: We present NEBULA, the first latent 3D generative model for scalable generation of large molecular libraries around a seed compound of interest. Such libraries are crucial for scientific discovery, but it remains challenging to generate large numbers of high quality samples efficiently. 3D-voxel-based methods have recently shown great promise for generating high quality samples de novo from random… ▽ More We present NEBULA, the first latent 3D generative model for scalable generation of large molecular libraries around a seed compound of interest. Such libraries are crucial for scientific discovery, but it remains challenging to generate large numbers of high quality samples efficiently. 3D-voxel-based methods have recently shown great promise for generating high quality samples de novo from random noise (Pinheiro et al., 2023). However, sampling in 3D-voxel space is computationally expensive and use in library generation is prohibitively slow. Here, we instead perform neural empirical Bayes sampling (Saremi & Hyvarinen, 2019) in the learned latent space of a vector-quantized variational autoencoder. NEBULA generates large molecular libraries nearly an order of magnitude faster than existing methods without sacrificing sample quality. Moreover, NEBULA generalizes better to unseen drug-like molecules, as demonstrated on two public datasets and multiple recently released drugs. We expect the approach herein to be highly enabling for machine learning-based drug discovery. The code is available at https://github.com/prescient-design/nebula △ Less

Submitted 3 July, 2024; originally announced July 2024.

arXiv:2407.00236 [pdf, other]

Closed-Form Test Functions for Biophysical Sequence Optimization Algorithms

Authors: Samuel Stanton, Robert Alberstein, Nathan Frey, Andrew Watkins, Kyunghyun Cho

Abstract: There is a growing body of work seeking to replicate the success of machine learning (ML) on domains like computer vision (CV) and natural language processing (NLP) to applications involving biophysical data. One of the key ingredients of prior successes in CV and NLP was the broad acceptance of difficult benchmarks that distilled key subproblems into approachable tasks that any junior researcher… ▽ More There is a growing body of work seeking to replicate the success of machine learning (ML) on domains like computer vision (CV) and natural language processing (NLP) to applications involving biophysical data. One of the key ingredients of prior successes in CV and NLP was the broad acceptance of difficult benchmarks that distilled key subproblems into approachable tasks that any junior researcher could investigate, but good benchmarks for biophysical domains are rare. This scarcity is partially due to a narrow focus on benchmarks which simulate biophysical data; we propose instead to carefully abstract biophysical problems into simpler ones with key geometric similarities. In particular we propose a new class of closed-form test functions for biophysical sequence optimization, which we call Ehrlich functions. We provide empirical results demonstrating these functions are interesting objects of study and can be non-trivial to solve with a standard genetic optimization baseline. △ Less

Submitted 28 June, 2024; originally announced July 2024.

arXiv:2406.11962 [pdf, other]

The properties of AGN in dwarf galaxies identified via SED fitting

Authors: B. Bichang'a, S. Kaviraj, I. Lazar, R. A. Jackson, S. Das, D. J. B. Smith, A. E. Watkins, G. Martin

Abstract: Given their dominance of the galaxy number density, dwarf galaxies are central to our understanding of galaxy formation. While the incidence of AGN and their impact on galaxy evolution has been extensively studied in massive galaxies, much less is known about the role of AGN in the evolution of dwarfs. We search for radiatively-efficient AGN in the nearby (0.1 < z < 0.3) dwarf (10^8 MSun < M < 10^… ▽ More Given their dominance of the galaxy number density, dwarf galaxies are central to our understanding of galaxy formation. While the incidence of AGN and their impact on galaxy evolution has been extensively studied in massive galaxies, much less is known about the role of AGN in the evolution of dwarfs. We search for radiatively-efficient AGN in the nearby (0.1 < z < 0.3) dwarf (10^8 MSun < M < 10^10 MSun) population, using SED fitting (via Prospector) applied to deep ultraviolet to mid-infrared photometry of 508 dwarf galaxies. Around a third (32 +/- 2 per cent) of our dwarfs show signs of AGN activity. We compare the properties of our dwarf AGN to control samples, constructed from non-AGN, which have the same distributions of redshift and stellar mass as their AGN counterparts. KS tests between the AGN and control distributions indicates that the AGN do not show differences in their distances to nodes, filaments and nearby massive galaxies from their control counterparts. This indicates that AGN triggering in the dwarf regime is not strongly correlated with local environment. The fraction of AGN hosts with early-type morphology and those that are interacting are also indistinguishable from the controls within the uncertainties, suggesting that interactions do not play a significant role in inducing AGN activity in our sample. Finally, the star formation activity in dwarf AGN is only slightly lower than that in their control counterparts, suggesting that the presence of radiatively-efficient AGN does not lead to significant, prompt quenching of star formation in these systems. △ Less

Submitted 17 June, 2024; originally announced June 2024.

Comments: Accepted for publication in MNRAS

arXiv:2405.06184 [pdf]

RT-utils: A Minimal Python Library for RT-struct Manipulation

Authors: Asim Shrestha, Adam Watkins, Fereshteh Yousefirizi, Arman Rahmim, Carlos F. Uribe

Abstract: Towards the need for automated and precise AI-based analysis of medical images, we present RT-utils, a specialized Python library tuned for the manipulation of radiotherapy (RT) structures stored in DICOM format. RT-utils excels in converting the polygon contours into binary masks, ensuring accuracy and efficiency. By converting DICOM RT structures into standardized formats such as NumPy arrays an… ▽ More Towards the need for automated and precise AI-based analysis of medical images, we present RT-utils, a specialized Python library tuned for the manipulation of radiotherapy (RT) structures stored in DICOM format. RT-utils excels in converting the polygon contours into binary masks, ensuring accuracy and efficiency. By converting DICOM RT structures into standardized formats such as NumPy arrays and SimpleITK Images, RT-utils optimizes inputs for computational solutions such as AI-based automated segmentation techniques or radiomics analysis. Since its inception in 2020, RT-utils has been used extensively with a focus on simplifying complex data processing tasks. RT-utils offers researchers a powerful solution to enhance workflows and drive significant advancements in medical imaging. △ Less

Submitted 9 May, 2024; originally announced May 2024.

arXiv:2404.19003 [pdf, other]

Implications on star-formation-rate indicators from HII regions and diffuse ionised gas in the M101 Group

Authors: A. E. Watkins, J. C. Mihos, P. Harding, R. Garner III

Abstract: We examine the connection between diffuse ionised gas (DIG), HII regions, and field O and B stars in the nearby spiral M101 and its dwarf companion NGC 5474 using ultra-deep H$α$ narrow-band imaging and archival GALEX UV imaging. We find a strong correlation between DIG H$α$ surface brightness and the incident ionising flux leaked from the nearby HII regions, which we reproduce well using simple C… ▽ More We examine the connection between diffuse ionised gas (DIG), HII regions, and field O and B stars in the nearby spiral M101 and its dwarf companion NGC 5474 using ultra-deep H$α$ narrow-band imaging and archival GALEX UV imaging. We find a strong correlation between DIG H$α$ surface brightness and the incident ionising flux leaked from the nearby HII regions, which we reproduce well using simple Cloudy simulations. While we also find a strong correlation between H$α$ and co-spatial FUV surface brightness in DIG, the extinction-corrected integrated UV colours in these regions imply stellar populations too old to produce the necessary ionising photon flux. Combined, this suggests that HII region leakage, not field OB stars, is the primary source of DIG in the M101 Group. Corroborating this interpretation, we find systematic disagreement between the H$α$- and FUV-derived star formation rates (SFRs) in the DIG, with SFR$_{{\rm H}α} < $SFR$_{\rm FUV}$ everywhere. Within HII regions, we find a constant SFR ratio of 0.44 to a limit of $\sim10^{-5}$ M$_{\odot}$~yr$^{-1}$. This result is in tension with other studies of star formation in spiral galaxies, which typically show a declining SFR$_{{\rm H}α}/$SFR$_{\rm FUV}$ ratio at low SFR. We reproduce such trends only when considering spatially averaged photometry that mixes HII regions, DIG, and regions lacking H$α$ entirely, suggesting that the declining trends found in other galaxies may result purely from the relative fraction of diffuse flux, leaky compact HII regions, and non-ionising FUV-emitting stellar populations in different regions within the galaxy. △ Less

Submitted 29 April, 2024; originally announced April 2024.

Comments: 17 pages, 12 figures, accepted for publication in MNRAS

arXiv:2404.12241 [pdf, other]

Introducing v0.5 of the AI Safety Benchmark from MLCommons

Authors: Bertie Vidgen, Adarsh Agrawal, Ahmed M. Ahmed, Victor Akinwande, Namir Al-Nuaimi, Najla Alfaraj, Elie Alhajjar, Lora Aroyo, Trupti Bavalatti, Max Bartolo, Borhane Blili-Hamelin, Kurt Bollacker, Rishi Bomassani, Marisa Ferrara Boston, Siméon Campos, Kal Chakra, Canyu Chen, Cody Coleman, Zacharie Delpierre Coudert, Leon Derczynski, Debojyoti Dutta, Ian Eisenberg, James Ezick, Heather Frase, Brian Fuller , et al. (75 additional authors not shown)

Abstract: This paper introduces v0.5 of the AI Safety Benchmark, which has been created by the MLCommons AI Safety Working Group. The AI Safety Benchmark has been designed to assess the safety risks of AI systems that use chat-tuned language models. We introduce a principled approach to specifying and constructing the benchmark, which for v0.5 covers only a single use case (an adult chatting to a general-pu… ▽ More This paper introduces v0.5 of the AI Safety Benchmark, which has been created by the MLCommons AI Safety Working Group. The AI Safety Benchmark has been designed to assess the safety risks of AI systems that use chat-tuned language models. We introduce a principled approach to specifying and constructing the benchmark, which for v0.5 covers only a single use case (an adult chatting to a general-purpose assistant in English), and a limited set of personas (i.e., typical users, malicious users, and vulnerable users). We created a new taxonomy of 13 hazard categories, of which 7 have tests in the v0.5 benchmark. We plan to release version 1.0 of the AI Safety Benchmark by the end of 2024. The v1.0 benchmark will provide meaningful insights into the safety of AI systems. However, the v0.5 benchmark should not be used to assess the safety of AI systems. We have sought to fully document the limitations, flaws, and challenges of v0.5. This release of v0.5 of the AI Safety Benchmark includes (1) a principled approach to specifying and constructing the benchmark, which comprises use cases, types of systems under test (SUTs), language and context, personas, tests, and test items; (2) a taxonomy of 13 hazard categories with definitions and subcategories; (3) tests for seven of the hazard categories, each comprising a unique set of test items, i.e., prompts. There are 43,090 test items in total, which we created with templates; (4) a grading system for AI systems against the benchmark; (5) an openly available platform, and downloadable tool, called ModelBench that can be used to evaluate the safety of AI systems on the benchmark; (6) an example evaluation report which benchmarks the performance of over a dozen openly available chat-tuned language models; (7) a test specification for the benchmark. △ Less

Submitted 13 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

arXiv:2404.04802 [pdf, ps, other]

Bright Star Subtraction Pipeline for LSST: Progress Review

Authors: Amir E. Bazkiaei, Lee S. Kelvin, Sarah Brough, Simon J. O'Toole, Aaron Watkins, Morgen A. Schmitz

Abstract: We present the Bright Star Subtraction (BSS) pipeline for the Vera C. Rubin Observatory's Legacy Survey of Space and Time (LSST). This pipeline generates an extended PSF model using observed stars and subtracts the model from the bright stars in LSST data. When testing the pipeline on Hyper Suprime-Cam (HSC) data, we find that the shape of the extended PSF model depends on the location of the dete… ▽ More We present the Bright Star Subtraction (BSS) pipeline for the Vera C. Rubin Observatory's Legacy Survey of Space and Time (LSST). This pipeline generates an extended PSF model using observed stars and subtracts the model from the bright stars in LSST data. When testing the pipeline on Hyper Suprime-Cam (HSC) data, we find that the shape of the extended PSF model depends on the location of the detector on the camera's focal plane. The closer a detector is to the edge of the focal plane, the less the extended PSF model is circularly symmetric. We introduce an algorithm that allows the user to consider the location dependency of the model. △ Less

Submitted 7 April, 2024; originally announced April 2024.

Comments: 4 pages, 1 figure; Astronomical Data Analysis Software & Systems XXXIII proceeding

arXiv:2402.12440 [pdf, other]

doi 10.1093/mnras/stae510

The morphological mix of dwarf galaxies in the nearby Universe

Authors: Ilin Lazar, Sugata Kaviraj, Aaron E. Watkins, Garreth Martin, Brian Bichang'a, Ryan A. Jackson

Abstract: We use a complete, unbiased sample of 257 dwarf (10^8 MSun < Mstar < 10^9.5 MSun) galaxies at z < 0.08, in the COSMOS field, to study the morphological mix of the dwarf population in low-density environments. Visual inspection of extremely deep optical images and their unsharp-masked counterparts reveals three principal dwarf morphological classes. 43 and 45 per cent of dwarfs exhibit the traditio… ▽ More We use a complete, unbiased sample of 257 dwarf (10^8 MSun < Mstar < 10^9.5 MSun) galaxies at z < 0.08, in the COSMOS field, to study the morphological mix of the dwarf population in low-density environments. Visual inspection of extremely deep optical images and their unsharp-masked counterparts reveals three principal dwarf morphological classes. 43 and 45 per cent of dwarfs exhibit the traditional `early-type' (elliptical/S0) and `late-type' (spiral) morphologies respectively. However, 10 per cent populate a `featureless' class, that lacks both the central light concentration seen in early-types and any spiral structure - this class is missing in the massive-galaxy regime. 14, 27 and 19 per cent of early-type, late-type and featureless dwarfs respectively show evidence for interactions, which drive around 20 per cent of the overall star formation activity in the dwarf population. Compared to their massive counterparts, dwarf early-types show a much lower incidence of interactions, are significantly less concentrated and share similar rest-frame colours as dwarf late-types. This suggests that the formation histories of dwarf and massive early-types are different, with dwarf early-types being shaped less by interactions and more by secular processes. The lack of large groups or clusters in COSMOS at z < 0.08, and the fact that our dwarf morphological classes show similar local density, suggests that featureless dwarfs in low-density environments are created via internal baryonic feedback, rather than by environmental processes. Finally, while interacting dwarfs can be identified using the asymmetry parameter, it is challenging to cleanly separate early and late-type dwarfs using traditional morphological parameters, such as `CAS', M20 and the Gini coefficient (unlike in the massive-galaxy regime). △ Less

Submitted 1 March, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

Comments: Accepted for publication in MNRAS, 20 pages, 19 figures (very minor textual changes to match published version)

arXiv:2401.12297 [pdf, other]

doi 10.1093/mnras/stae236

Strategies for optimal sky subtraction in the low surface brightness regime

Authors: A. E. Watkins, S. Kaviraj, C. C. Collins, J. H. Knapen, L. S. Kelvin, P. -A. Duc, J. Román, J. C. Mihos

Abstract: The low surface brightness (LSB) regime ($μ_{g} \gtrsim 26$ mag arcsec$^{-2}$) comprises a vast, mostly unexplored discovery space, from dwarf galaxies to the diffuse interstellar medium. Accessing this regime requires precisely removing instrumental signatures and light contamination, including, most critically, night sky emission. This is not trivial, as faint astrophysical and instrumental cont… ▽ More The low surface brightness (LSB) regime ($μ_{g} \gtrsim 26$ mag arcsec$^{-2}$) comprises a vast, mostly unexplored discovery space, from dwarf galaxies to the diffuse interstellar medium. Accessing this regime requires precisely removing instrumental signatures and light contamination, including, most critically, night sky emission. This is not trivial, as faint astrophysical and instrumental contamination can bias sky models at the precision needed to characterize LSB structures. Using idealized synthetic images, we assess how this bias impacts two common LSB-oriented sky-estimation algorithms: 1.) masking and parametric modelling, and 2.) stacking and smoothing dithered exposures. Undetected flux limits both methods by imposing a pedestal offset to all derived sky models. Careful, deep masking of fixed sources can mitigate this, but source density always imposes a fundamental limit. Stellar scattered light can contribute $\sim28$--$29$ mag arcsec$^{-2}$ of background flux even in low-density fields; its removal is critical prior to sky estimation. For complex skies, image combining is an effective non-parametric approach, although it strongly depends on observing strategy and adds noise to images on the smoothing kernel scale. Preemptive subtraction of fixed sources may be the only practical approach for robust sky estimation. We thus tested a third algorithm, subtracting a preliminary sky-subtracted coadd from exposures to isolate sky emission. Unfortunately, initial errors in sky estimation propagate through all subsequent sky models, making the method impractical. For large-scale surveys like LSST, where key science goals constrain observing strategy, masking and modelling remains the optimal sky estimation approach, assuming stellar scattered light is removed first. △ Less

Submitted 22 January, 2024; originally announced January 2024.

Comments: 18 pages, 11 figures, accepted for publication in MNRAS

arXiv:2401.03985 [pdf, other]

BST1047+1156: A (Failing) Ultradiffuse Tidal Dwarf in the Leo I Group

Authors: J. Christopher Mihos, Patrick R. Durrell, Aaron E. Watkins, Stacy S. McGaugh, John J. Feldmeier

Abstract: We use deep Hubble Space Telescope imaging to study the resolved stellar populations in BST1047+1156, a gas-rich, ultradiffuse dwarf galaxy found in the intragroup environment of the Leo I galaxy group. While our imaging reaches approximately two magnitudes below the tip of the red giant branch at the Leo I distance of 11 Mpc, we find no evidence for an old red giant sequence that would signal an… ▽ More We use deep Hubble Space Telescope imaging to study the resolved stellar populations in BST1047+1156, a gas-rich, ultradiffuse dwarf galaxy found in the intragroup environment of the Leo I galaxy group. While our imaging reaches approximately two magnitudes below the tip of the red giant branch at the Leo I distance of 11 Mpc, we find no evidence for an old red giant sequence that would signal an extended star formation history for the object. Instead, we clearly detect the red and blue helium burning sequences of its stellar populations, as well as the fainter blue main sequence, all indicative of a recent burst of star formation having taken place over the past 50--250 Myr. Comparing to isochrones for young metal-poor stellar populations, we infer this post-starburst population to be moderately metal poor, with metallicity [M/H] in the range -1 to -1.5. The combination of a young, moderately metal-poor post starburst population and no old stars motivates a scenario in which BST1047 was recently formed during a weak burst of star formation in gas that was tidally stripped from the outskirts of the neighboring massive spiral M96. BST1047's extremely diffuse nature, lack of ongoing star formation, and disturbed HI morphology all argue that it is a transitory object, a "failing tidal dwarf" in the process of being disrupted by interactions within the Leo I group. Finally, in the environment surrounding BST1047, our imaging also reveals the old, metal-poor ([M/H]=-1.3 +/- 0.2) stellar halo of M96 at a projected radius of 50 kpc. △ Less

Submitted 8 January, 2024; originally announced January 2024.

Comments: 15 pages, 9 figures, accepted for publication in The Astrophysical Journal

arXiv:2312.02180 [pdf, other]

The Role of Low-energy (< 20 eV) Secondary Electrons in the Extraterrestrial Synthesis of Prebiotic Molecules

Authors: Qin Tong Wu, Hannah Anderson, Aurland K. Watkins, Devyani Arora, Kennedy Barnes, Marco Padovani, Christopher N. Shingledecker, Christopher R. Arumainayagam, James B. R. Battat

Abstract: We demonstrate for the first time that Galactic cosmic rays with energies as high as 1e10 eV can trigger a cascade of low-energy (< 20 eV) secondary electrons that could be a significant contributor to the interstellar synthesis of prebiotic molecules whose delivery by comets, meteorites, and interplanetary dust particles may have kick-started life on Earth. We explore the relative importance of l… ▽ More We demonstrate for the first time that Galactic cosmic rays with energies as high as 1e10 eV can trigger a cascade of low-energy (< 20 eV) secondary electrons that could be a significant contributor to the interstellar synthesis of prebiotic molecules whose delivery by comets, meteorites, and interplanetary dust particles may have kick-started life on Earth. We explore the relative importance of low-energy (< 20 eV) secondary electrons--agents of radiation chemistry--and low-energy (< 10 eV), non-ionizing photons--instigators of photochemistry. Our calculations indicate fluxes of 100 electrons/cm2/s for low-energy secondary electrons produced within interstellar ices due to incident attenuated Galactic cosmic-ray (CR) protons. Consequently, in certain star-forming regions where internal high-energy radiation sources produce ionization rates that are observed to be a thousand times greater than the typical interstellar Galactic ionization rate, the flux of low-energy secondary electrons should far exceed that of non-ionizing photons. Because reaction cross-sections can be several orders of magnitude larger for electrons than for photons, even in the absence of such enhancement our calculations indicate that secondary low-energy electrons are at least as significant as low-energy (< 10 eV) non-ionizing photons in the interstellar synthesis of prebiotic molecules. Most importantly, our results demonstrate the pressing need for explicitly incorporating low-energy electrons in current and future astrochemical simulations of cosmic ices. Such models are critically important for interpreting James Webb Space Telescope infrared measurements, which are currently being used to probe the origins of life by studying complex organic molecules found in ices near star-forming regions. △ Less

Submitted 28 November, 2023; originally announced December 2023.

Comments: 14 pages, 6 figures

arXiv:2311.03237 [pdf, other]

doi 10.1051/0004-6361/202347729

Constraining the top-light initial mass function in the extended ultraviolet disk of M83

Authors: R. P. V. Rautio, A. E. Watkins, H. Salo, A. Venhola, J. H. Knapen, S. Comerón

Abstract: The universality or non-universality of the initial mass function (IMF) has significant implications for determining star formation rates and star formation histories from photometric properties of stellar populations. We reexamine whether the IMF is deficient in high-mass stars (top-light) in the low-density environment of the outer disk of M83 and constrain the shape of the IMF therein. Using ar… ▽ More The universality or non-universality of the initial mass function (IMF) has significant implications for determining star formation rates and star formation histories from photometric properties of stellar populations. We reexamine whether the IMF is deficient in high-mass stars (top-light) in the low-density environment of the outer disk of M83 and constrain the shape of the IMF therein. Using archival Galaxy Evolution Explorer (GALEX) far ultraviolet (FUV) and near ultraviolet (NUV) data and new deep OmegaCAM narrowband H$α$ imaging, we constructed a catalog of FUV-selected objects in the outer disk of M83. We counted H$α$-bright clusters and clusters that are blue in FUV$-$NUV in the catalog, measured the maximum flux ratio $F_{\mathrm{H}α}/f_{λ\mathrm{FUV}}$ among the clusters, and measured the total flux ratio $ΣF_{\mathrm{H}α}/Σf_{λ\mathrm{FUV}}$ over the catalog. We then compared these measurements to predictions from stellar population synthesis models made with a standard Salpeter IMF, truncated IMFs, and steep IMFs. We also investigated the effect of varying the assumed internal extinction on our results. We are not able to reproduce our observations with models using the standard Salpeter IMF or the truncated IMFs. It is only when assuming an average internal extinction of $0.10 < A_{\mathrm{V}} < 0.15$ in the outer disk stellar clusters that models with steep IMFs ($α> 3.1$) simultaneously reproduce the observed cluster counts, the maximum observed $F_{\mathrm{H}α}/f_{λ\mathrm{FUV}}$, and the observed $ΣF_{\mathrm{H}α}/Σf_{λ\mathrm{FUV}}$. Our results support a non-universal IMF that is deficient in high-mass stars in low-density environments. △ Less

Submitted 6 November, 2023; originally announced November 2023.

Comments: 18 pages, 15 figures, accepted to Astronomy & Astrophysics

Journal ref: A&A 681, A76 (2024)

arXiv:2311.02289 [pdf, other]

Fidelity and variability in the interlayer electronic structure of the kagome superconductor CsV3Sb5

Authors: Aurland K. Watkins, Dirk Johrendt, Vojtech Vlcek, Stephen D. Wilson, Ram Seshadri

Abstract: The AV3Sb5 (A = K, Rb, Cs) kagome materials host an interplay of emergent phenomena including superconductivity, charge density wave states, and non-trivial electronic structure topology. The band structures of these materials exhibit a rich variety of features like Dirac crossings, saddle points associated with van Hove singularities, and flat bands prompting significant investigations into the i… ▽ More The AV3Sb5 (A = K, Rb, Cs) kagome materials host an interplay of emergent phenomena including superconductivity, charge density wave states, and non-trivial electronic structure topology. The band structures of these materials exhibit a rich variety of features like Dirac crossings, saddle points associated with van Hove singularities, and flat bands prompting significant investigations into the in-plane electronic behavior. However, recent findings including the charge density wave ordering and effects due to pressure or chemical do** point to the importance of understanding interactions between kagome layers. Probing this c-axis electronic structure via experimental methods remains challenging due to limitations of the crystals and, therefore, rigorous computational approaches are necessary to study the interlayer interactions. Here we use first-principles approaches to study the electronic structure of CsV3Sb5 with emphasis on the kz dispersion. We find that the inclusion of nonlocal and dynamical many-body correlation has a substantial impact on the interlayer band structure. We present new band behavior that additionally supports the integration of symmetry in accurately plotting electronic structures and influences further analysis like the calculation of topological invariants. △ Less

Submitted 3 November, 2023; originally announced November 2023.

Comments: 9 pages, 4 figures

arXiv:2310.07053 [pdf, other]

doi 10.1021/acs.chemmater.3c02814

Soft-Chemical Synthesis, Structure Evolution, and Insulator-to-Metal Transition in a Prototypical Metal Oxide, λ-RhO$_2$

Authors: Juan R. Chamorro, Julia L. Zuo, Euan N. Bassey, Aurland K. Watkins, Guomin Zhu, Arava Zohar, Kira E. Wyckoff, Tiffany L. Kinnibrugh, Saul H. Lapidus, Susanne Stemmer, Raphaële J. Clément, Stephen D. Wilson, Ram Seshadri

Abstract: $λ$-RhO$_2$, a prototype 4d transition metal oxide, has been prepared by oxidative delithiation of spinel LiRh$_2$O$_4$ using ceric ammonium nitrate. Average-structure studies of this RhO$_2… ▽ More $λ$-RhO$_2$, a prototype 4d transition metal oxide, has been prepared by oxidative delithiation of spinel LiRh$_2$O$_4$ using ceric ammonium nitrate. Average-structure studies of this RhO$_2$ polytype, including synchrotron powder X-ray diffraction and electron diffraction, indicate the room temperature structure to be tetragonal, in the space group I41/amd, with a first-order structural transition to cubic Fd-3m at T = 345 K on warming. Synchrotron X-ray pair distribution function analysis and $^7$Li solid state nuclear magnetic resonance measurements suggest that the room temperature structure displays local Rh-Rh bonding. The formation of these local dimers appears to be associated with a metal-to insulator transition with a non-magnetic ground state, as also supported by density functional theory-based electronic structure calculations. This contribution demonstrates the power of soft chemistry to kinetically stabilize a surprisingly simple binary oxide compound. △ Less

Submitted 10 October, 2023; originally announced October 2023.

arXiv:2309.03332 [pdf, other]

doi 10.1103/PhysRevMaterials.7.094402

Magnetic order in the $S_{\mathrm{eff}}$ = 1/2 triangular-lattice compound NdCd$_3$P$_3$

Authors: Juan R. Chamorro, Azzedin R. Jackson, Aurland K. Watkins, Ram Seshadri, Stephen D. Wilson

Abstract: We present and characterize a new member of the $R$Cd$_3$P$_3$ ($R$= rare earth) family of materials, NdCd$_3$P$_3$, which possesses Nd$^{3+}$ cations arranged on well-separated triangular lattice layers. Magnetic susceptibility and heat capacity measurements demonstrate a likely $S_{\mathrm{eff}}$ = 1/2 ground state, and also reveal the formation of long-range antiferromagnetic order at… ▽ More We present and characterize a new member of the $R$Cd$_3$P$_3$ ($R$= rare earth) family of materials, NdCd$_3$P$_3$, which possesses Nd$^{3+}$ cations arranged on well-separated triangular lattice layers. Magnetic susceptibility and heat capacity measurements demonstrate a likely $S_{\mathrm{eff}}$ = 1/2 ground state, and also reveal the formation of long-range antiferromagnetic order at $T_{N} = 0.34$ K. Via measurements of magnetization, heat capacity, and electrical resistivity, we characterize the electronic properties of NdCd$_3$P$_3$ and compare results to density functional theory calculations. △ Less

Submitted 6 September, 2023; originally announced September 2023.

Comments: Accepted for publication at Physical Review Materials

arXiv:2308.05326 [pdf, other]

OpenProteinSet: Training data for structural biology at scale

Authors: Gustaf Ahdritz, Nazim Bouatta, Sachin Kadyan, Lukas Jarosch, Daniel Berenberg, Ian Fisk, Andrew M. Watkins, Stephen Ra, Richard Bonneau, Mohammed AlQuraishi

Abstract: Multiple sequence alignments (MSAs) of proteins encode rich biological information and have been workhorses in bioinformatic methods for tasks like protein design and protein structure prediction for decades. Recent breakthroughs like AlphaFold2 that use transformers to attend directly over large quantities of raw MSAs have reaffirmed their importance. Generation of MSAs is highly computationally… ▽ More Multiple sequence alignments (MSAs) of proteins encode rich biological information and have been workhorses in bioinformatic methods for tasks like protein design and protein structure prediction for decades. Recent breakthroughs like AlphaFold2 that use transformers to attend directly over large quantities of raw MSAs have reaffirmed their importance. Generation of MSAs is highly computationally intensive, however, and no datasets comparable to those used to train AlphaFold2 have been made available to the research community, hindering progress in machine learning for proteins. To remedy this problem, we introduce OpenProteinSet, an open-source corpus of more than 16 million MSAs, associated structural homologs from the Protein Data Bank, and AlphaFold2 protein structure predictions. We have previously demonstrated the utility of OpenProteinSet by successfully retraining AlphaFold2 on it. We expect OpenProteinSet to be broadly useful as training and validation data for 1) diverse tasks focused on protein structure, function, and design and 2) large-scale multimodal machine learning research. △ Less

Submitted 10 August, 2023; originally announced August 2023.

arXiv:2306.11681 [pdf, other]

MoleCLUEs: Molecular Conformers Maximally In-Distribution for Predictive Models

Authors: Michael Maser, Natasa Tagasovska, Jae Hyeon Lee, Andrew Watkins

Abstract: Structure-based molecular ML (SBML) models can be highly sensitive to input geometries and give predictions with large variance. We present an approach to mitigate the challenge of selecting conformations for such models by generating conformers that explicitly minimize predictive uncertainty. To achieve this, we compute estimates of aleatoric and epistemic uncertainties that are differentiable w.… ▽ More Structure-based molecular ML (SBML) models can be highly sensitive to input geometries and give predictions with large variance. We present an approach to mitigate the challenge of selecting conformations for such models by generating conformers that explicitly minimize predictive uncertainty. To achieve this, we compute estimates of aleatoric and epistemic uncertainties that are differentiable w.r.t. latent posteriors. We then iteratively sample new latents in the direction of lower uncertainty by gradient descent. As we train our predictive models jointly with a conformer decoder, the new latent embeddings can be mapped to their corresponding inputs, which we call \textit{MoleCLUEs}, or (molecular) counterfactual latent uncertainty explanations \citep{antoran2020getting}. We assess our algorithm for the task of predicting drug properties from 3D structure with maximum confidence. We additionally analyze the structure trajectories obtained from conformer optimizations, which provide insight into the sources of uncertainty in SBML. △ Less

Submitted 6 November, 2023; v1 submitted 20 June, 2023; originally announced June 2023.

Comments: NeurIPS 2023 AI for Science Workshop

arXiv:2306.09414 [pdf]

Optimizing Roman's High Latitude Wide Area Survey for Low Surface Brightness Astronomy

Authors: Mireia Montes, Francesca Annibali, Michele Bellazzini, Alejandro S. Borlaff, Sarah Brough, Fernando Buitrago, Nushkia Chamba, Chris Collins, Ian Dell'Antonio, Ivanna Escala, Anthony H. Gonzalez, Benne Holwerda, Sugata Kaviraj, Johan Knapen, Anton Koekemoer, Seppo Laine, Pamela Marcum, Garreth Martin, David Martinez-Delgado, Chris Mihos, Massimo Ricotti, Ignacio Trujillo, Aaron E. Watkins

Abstract: One of the last remaining frontiers in optical/near-infrared observational astronomy is the low surface brightness regime (LSB, V-band surface brightness, $μ_V>$ 27 AB mag/arcsec$^2$). These are the structures at very low stellar surface densities, largely unseen by even current wide-field surveys such as the Legacy Survey. Studying this domain promises to be transformative for our understanding o… ▽ More One of the last remaining frontiers in optical/near-infrared observational astronomy is the low surface brightness regime (LSB, V-band surface brightness, $μ_V>$ 27 AB mag/arcsec$^2$). These are the structures at very low stellar surface densities, largely unseen by even current wide-field surveys such as the Legacy Survey. Studying this domain promises to be transformative for our understanding of star formation in low-mass galaxies, the hierarchical assembly of galaxies and galaxy clusters, and the nature of dark matter. It is thus essential to reach depths beyond $μ_V$ = 30 AB mag/arcsec$^2$ to detect the faintest extragalactic sources, such as dwarf galaxies and the stellar halos around galaxies and within galaxy clusters. The High Latitude Wide Area Survey offers a unique opportunity to statistically study the LSB universe at unprecedented depths in the IR over an area of $\sim$2000 square degrees. The high spatial resolution will minimize source confusion, allowing an unbiased characterization of LSB structures, including the identification of stars in nearby LSB galaxies and globular clusters. In addition, the combination of Roman with other upcoming deep imaging observatories (such as Rubin) will provide multi-wavelength coverage to derive photometric redshifts and infer the stellar populations of LSB objects. △ Less

Submitted 15 June, 2023; originally announced June 2023.

Comments: White paper submitted to the call for input for the Roman Space Telescope's Core Community Surveys

arXiv:2306.07473 [pdf, other]

3D molecule generation by denoising voxel grids

Authors: Pedro O. Pinheiro, Joshua Rackers, Joseph Kleinhenz, Michael Maser, Omar Mahmood, Andrew Martin Watkins, Stephen Ra, Vishnu Sresht, Saeed Saremi

Abstract: We propose a new score-based approach to generate 3D molecules represented as atomic densities on regular grids. First, we train a denoising neural network that learns to map from a smooth distribution of noisy molecules to the distribution of real molecules. Then, we follow the neural empirical Bayes framework (Saremi and Hyvarinen, 19) and generate molecules in two steps: (i) sample noisy densit… ▽ More We propose a new score-based approach to generate 3D molecules represented as atomic densities on regular grids. First, we train a denoising neural network that learns to map from a smooth distribution of noisy molecules to the distribution of real molecules. Then, we follow the neural empirical Bayes framework (Saremi and Hyvarinen, 19) and generate molecules in two steps: (i) sample noisy density grids from a smooth distribution via underdamped Langevin Markov chain Monte Carlo, and (ii) recover the "clean" molecule by denoising the noisy grid with a single step. Our method, VoxMol, generates molecules in a fundamentally different way than the current state of the art (ie, diffusion models applied to atom point clouds). It differs in terms of the data representation, the noise model, the network architecture and the generative modeling algorithm. Our experiments show that VoxMol captures the distribution of drug-like molecules better than state of the art, while being faster to generate samples. △ Less

Submitted 8 March, 2024; v1 submitted 12 June, 2023; originally announced June 2023.

arXiv:2305.08598 [pdf, other]

doi 10.1145/3593013.3593978

Humans, AI, and Context: Understanding End-Users' Trust in a Real-World Computer Vision Application

Authors: Sunnie S. Y. Kim, Elizabeth Anne Watkins, Olga Russakovsky, Ruth Fong, Andrés Monroy-Hernández

Abstract: Trust is an important factor in people's interactions with AI systems. However, there is a lack of empirical studies examining how real end-users trust or distrust the AI system they interact with. Most research investigates one aspect of trust in lab settings with hypothetical end-users. In this paper, we provide a holistic and nuanced understanding of trust in AI through a qualitative case study… ▽ More Trust is an important factor in people's interactions with AI systems. However, there is a lack of empirical studies examining how real end-users trust or distrust the AI system they interact with. Most research investigates one aspect of trust in lab settings with hypothetical end-users. In this paper, we provide a holistic and nuanced understanding of trust in AI through a qualitative case study of a real-world computer vision application. We report findings from interviews with 20 end-users of a popular, AI-based bird identification app where we inquired about their trust in the app from many angles. We find participants perceived the app as trustworthy and trusted it, but selectively accepted app outputs after engaging in verification behaviors, and decided against app adoption in certain high-stakes scenarios. We also find domain knowledge and context are important factors for trust-related assessment and decision-making. We discuss the implications of our findings and provide recommendations for future research on trust in AI. △ Less

Submitted 15 May, 2023; originally announced May 2023.

Comments: FAccT 2023

arXiv:2302.13733 [pdf, other]

doi 10.1093/mnras/stad654

A possible signature of the influence of tidal perturbations in dwarf galaxy scaling relations

Authors: A. E. Watkins, H. Salo, S. Kaviraj, C. A. Collins, J. H. Knapen, A. Venhola, J. Román

Abstract: Dwarf galaxies are excellent cosmological probes, because their shallow potential wells make them very sensitive to the key processes that drive galaxy evolution, including baryonic feedback, tidal interactions, and ram pressure strip**. However, some of the key parameters of dwarf galaxies, which help trace the effects of these processes, are still debated, including the relationship between th… ▽ More Dwarf galaxies are excellent cosmological probes, because their shallow potential wells make them very sensitive to the key processes that drive galaxy evolution, including baryonic feedback, tidal interactions, and ram pressure strip**. However, some of the key parameters of dwarf galaxies, which help trace the effects of these processes, are still debated, including the relationship between their sizes and masses. We re-examine the Fornax Cluster dwarf population from the point of view of isomass-radius--stellar mass relations (IRSMRs) using the Fornax Deep Survey Dwarf galaxy Catalogue, with the centrally located (among dwarfs) $3.63 \mathcal{M}_{\odot}$~pc$^{-2}$ isodensity radius defining our fiducial relation. This relation is a powerful diagnostic tool for identifying dwarfs with unusual structure, as dwarf galaxies' remarkable monotonicity in light profile shapes, as a function of stellar mass, reduces the relation's scatter tremendously. By examining how different dwarf properties (colour, tenth-nearest-neighbour distance, etc.) correlate with distance from our fiducial relation, we find a significant population of structural outliers with comparatively lower central mass surface density and larger half-light-radii, residing in locally denser regions in the cluster, albeit with similar red colours. We propose that these faint, extended outliers likely formed through tidal disturbances, which make the dwarfs more diffuse, but with little mass loss. Comparing these outliers with ultra-diffuse galaxies (UDGs), we find that the term UDG lacks discriminatory power; UDGs in the Fornax Cluster lie both on and off of IRSMRs defined at small radii, while IRSMR outliers with masses below $\sim 10^{7.5} \mathcal{M}_{\odot}$ are excluded from the UDG classification due to their small effective radii. △ Less

Submitted 27 February, 2023; originally announced February 2023.

Comments: 16 pages (+2 appendix), 10 figures, accepted for publication in MNRAS

arXiv:2302.07754 [pdf, other]

SupSiam: Non-contrastive Auxiliary Loss for Learning from Molecular Conformers

Authors: Michael Maser, Ji Won Park, Joshua Yao-Yu Lin, Jae Hyeon Lee, Nathan C. Frey, Andrew Watkins

Abstract: We investigate Siamese networks for learning related embeddings for augmented samples of molecular conformers. We find that a non-contrastive (positive-pair only) auxiliary task aids in supervised training of Euclidean neural networks (E3NNs) and increases manifold smoothness (MS) around point-cloud geometries. We demonstrate this property for multiple drug-activity prediction tasks while maintain… ▽ More We investigate Siamese networks for learning related embeddings for augmented samples of molecular conformers. We find that a non-contrastive (positive-pair only) auxiliary task aids in supervised training of Euclidean neural networks (E3NNs) and increases manifold smoothness (MS) around point-cloud geometries. We demonstrate this property for multiple drug-activity prediction tasks while maintaining relevant performance metrics, and propose an extension of MS to probabilistic and regression settings. We provide an analysis of representation collapse, finding substantial effects of task-weighting, latent dimension, and regularization. We expect the presented protocol to aid in the development of reliable E3NNs from molecular conformers, even for small-data drug discovery programs. △ Less

Submitted 15 February, 2023; originally announced February 2023.

Comments: Submitted to the MLDD workshop, ICLR 2023

arXiv:2302.06631 [pdf, other]

doi 10.1093/mnras/stad224

Relaxed blue ellipticals: accretion-driven stellar growth is a key evolutionary channel for low mass elliptical galaxies

Authors: Ilin Lazar, Sugata Kaviraj, Garreth Martin, Clotilde Laigle, Aaron E. Watkins, Ryan A. Jackson

Abstract: How elliptical galaxies form is a key question in observational cosmology. While the formation of massive ellipticals is strongly linked to mergers, the low mass (Mstar < 10^9.5 MSun) regime remains less well explored. In particular, studying elliptical populations when they are blue, and therefore rapidly building stellar mass, offers strong constraints on their formation. Here, we study 108 blue… ▽ More How elliptical galaxies form is a key question in observational cosmology. While the formation of massive ellipticals is strongly linked to mergers, the low mass (Mstar < 10^9.5 MSun) regime remains less well explored. In particular, studying elliptical populations when they are blue, and therefore rapidly building stellar mass, offers strong constraints on their formation. Here, we study 108 blue, low-mass ellipticals (which have a median stellar mass of 10^8.7 MSun) at z < 0.3 in the COSMOS field. Visual inspection of extremely deep optical HSC images indicates that less than 3 per cent of these systems have visible tidal features, a factor of 2 less than the incidence of tidal features in a control sample of galaxies with the same distribution of stellar mass and redshift. This suggests that the star formation activity in these objects is not driven by mergers or interactions but by secular gas accretion. We combine accurate physical parameters from the COSMOS2020 catalog, with measurements of local density and the locations of galaxies in the cosmic web, to show that our blue ellipticals reside in low-density environments, further away from nodes and large-scale filaments than other galaxies. At similar stellar masses and environments, blue ellipticals outnumber their normal (red) counterparts by a factor of 2. Thus, these systems are likely progenitors of not only normal ellipticals at similar stellar mass but, given their high star formation rates, also of ellipticals at higher stellar masses. Secular gas accretion, therefore, likely plays a significant (and possibly dominant) role in the stellar assembly of elliptical galaxies in the low mass regime. △ Less

Submitted 13 February, 2023; originally announced February 2023.

Comments: Published in MNRAS

Journal ref: MNRAS, 520, 2109 (2023)

arXiv:2211.07463 [pdf, other]

doi 10.3847/1538-4357/aca27a

Deep Narrowband Photometry of the M101 Group: Strong-Line Abundances of 720 HII Regions

Authors: Ray Garner III, J. Christopher Mihos, Paul Harding, Aaron E. Watkins, Stacy S. McGaugh

Abstract: We present deep, narrowband imaging of the nearby spiral galaxy M101 and its satellites to analyze the oxygen abundances of their HII regions. Using CWRU's Burrell Schmidt telescope, we add to the narrowband dataset of the M101 Group, consisting of H$α$, H$β$, and [OIII] emission lines, the blue [OII]$λ$3727 emission line for the first time. This allows for complete spatial coverage of the oxygen… ▽ More We present deep, narrowband imaging of the nearby spiral galaxy M101 and its satellites to analyze the oxygen abundances of their HII regions. Using CWRU's Burrell Schmidt telescope, we add to the narrowband dataset of the M101 Group, consisting of H$α$, H$β$, and [OIII] emission lines, the blue [OII]$λ$3727 emission line for the first time. This allows for complete spatial coverage of the oxygen abundance of the entire M101 Group. We used the strong-line ratio $R_{23}$ to estimate oxygen abundances for the HII regions in our sample, utilizing three different calibration techniques to provide a baseline estimate of the oxygen abundances. This results in ~650 HII regions for M101, 10 HII regions for NGC 5477, and ~60 HII regions for NGC 5474, the largest sample for this Group to date. M101 shows a strong abundance gradient while the satellite galaxies present little or no gradient. There is some evidence for a flattening of the gradient in M101 beyond $R \sim 14 \text{ kpc}$. Additionally, M101 shows signs of azimuthal abundance variations to the west and southwest. The radial and azimuthal abundance variations in M101 are likely explained by an interaction it had with its most massive satellite NGC 5474 ~300 Myr ago combined with internal dynamical effects such as corotation. △ Less

Submitted 14 November, 2022; originally announced November 2022.

Comments: 24 pages, 14 figures, 5 tables. Accepted to ApJ

arXiv:2210.04096 [pdf, other]

PropertyDAG: Multi-objective Bayesian optimization of partially ordered, mixed-variable properties for biological sequence design

Authors: Ji Won Park, Samuel Stanton, Saeed Saremi, Andrew Watkins, Henri Dwyer, Vladimir Gligorijevic, Richard Bonneau, Stephen Ra, Kyunghyun Cho

Abstract: Bayesian optimization offers a sample-efficient framework for navigating the exploration-exploitation trade-off in the vast design space of biological sequences. Whereas it is possible to optimize the various properties of interest jointly using a multi-objective acquisition function, such as the expected hypervolume improvement (EHVI), this approach does not account for objectives with a hierarch… ▽ More Bayesian optimization offers a sample-efficient framework for navigating the exploration-exploitation trade-off in the vast design space of biological sequences. Whereas it is possible to optimize the various properties of interest jointly using a multi-objective acquisition function, such as the expected hypervolume improvement (EHVI), this approach does not account for objectives with a hierarchical dependency structure. We consider a common use case where some regions of the Pareto frontier are prioritized over others according to a specified $\textit{partial ordering}$ in the objectives. For instance, when designing antibodies, we would like to maximize the binding affinity to a target antigen only if it can be expressed in live cell culture -- modeling the experimental dependency in which affinity can only be measured for antibodies that can be expressed and thus produced in viable quantities. In general, we may want to confer a partial ordering to the properties such that each property is optimized conditioned on its parent properties satisfying some feasibility condition. To this end, we present PropertyDAG, a framework that operates on top of the traditional multi-objective BO to impose this desired ordering on the objectives, e.g. expression $\rightarrow$ affinity. We demonstrate its performance over multiple simulated active learning iterations on a penicillin production task, toy numerical problem, and a real-world antibody design task. △ Less

Submitted 8 October, 2022; originally announced October 2022.

Comments: 9 pages, 7 figures. Submitted to NeurIPS 2022 AI4Science Workshop

arXiv:2210.03735 [pdf, other]

doi 10.1145/3544548.3581001

"Help Me Help the AI": Understanding How Explainability Can Support Human-AI Interaction

Authors: Sunnie S. Y. Kim, Elizabeth Anne Watkins, Olga Russakovsky, Ruth Fong, Andrés Monroy-Hernández

Abstract: Despite the proliferation of explainable AI (XAI) methods, little is understood about end-users' explainability needs and behaviors around XAI explanations. To address this gap and contribute to understanding how explainability can support human-AI interaction, we conducted a mixed-methods study with 20 end-users of a real-world AI application, the Merlin bird identification app, and inquired abou… ▽ More Despite the proliferation of explainable AI (XAI) methods, little is understood about end-users' explainability needs and behaviors around XAI explanations. To address this gap and contribute to understanding how explainability can support human-AI interaction, we conducted a mixed-methods study with 20 end-users of a real-world AI application, the Merlin bird identification app, and inquired about their XAI needs, uses, and perceptions. We found that participants desire practically useful information that can improve their collaboration with the AI, more so than technical system details. Relatedly, participants intended to use XAI explanations for various purposes beyond understanding the AI's outputs: calibrating trust, improving their task skills, changing their behavior to supply better inputs to the AI, and giving constructive feedback to developers. Finally, among existing XAI approaches, participants preferred part-based explanations that resemble human reasoning and explanations. We discuss the implications of our findings and provide recommendations for future XAI design. △ Less

Submitted 16 February, 2023; v1 submitted 2 October, 2022; originally announced October 2022.

Comments: CHI 2023

Journal ref: Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (CHI '23), April 23-28, 2023, Hamburg, Germany. ACM, New York, NY, USA

arXiv:2208.13527 [pdf, other]

doi 10.1051/0004-6361/202142447

Linking star formation thresholds and truncations in the thin and thick disks of the low-mass galaxy UGC 7321

Authors: Simón Díaz-García, Sébastien Comerón, Stéphane Courteau, Aaron E. Watkins, Johan H. Knapen, Javier Román

Abstract: Thin and thick disks are found in most spiral galaxies, yet their formation scenarios remain uncertain. Whether thick disks form through slow or fast, internal or environmental, processes is unclear. The physical origin of outer truncations in thin and thick disks, observed as a drop in optical and near-infrared (NIR) surface brightness profiles, is also a much debated topic. These truncations hav… ▽ More Thin and thick disks are found in most spiral galaxies, yet their formation scenarios remain uncertain. Whether thick disks form through slow or fast, internal or environmental, processes is unclear. The physical origin of outer truncations in thin and thick disks, observed as a drop in optical and near-infrared (NIR) surface brightness profiles, is also a much debated topic. These truncations have been linked to star formation (SF) thresholds in Milky-Way type galaxies, but no such connection has been made for their low-mass counterparts or in thick disks. Our photometric analysis of the edge-on galaxy UGC 7321 offers a possible breakthrough. This well-studied diffuse, isolated, bulgeless, ultra-thin galaxy is thought to be under-evolved both dynamically and in SF. It is an ideal target to disentangle internal effects in the formation of thick disks and truncations. Our axial light profiles from deep far- and near-ultraviolet (UV; GALEX) images, tracing recent SF, and optical (DESI grz) and NIR (Spitzer 3.6 microns) images, tracing old stellar populations, enable a detailed identification of an outer truncation in all probed wavelengths in both the thin and thick disks. After deprojecting to a face-on view, a sharp truncation signature is found at a stellar density of roughly 1.5 solar masses per square parsec, in agreement with theoretical expectations of gas density SF thresholds. The redder colours beyond the truncation radius are indicative of stellar migration towards the outer regions. We thus show that thick disks and truncations can form via internal mechanisms alone, given the pristine nature of UGC 7321. We report the discovery of a truncation at and above the mid-plane of a diffuse galaxy that is linked to a SF threshold; this poses a constraint on physically-motivated disk size measurements among low-mass galaxies. △ Less

Submitted 4 November, 2022; v1 submitted 29 August, 2022; originally announced August 2022.

Comments: Accepted for publication in A&A (August 29, 2022). 7 pages, 3 figures

Journal ref: A&A 667, A109 (2022)

arXiv:2206.00035 [pdf, ps, other]

Weaving Privacy and Power: On the Privacy Practices of Labor Organizers in the U.S. Technology Industry

Authors: Sayash Kapoor, Matthew Sun, Mona Wang, Klaudia Jaźwińska, Elizabeth Anne Watkins

Abstract: We investigate the privacy practices of labor organizers in the computing technology industry and explore the changes in these practices as a response to remote work. Our study is situated at the intersection of two pivotal shifts in workplace dynamics: (a) the increase in online workplace communications due to remote work, and (b) the resurgence of the labor movement and an increase in collective… ▽ More We investigate the privacy practices of labor organizers in the computing technology industry and explore the changes in these practices as a response to remote work. Our study is situated at the intersection of two pivotal shifts in workplace dynamics: (a) the increase in online workplace communications due to remote work, and (b) the resurgence of the labor movement and an increase in collective action in workplaces -- especially in the tech industry, where this phenomenon has been dubbed the tech worker movement. Through a series of qualitative interviews with 29 tech workers involved in collective action, we investigate how labor organizers assess and mitigate risks to privacy while engaging in these actions. Among the most common risks that organizers experienced are retaliation from their employer, lateral worker conflict, emotional burnout, and the possibility of information about the collective effort leaking to management. Depending on the nature and source of the risk, organizers use a blend of digital security practices and community-based mechanisms. We find that digital security practices are more relevant when the threat comes from management, while community management and moderation are central to protecting organizers from lateral worker conflict. Since labor organizing is a collective rather than individual project, individual privacy and collective privacy are intertwined, sometimes in conflict and often mutually constitutive. Notions of privacy that solely center individuals are often incompatible with the needs of organizers, who noted that safety in numbers could only be achieved when workers presented a united front to management. We conclude with design recommendations that can help create safer, more secure and more private tools to better address the risks that organizers face. △ Less

Submitted 31 May, 2022; originally announced June 2022.

Comments: Accepted to CSCW 2022

arXiv:2205.04259 [pdf, other]

Multi-segment preserving sampling for deep manifold sampler

Authors: Daniel Berenberg, Jae Hyeon Lee, Simon Kelow, Ji Won Park, Andrew Watkins, Vladimir Gligorijević, Richard Bonneau, Stephen Ra, Kyunghyun Cho

Abstract: Deep generative modeling for biological sequences presents a unique challenge in reconciling the bias-variance trade-off between explicit biological insight and model flexibility. The deep manifold sampler was recently proposed as a means to iteratively sample variable-length protein sequences by exploiting the gradients from a function predictor. We introduce an alternative approach to this guide… ▽ More Deep generative modeling for biological sequences presents a unique challenge in reconciling the bias-variance trade-off between explicit biological insight and model flexibility. The deep manifold sampler was recently proposed as a means to iteratively sample variable-length protein sequences by exploiting the gradients from a function predictor. We introduce an alternative approach to this guided sampling procedure, multi-segment preserving sampling, that enables the direct inclusion of domain-specific knowledge by designating preserved and non-preserved segments along the input sequence, thereby restricting variation to only select regions. We present its effectiveness in the context of antibody design by training two models: a deep manifold sampler and a GPT-2 language model on nearly six million heavy chain sequences annotated with the IGHV1-18 gene. During sampling, we restrict variation to only the complementarity-determining region 3 (CDR3) of the input. We obtain log probability scores from a GPT-2 model for each sampled CDR3 and demonstrate that multi-segment preserving sampling generates reasonable designs while maintaining the desired, preserved regions. △ Less

Submitted 9 May, 2022; originally announced May 2022.

arXiv:2203.07675 [pdf, other]

doi 10.1093/mnras/stac1003

Preparing for low surface brightness science with the Vera C. Rubin Observatory: characterisation of tidal features from mock images

Authors: G. Martin, A. E. Bazkiaei, M. Spavone, E. Iodice, J. C. Mihos, M. Montes, J. A. Benavides, S. Brough, J. L. Carlin, C. A. Collins, P. A. Duc, F. A. Gómez, G. Galaz, H. M. Hernández-Toledo, R. A. Jackson, S. Kaviraj, J. H. Knapen, C. Martínez-Lombilla, S. McGee, D. O'Ryan, D. J. Prole, R. M. Rich, J. Román, E. A. Shah, T. K. Starkenburg , et al. (28 additional authors not shown)

Abstract: Tidal features in the outskirts of galaxies yield unique information about their past interactions and are a key prediction of the hierarchical structure formation paradigm. The Vera C. Rubin Observatory is poised to deliver deep observations for potentially of millions of objects with visible tidal features, but the inference of galaxy interaction histories from such features is not straightforwa… ▽ More Tidal features in the outskirts of galaxies yield unique information about their past interactions and are a key prediction of the hierarchical structure formation paradigm. The Vera C. Rubin Observatory is poised to deliver deep observations for potentially of millions of objects with visible tidal features, but the inference of galaxy interaction histories from such features is not straightforward. Utilising automated techniques and human visual classification in conjunction with realistic mock images produced using the NEWHORIZON cosmological simulation, we investigate the nature, frequency and visibility of tidal features and debris across a range of environments and stellar masses. In our simulated sample, around 80 per cent of the flux in the tidal features around Milky Way or greater mass galaxies is detected at the 10-year depth of the Legacy Survey of Space and Time (30-31 mag / sq. arcsec), falling to 60 per cent assuming a shallower final depth of 29.5 mag / sq. arcsec. The fraction of total flux found in tidal features increases towards higher masses, rising to 10 per cent for the most massive objects in our sample (M*~10^{11.5} Msun). When observed at sufficient depth, such objects frequently exhibit many distinct tidal features with complex shapes. The interpretation and characterisation of such features varies significantly with image depth and object orientation, introducing significant biases in their classification. Assuming the data reduction pipeline is properly optimised, we expect the Rubin Observatory to be capable of recovering much of the flux found in the outskirts of Milky Way mass galaxies, even at intermediate redshifts (z<0.2). △ Less

Submitted 7 May, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

Comments: 29 pages, 25 figures; accepted for publication in MNRAS following minor corrections

Journal ref: Monthly Notices of the Royal Astronomical Society, Volume 513, Issue 1, June 2022, Pages 1459-1487,

arXiv:2203.01455 [pdf]

A relationship and not a thing: A relational approach to algorithmic accountability and assessment documentation

Authors: Jacob Metcalf, Emanuel Moss, Ranjit Singh, Emnet Tafese, Elizabeth Anne Watkins

Abstract: Central to a number of scholarly, regulatory, and public conversations about algorithmic accountability is the question of who should have access to documentation that reveals the inner workings, intended function, and anticipated consequences of algorithmic systems, potentially establishing new routes for impacted publics to contest the operations of these systems. Currently, developers largely h… ▽ More Central to a number of scholarly, regulatory, and public conversations about algorithmic accountability is the question of who should have access to documentation that reveals the inner workings, intended function, and anticipated consequences of algorithmic systems, potentially establishing new routes for impacted publics to contest the operations of these systems. Currently, developers largely have a monopoly on information about how their systems actually work and are incentivized to maintain their own ignorance about aspects of how their systems affect the world. Increasingly, legislators, regulators and advocates have turned to assessment documentation in order to address the gap between the public's experience of algorithmic harms and the obligations of developers to document and justify their design decisions. However, issues of standing and expertise currently prevent publics from cohering around shared interests in preventing and redressing algorithmic harms; as we demonstrate with multiple cases, courts often find computational harms non-cognizable and rarely require developers to address material claims of harm. Constructed with a triadic accountability relationship, algorithmic impact assessment regimes could alter this situation by establishing procedural rights around public access to reporting and documentation. Develo** a relational approach to accountability, we argue that robust accountability regimes must establish opportunities for publics to cohere around shared experiences and interests, and to contest the outcomes of algorithmic systems that affect their lives. Furthermore, algorithmic accountability policies currently under consideration in many jurisdictions must provide the public with adequate standing and opportunities to access and contest the documentation provided by the actors and the judgments passed by the forum. △ Less

Submitted 2 March, 2022; originally announced March 2022.

arXiv:2203.01157 [pdf, other]

doi 10.1145/3514094.3534138

Artificial Concepts of Artificial Intelligence: Institutional Compliance and Resistance in AI Startups

Authors: Amy A. Winecoff, Elizabeth Anne Watkins

Abstract: Scholars and industry practitioners have debated how to best develop interventions for ethical artificial intelligence (AI). Such interventions recommend that companies building and using AI tools change their technical practices, but fail to wrangle with critical questions about the organizational and institutional context in which AI is developed. In this paper, we contribute descriptive researc… ▽ More Scholars and industry practitioners have debated how to best develop interventions for ethical artificial intelligence (AI). Such interventions recommend that companies building and using AI tools change their technical practices, but fail to wrangle with critical questions about the organizational and institutional context in which AI is developed. In this paper, we contribute descriptive research around the life of "AI" as a discursive concept and organizational practice in an understudied sphere--emerging AI startups--and with a focus on extra-organizational pressures faced by entrepreneurs. Leveraging a theoretical lens for how organizations change, we conducted semi-structured interviews with 23 entrepreneurs working at early-stage AI startups. We find that actors within startups both conform to and resist institutional pressures. Our analysis identifies a central tension for AI entrepreneurs: they often valued scientific integrity and methodological rigor; however, influential external stakeholders either lacked the technical knowledge to appreciate entrepreneurs' emphasis on rigor or were more focused on business priorities. As a result, entrepreneurs adopted hyped marketing messages about AI that diverged from their scientific values, but attempted to preserve their legitimacy internally. Institutional pressures and organizational constraints also influenced entrepreneurs' modeling practices and their response to actual or impending regulation. We conclude with a discussion for how such pressures could be used as leverage for effective interventions towards building ethical AI. △ Less

Submitted 14 June, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

arXiv:2202.09519 [pdf, other]

The four-fifths rule is not disparate impact: a woeful tale of epistemic trespassing in algorithmic fairness

Authors: Elizabeth Anne Watkins, Michael McKenna, Jiahao Chen

Abstract: Computer scientists are trained to create abstractions that simplify and generalize. However, a premature abstraction that omits crucial contextual details creates the risk of epistemic trespassing, by falsely asserting its relevance into other contexts. We study how the field of responsible AI has created an imperfect synecdoche by abstracting the four-fifths rule (a.k.a. the 4/5 rule or 80% rule… ▽ More Computer scientists are trained to create abstractions that simplify and generalize. However, a premature abstraction that omits crucial contextual details creates the risk of epistemic trespassing, by falsely asserting its relevance into other contexts. We study how the field of responsible AI has created an imperfect synecdoche by abstracting the four-fifths rule (a.k.a. the 4/5 rule or 80% rule), a single part of disparate impact discrimination law, into the disparate impact metric. This metric incorrectly introduces a new deontic nuance and new potentials for ethical harms that were absent in the original 4/5 rule. We also survey how the field has amplified the potential for harm in codifying the 4/5 rule into popular AI fairness software toolkits. The harmful erasure of legal nuances is a wake-up call for computer scientists to self-critically re-evaluate the abstractions they create and use, particularly in the interdisciplinary field of AI ethics. △ Less

Submitted 18 February, 2022; originally announced February 2022.

Comments: 10 pages, 1 figure, 2 tables

Report number: P22-1-v0.2.2 MSC Class: 68T27; 03B70 ACM Class: K.4; K.5; F.4; I.2

arXiv:2201.08381 [pdf, other]

doi 10.1051/0004-6361/202142627

Stellar masses, sizes, and radial profiles for 465 nearby early-type galaxies: an extension to the Spitzer Survey of Stellar Structure in Galaxies (S$^{4}$G)

Authors: A. E. Watkins, H. Salo, E. Laurikainen, S. Díaz-García, S. Comerón, J. Janz, A. H. Su, R. Buta, E. Athanassoula, A. Bosma, L. C. Ho, B. W. Holwerda, T. Kim, J. H. Knapen, S. Laine, K. Menéndez-Delmestre, R. F. Peletier, K. Sheth, D. Zaritsky

Abstract: The Spitzer Survey of Stellar Structure in Galaxies (S$^{4}$G) is a detailed study of over 2300 nearby galaxies in the near-infrared (NIR), which has been critical to our understanding of the detailed structures of nearby galaxies. Because the sample galaxies were selected only using radio-derived velocities, however, the survey favored late-type disk galaxies over lenticulars and ellipticals. A f… ▽ More The Spitzer Survey of Stellar Structure in Galaxies (S$^{4}$G) is a detailed study of over 2300 nearby galaxies in the near-infrared (NIR), which has been critical to our understanding of the detailed structures of nearby galaxies. Because the sample galaxies were selected only using radio-derived velocities, however, the survey favored late-type disk galaxies over lenticulars and ellipticals. A follow-up Spitzer survey was conducted to rectify this bias, adding 465 early-type galaxies (ETGs) to the original sample, to be analyzed in a manner consistent with the initial survey. We present the data release of this ETG extension, up to the third data processing pipeline (P3): surface photometry. We produce curves of growth and radial surface brightness profiles (with and without inclination corrections) using reduced and masked Spitzer IRAC 3.6$μ$m and 4.5$μ$m images produced through Pipelines 1 and 2, respectively. From these profiles, we derive the following integrated quantities: total magnitudes, stellar masses, concentration parameters, and galaxy size metrics. We showcase NIR scaling relations for ETGs among these quantities. We examine general trends across the whole S$^{4}$G and ETG extension among our derived parameters, highlighting differences between ETGs and late-type galaxies (LTGs). ETGs are, on average, more massive and more concentrated than LTGs, and also show subtle distinctions among ETG morphological sub-types. We also derive the following scaling relations and compare with previous results in visible light: mass--size (both half-light and isophotal), mass--concentration, mass--surface brightness (central, effective, and within 1 kpc), and mass--color. We find good agreement with previous works, though some relations (e.g., mass--central surface brightness) will require more careful multi-component decompositions to be fully understood. △ Less

Submitted 20 January, 2022; originally announced January 2022.

Comments: 25 pages, 17 figures, accepted for publication in A&A

Journal ref: A&A 660, A69 (2022)

arXiv:2201.03862 [pdf, other]

doi 10.5281/zenodo.7195671

Rubin-Euclid Derived Data Products: Initial Recommendations

Authors: Leanne P. Guy, Jean-Charles Cuillandre, Etienne Bachelet, Manda Banerji, Franz E. Bauer, Thomas Collett, Christopher J. Conselice, Siegfried Eggl, Annette Ferguson, Adriano Fontana, Catherine Heymans, Isobel M. Hook, Éric Aubourg, Hervé Aussel, James Bosch, Benoit Carry, Henk Hoekstra, Konrad Kuijken, Francois Lanusse, Peter Melchior, Joseph Mohr, Michele Moresco, Reiko Nakajima, Stéphane Paltani, Michael Troxel , et al. (95 additional authors not shown)

Abstract: This report is the result of a joint discussion between the Rubin and Euclid scientific communities. The work presented in this report was focused on designing and recommending an initial set of Derived Data products (DDPs) that could realize the science goals enabled by joint processing. All interested Rubin and Euclid data rights holders were invited to contribute via an online discussion forum… ▽ More This report is the result of a joint discussion between the Rubin and Euclid scientific communities. The work presented in this report was focused on designing and recommending an initial set of Derived Data products (DDPs) that could realize the science goals enabled by joint processing. All interested Rubin and Euclid data rights holders were invited to contribute via an online discussion forum and a series of virtual meetings. Strong interest in enhancing science with joint DDPs emerged from across a wide range of astrophysical domains: Solar System, the Galaxy, the Local Volume, from the nearby to the primaeval Universe, and cosmology. △ Less

Submitted 13 October, 2022; v1 submitted 11 January, 2022; originally announced January 2022.

Comments: Report of the Rubin-Euclid Derived Data Products Working Group, 78 pages, 11 figures

arXiv:2201.00566 [pdf, other]

doi 10.1051/0004-6361/202142440

The multifarious ionization sources and disturbed kinematics of extraplanar gas in five low-mass galaxies

Authors: R. P. V. Rautio, A. E. Watkins, S. Comerón, H. Salo, S. Díaz-García, J. Janz

Abstract: We investigate the origin of the extraplanar diffuse ionized gas (eDIG) and its predominant ionization mechanisms in five nearby (17-46 Mpc) low-mass ($10^9\text{-}10^{10}$ $M_{\odot}$) edge-on disk galaxies: ESO 157-49, ESO 469-15, ESO 544-27, IC 217, and IC 1553. We acquired Multi Unit Spectroscopic Explorer (MUSE) integral field spectroscopy and deep narrowband H$α$ imaging of our sample galaxi… ▽ More We investigate the origin of the extraplanar diffuse ionized gas (eDIG) and its predominant ionization mechanisms in five nearby (17-46 Mpc) low-mass ($10^9\text{-}10^{10}$ $M_{\odot}$) edge-on disk galaxies: ESO 157-49, ESO 469-15, ESO 544-27, IC 217, and IC 1553. We acquired Multi Unit Spectroscopic Explorer (MUSE) integral field spectroscopy and deep narrowband H$α$ imaging of our sample galaxies. To investigate the connection between in-plane star formation and eDIG, we perform a photometric analysis of our narrowband H$α$ imaging. We measure eDIG scale heights of $h_{z\text{eDIG}} = 0.59 \text{-} 1.39$ kpc and find a positive correlation between them and specific star formation rates. In all galaxies, we also find a strong correlation between extraplanar and midplane radial H$α$ profiles. Using our MUSE data, we investigate the origin of eDIG via kinematics. We find ionized gas rotation velocity lags above the midplane with values between 10 and 27 km s$^{-1}$ kpc$^{-1}$. While we do find hints of an accretion origin for the ionized gas in ESO 157-49, IC 217, and IC 1553, overall the ionized gas kinematics of our galaxies do not match a steady galaxy model or any simplistic model of accretion or internal origin for the gas. We also construct standard diagnostic diagrams and emission-line maps (EW(H$α$), [NII]/H$α$, [SII]//H$α$, [OIII]/H$β$) and find regions consistent with mixed OB star and hot low-mass evolved stars (HOLMES) ionization, and mixed OB-shock ionization. Our results suggest that OB stars are the primary driver of eDIG ionization, while both HOLMES and shocks may locally contribute to the ionization of eDIG to a significant degree. Despite our galaxies' similar structures and masses, we find a surprisingly composite image of ionization mechanisms and a multifarious origin for the eDIG. △ Less

Submitted 3 January, 2022; originally announced January 2022.

Comments: 21 pages, 14 figures, accepted to Astronomy & Astrophysics

Journal ref: A&A 659, A153 (2022)

arXiv:2112.14787 [pdf, other]

Harnessing re-programmable phase transitions to control the propagation of sound waves

Authors: Audrey A. Watkins, Austin Eichelberg, Osama R. Bilal

Abstract: Metamaterials can enable peculiar static and dynamic behavior (such as negative effective mass density, dynamical stiffness, and Poisson's ratio) due to their geometry rather than their chemical composition. The geometry of these metamaterials can be thought of as the phase of the material, which is usually fixed once the material is fabricated. While there exist many theoretical and numerical stu… ▽ More Metamaterials can enable peculiar static and dynamic behavior (such as negative effective mass density, dynamical stiffness, and Poisson's ratio) due to their geometry rather than their chemical composition. The geometry of these metamaterials can be thought of as the phase of the material, which is usually fixed once the material is fabricated. While there exist many theoretical and numerical studies of metamaterials that can change phase, or re-program, experimental realizations remain limited due to challenges in manufacturability, the destructive nature of the re-programming and inherent non-linearities. Through a combination of analytical, numerical and experimental analyses, we utilize tunable, self-assembled, nonlinear magnetic lattices to realize metamaterials with reversible phase transitions. Our metamaterials are composed of free-floating disks, with embedded permanent magnets, confined within magnetic boundaries. We exploit the non-destructive nature of the adjustable magnetic boundaries to create a set of re-programmable metamaterials to control the propagation of sound waves. Furthermore, we demonstrate a robust, real-time tunable wave filter at ultra-low frequencies. Our findings can expand the metamaterials horizon into functional and tunable devices. △ Less

Submitted 29 December, 2021; originally announced December 2021.

arXiv:2112.03784 [pdf, ps, other]

Qualitative Analysis for Human Centered AI

Authors: Orestis Papakyriakopoulos, Elizabeth Anne Watkins, Amy Winecoff, Klaudia Jaźwińska, Tithi Chattopadhyay

Abstract: Human-centered artificial intelligence (AI) posits that machine learning and AI should be developed and applied in a socially aware way. In this article, we argue that qualitative analysis (QA) can be a valuable tool in this process, supplementing, informing, and extending the possibilities of AI models. We show this by describing how QA can be integrated in the current prediction paradigm of AI,… ▽ More Human-centered artificial intelligence (AI) posits that machine learning and AI should be developed and applied in a socially aware way. In this article, we argue that qualitative analysis (QA) can be a valuable tool in this process, supplementing, informing, and extending the possibilities of AI models. We show this by describing how QA can be integrated in the current prediction paradigm of AI, assisting scientists in the process of selecting data, variables, and model architectures. Furthermore, we argue that QA can be a part of novel paradigms towards Human Centered AI. QA can support scientists and practitioners in practical problem solving and situated model development. It can also promote participatory design approaches, reveal understudied and emerging issues in AI systems, and assist policy making. △ Less

Submitted 7 December, 2021; originally announced December 2021.

Journal ref: HCAI:Human Centered AI workshop at Neural Information Processing Systems 2021

arXiv:2110.07531 [pdf]

Deep learning models for predicting RNA degradation via dual crowdsourcing

Authors: Hannah K. Wayment-Steele, Wipapat Kladwang, Andrew M. Watkins, Do Soon Kim, Bojan Tunguz, Walter Reade, Maggie Demkin, Jonathan Romano, Roger Wellington-Oguri, John J. Nicol, Jiayang Gao, Kazuki Onodera, Kazuki Fujikawa, Hanfei Mao, Gilles Vandewiele, Michele Tinti, Bram Steenwinckel, Takuya Ito, Taiga Noumi, Shujun He, Keiichiro Ishi, Youhan Lee, Fatih Öztürk, Anthony Chiu, Emin Öztürk , et al. (4 additional authors not shown)

Abstract: Messenger RNA-based medicines hold immense potential, as evidenced by their rapid deployment as COVID-19 vaccines. However, worldwide distribution of mRNA molecules has been limited by their thermostability, which is fundamentally limited by the intrinsic instability of RNA molecules to a chemical degradation reaction called in-line hydrolysis. Predicting the degradation of an RNA molecule is a ke… ▽ More Messenger RNA-based medicines hold immense potential, as evidenced by their rapid deployment as COVID-19 vaccines. However, worldwide distribution of mRNA molecules has been limited by their thermostability, which is fundamentally limited by the intrinsic instability of RNA molecules to a chemical degradation reaction called in-line hydrolysis. Predicting the degradation of an RNA molecule is a key task in designing more stable RNA-based therapeutics. Here, we describe a crowdsourced machine learning competition ("Stanford OpenVaccine") on Kaggle, involving single-nucleotide resolution measurements on 6043 102-130-nucleotide diverse RNA constructs that were themselves solicited through crowdsourcing on the RNA design platform Eterna. The entire experiment was completed in less than 6 months, and 41% of nucleotide-level predictions from the winning model were within experimental error of the ground truth measurement. Furthermore, these models generalized to blindly predicting orthogonal degradation data on much longer mRNA molecules (504-1588 nucleotides) with improved accuracy compared to previously published models. Top teams integrated natural language processing architectures and data augmentation techniques with predictions from previous dynamic programming models for RNA secondary structure. These results indicate that such models are capable of representing in-line hydrolysis with excellent accuracy, supporting their use for designing stabilized messenger RNAs. The integration of two crowdsourcing platforms, one for data set creation and another for machine learning, may be fruitful for other urgent problems that demand scientific discovery on rapid timescales. △ Less

Submitted 22 April, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

arXiv:2109.10178 [pdf, other]

doi 10.1103/PhysRevB.104.L140101

Exploiting Localized Transition Waves to Tune Sound Propagation in Soft Materials

Authors: Audrey A. Watkins, Austin Eichelberg, Osama R. Bilal

Abstract: Programmable materials hold great potential for many applications such as deployable structures, soft robotics, and wave control, however, the presence of instability and disorder might hinder their utilization. Through a combination of analytical, numerical, and experimental analyses, we harness the interplay between instabilities, geometric frustration, and mechanical deformations to control the… ▽ More Programmable materials hold great potential for many applications such as deployable structures, soft robotics, and wave control, however, the presence of instability and disorder might hinder their utilization. Through a combination of analytical, numerical, and experimental analyses, we harness the interplay between instabilities, geometric frustration, and mechanical deformations to control the propagation of sound waves within self-assembled soft materials. We consider levitated magnetic disks confined by a magnetic boundary in-plane. The assemblies can be either ordered or disordered depending on the intrinsic disk symmetry. By applying an external load to the assembly, we observe the nucleation and propagation of different topological defects within the lattices. In the presence of instabilities, the defect propagation gives rise to time-independent localized transition waves. Surprisingly, in the presence of frustration, the applied load briefly introduces deformation-induced order to the material. By further deforming the lattices, new patterns emerge across all disk symmetries. We utilize these patterns to tune sound propagation through the material. Our findings could open new possibilities for designing exotic materials with potential applications ranging from sound control to soft robotics. △ Less

Submitted 21 September, 2021; originally announced September 2021.

arXiv:2105.05167 [pdf, other]

doi 10.3847/1538-4357/ac0055

A Deep Census of Outlying Star Formation in the M101 Group

Authors: Ray Garner III, J. Christopher Mihos, Paul Harding, Aaron E. Watkins

Abstract: We present deep, narrowband imaging of the nearby spiral galaxy M101 and its group environment to search for star-forming dwarf galaxies and outlying HII regions. Using the Burrell Schmidt telescope, we target the brightest emission lines of star-forming regions, H$α$, H$β$, and [OIII], to detect potential outlying star-forming regions. Our survey covers $\sim$6 square degrees around M101, and we… ▽ More We present deep, narrowband imaging of the nearby spiral galaxy M101 and its group environment to search for star-forming dwarf galaxies and outlying HII regions. Using the Burrell Schmidt telescope, we target the brightest emission lines of star-forming regions, H$α$, H$β$, and [OIII], to detect potential outlying star-forming regions. Our survey covers $\sim$6 square degrees around M101, and we detect objects in emission down to an H$α$ flux level of $5.7 \times 10^{-17}$ erg s$^{-1}$ cm$^{-2}$ (equivalent to a limiting SFR of $1.7 \times 10^{-6}$ $M_\odot$ yr$^{-1}$ at the distance of M101). After careful removal of background contaminants and foreground M stars, we detect 19 objects in emission in all three bands, and 8 objects in emission in H$α$ and [OIII]. We compare the structural and photometric properties of the detected sources to Local Group dwarf galaxies and star-forming galaxies in the 11HUGS and SINGG surveys. We find no large population of outlying HII regions or undiscovered star-forming dwarfs in the M101 Group, as most sources (93%) are consistent with being M101 outer disk HII regions. Only two sources were associated with other galaxies: a faint star-forming satellite of the background galaxy NGC 5486, and a faint outlying HII region near the M101 companion NGC 5474. We also find no narrowband emission associated with recently discovered ultradiffuse galaxies and starless HI clouds near M101. The lack of any hidden population of low luminosity star-forming dwarfs around M101 suggests a rather shallow faint end slope (as flat as $α\sim -1.0$) for the star-forming luminosity function in the M101 Group. We discuss our results in the context of tidally-triggered star formation models and the interaction history of the M101 Group. △ Less

Submitted 11 May, 2021; originally announced May 2021.

Comments: 24 pages, 14 figures, 6 tables, accepted for publication in ApJ

arXiv:2101.05699 [pdf, other]

doi 10.1051/0004-6361/202039633

The Fornax Deep Survey (FDS) with the VST XI. The search for signs of preprocessing between the Fornax main cluster and Fornax A group

Authors: Alan H. Su, Heikki Salo, Joachim Janz, Eija Laurikainen, Aku Venhola, Reynier F. Peletier, Enrica Iodice, Michael Hilker, Michele Cantiello, Nicola Napolitano, Marilena Spavone, Maria A. Raj, Glenn van de Ven, Steffen. Mieske, Maurizio Paolillo, Massimo Capaccioli, Edwin A. Valentijn, Aaron E. Watkins

Abstract: We investigate the structural properties of cluster and group galaxies by studying the Fornax main cluster and the infalling Fornax A group, exploring the effects of galaxy preprocessing in this showcase example. Additionally, we compare the structural complexity of Fornax galaxies to those in the Virgo cluster and in the field. Our sample consists of 582 galaxies from the Fornax main cluster and… ▽ More We investigate the structural properties of cluster and group galaxies by studying the Fornax main cluster and the infalling Fornax A group, exploring the effects of galaxy preprocessing in this showcase example. Additionally, we compare the structural complexity of Fornax galaxies to those in the Virgo cluster and in the field. Our sample consists of 582 galaxies from the Fornax main cluster and Fornax A group. We quantified the light distributions of each galaxy based on a combination of aperture photometry, Sérsic+PSF (point spread function) and multi-component decompositions, and non-parametric measures of morphology (Concentration $C$; Asymmetry $A$, Clumpiness $S$; Gini $G$; second order moment of light $M_{20}$), and structural complexity based on multi-component decompositions. These quantities were then compared between the Fornax main cluster and Fornax A group. The structural complexity of Fornax galaxies were also compared to those in Virgo and in the field. Overall, we find significant differences in the distributions of quantities derived from Sérsic profiles ($g'-r'$, $r'-i'$, $R_e$, and $\barμ_{e,r'}$), and non-parametric indices ($A$ and $S$) between the Fornax main cluster and Fornax A group. Moreover, we find significant cluster-centric trends with $r'-i'$, $R_e$, and $\barμ_{e,r'}$, as well as $A$, $S$, $G$, and $M_{20}$ for galaxies in the Fornax main cluster. We find the structural complexity of galaxies increases as a function of the absolute $r'$-band magnitude (and stellar mass), with the largest change occurring between -14 mag $\lesssim M_{r'}\lesssim$ -19 mag. This same trend was observed for galaxies in the Virgo cluster and in the field, which suggests that the formation or maintenance of morphological structures (e.g. bulges, bar) is largely dependent on the stellar mass of the galaxies, rather than their environment. △ Less

Submitted 14 January, 2021; originally announced January 2021.

Comments: Submitted to A&A 9th October 2020, accepted 11th January 2021. For decompositions see https://www.oulu.fi/astronomy/FDS_DECOMP/main/index.html (username=password=sundial)

Journal ref: A&A 647, A100 (2021)

arXiv:2011.02937 [pdf, other]

doi 10.1051/0004-6361/202039382

The complex multi-component outflow of the Seyfert galaxy NGC 7130

Authors: S. Comerón, J. H. Knapen, C. Ramos Almeida, A. E. Watkins

Abstract: AGN are a key ingredient for understanding galactic evolution. AGN-driven outflows are one of the manifestations of feedback. The AO mode for MUSE at the VLT permits to study the innermost tens of parsecs of nearby AGN in the optical. We present a detailed analysis of the ionised gas in the central regions of NGC 7130, an archetypical composite Seyfert and nuclear starburst galaxy. We achieve an a… ▽ More AGN are a key ingredient for understanding galactic evolution. AGN-driven outflows are one of the manifestations of feedback. The AO mode for MUSE at the VLT permits to study the innermost tens of parsecs of nearby AGN in the optical. We present a detailed analysis of the ionised gas in the central regions of NGC 7130, an archetypical composite Seyfert and nuclear starburst galaxy. We achieve an angular resolution of 0.17$^{\prime\prime}$ (50 pc). We performed a multi-component analysis of the main ISM lines and identified nine kinematic components, six of which correspond to the outflow. The outflow is biconic and has velocities of a few $100\,{\rm km\,s^{-1}}$ with respect to the disc. We decompose the approaching side of the outflow into a broad and a narrow component with typical velocity dispersions below and above $\sim200\,{\rm km\,s^{-1}}$, respectively. The blueshifted narrow component has substructure, in particular a collimated plume aligned with the radio jet, indicating that it may be jet-powered. The redshifted lobe is composed of two Narrow Components and a Broad Component. An additional redshifted component is seen outside the main outflow axis. Line ratio diagnostics indicate that the outflow gas in the main axis is AGN-powered whereas the off-axis component has LINER properties. The ionised gas mass outflow rate is $\dot{M}=1.2\pm0.7\,M_{\odot}\,{\rm yr^{-1}}$ and the kinetic power is $\dot{E}_{\rm kin}=(2.7\pm2.0)\times10^{41}\,{\rm erg\,s^{-1}}$, which corresponds to $F_{\rm kin}=0.12\pm0.09\%$ of the bolometric AGN power. The combination of high angular resolution integral field spectroscopy and a careful multi-component decomposition allows a uniquely detailed view of the outflow in NGC 7130, illustrating that AGN kinematics are more complex than traditionally derived from less sophisticated data and analyses. (abridged) △ Less

Submitted 7 April, 2021; v1 submitted 5 November, 2020; originally announced November 2020.

Comments: Published in A&A. This version of the preprint has been corrected with the modifications appearing in the A&A corrigendum

Journal ref: A&A 645, A130 (2021)

arXiv:2009.13525 [pdf, other]

doi 10.3847/1538-4357/abbc1a

Hints for icy pebble migration feeding an oxygen-rich chemistry in the inner planet-forming region of disks

Authors: Andrea Banzatti, Ilaria Pascucci, Arthur D. Bosman, Paola Pinilla, Colette Salyk, Greg J. Herczeg, Klaus M. Pontoppidan, Ivan Vazquez, Andrew Watkins, Sebastiaan Krijt, Nathan Hendler, Feng Long

Abstract: We present a synergic study of protoplanetary disks to investigate links between inner disk gas molecules and the large-scale migration of solid pebbles. The sample includes 63 disks where two types of measurements are available: i) spatially-resolved disk images revealing the radial distribution of disk pebbles (mm-cm dust grains), from millimeter observations with ALMA or the SMA, and ii) infrar… ▽ More We present a synergic study of protoplanetary disks to investigate links between inner disk gas molecules and the large-scale migration of solid pebbles. The sample includes 63 disks where two types of measurements are available: i) spatially-resolved disk images revealing the radial distribution of disk pebbles (mm-cm dust grains), from millimeter observations with ALMA or the SMA, and ii) infrared molecular emission spectra as observed with Spitzer. The line flux ratios of H2O with HCN, C2H2, and CO2 all anti-correlate with the dust disk radius R$_{dust}$, expanding previous results found by Najita et al. (2013) for HCN/H2O and the dust disk mass. By normalization with the dependence on accretion luminosity common to all molecules, only the H2O luminosity maintains a detectable anti-correlation with disk radius, suggesting that the strongest underlying relation is between H2O and R$_{dust}$. If R$_{dust}$ is set by large-scale pebble drift, and if molecular luminosities trace the elemental budgets of inner disk warm gas, these results can be naturally explained with scenarios where the inner disk chemistry is fed by sublimation of oxygen-rich icy pebbles migrating inward from the outer disk. Anti-correlations are also detected between all molecular luminosities and the infrared index n$_{13-30}$, which is sensitive to the presence and size of an inner disk dust cavity. Overall, these relations suggest a physical interconnection between dust and gas evolution both locally and across disk scales. We discuss fundamental predictions to test this interpretation and study the interplay between pebble drift, inner disk depletion, and the chemistry of planet-forming material. △ Less

Submitted 28 September, 2020; originally announced September 2020.

Comments: Accepted for publication on ApJ

arXiv:2009.08381 [pdf, other]

doi 10.3389/fmats.2020.606877

Demultiplexing infrasound phonons with tunable magnetic lattices

Authors: Audrey A. Watkins, Osama R. Bilal

Abstract: Controlling infrasound signals is crucial to many processes ranging from predicting atmospheric events and seismic activities to sensing nuclear detonations. These waves can be manipulated through phononic crystals and acoustic metamaterials. However, at such ultra-low frequencies, the size (usually on the order of meters) and the mass (usually on the order of many kilograms) of these materials ca… ▽ More Controlling infrasound signals is crucial to many processes ranging from predicting atmospheric events and seismic activities to sensing nuclear detonations. These waves can be manipulated through phononic crystals and acoustic metamaterials. However, at such ultra-low frequencies, the size (usually on the order of meters) and the mass (usually on the order of many kilograms) of these materials can hinder its potential applications in the infrasonic domain. Here, we utilize tunable lattices of repelling magnets to guide and sort infrasound waves into different channels based on their frequencies. We construct our lattices by confining meta-atoms (free-floating macroscopic disks with embedded magnets) within a magnetic boundary. By changing the confining boundary, we control the meta-atoms' spacing and therefore the intensity of their coupling potentials and wave propagation characteristics. As a demonstration of principle, we present the first experimental realization of an infrasound phonon demultiplexer (i.e., guiding ultra-low frequency waves into different channels based on their frequencies). The realized platform can be utilized to manipulate ultra-low frequency waves, within a relatively small volume, while utilizing negligible mass. In addition, the self-assembly nature of the meta-atoms can be key in creating re-programmable materials with exceptional nonlinear properties. △ Less

Submitted 8 December, 2020; v1 submitted 17 September, 2020; originally announced September 2020.

arXiv:2005.01684 [pdf, other]

doi 10.1093/mnras/staa1245

Original Research By Young Twinkle Students (ORBYTS): Ephemeris Refinement of Transiting Exoplanets

Authors: Billy Edwards, Quentin Changeat, Kai Hou Yip, Angelos Tsiaras, Jake Taylor, Bilal Akhtar, Josef AlDaghir, Pranup Bhattarai, Tushar Bhudia, Aashish Chapagai, Michael Huang, Danyaal Kabir, Vieran Khag, Summyyah Khaliq, Kush Khatri, Jaidev Kneth, Manisha Kothari, Ibrahim Najmudin, Lobanaa Panchalingam, Manthan Patel, Luxshan Premachandran, Adam Qayyum, Prasen Rana, Zain Shaikh, Sheryar Syed , et al. (38 additional authors not shown)

Abstract: We report follow-up observations of transiting exoplanets that have either large uncertainties (>10 minutes) in their transit times or have not been observed for over three years. A fully robotic ground-based telescope network, observations from citizen astronomers and data from TESS have been used to study eight planets, refining their ephemeris and orbital data. Such follow-up observations are k… ▽ More We report follow-up observations of transiting exoplanets that have either large uncertainties (>10 minutes) in their transit times or have not been observed for over three years. A fully robotic ground-based telescope network, observations from citizen astronomers and data from TESS have been used to study eight planets, refining their ephemeris and orbital data. Such follow-up observations are key for ensuring accurate transit times for upcoming ground and space-based telescopes which may seek to characterise the atmospheres of these planets. We find deviations from the expected transit time for all planets, with transits occurring outside the 1 sigma uncertainties for seven planets. Using the newly acquired observations, we subsequently refine their periods and reduce the current predicted ephemeris uncertainties to 0.28 - 4.01 minutes. A significant portion of this work has been completed by students at two high schools in London as part of the Original Research By Young Twinkle Students (ORBYTS) programme. △ Less

Submitted 4 May, 2020; originally announced May 2020.

Comments: Accepted for publication in MNRAS

arXiv:2003.04701 [pdf, other]

On the origins of up-bending breaks in disk galaxies

Authors: Aaron E. Watkins, Jarkko Laine, Sébastien Comerón, Joachim Janz, Heikki Salo

Abstract: Using SPITZER 3.6$μ$m imaging, we investigate the physical and data-driven origins of up-bending (Type III) disk breaks. We apply a robust new break-finding algorithm to 175 low-inclination disk galaxies previously identified as containing Type III breaks, classify each galaxy by its outermost re-classified (via our new algorithm) break type, and compare the local environments of each resulting su… ▽ More Using SPITZER 3.6$μ$m imaging, we investigate the physical and data-driven origins of up-bending (Type III) disk breaks. We apply a robust new break-finding algorithm to 175 low-inclination disk galaxies previously identified as containing Type III breaks, classify each galaxy by its outermost re-classified (via our new algorithm) break type, and compare the local environments of each resulting subgroup. Using three different measures of the local density of galaxies, we find that galaxies with extended outer spheroids (Type IIIs) occupy the highest density environments in our sample, while those with extended down-bending (Type II) disks and symmetric outskirts occupy the lowest density environments. Among outermost breaks, the most common origin of Type III breaks in our sample is methodological; the use of elliptical apertures to measure the radial profiles of asymmetric galaxies usually results in features akin to Type III breaks. △ Less

Submitted 10 March, 2020; originally announced March 2020.

Comments: 4 pages, 1 figure, for IAU Symposium 355

arXiv:2002.08511 [pdf]

doi 10.1016/j.nima.2018.11.103

Tuning of LANSCE 805-MHz High-Energy Linear Accelerator with Reduced Beam Losses

Authors: Y. K. Batygin, F. E. Shelley, H. A. Watkins

Abstract: Suppression of beam losses is essential for successful operation of high-intensity linac. Historically, the values of the field amplitudes and phases of the side-coupled, 805-MHz LANSCE linac modules are maintained using a well-known delta-t tuning procedure. Transverse matching of the beam with accelerator is performed through adjustments of beam ellipses in 4D phase space with accelerator lattic… ▽ More Suppression of beam losses is essential for successful operation of high-intensity linac. Historically, the values of the field amplitudes and phases of the side-coupled, 805-MHz LANSCE linac modules are maintained using a well-known delta-t tuning procedure. Transverse matching of the beam with accelerator is performed through adjustments of beam ellipses in 4D phase space with accelerator lattice using matching quadrupoles. Control of the beam-energy ramp along the length of a proton linear accelerator is required to improve tune of accelerator and decrease beam losses. Time-of-flight measurements of the H- beam energy are now being used to confirm and improve the overall control of the energy ramp along the linac. The time-of-flight method utilizes absolute measurements of beam energy using direct signals from beam at an oscilloscope, as well as the difference in RF phases measured as the beam passes installed delta-t pickup loops. A newly developed BPPM data acquisition system is used. Beam energy measurement along accelerator together with phase scans, klystrons output power control, and delta-t method, allow tuning of accelerator much more close to original design and reduce losses generated by linac. Details of the upgraded tuning procedure and results of tuning and operation are presented. △ Less

Submitted 19 February, 2020; originally announced February 2020.

Journal ref: Nuclear Instruments and Methods in Physics Research, A 916 (2019) 215-225

arXiv:1912.07553 [pdf, ps, other]

doi 10.1103/PhysRevA.101.053848

Nanosecond-timescale development of Faraday rotation in an ultracold gas

Authors: Jonathan R. Gilbert, Mark A. Watkins, Jacob L. Roberts

Abstract: When a gas of ultracold atoms is suddenly illuminated by light that is nearly resonant with an atomic transition, the atoms cannot respond instantaneously. This non-instantaneous response means the gas is initially more transparent to the applied light than in steady-state. The timescale associated with the development of light absorption is set by the atomic excited state lifetime. Similarly, the… ▽ More When a gas of ultracold atoms is suddenly illuminated by light that is nearly resonant with an atomic transition, the atoms cannot respond instantaneously. This non-instantaneous response means the gas is initially more transparent to the applied light than in steady-state. The timescale associated with the development of light absorption is set by the atomic excited state lifetime. Similarly, the index of refraction in the gas also requires time to reach a steady-state value, but the development of the associated phase response is expected to be slower than absorption effects. Faraday rotation is one manifestation of differing indices of refraction for orthogonal circular light polarization components. We have performed experiments measuring the time-dependent development of polarization rotation in an ultracold gas subjected to a magnetic field. Our measurements match theoretical predictions based on solving optical Bloch equations. We are able to identify how parameters such as steady-state optical thickness and applied magnetic field strength influence the development of Faraday rotation. △ Less

Submitted 16 December, 2019; originally announced December 2019.

Journal ref: Phys. Rev. A 101, 053848 (2020)

arXiv:1911.00125 [pdf]

Eclipse time variations and the continued search for companions to short period eclipsing binary systems

Authors: George Faillace, David Pulley, Americo Watkins, John Mallett, Ian Sharp, Xinyu Mai

Abstract: Eclipse time variations have been detected in a number of post common envelope binary systems consisting of a subdwarf B star or white dwarf primary star and cool M type or brown dwarf secondary. In this paper we consider circumbinary hypotheses of two sdB systems, HS 0705+6700 (also known as V470 Cam) and NSVS 14256825 and one white dwarf system, NN Ser. In addition, and for comparison purposes,… ▽ More Eclipse time variations have been detected in a number of post common envelope binary systems consisting of a subdwarf B star or white dwarf primary star and cool M type or brown dwarf secondary. In this paper we consider circumbinary hypotheses of two sdB systems, HS 0705+6700 (also known as V470 Cam) and NSVS 14256825 and one white dwarf system, NN Ser. In addition, and for comparison purposes, we investigate the eclipse time variations of the W UMa system NSVS 01286630 with its stellar circumbinary companion. All four systems have claims of circumbinary objects with computed physical and orbital parameters. We report 108 new observations of minima for these four eclipsing systems observed between 2017 May and 2019 September and combining these with all published data, we investigate how well the published circumbinary object hypotheses fit with our new data. Our new data has shown departure from early predictions for three of the four systems, but it is premature to conclude that these results rule out the presence of circumbinary objects. There is also the possibility (but with no observational proof so far) of detecting close-in transiting circumbinary objects around these systems but these are likely to have periods of days rather than years. △ Less

Submitted 31 October, 2019; originally announced November 2019.

Comments: 14 pages, 5 tables and 6 figures

Showing 1–50 of 72 results for author: Watkins, A