-
Unraveling the Dynamics of SPY Trading Volumes: A Comprehensive Analysis of Daily and Intraday Liquidity Trends
Authors:
Ananya Krishnan,
Martin Pollack,
Alma Cooper
Abstract:
In this project, we investigate the accuracy of forecasting intraday and daily trading volume of the exchange-traded fund SPY. The ability to forecast volume over varying time intervals with high accuracy is a critical element to many trading strategies. After performing exploratory data analysis on intraday and daily SPY data we identify three methods for our analysis: ARIMA and ARIMAX models, wi…
▽ More
In this project, we investigate the accuracy of forecasting intraday and daily trading volume of the exchange-traded fund SPY. The ability to forecast volume over varying time intervals with high accuracy is a critical element to many trading strategies. After performing exploratory data analysis on intraday and daily SPY data we identify three methods for our analysis: ARIMA and ARIMAX models, with or without seasonality, as well as a Frequency Domain Process Representation. To evaluate predictive power of our models, we use mean squared error, mean absolute percentage error, and volume weighted average price (VWAP) tracking error. All models for both intraday and daily data output strong VWAP predictions in comparison to the VWAP estimates produced by naive baseline methodologies. In both cases volume is most accurately forecasted using ARIMA models with exogenous variables in the form of technical indicators, with intraday incorporating a seasonal component and daily not.
△ Less
Submitted 24 June, 2024;
originally announced June 2024.
-
Between Randomness and Arbitrariness: Some Lessons for Reliable Machine Learning at Scale
Authors:
A. Feder Cooper
Abstract:
To develop rigorous knowledge about ML models -- and the systems in which they are embedded -- we need reliable measurements. But reliable measurement is fundamentally challenging, and touches on issues of reproducibility, scalability, uncertainty quantification, epistemology, and more. This dissertation addresses criteria needed to take reliability seriously: both criteria for designing meaningfu…
▽ More
To develop rigorous knowledge about ML models -- and the systems in which they are embedded -- we need reliable measurements. But reliable measurement is fundamentally challenging, and touches on issues of reproducibility, scalability, uncertainty quantification, epistemology, and more. This dissertation addresses criteria needed to take reliability seriously: both criteria for designing meaningful metrics, and for methodologies that ensure that we can dependably and efficiently measure these metrics at scale and in practice. In doing so, this dissertation articulates a research vision for a new field of scholarship at the intersection of machine learning, law, and policy. Within this frame, we cover topics that fit under three different themes: (1) quantifying and mitigating sources of arbitrariness in ML, (2) taming randomness in uncertainty estimation and optimization algorithms, in order to achieve scalability without sacrificing reliability, and (3) providing methods for evaluating generative-AI systems, with specific focuses on quantifying memorization in language models and training latent diffusion models on open-licensed data. By making contributions in these three themes, this dissertation serves as an empirical proof by example that research on reliable measurement for machine learning is intimately and inescapably bound up with research in law and policy. These different disciplines pose similar research questions about reliable measurement in machine learning. They are, in fact, two complementary sides of the same research vision, which, broadly construed, aims to construct machine-learning systems that cohere with broader societal values.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
AuriDESI: Mock Catalogues for the DESI Milky Way Survey
Authors:
Namitha Kizhuprakkat,
Andrew P. Cooper,
Alexander H. Riley,
Sergey E. Koposov,
Jessica Nicole Aguilar,
Steven Ahlen,
Carlos Allende Prieto,
David Brooks,
Todd Claybaugh,
Kyle Dawson,
Axel de la Macorra,
Peter Doel,
Jaime E. Forero-Romero,
Carlos Frenk,
Enrique Gaztañaga,
Oleg Y. Gnedin,
Robert J. J. Grand,
Satya Gontcho A Gontcho,
Klaus Honscheid,
Robert Kehoe,
Martin Landriau,
Marc Manera,
Aaron Meisner,
Ramon Miquel,
Jundan Nie
, et al. (9 additional authors not shown)
Abstract:
The Dark Energy Spectroscopic Instrument Milky Way Survey (DESI MWS) will explore the assembly history of the Milky Way by characterising remnants of ancient dwarf galaxy accretion events and improving constraints on the distribution of dark matter in the outer halo. We present mock catalogues that reproduce the selection criteria of MWS and the format of the final MWS data set. These catalogues c…
▽ More
The Dark Energy Spectroscopic Instrument Milky Way Survey (DESI MWS) will explore the assembly history of the Milky Way by characterising remnants of ancient dwarf galaxy accretion events and improving constraints on the distribution of dark matter in the outer halo. We present mock catalogues that reproduce the selection criteria of MWS and the format of the final MWS data set. These catalogues can be used to test methods for quantifying the properties of stellar halo substructure and reconstructing the Milky Way's accretion history with the MWS data, including the effects of halo-to-halo variance. The mock catalogues are based on a phase-space kernel expansion technique applied to star particles in the Auriga suite of six high-resolution $Λ$CDM magneto-hydrodynamic zoom-in simulations. They include photometric properties (and associated errors) used in DESI target selection and the outputs of the MWS spectral analysis pipeline (radial velocity, metallicity, surface gravity, and temperature). They also include information from the underlying simulation, such as the total gravitational potential and information on the progenitors of accreted halo stars. We discuss how the subset of halo stars observable by MWS in these simulations corresponds to their true content and properties. These mock Milky Ways have rich accretion histories, resulting in a large number of substructures that span the whole stellar halo out to large distances and have substantial overlap in the space of orbital energy and angular momentum.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Beyond the Rotational Deathline: Radio Emission from Ultra-long Period Magnetars
Authors:
A. J. Cooper,
Z. Wadiasingh
Abstract:
Motivated by the recent detection of ultra-long period radio transients, we investigate new models of coherent radio emission via low-altitude electron-positron pair production in neutron stars beyond rotationally-powered curvature radiation deathlines. We find that plastic motion (akin to 'continental drift') and qualitatively similar thermoelectric action by temperature gradients in the crusts o…
▽ More
Motivated by the recent detection of ultra-long period radio transients, we investigate new models of coherent radio emission via low-altitude electron-positron pair production in neutron stars beyond rotationally-powered curvature radiation deathlines. We find that plastic motion (akin to 'continental drift') and qualitatively similar thermoelectric action by temperature gradients in the crusts of slowly rotating, highly magnetized neutron stars could impart mild local magnetospheric twists. Regardless of which mechanism drives twists, we find that particle acceleration initiates pair cascades across charge-starved gaps above a mild critical twist. Cascades are initiated via resonant inverse-Compton scattered photons or curvature radiation, and may produce broadband coherent radio emission. We compute the pair luminosity (maximum allowed radio luminosity) for these two channels, and derive deathlines and 'active zones' in $P-\dot{P}$ space from a variety of considerations. We find these twist-initiated pair cascades only occur for magnetar-like field strengths $B \gtrsim 10^{14}$ G and long periods: $P_{\rm RICS} \gtrsim 120 \; (T/10^{6.5} {\rm K})^{-5} \, {\rm sec}$ and $P_{\rm curv} \gtrsim 150 \; ({\rm v_{\rm pl}}/10^{3} {\, \rm cm \, yr^{-1}})^{-7/6} \, {\rm sec}$. Using a simplified geometric model, we find that plastic motion or thermoelectrically driven twists might naturally reproduce the observed luminosities, timescales, and timing signatures. We further derive 'active zones' in which rotationally-powered pair creation occurs via resonantly scattered photons, beyond standard curvature deathlines for pulsars. All cascades are generically accompanied by simultaneous (non-)thermal X-ray/UV counterparts which might be detectable with current instrumentation.
△ Less
Submitted 6 June, 2024;
originally announced June 2024.
-
Star Stream Velocity Distributions in CDM and WDM Galactic Halos
Authors:
Raymond G. Carlberg,
Adrian Jenkins,
Carlos S. Frenk,
Andrew P. Cooper
Abstract:
The dark matter subhalos orbiting in a galactic halo perturb the orbits of stars in thin stellar streams. Over time the random velocities in the streams develop non-Gaussian wings. The rate of velocity increase is approximately a random walk at a rate proportional to the number of subhalos, primarily those in the mass range $\approx 10^{6-7} M_\odot$. The distribution of random velocities in long,…
▽ More
The dark matter subhalos orbiting in a galactic halo perturb the orbits of stars in thin stellar streams. Over time the random velocities in the streams develop non-Gaussian wings. The rate of velocity increase is approximately a random walk at a rate proportional to the number of subhalos, primarily those in the mass range $\approx 10^{6-7} M_\odot$. The distribution of random velocities in long, thin, streams is measured in simulated Milky Way-like halos that develop in representative WDM and CDM cosmologies. The radial velocity distributions are well modeled as the sum of a Gaussian and an exponential. The resulting MCMC fits find Gaussian cores of 1-2 km/sec and exponential wings that increase from 3 km/sec for 5.5 keV WDM, 4 km/sec for 7 keV WDM, to 6 km/sec for a CDM halo. The observational prospects to use stream measurements to constrain the nature of galactic dark matter are discussed.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Constraining the physical properties of large-scale jets from black hole X-ray binaries and their impact on the local environment with blast-wave dynamical models
Authors:
Francesco Carotenuto,
Rob Fender,
Alexandra J. Tetarenko,
Stéphane Corbel,
Andrzej A. Zdziarski,
Gulzar Shaik,
Alex J. Cooper,
Irene Di Palma
Abstract:
Relativistic discrete ejecta launched by black hole X-ray binaries (BH XRBs) can be observed to propagate up to parsec-scales from the central object. Observing the final deceleration phase of these jets is crucial to estimate their physical parameters and to reconstruct their full trajectory, with implications for the jet powering mechanism, composition and formation. In this paper we present the…
▽ More
Relativistic discrete ejecta launched by black hole X-ray binaries (BH XRBs) can be observed to propagate up to parsec-scales from the central object. Observing the final deceleration phase of these jets is crucial to estimate their physical parameters and to reconstruct their full trajectory, with implications for the jet powering mechanism, composition and formation. In this paper we present the results of the modelling of the motion of the ejecta from three BH XRBs: MAXI J1820+070, MAXI J1535$-$571 and XTE J1752$-$223, for which high-resolution radio and X-ray observations of jets propagating up to $\sim$15 arcsec ($\sim$0.6 pc at 3 kpc) from the core have been published in the recent years. For each jet, we modeled its entire motion with a dynamical blast-wave model, inferring robust values for the jet Lorentz factor, inclination angle and ejection time. Under several assumptions associated to the ejection duration, the jet opening angle and the available accretion power, we are able to derive stringent constraints on the maximum jet kinetic energy for each source (between $10^{43}$ and $10^{44}$ erg, including also H1743$-$322), as well as placing interesting upper limits on the density of the ISM through which the jets are propagating (from $n_{\rm ISM} \lesssim 0.4$ cm$^{-3}$ down to $n_{\rm ISM} \lesssim 10^{-4}$ cm$^{-3}$). Overall, our results highlight the potential of applying models derived from gamma-ray bursts to the physics of jets from BH XRBs and support the emerging picture of these sources as preferentially embedded in low-density environments.
△ Less
Submitted 26 May, 2024;
originally announced May 2024.
-
BraTS-Path Challenge: Assessing Heterogeneous Histopathologic Brain Tumor Sub-regions
Authors:
Spyridon Bakas,
Siddhesh P. Thakur,
Shahriar Faghani,
Mana Moassefi,
Ujjwal Baid,
Verena Chung,
Sarthak Pati,
Shubham Innani,
Bhakti Baheti,
Jake Albrecht,
Alexandros Karargyris,
Hasan Kassem,
MacLean P. Nasrallah,
Jared T. Ahrendsen,
Valeria Barresi,
Maria A. Gubbiotti,
Giselle Y. López,
Calixto-Hope G. Lucas,
Michael L. Miller,
Lee A. D. Cooper,
Jason T. Huse,
William R. Bell
Abstract:
Glioblastoma is the most common primary adult brain tumor, with a grim prognosis - median survival of 12-18 months following treatment, and 4 months otherwise. Glioblastoma is widely infiltrative in the cerebral hemispheres and well-defined by heterogeneous molecular and micro-environmental histopathologic profiles, which pose a major obstacle in treatment. Correctly diagnosing these tumors and as…
▽ More
Glioblastoma is the most common primary adult brain tumor, with a grim prognosis - median survival of 12-18 months following treatment, and 4 months otherwise. Glioblastoma is widely infiltrative in the cerebral hemispheres and well-defined by heterogeneous molecular and micro-environmental histopathologic profiles, which pose a major obstacle in treatment. Correctly diagnosing these tumors and assessing their heterogeneity is crucial for choosing the precise treatment and potentially enhancing patient survival rates. In the gold-standard histopathology-based approach to tumor diagnosis, detecting various morpho-pathological features of distinct histology throughout digitized tissue sections is crucial. Such "features" include the presence of cellular tumor, geographic necrosis, pseudopalisading necrosis, areas abundant in microvascular proliferation, infiltration into the cortex, wide extension in subcortical white matter, leptomeningeal infiltration, regions dense with macrophages, and the presence of perivascular or scattered lymphocytes. With these features in mind and building upon the main aim of the BraTS Cluster of Challenges https://www.synapse.org/brats2024, the goal of the BraTS-Path challenge is to provide a systematically prepared comprehensive dataset and a benchmarking environment to develop and fairly compare deep-learning models capable of identifying tumor sub-regions of distinct histologic profile. These models aim to further our understanding of the disease and assist in the diagnosis and grading of conditions in a consistent manner.
△ Less
Submitted 17 May, 2024;
originally announced May 2024.
-
The James Webb Interferometer: Space-based interferometric detections of PDS 70 b and c at 4.8 $μ$m
Authors:
Dori Blakely,
Doug Johnstone,
Gabriele Cugno,
Anand Sivaramakrishnan,
Peter Tuthill,
Ruobing Dong,
Benjamin J. S. Pope,
Loïc Albert,
Max Charles,
Rachel A. Cooper,
Matthew De Furio,
Louis Desdoigts,
René Doyon,
Logan Francis,
Alexandra Z. Greenbaum,
David Lafrenière,
James P. Lloyd,
Michael R. Meyer,
Laurent Pueyo,
Shrishmoy Ray,
Joel Sánchez-Bermúdez,
Anthony Soulain,
Deepashri Thatte,
Thomas Vandal
Abstract:
We observed the planet-hosting system PDS 70 with the James Webb Interferometer, JWST's Aperture Masking Interferometric (AMI) mode within NIRISS. Observing with the F480M filter centered at 4.8 $μ$m, we simultaneously fit a geometric model to the outer disk and the two known planetary companions. We re-detect the protoplanets PDS 70 b and c at an SNR of 21 and 11, respectively. Our photometry of…
▽ More
We observed the planet-hosting system PDS 70 with the James Webb Interferometer, JWST's Aperture Masking Interferometric (AMI) mode within NIRISS. Observing with the F480M filter centered at 4.8 $μ$m, we simultaneously fit a geometric model to the outer disk and the two known planetary companions. We re-detect the protoplanets PDS 70 b and c at an SNR of 21 and 11, respectively. Our photometry of both PDS 70 b and c provide evidence for circumplanetary disk emission through fitting SED models to these new measurements and those found in the literature. We also newly detect emission within the disk gap at an SNR of $\sim$4, at a position angle of $207^{+11}_{-10}$ degrees, and an unconstrained separation within $\sim$200 mas. Follow-up observations will be needed to determine the nature of this emission. We place a 5$σ$ upper limit of $Δ$mag = 7.56 on the contrast of the candidate PDS 70 d at 4.8 $μ$m, which indicates that if the previously observed emission at shorter wavelengths is due to a planet, this putative planet has a different atmospheric composition than PDS 70 b or c. Finally, we place upper limits on emission from any additional planets in the disk gap. We find an azimuthally averaged 5$σ$ upper limit of $Δ$mag $\approx$ 7.5 at separations greater than 125 mas. These are the deepest limits to date within $\sim$250 mas at 4.8 $μ$m and the first space-based interferometric observations of this system.
△ Less
Submitted 19 April, 2024;
originally announced April 2024.
-
The Files are in the Computer: Copyright, Memorization, and Generative-AI Systems
Authors:
A. Feder Cooper,
James Grimmelmann
Abstract:
A central issue in copyright lawsuits against companies that produce generative-AI systems is the degree to which a generative-AI model does or does not "memorize" the data it was trained on. Unfortunately, the debate has been clouded by ambiguity over what "memorization" is, leading to legal debates in which participants often talk past one another. In this Essay, we attempt to bring clarity to t…
▽ More
A central issue in copyright lawsuits against companies that produce generative-AI systems is the degree to which a generative-AI model does or does not "memorize" the data it was trained on. Unfortunately, the debate has been clouded by ambiguity over what "memorization" is, leading to legal debates in which participants often talk past one another. In this Essay, we attempt to bring clarity to the conversation over memorization and its relationship to copying that is cognizable by U.S. copyright law.
△ Less
Submitted 2 July, 2024; v1 submitted 18 April, 2024;
originally announced April 2024.
-
Discovery of the optical and radio counterpart to the fast X-ray transient EP240315a
Authors:
J. H. Gillanders,
L. Rhodes,
S. Srivastav,
F. Carotenuto,
J. Bright,
M. E. Huber,
H. F. Stevance,
S. J. Smartt,
K. C. Chambers,
T. -W. Chen,
R. Fender,
A. Andersson,
A. J. Cooper,
P. G. Jonker,
F. J. Cowie,
T. deBoer,
N. Erasmus,
M. D. Fulton,
H. Gao,
J. Herman,
C. -C. Lin,
T. Lowe,
E. A. Magnier,
H. -Y. Miao,
P. Minguez
, et al. (14 additional authors not shown)
Abstract:
Fast X-ray Transients (FXTs) are extragalactic bursts of soft X-rays first identified >10 years ago. Since then, nearly 40 events have been discovered, although almost all of these have been recovered from archival Chandra and XMM-Newton data. To date, optical sky surveys and follow-up searches have not revealed any multi-wavelength counterparts. The Einstein Probe, launched in January 2024, has s…
▽ More
Fast X-ray Transients (FXTs) are extragalactic bursts of soft X-rays first identified >10 years ago. Since then, nearly 40 events have been discovered, although almost all of these have been recovered from archival Chandra and XMM-Newton data. To date, optical sky surveys and follow-up searches have not revealed any multi-wavelength counterparts. The Einstein Probe, launched in January 2024, has started surveying the sky in the soft X-ray regime (0.5-4 keV) and will rapidly increase the sample of FXTs discovered in real time. Here, we report the first discovery of both an optical and radio counterpart to a distant FXT, the fourth source publicly released by the Einstein Probe. We discovered a fast-fading optical transient within the 3 arcmin localisation radius of EP240315a with the all-sky optical survey ATLAS, and our follow-up Gemini spectrum provides a redshift, z=4.859+/-0.002. Furthermore, we uncovered a radio counterpart in the S-band (3.0 GHz) with the MeerKAT radio interferometer. The optical (rest-frame UV) and radio luminosities indicate the FXT most likely originates from either a long gamma-ray burst or a relativistic tidal disruption event. This may be a fortuitous early mission detection by the Einstein Probe or may signpost a mode of discovery for high-redshift, high-energy transients through soft X-ray surveys, combined with locating multi-wavelength counterparts.
△ Less
Submitted 19 June, 2024; v1 submitted 16 April, 2024;
originally announced April 2024.
-
Focused Active Learning for Histopathological Image Classification
Authors:
Arne Schmidt,
Pablo Morales-Álvarez,
Lee A. D. Cooper,
Lee A. Newberg,
Andinet Enquobahrie,
Aggelos K. Katsaggelos,
Rafael Molina
Abstract:
Active Learning (AL) has the potential to solve a major problem of digital pathology: the efficient acquisition of labeled data for machine learning algorithms. However, existing AL methods often struggle in realistic settings with artifacts, ambiguities, and class imbalances, as commonly seen in the medical field. The lack of precise uncertainty estimations leads to the acquisition of images with…
▽ More
Active Learning (AL) has the potential to solve a major problem of digital pathology: the efficient acquisition of labeled data for machine learning algorithms. However, existing AL methods often struggle in realistic settings with artifacts, ambiguities, and class imbalances, as commonly seen in the medical field. The lack of precise uncertainty estimations leads to the acquisition of images with a low informative value. To address these challenges, we propose Focused Active Learning (FocAL), which combines a Bayesian Neural Network with Out-of-Distribution detection to estimate different uncertainties for the acquisition function. Specifically, the weighted epistemic uncertainty accounts for the class imbalance, aleatoric uncertainty for ambiguous images, and an OoD score for artifacts. We perform extensive experiments to validate our method on MNIST and the real-world Panda dataset for the classification of prostate cancer. The results confirm that other AL methods are 'distracted' by ambiguities and artifacts which harm the performance. FocAL effectively focuses on the most informative images, avoiding ambiguities and artifacts during acquisition. For both experiments, FocAL outperforms existing AL approaches, reaching a Cohen's kappa of 0.764 with only 0.69% of the labeled Panda data.
△ Less
Submitted 6 April, 2024;
originally announced April 2024.
-
Reinforcement Learning Design for Quickest Change Detection
Authors:
Austin Cooper,
Sean Meyn
Abstract:
The field of quickest change detection (QCD) concerns design and analysis of algorithms to estimate in real time the time at which an important event takes place, and identify properties of the post-change behavior. It is shown in this paper that approaches based on reinforcement learning (RL) can be adapted based on any "surrogate information state" that is adapted to the observations. Hence we a…
▽ More
The field of quickest change detection (QCD) concerns design and analysis of algorithms to estimate in real time the time at which an important event takes place, and identify properties of the post-change behavior. It is shown in this paper that approaches based on reinforcement learning (RL) can be adapted based on any "surrogate information state" that is adapted to the observations. Hence we are left to choose both the surrogate information state process and the algorithm. For the former, it is argued that there are many choices available, based on a rich theory of asymptotic statistics for QCD. Two approaches to RL design are considered: (i) Stochastic gradient descent based on an actor-critic formulation. Theory is largely complete for this approach: the algorithm is unbiased, and will converge to a local minimum. However, it is shown that variance of stochastic gradients can be very large, necessitating the need for commensurately long run times; (ii) Q-learning algorithms based on a version of the projected Bellman equation. It is shown that the algorithm is stable, in the sense of bounded sample paths, and that a solution to the projected Bellman equation exists under mild conditions. Numerical experiments illustrate these findings, and provide a roadmap for algorithm design in more general settings.
△ Less
Submitted 20 March, 2024;
originally announced March 2024.
-
Stealing Part of a Production Language Model
Authors:
Nicholas Carlini,
Daniel Paleka,
Krishnamurthy Dj Dvijotham,
Thomas Steinke,
Jonathan Hayase,
A. Feder Cooper,
Katherine Lee,
Matthew Jagielski,
Milad Nasr,
Arthur Conmy,
Eric Wallace,
David Rolnick,
Florian Tramèr
Abstract:
We introduce the first model-stealing attack that extracts precise, nontrivial information from black-box production language models like OpenAI's ChatGPT or Google's PaLM-2. Specifically, our attack recovers the embedding projection layer (up to symmetries) of a transformer model, given typical API access. For under \…
▽ More
We introduce the first model-stealing attack that extracts precise, nontrivial information from black-box production language models like OpenAI's ChatGPT or Google's PaLM-2. Specifically, our attack recovers the embedding projection layer (up to symmetries) of a transformer model, given typical API access. For under \$20 USD, our attack extracts the entire projection matrix of OpenAI's Ada and Babbage language models. We thereby confirm, for the first time, that these black-box models have a hidden dimension of 1024 and 2048, respectively. We also recover the exact hidden dimension size of the gpt-3.5-turbo model, and estimate it would cost under \$2,000 in queries to recover the entire projection matrix. We conclude with potential defenses and mitigations, and discuss the implications of possible future work that could extend our attack.
△ Less
Submitted 11 March, 2024;
originally announced March 2024.
-
The DESI Early Data Release White Dwarf Catalogue
Authors:
Christopher J. Manser,
Paula Izquierdo,
Boris T. Gänsicke,
Andrew Swan,
Detlev Koester,
Akshay Robert,
Siyi Xu,
Keith Inight,
Ben Amroota,
N. P. Gentile Fusillo,
Sergey E. Koposov,
Bokyoung Kim,
Arjun Dey,
Carlos Allende Prieto,
J. Aguilar,
S. Ahlen,
R. Blum,
D. Brooks,
T. Claybaugh,
A. P. Cooper,
K. Dawson,
A. de la Macorra,
P. Doel,
J. E. Forero-Romero,
E. Gaztañaga
, et al. (29 additional authors not shown)
Abstract:
The Early Data Release (EDR) of the Dark Energy Spectroscopic Instrument (DESI) comprises spectroscopy obtained from 2020 December 14 to 2021 June 10. White dwarfs were targeted by DESI both as calibration sources and as science targets and were selected based on Gaia photometry and astrometry. Here we present the DESI EDR white dwarf catalogue, which includes 2706 spectroscopically confirmed whit…
▽ More
The Early Data Release (EDR) of the Dark Energy Spectroscopic Instrument (DESI) comprises spectroscopy obtained from 2020 December 14 to 2021 June 10. White dwarfs were targeted by DESI both as calibration sources and as science targets and were selected based on Gaia photometry and astrometry. Here we present the DESI EDR white dwarf catalogue, which includes 2706 spectroscopically confirmed white dwarfs of which approximately 1630 (roughly 60 per cent) have been spectroscopically observed for the first time, as well as 66 white dwarf binary systems. We provide spectral classifications for all white dwarfs, and discuss their distribution within the Gaia Hertzsprung-Russell diagram. We provide atmospheric parameters derived from spectroscopic and photometric fits for white dwarfs with pure hydrogen or helium photospheres, a mixture of those two, and white dwarfs displaying carbon features in their spectra. We also discuss the less abundant systems in the sample, such as those with magnetic fields, and cataclysmic variables. The DESI EDR white dwarf sample is significantly less biased than the sample observed by the Sloan Digital Sky Survey, which is skewed to bluer and therefore hotter white dwarfs, making DESI more complete and suitable for performing statistical studies of white dwarfs.
△ Less
Submitted 28 February, 2024;
originally announced February 2024.
-
AI-Augmented Brainwriting: Investigating the use of LLMs in group ideation
Authors:
Orit Shaer,
Angelora Cooper,
Osnat Mokryn,
Andrew L. Kun,
Hagit Ben Shoshan
Abstract:
The growing availability of generative AI technologies such as large language models (LLMs) has significant implications for creative work. This paper explores twofold aspects of integrating LLMs into the creative process - the divergence stage of idea generation, and the convergence stage of evaluation and selection of ideas. We devised a collaborative group-AI Brainwriting ideation framework, wh…
▽ More
The growing availability of generative AI technologies such as large language models (LLMs) has significant implications for creative work. This paper explores twofold aspects of integrating LLMs into the creative process - the divergence stage of idea generation, and the convergence stage of evaluation and selection of ideas. We devised a collaborative group-AI Brainwriting ideation framework, which incorporated an LLM as an enhancement into the group ideation process, and evaluated the idea generation process and the resulted solution space. To assess the potential of using LLMs in the idea evaluation process, we design an evaluation engine and compared it to idea ratings assigned by three expert and six novice evaluators. Our findings suggest that integrating LLM in Brainwriting could enhance both the ideation process and its outcome. We also provide evidence that LLMs can support idea evaluation. We conclude by discussing implications for HCI education and practice.
△ Less
Submitted 29 February, 2024; v1 submitted 22 February, 2024;
originally announced February 2024.
-
On the Standardization of Behavioral Use Clauses and Their Adoption for Responsible Licensing of AI
Authors:
Daniel McDuff,
Tim Korjakow,
Scott Cambo,
Jesse Josua Benjamin,
Jenny Lee,
Yacine Jernite,
Carlos Muñoz Ferrandis,
Aaron Gokaslan,
Alek Tarkowski,
Joseph Lindley,
A. Feder Cooper,
Danish Contractor
Abstract:
Growing concerns over negligent or malicious uses of AI have increased the appetite for tools that help manage the risks of the technology. In 2018, licenses with behaviorial-use clauses (commonly referred to as Responsible AI Licenses) were proposed to give developers a framework for releasing AI assets while specifying their users to mitigate negative applications. As of the end of 2023, on the…
▽ More
Growing concerns over negligent or malicious uses of AI have increased the appetite for tools that help manage the risks of the technology. In 2018, licenses with behaviorial-use clauses (commonly referred to as Responsible AI Licenses) were proposed to give developers a framework for releasing AI assets while specifying their users to mitigate negative applications. As of the end of 2023, on the order of 40,000 software and model repositories have adopted responsible AI licenses licenses. Notable models licensed with behavioral use clauses include BLOOM (language) and LLaMA2 (language), Stable Diffusion (image), and GRID (robotics). This paper explores why and how these licenses have been adopted, and why and how they have been adapted to fit particular use cases. We use a mixed-methods methodology of qualitative interviews, clustering of license clauses, and quantitative analysis of license adoption. Based on this evidence we take the position that responsible AI licenses need standardization to avoid confusing users or diluting their impact. At the same time, customization of behavioral restrictions is also appropriate in some contexts (e.g., medical domains). We advocate for ``standardized customization'' that can meet users' needs and can be supported via tooling.
△ Less
Submitted 7 February, 2024;
originally announced February 2024.
-
Machine Learning Resistant Amorphous Silicon Physically Unclonable Functions (PUFs)
Authors:
Velat Kilic,
Neil Macfarlane,
Jasper Stround,
Samuel Metais,
Milad Alemohammad,
A. Brinton Cooper,
Amy C. Foster,
Mark A. Foster
Abstract:
We investigate usage of nonlinear wave chaotic amorphous silicon (a-Si) cavities as physically unclonable functions (PUF). Machine learning attacks on integrated electronic PUFs have been demonstrated to be very effective at modeling PUF behavior. Such attacks on integrated a-Si photonic PUFs are investigated through application of algorithms including linear regression, k-nearest neighbor, decisi…
▽ More
We investigate usage of nonlinear wave chaotic amorphous silicon (a-Si) cavities as physically unclonable functions (PUF). Machine learning attacks on integrated electronic PUFs have been demonstrated to be very effective at modeling PUF behavior. Such attacks on integrated a-Si photonic PUFs are investigated through application of algorithms including linear regression, k-nearest neighbor, decision tree ensembles (random forests and gradient boosted trees), and deep neural networks (DNNs). We found that DNNs performed the best among all the algorithms studied but still failed to completely break the a-Si PUF security which we quantify through a private information metric. Furthermore, machine learning resistance of a-Si PUFs were found to be directly related to the strength of their nonlinear response.
△ Less
Submitted 5 February, 2024;
originally announced February 2024.
-
CIS-UNet: Multi-Class Segmentation of the Aorta in Computed Tomography Angiography via Context-Aware Shifted Window Self-Attention
Authors:
Muhammad Imran,
Jonathan R Krebs,
Veera Rajasekhar Reddy Gopu,
Brian Fazzone,
Vishal Balaji Sivaraman,
Amarjeet Kumar,
Chelsea Viscardi,
Robert Evans Heithaus,
Benjamin Shickel,
Yuyin Zhou,
Michol A Cooper,
Wei Shao
Abstract:
Advancements in medical imaging and endovascular grafting have facilitated minimally invasive treatments for aortic diseases. Accurate 3D segmentation of the aorta and its branches is crucial for interventions, as inaccurate segmentation can lead to erroneous surgical planning and endograft construction. Previous methods simplified aortic segmentation as a binary image segmentation problem, overlo…
▽ More
Advancements in medical imaging and endovascular grafting have facilitated minimally invasive treatments for aortic diseases. Accurate 3D segmentation of the aorta and its branches is crucial for interventions, as inaccurate segmentation can lead to erroneous surgical planning and endograft construction. Previous methods simplified aortic segmentation as a binary image segmentation problem, overlooking the necessity of distinguishing between individual aortic branches. In this paper, we introduce Context Infused Swin-UNet (CIS-UNet), a deep learning model designed for multi-class segmentation of the aorta and thirteen aortic branches. Combining the strengths of Convolutional Neural Networks (CNNs) and Swin transformers, CIS-UNet adopts a hierarchical encoder-decoder structure comprising a CNN encoder, symmetric decoder, skip connections, and a novel Context-aware Shifted Window Self-Attention (CSW-SA) as the bottleneck block. Notably, CSW-SA introduces a unique utilization of the patch merging layer, distinct from conventional Swin transformers. It efficiently condenses the feature map, providing a global spatial context and enhancing performance when applied at the bottleneck layer, offering superior computational efficiency and segmentation accuracy compared to the Swin transformers. We trained our model on computed tomography (CT) scans from 44 patients and tested it on 15 patients. CIS-UNet outperformed the state-of-the-art SwinUNetR segmentation model, which is solely based on Swin transformers, by achieving a superior mean Dice coefficient of 0.713 compared to 0.697, and a mean surface distance of 2.78 mm compared to 3.39 mm. CIS-UNet's superior 3D aortic segmentation offers improved precision and optimization for planning endovascular treatments. Our dataset and code will be publicly available.
△ Less
Submitted 23 January, 2024;
originally announced January 2024.
-
Scalable Extraction of Training Data from (Production) Language Models
Authors:
Milad Nasr,
Nicholas Carlini,
Jonathan Hayase,
Matthew Jagielski,
A. Feder Cooper,
Daphne Ippolito,
Christopher A. Choquette-Choo,
Eric Wallace,
Florian Tramèr,
Katherine Lee
Abstract:
This paper studies extractable memorization: training data that an adversary can efficiently extract by querying a machine learning model without prior knowledge of the training dataset. We show an adversary can extract gigabytes of training data from open-source language models like Pythia or GPT-Neo, semi-open models like LLaMA or Falcon, and closed models like ChatGPT. Existing techniques from…
▽ More
This paper studies extractable memorization: training data that an adversary can efficiently extract by querying a machine learning model without prior knowledge of the training dataset. We show an adversary can extract gigabytes of training data from open-source language models like Pythia or GPT-Neo, semi-open models like LLaMA or Falcon, and closed models like ChatGPT. Existing techniques from the literature suffice to attack unaligned models; in order to attack the aligned ChatGPT, we develop a new divergence attack that causes the model to diverge from its chatbot-style generations and emit training data at a rate 150x higher than when behaving properly. Our methods show practical attacks can recover far more data than previously thought, and reveal that current alignment techniques do not eliminate memorization.
△ Less
Submitted 28 November, 2023;
originally announced November 2023.
-
A First Look with JWST Aperture Masking Interferometry (AMI): Resolving Circumstellar Dust around the Wolf-Rayet Binary WR 137 beyond the Rayleigh Limit
Authors:
Ryan M. Lau,
Matthew J. Hankins,
Joel Sanchez-Bermudez,
Deepashri Thatte,
Anthony Soulain,
Rachel A. Cooper,
Anand Sivaramakrishnan,
Michael F. Corcoran,
Alexandra Z. Greenbaum,
Theodore R. Gull,
Yinuo Han,
Olivia C. Jones,
Thomas Madura,
Anthony F. J. Moffat,
Mark R. Morris,
Takashi Onaka,
Christopher M. P. Russell,
Noel D. Richardson,
Nathan Smith,
Peter Tuthill,
Kevin Volk,
Gerd Weigelt,
Peredur M. Williams
Abstract:
We present infrared aperture masking interferometry (AMI) observations of newly formed dust from the colliding winds of the massive binary system Wolf-Rayet (WR) 137 with JWST using the Near Infrared Imager and Slitless Spectrograph (NIRISS). NIRISS AMI observations of WR 137 and a point-spread-function calibrator star, HD~228337, were taken using the F380M and F480M filters in 2022 July and Augus…
▽ More
We present infrared aperture masking interferometry (AMI) observations of newly formed dust from the colliding winds of the massive binary system Wolf-Rayet (WR) 137 with JWST using the Near Infrared Imager and Slitless Spectrograph (NIRISS). NIRISS AMI observations of WR 137 and a point-spread-function calibrator star, HD~228337, were taken using the F380M and F480M filters in 2022 July and August as part of the Director's Discretionary Early Release Science (DD-ERS) program 1349. Interferometric observables (squared visibilities and closure phases) from the WR 137 "interferogram" were extracted and calibrated using three independent software tools: ImPlaneIA, AMICAL, and SAMpip. The analysis of the calibrated observables yielded consistent values except for slightly discrepant closure phases measured by ImPlaneIA. Based on all three sets of calibrated observables, images were reconstructed using three independent software tools: BSMEM, IRBis, and SQUEEZE. All reconstructed image combinations generated consistent images in both F380M and F480M filters. The reconstructed images of WR 137 reveal a bright central core with a $\sim300$ mas linear filament extending to the northwest. A geometric colliding-wind model with dust production constrained to the orbital plane of the binary system and enhanced as the system approaches periapsis provided a general agreement with the interferometric observables and reconstructed images. Based on a colliding-wind dust condensation analysis, we suggest that dust formation within the orbital plane of WR 137 is induced by enhanced equatorial mass-loss from the rapidly rotating O9 companion star, whose axis of rotation is aligned with that of the orbit.
△ Less
Submitted 22 December, 2023; v1 submitted 27 November, 2023;
originally announced November 2023.
-
Domain Knowledge Injection in Bayesian Search for New Materials
Authors:
Zikai Xie,
Xenophon Evangelopoulos,
Joseph Thacker,
Andrew Cooper
Abstract:
In this paper we propose DKIBO, a Bayesian optimization (BO) algorithm that accommodates domain knowledge to tune exploration in the search space. Bayesian optimization has recently emerged as a sample-efficient optimizer for many intractable scientific problems. While various existing BO frameworks allow the input of prior beliefs to accelerate the search by narrowing down the space, incorporatin…
▽ More
In this paper we propose DKIBO, a Bayesian optimization (BO) algorithm that accommodates domain knowledge to tune exploration in the search space. Bayesian optimization has recently emerged as a sample-efficient optimizer for many intractable scientific problems. While various existing BO frameworks allow the input of prior beliefs to accelerate the search by narrowing down the space, incorporating such knowledge is not always straightforward and can often introduce bias and lead to poor performance. Here we propose a simple approach to incorporate structural knowledge in the acquisition function by utilizing an additional deterministic surrogate model to enrich the approximation power of the Gaussian process. This is suitably chosen according to structural information of the problem at hand and acts a corrective term towards a better-informed sampling. We empirically demonstrate the practical utility of the proposed method by successfully injecting domain knowledge in a materials design task. We further validate our method's performance on different experimental settings and ablation analyses.
△ Less
Submitted 25 November, 2023;
originally announced November 2023.
-
Report of the 1st Workshop on Generative AI and Law
Authors:
A. Feder Cooper,
Katherine Lee,
James Grimmelmann,
Daphne Ippolito,
Christopher Callison-Burch,
Christopher A. Choquette-Choo,
Niloofar Mireshghallah,
Miles Brundage,
David Mimno,
Madiha Zahrah Choksi,
Jack M. Balkin,
Nicholas Carlini,
Christopher De Sa,
Jonathan Frankle,
Deep Ganguli,
Bryant Gipson,
Andres Guadamuz,
Swee Leng Harris,
Abigail Z. Jacobs,
Elizabeth Joh,
Gautam Kamath,
Mark Lemley,
Cass Matthews,
Christine McLeavey,
Corynne McSherry
, et al. (10 additional authors not shown)
Abstract:
This report presents the takeaways of the inaugural Workshop on Generative AI and Law (GenLaw), held in July 2023. A cross-disciplinary group of practitioners and scholars from computer science and law convened to discuss the technical, doctrinal, and policy challenges presented by law for Generative AI, and by Generative AI for law, with an emphasis on U.S. law in particular. We begin the report…
▽ More
This report presents the takeaways of the inaugural Workshop on Generative AI and Law (GenLaw), held in July 2023. A cross-disciplinary group of practitioners and scholars from computer science and law convened to discuss the technical, doctrinal, and policy challenges presented by law for Generative AI, and by Generative AI for law, with an emphasis on U.S. law in particular. We begin the report with a high-level statement about why Generative AI is both immensely significant and immensely challenging for law. To meet these challenges, we conclude that there is an essential need for 1) a shared knowledge base that provides a common conceptual language for experts across disciplines; 2) clarification of the distinctive technical capabilities of generative-AI systems, as compared and contrasted to other computer and AI systems; 3) a logical taxonomy of the legal issues these systems raise; and, 4) a concrete research agenda to promote collaboration and knowledge-sharing on emerging issues at the intersection of Generative AI and law. In this report, we synthesize the key takeaways from the GenLaw workshop that begin to address these needs. All of the listed authors contributed to the workshop upon which this report is based, but they and their organizations do not necessarily endorse all of the specific claims in this report.
△ Less
Submitted 2 December, 2023; v1 submitted 10 November, 2023;
originally announced November 2023.
-
CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images
Authors:
Aaron Gokaslan,
A. Feder Cooper,
Jasmine Collins,
Landan Seguin,
Austin Jacobson,
Mihir Patel,
Jonathan Frankle,
Cory Stephenson,
Volodymyr Kuleshov
Abstract:
We assemble a dataset of Creative-Commons-licensed (CC) images, which we use to train a set of open diffusion models that are qualitatively competitive with Stable Diffusion 2 (SD2). This task presents two challenges: (1) high-resolution CC images lack the captions necessary to train text-to-image generative models; (2) CC images are relatively scarce. In turn, to address these challenges, we use…
▽ More
We assemble a dataset of Creative-Commons-licensed (CC) images, which we use to train a set of open diffusion models that are qualitatively competitive with Stable Diffusion 2 (SD2). This task presents two challenges: (1) high-resolution CC images lack the captions necessary to train text-to-image generative models; (2) CC images are relatively scarce. In turn, to address these challenges, we use an intuitive transfer learning technique to produce a set of high-quality synthetic captions paired with curated CC images. We then develop a data- and compute-efficient training recipe that requires as little as 3% of the LAION-2B data needed to train existing SD2 models, but obtains comparable quality. These results indicate that we have a sufficient number of CC images (~70 million) for training high-quality models. Our training recipe also implements a variety of optimizations that achieve ~3X training speed-ups, enabling rapid model iteration. We leverage this recipe to train several high-quality text-to-image models, which we dub the CommonCanvas family. Our largest model achieves comparable performance to SD2 on a human evaluation, despite being trained on our CC dataset that is significantly smaller than LAION and using synthetic captions for training. We release our models, data, and code at https://github.com/mosaicml/diffusion/blob/main/assets/common-canvas.md
△ Less
Submitted 25 October, 2023;
originally announced October 2023.
-
Characterisation of ferroelectric domains in magnetite (Fe3O4)
Authors:
S. D. Seddon,
A. Cooper,
T. Fricke,
S. G. Ebbinghaus,
M. Walker,
T. P. A. Hase,
W. J. A. Blackmore,
M. Alexe
Abstract:
Magnetite has long been investigated across many disciplines due to the interplay between its ferroic order parameters, namely its ferrimagnetism, ferroelasticity and ferroelectricty. Despite this, the experimental difficulty in measuring low temperature real space images of the ferroelectric domains has meant that the local behaviour of ferroelectric domains emergent below the 38 K phase transiti…
▽ More
Magnetite has long been investigated across many disciplines due to the interplay between its ferroic order parameters, namely its ferrimagnetism, ferroelasticity and ferroelectricty. Despite this, the experimental difficulty in measuring low temperature real space images of the ferroelectric domains has meant that the local behaviour of ferroelectric domains emergent below the 38 K phase transition have yet to be realised. This work presents real space images of the ferroelectric domains, and uses piezo force microscopy to, as a function of temperature, probe the onset of piezoelectricty and ferroelectricity across the 38 K transition
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
Bayesian cross-validation by parallel Markov Chain Monte Carlo
Authors:
Alex Cooper,
Aki Vehtari,
Catherine Forbes,
Lauren Kennedy,
Dan Simpson
Abstract:
Brute force cross-validation (CV) is a method for predictive assessment and model selection that is general and applicable to a wide range of Bayesian models. Naive or `brute force' CV approaches are often too computationally costly for interactive modeling workflows, especially when inference relies on Markov chain Monte Carlo (MCMC). We propose overcoming this limitation using massively parallel…
▽ More
Brute force cross-validation (CV) is a method for predictive assessment and model selection that is general and applicable to a wide range of Bayesian models. Naive or `brute force' CV approaches are often too computationally costly for interactive modeling workflows, especially when inference relies on Markov chain Monte Carlo (MCMC). We propose overcoming this limitation using massively parallel MCMC. Using accelerator hardware such as graphics processor units (GPUs), our approach can be about as fast (in wall clock time) as a single full-data model fit.
Parallel CV is flexible because it can easily exploit a wide range data partitioning schemes, such as those designed for non-exchangeable data. It can also accommodate a range of scoring rules.
We propose MCMC diagnostics, including a summary of MCMC mixing based on the popular potential scale reduction factor ($\widehat{R}$) and MCMC effective sample size ($\widehat{ESS}$) measures. We also describe a method for determining whether an $\widehat{R}$ diagnostic indicates approximate stationarity of the chains, that may be of more general interest for applications beyond parallel CV. Finally, we show that parallel CV and its diagnostics can be implemented with online algorithms, allowing parallel CV to scale up to very large blocking designs on memory-constrained computing accelerators.
△ Less
Submitted 13 January, 2024; v1 submitted 10 October, 2023;
originally announced October 2023.
-
MWA rapid follow-up of gravitational wave transients: prospects for detecting prompt radio counterparts
Authors:
J. Tian,
G. E. Anderson,
A. J. Cooper,
K. Gourdji,
M. Sokolowski,
A. Rowlinson,
A. Williams,
G. Sleap,
D. Dobie,
D. L. Kaplan,
Tara Murphy,
S. J. Tingay,
F. H. Panther,
P. D. Lasky,
A. Bahramian,
J. C. A. Miller-Jones,
C. W. James,
B. W. Meyers,
S. J. McSweeney,
P. J. Hancock
Abstract:
We present and evaluate the prospects for detecting coherent radio counterparts to gravitational wave (GW) events using Murchison Widefield Array (MWA) triggered observations. The MWA rapid-response system, combined with its buffering mode ($\sim4$ minutes negative latency), enables us to catch any radio signals produced from seconds prior to hours after a binary neutron star (BNS) merger. The lar…
▽ More
We present and evaluate the prospects for detecting coherent radio counterparts to gravitational wave (GW) events using Murchison Widefield Array (MWA) triggered observations. The MWA rapid-response system, combined with its buffering mode ($\sim4$ minutes negative latency), enables us to catch any radio signals produced from seconds prior to hours after a binary neutron star (BNS) merger. The large field of view of the MWA ($\sim1000\,\text{deg}^2$ at 120\,MHz) and its location under the high sensitivity sky region of the LIGO-Virgo-KAGRA (LVK) detector network, forecast a high chance of being on-target for a GW event. We consider three observing configurations for the MWA to follow up GW BNS merger events, including a single dipole per tile, the full array, and four sub-arrays. We then perform a population synthesis of BNS systems to predict the radio detectable fraction of GW events using these configurations. We find that the configuration with four sub-arrays is the best compromise between sky coverage and sensitivity as it is capable of placing meaningful constraints on the radio emission from 12.6\% of GW BNS detections. Based on the timescales of four BNS merger coherent radio emission models, we propose an observing strategy that involves triggering the buffering mode to target coherent signals emitted prior to, during or shortly following the merger, which is then followed by continued recording for up to three hours to target later time post-merger emission. We expect MWA to trigger on $\sim5\text{--}22$ BNS merger events during the LVK O4 observing run, which could potentially result in two detections of predicted coherent emission.
△ Less
Submitted 28 September, 2023;
originally announced September 2023.
-
GW190425: Pan-STARRS and ATLAS coverage of the skymap and limits on optical emission associated with FRB190425
Authors:
S. J. Smartt,
M. Nicholl,
S. Srivastav,
M. E. Huber,
K. C. Chambers,
K. W. Smith,
D. R. Young,
M. D. Fulton,
J. L. Tonry,
C. W. Stubbs,
L. Denneau,
A. J. Cooper,
A. Aamer,
J. P. Anderson,
A. Andersson,
J. Bulger,
T. -W Chen,
P. Clark,
T. de Boer,
H. Gao,
J. H. Gillanders,
A. Lawrence,
C. C. Lin,
T. B. Lowe,
E. A. Magnier
, et al. (10 additional authors not shown)
Abstract:
GW190425 is the second of only two binary neutron star (BNS) merger events to be significantly detected by the LIGO-Virgo- Kagra gravitational wave detectors. With a detection only in LIGO Livingston, the skymap containing the source was large and no plausible electromagnetic counterpart was found in real time searching in 2019. Here we summarise our ATLAS and Pan-STARRS wide-field optical coverag…
▽ More
GW190425 is the second of only two binary neutron star (BNS) merger events to be significantly detected by the LIGO-Virgo- Kagra gravitational wave detectors. With a detection only in LIGO Livingston, the skymap containing the source was large and no plausible electromagnetic counterpart was found in real time searching in 2019. Here we summarise our ATLAS and Pan-STARRS wide-field optical coverage of the skymap beginning within 1 hour and 3 hours respectively of the GW190425 merger time. More recently, a potential coincidence between GW190425 and a fast radio burst FRB 190425 has been suggested, given their spatial and temporal coincidence. The smaller sky localisation area of FRB 190425 and its dispersion measure have led to the identification of a likely host galaxy, UGC 10667 at a distance of 141 +/- 10 Mpc. Our optical imaging covered the galaxy 6.0 hrs after GW190425 was detected and 3.5 hrs after the FRB 190425. No optical emission was detected and further imaging at +1.2 and +13.2 days also revealed no emission. If the FRB 190425 and GW190425 association were real, we highlight our limits on kilonova emission from a BNS merger in UGC 10667. The model for producing FRB 190425 from a BNS merger involves a supramassive magnetised neutron star spinning down by dipole emission on the timescale of hours. We show that magnetar enhanced kilonova emission is ruled out by optical upper limits. The lack of detected optical emission from a kilonova in UGC 10667 disfavours, but does not disprove, the FRB-GW link for this source.
△ Less
Submitted 20 September, 2023;
originally announced September 2023.
-
Talkin' 'Bout AI Generation: Copyright and the Generative-AI Supply Chain
Authors:
Katherine Lee,
A. Feder Cooper,
James Grimmelmann
Abstract:
"Does generative AI infringe copyright?" is an urgent question. It is also a difficult question, for two reasons. First, "generative AI" is not just one product from one company. It is a catch-all name for a massive ecosystem of loosely related technologies, including conversational text chatbots like ChatGPT, image generators like Midjourney and DALL-E, coding assistants like GitHub Copilot, and…
▽ More
"Does generative AI infringe copyright?" is an urgent question. It is also a difficult question, for two reasons. First, "generative AI" is not just one product from one company. It is a catch-all name for a massive ecosystem of loosely related technologies, including conversational text chatbots like ChatGPT, image generators like Midjourney and DALL-E, coding assistants like GitHub Copilot, and systems that compose music and create videos. These systems behave differently and raise different legal issues. The second problem is that copyright law is notoriously complicated, and generative-AI systems manage to touch on a great many corners of it: authorship, similarity, direct and indirect liability, fair use, and licensing, among much else. These issues cannot be analyzed in isolation, because there are connections everywhere.
In this Article, we aim to bring order to the chaos. To do so, we introduce the generative-AI supply chain: an interconnected set of stages that transform training data (millions of pictures of cats) into generations (a new, potentially never-seen-before picture of a cat that has never existed). Breaking down generative AI into these constituent stages reveals all of the places at which companies and users make choices that have copyright consequences. It enables us to trace the effects of upstream technical designs on downstream uses, and to assess who in these complicated sociotechnical systems bears responsibility for infringement when it happens. Because we engage so closely with the technology of generative AI, we are able to shed more light on the copyright questions. We do not give definitive answers as to who should and should not be held liable. Instead, we identify the key decisions that courts will need to make as they grapple with these issues, and point out the consequences that would likely flow from different liability regimes.
△ Less
Submitted 1 March, 2024; v1 submitted 15 September, 2023;
originally announced September 2023.
-
Modular, Multi-Robot Integration of Laboratories: An Autonomous Solid-State Workflow for Powder X-Ray Diffraction
Authors:
Amy. M. Lunt,
Hatem Fakhruldeen,
Gabriella Pizzuto,
Louis Longley,
Alexander White,
Nicola Rankin,
Rob Clowes,
Ben Alston,
Lucia Gigli,
Graeme M. Day,
Andrew I. Cooper,
Sam. Y. Chong
Abstract:
Automation can transform productivity in research activities that use liquid handling, such as organic synthesis, but it has made less impact in materials laboratories, which require sample preparation steps and a range of solid-state characterization techniques. For example, powder X-ray diffraction (PXRD) is a key method in materials and pharmaceutical chemistry, but its end-to-end automation is…
▽ More
Automation can transform productivity in research activities that use liquid handling, such as organic synthesis, but it has made less impact in materials laboratories, which require sample preparation steps and a range of solid-state characterization techniques. For example, powder X-ray diffraction (PXRD) is a key method in materials and pharmaceutical chemistry, but its end-to-end automation is challenging because it involves solid powder handling and sample processing. Here we present a fully autonomous solid-state workflow for PXRD experiments that can match or even surpass manual data quality. The workflow involves 12 steps performed by a team of three multipurpose robots, illustrating the power of flexible, modular automation to integrate complex, multitask laboratories.
△ Less
Submitted 23 November, 2023; v1 submitted 1 September, 2023;
originally announced September 2023.
-
HypBO: Accelerating Black-Box Scientific Experiments Using Experts' Hypotheses
Authors:
Abdoulatif Cisse,
Xenophon Evangelopoulos,
Sam Carruthers,
Vladimir V. Gusev,
Andrew I. Cooper
Abstract:
Robotics and automation offer massive accelerations for solving intractable, multivariate scientific problems such as materials discovery, but the available search spaces can be dauntingly large. Bayesian optimization (BO) has emerged as a popular sample-efficient optimization engine, thriving in tasks where no analytic form of the target function/property is known. Here, we exploit expert human k…
▽ More
Robotics and automation offer massive accelerations for solving intractable, multivariate scientific problems such as materials discovery, but the available search spaces can be dauntingly large. Bayesian optimization (BO) has emerged as a popular sample-efficient optimization engine, thriving in tasks where no analytic form of the target function/property is known. Here, we exploit expert human knowledge in the form of hypotheses to direct Bayesian searches more quickly to promising regions of chemical space. Previous methods have used underlying distributions derived from existing experimental measurements, which is unfeasible for new, unexplored scientific tasks. Also, such distributions cannot capture intricate hypotheses. Our proposed method, which we call HypBO, uses expert human hypotheses to generate improved seed samples. Unpromising seeds are automatically discounted, while promising seeds are used to augment the surrogate model data, thus achieving better-informed sampling. This process continues in a global versus local search fashion, organized in a bilevel optimization framework. We validate the performance of our method on a range of synthetic functions and demonstrate its practical utility on a real chemical design task where the use of expert hypotheses accelerates the search performance significantly.
△ Less
Submitted 28 January, 2024; v1 submitted 22 August, 2023;
originally announced August 2023.
-
Leveraging Multi-modal Sensing for Robotic Insertion Tasks in R&D Laboratories
Authors:
Aaron Butterworth,
Gabriella Pizzuto,
Leszek Pecyna,
Andrew I. Cooper,
Shan Luo
Abstract:
Performing a large volume of experiments in Chemistry labs creates repetitive actions costing researchers time, automating these routines is highly desirable. Previous experiments in robotic chemistry have performed high numbers of experiments autonomously, however, these processes rely on automated machines in all stages from solid or liquid addition to analysis of the final product. In these sys…
▽ More
Performing a large volume of experiments in Chemistry labs creates repetitive actions costing researchers time, automating these routines is highly desirable. Previous experiments in robotic chemistry have performed high numbers of experiments autonomously, however, these processes rely on automated machines in all stages from solid or liquid addition to analysis of the final product. In these systems every transition between machine requires the robotic chemist to pick and place glass vials, however, this is currently performed using open loop methods which require all equipment being used by the robot to be in well defined known locations. We seek to begin closing the loop in this vial handling process in a way which also fosters human-robot collaboration in the chemistry lab environment. To do this the robot must be able to detect valid placement positions for the vials it is collecting, and reliably insert them into the detected locations. We create a single modality visual method for estimating placement locations to provide a baseline before introducing two additional methods of feedback (force and tactile feedback). Our visual method uses a combination of classic computer vision methods and a CNN discriminator to detect possible insertion points, then a vial is grasped and positioned above an insertion point and the multi-modal methods guide the final insertion movements using an efficient search pattern. Through our experiments we show the baseline insertion rate of 48.78% improves to 89.55% with the addition of our "force and vision" multi-modal feedback method.
△ Less
Submitted 2 July, 2023;
originally announced July 2023.
-
Ghostly galaxies: accretion-dominated stellar systems in low-mass dark matter halos
Authors:
Chung-Wen Wang,
Andrew P. Cooper,
Sownak Bose,
Carlos S. Frenk,
Wojciech A. Hellwing
Abstract:
Wide-area deep imaging surveys have discovered large numbers of extremely low surface brightness dwarf galaxies, which challenge galaxy formation theory and, potentially, offer new constraints on the nature of dark matter. Here we discuss one as-yet unexplored formation mechanism that may account for a fraction of low surface brightness dwarfs. We call this the `ghost galaxy' scenario. In this sce…
▽ More
Wide-area deep imaging surveys have discovered large numbers of extremely low surface brightness dwarf galaxies, which challenge galaxy formation theory and, potentially, offer new constraints on the nature of dark matter. Here we discuss one as-yet unexplored formation mechanism that may account for a fraction of low surface brightness dwarfs. We call this the `ghost galaxy' scenario. In this scenario, inefficient radiative cooling prevents star formation in the `main branch' of the merger tree of a low mass dark matter halo, such that almost all its stellar mass is acquired through mergers with less massive (but nevertheless star-forming) progenitors. Present-day systems formed in this way would be `ghostly' isolated stellar halos with no central galaxy. We use merger trees based on the Extended Press-Schechter formalism and the COCO cosmological N-body simulation to demonstrate that mass assembly histories of this kind can occur for low-mass halos in Lambda-CDM, but they are rare. They are most probable in isolated halos of present-day mass ~4x10^9 M_sun, occurring for ~5 per cent of all halos of that mass under standard assumptions about the timing and effect of cosmic reionization. The stellar masses of star-forming progenitors in these systems are highly uncertain; abundance-matching arguments imply a bimodal present-day mass function having a brighter population (median M_star ~3x10^6 M_sun) consistent with the tail of the observed luminosity function of ultra-diffuse galaxies. This suggests observable analogues of these systems may await discovery. We find that a stronger ionizing background (globally or locally) produces brighter and more extended ghost galaxies.
△ Less
Submitted 30 October, 2023; v1 submitted 30 June, 2023;
originally announced June 2023.
-
Control of an environmental spin defect beyond the coherence limit of a central spin
Authors:
Alexander Ungar,
Paola Cappellaro,
Alexandre Cooper,
Won Kyu Calvin Sun
Abstract:
Electronic spin defects in the environment of an optically-active spin can be used to increase the size and hence the performance of solid-state quantum registers, especially for applications in quantum metrology and quantum communication. Previous works on multi-qubit electronic-spin registers in the environment of a Nitrogen-Vacancy (NV) center in diamond have only included spins directly couple…
▽ More
Electronic spin defects in the environment of an optically-active spin can be used to increase the size and hence the performance of solid-state quantum registers, especially for applications in quantum metrology and quantum communication. Previous works on multi-qubit electronic-spin registers in the environment of a Nitrogen-Vacancy (NV) center in diamond have only included spins directly coupled to the NV. As this direct coupling is limited by the central spin coherence time, it significantly restricts the register's maximum attainable size. To address this problem, we present a scalable approach to increase the size of electronic-spin registers. Our approach exploits a weakly-coupled probe spin together with double-resonance control sequences to mediate the transfer of spin polarization between the central NV spin and an environmental spin that is not directly coupled to it. We experimentally realize this approach to demonstrate the detection and coherent control of an unknown electronic spin outside the coherence limit of a central NV. Our work paves the way for engineering larger quantum spin registers with the potential to advance nanoscale sensing, enable correlated noise spectroscopy for error correction, and facilitate the realization of spin-chain quantum wires for quantum communication.
△ Less
Submitted 15 March, 2024; v1 submitted 29 June, 2023;
originally announced June 2023.
-
Possible contribution of X-ray binary jets to the Galactic cosmic ray and neutrino flux
Authors:
Dimitrios Kantzas,
Sera Markoff,
Alex J. Cooper,
Daniele Gaggero,
Maria Petropoulou,
Pedro De La Torre Luque
Abstract:
For over a century, the identification of high-energy cosmic ray (CR) sources remains an open question. For Galactic CRs with energy up to $10^{15}$ eV, supernova remnants (SNRs) have traditionally been thought the main candidate source. However, recent TeV gamma-ray observations have questioned the SNR paradigm. Propagating CRs are deflected by the Galactic magnetic field, hence, gamma-rays and n…
▽ More
For over a century, the identification of high-energy cosmic ray (CR) sources remains an open question. For Galactic CRs with energy up to $10^{15}$ eV, supernova remnants (SNRs) have traditionally been thought the main candidate source. However, recent TeV gamma-ray observations have questioned the SNR paradigm. Propagating CRs are deflected by the Galactic magnetic field, hence, gamma-rays and neutrinos produced via inelastic hadronic interactions are the only means for unveiling the CR sources. In this work, we study the gamma-ray and neutrino emission produced by CRs accelerated inside Galactic jets of stellar-mass black holes in X-ray binaries (BHXBs). We calculate the intrinsic neutrino emission of two prototypical BHXBs, Cygnus X-1 and GX 339-4, for which we have high-quality, quasi-simultaneous multiwavelength spectra. Based on these prototypical sources, we discuss the likelihood of the 35 known Galactic BHXBs to be efficient CR accelerators. Moreover, we estimate the potential contribution to the CR spectrum of a viable population of BHXBs that reside in the Galactic plane. When these BHXBs go into outburst, they may accelerate particles up to 100s of TeV that contribute to the diffuse gamma-ray and neutrino spectra while propagating in the Galactic medium. Using HERMES, an open-source code that calculates the hadronic processes along the line of sight, we discuss the contribution of BHXBs to the diffuse gamma-ray and neutrino fluxes, and compare these to their intrinsic gamma-ray and neutrino emissions. Finally, we discuss the contribution of BHXBs to the observed spectrum of Galactic CRs.
△ Less
Submitted 22 June, 2023;
originally announced June 2023.
-
NANCY: Next-generation All-sky Near-infrared Community surveY
Authors:
Jiwon Jesse Han,
Arjun Dey,
Adrian M. Price-Whelan,
Joan Najita,
Edward F. Schlafly,
Andrew Saydjari,
Risa H. Wechsler,
Ana Bonaca,
David J Schlegel,
Charlie Conroy,
Anand Raichoor,
Alex Drlica-Wagner,
Juna A. Kollmeier,
Sergey E. Koposov,
Gurtina Besla,
Hans-Walter Rix,
Alyssa Goodman,
Douglas Finkbeiner,
Abhijeet Anand,
Matthew Ashby,
Benedict Bahr-Kalus,
Rachel Beaton,
Jayashree Behera,
Eric F. Bell,
Eric C Bellm
, et al. (184 additional authors not shown)
Abstract:
The Nancy Grace Roman Space Telescope is capable of delivering an unprecedented all-sky, high-spatial resolution, multi-epoch infrared map to the astronomical community. This opportunity arises in the midst of numerous ground- and space-based surveys that will provide extensive spectroscopy and imaging together covering the entire sky (such as Rubin/LSST, Euclid, UNIONS, SPHEREx, DESI, SDSS-V, GAL…
▽ More
The Nancy Grace Roman Space Telescope is capable of delivering an unprecedented all-sky, high-spatial resolution, multi-epoch infrared map to the astronomical community. This opportunity arises in the midst of numerous ground- and space-based surveys that will provide extensive spectroscopy and imaging together covering the entire sky (such as Rubin/LSST, Euclid, UNIONS, SPHEREx, DESI, SDSS-V, GALAH, 4MOST, WEAVE, MOONS, PFS, UVEX, NEO Surveyor, etc.). Roman can uniquely provide uniform high-spatial-resolution (~0.1 arcsec) imaging over the entire sky, vastly expanding the science reach and precision of all of these near-term and future surveys. This imaging will not only enhance other surveys, but also facilitate completely new science. By imaging the full sky over two epochs, Roman can measure the proper motions for stars across the entire Milky Way, probing 100 times fainter than Gaia out to the very edge of the Galaxy. Here, we propose NANCY: a completely public, all-sky survey that will create a high-value legacy dataset benefiting innumerable ongoing and forthcoming studies of the universe. NANCY is a pure expression of Roman's potential: it images the entire sky, at high spatial resolution, in a broad infrared bandpass that collects as many photons as possible. The majority of all ongoing astronomical surveys would benefit from incorporating observations of NANCY into their analyses, whether these surveys focus on nearby stars, the Milky Way, near-field cosmology, or the broader universe.
△ Less
Submitted 20 June, 2023;
originally announced June 2023.
-
GTC Follow-up Observations of Very Metal-Poor Star Candidates from DESI
Authors:
Carlos Allende Prieto,
David S. Aguado,
Jonay I. González Hernández,
Rafael Rebolo,
Joan Najita,
Christopher J. Manser,
Constance Rockosi,
Zachary Slepian,
Mar Mezcua,
Monica Valluri,
Rana Ezzeddine,
Sergey E. Koposov,
Andrew P. Cooper,
Arjun Dey,
Boris T. Gänsicke,
Ting S. Li,
Katia Cunha,
Siwei Zou,
Jessica Nicole Aguilar,
Steven Ahlen,
David Brooks,
Todd Claybaugh,
Shaun Cole,
Sarah Eftekharzadeh,
Kevin Fanning
, et al. (26 additional authors not shown)
Abstract:
The observations from the Dark Energy Spectroscopic Instrument (DESI) will significantly increase the numbers of known extremely metal-poor stars by a factor of ~ 10, improving the sample statistics to study the early chemical evolution of the Milky Way and the nature of the first stars. In this paper we report high signal-to-noise follow-up observations of 9 metal-poor stars identified during the…
▽ More
The observations from the Dark Energy Spectroscopic Instrument (DESI) will significantly increase the numbers of known extremely metal-poor stars by a factor of ~ 10, improving the sample statistics to study the early chemical evolution of the Milky Way and the nature of the first stars. In this paper we report high signal-to-noise follow-up observations of 9 metal-poor stars identified during the DESI commissioning with the Optical System for Imaging and low-Intermediate-Resolution Integrated Spectroscopy (OSIRIS) instrument on the 10.4m Gran Telescopio Canarias (GTC). The analysis of the data using a well-vetted methodology confirms the quality of the DESI spectra and the performance of the pipelines developed for the data reduction and analysis of DESI data.
△ Less
Submitted 27 October, 2023; v1 submitted 9 June, 2023;
originally announced June 2023.
-
The Early Data Release of the Dark Energy Spectroscopic Instrument
Authors:
DESI Collaboration,
A. G. Adame,
J. Aguilar,
S. Ahlen,
S. Alam,
G. Aldering,
D. M. Alexander,
R. Alfarsy,
C. Allende Prieto,
M. Alvarez,
O. Alves,
A. Anand,
F. Andrade-Oliveira,
E. Armengaud,
J. Asorey,
S. Avila,
A. Aviles,
S. Bailey,
A. Balaguera-Antolínez,
O. Ballester,
C. Baltay,
A. Bault,
J. Bautista,
J. Behera,
S. F. Beltran
, et al. (240 additional authors not shown)
Abstract:
The Dark Energy Spectroscopic Instrument (DESI) completed its five-month Survey Validation in May 2021. Spectra of stellar and extragalactic targets from Survey Validation constitute the first major data sample from the DESI survey. This paper describes the public release of those spectra, the catalogs of derived properties, and the intermediate data products. In total, the public release includes…
▽ More
The Dark Energy Spectroscopic Instrument (DESI) completed its five-month Survey Validation in May 2021. Spectra of stellar and extragalactic targets from Survey Validation constitute the first major data sample from the DESI survey. This paper describes the public release of those spectra, the catalogs of derived properties, and the intermediate data products. In total, the public release includes good-quality spectral information from 466,447 objects targeted as part of the Milky Way Survey, 428,758 as part of the Bright Galaxy Survey, 227,318 as part of the Luminous Red Galaxy sample, 437,664 as part of the Emission Line Galaxy sample, and 76,079 as part of the Quasar sample. In addition, the release includes spectral information from 137,148 objects that expand the scope beyond the primary samples as part of a series of secondary programs. Here, we describe the spectral data, data quality, data products, Large-Scale Structure science catalogs, access to the data, and references that provide relevant background to using these spectra.
△ Less
Submitted 15 June, 2023; v1 submitted 9 June, 2023;
originally announced June 2023.
-
Validation of the Scientific Program for the Dark Energy Spectroscopic Instrument
Authors:
DESI Collaboration,
A. G. Adame,
J. Aguilar,
S. Ahlen,
S. Alam,
G. Aldering,
D. M. Alexander,
R. Alfarsy,
C. Allende Prieto,
M. Alvarez,
O. Alves,
A. Anand,
F. Andrade-Oliveira,
E. Armengaud,
J. Asorey,
S. Avila,
A. Aviles,
S. Bailey,
A. Balaguera-Antolínez,
O. Ballester,
C. Baltay,
A. Bault,
J. Bautista,
J. Behera,
S. F. Beltran
, et al. (239 additional authors not shown)
Abstract:
The Dark Energy Spectroscopic Instrument (DESI) was designed to conduct a survey covering 14,000 deg$^2$ over five years to constrain the cosmic expansion history through precise measurements of Baryon Acoustic Oscillations (BAO). The scientific program for DESI was evaluated during a five month Survey Validation (SV) campaign before beginning full operations. This program produced deep spectra of…
▽ More
The Dark Energy Spectroscopic Instrument (DESI) was designed to conduct a survey covering 14,000 deg$^2$ over five years to constrain the cosmic expansion history through precise measurements of Baryon Acoustic Oscillations (BAO). The scientific program for DESI was evaluated during a five month Survey Validation (SV) campaign before beginning full operations. This program produced deep spectra of tens of thousands of objects from each of the stellar (MWS), bright galaxy (BGS), luminous red galaxy (LRG), emission line galaxy (ELG), and quasar target classes. These SV spectra were used to optimize redshift distributions, characterize exposure times, determine calibration procedures, and assess observational overheads for the five-year program. In this paper, we present the final target selection algorithms, redshift distributions, and projected cosmology constraints resulting from those studies. We also present a `One-Percent survey' conducted at the conclusion of Survey Validation covering 140 deg$^2$ using the final target selection algorithms with exposures of a depth typical of the main survey. The Survey Validation indicates that DESI will be able to complete the full 14,000 deg$^2$ program with spectroscopically-confirmed targets from the MWS, BGS, LRG, ELG, and quasar programs with total sample sizes of 7.2, 13.8, 7.46, 15.7, and 2.87 million, respectively. These samples will allow exploration of the Milky Way halo, clustering on all scales, and BAO measurements with a statistical precision of 0.28% over the redshift interval $z<1.1$, 0.39% over the redshift interval $1.1<z<1.9$, and 0.46% over the redshift interval $1.9<z<3.5$.
△ Less
Submitted 12 January, 2024; v1 submitted 9 June, 2023;
originally announced June 2023.
-
The Near Infrared Imager and Slitless Spectrograph for the James Webb Space Telescope -- I. Instrument Overview and in-Flight Performance
Authors:
Rene Doyon,
C. J Willott,
John B. Hutchings,
Anand Sivaramakrishnan,
Loic Albert,
David Lafreniere,
Neil Rowlands,
M. Begona Vila,
Andre R. Martel,
Stephanie LaMassa,
David Aldridge,
Etienne Artigau,
Peter Cameron,
Pierre Chayer,
Neil J. Cook,
Rachel A. Cooper,
Antoine Darveau-Bernier,
Jean Dupuis,
Colin Earnshaw,
Nestor Espinoza,
Joseph C. Filippazzo,
Alexander W. Fullerton,
Daniel Gaudreau,
Roman Gawlik,
Paul Goudfrooij
, et al. (38 additional authors not shown)
Abstract:
The Near-Infrared Imager and Slitless Spectrograph (NIRISS) is the science module of the Canadian-built Fine Guidance Sensor (FGS) onboard the James Webb Space Telescope (JWST). NIRISS has four observing modes: 1) broadband imaging featuring seven of the eight NIRCam broadband filters, 2) wide-field slitless spectroscopy (WFSS) at a resolving power of $\sim$150 between 0.8 and 2.2 $μ$m, 3) single-…
▽ More
The Near-Infrared Imager and Slitless Spectrograph (NIRISS) is the science module of the Canadian-built Fine Guidance Sensor (FGS) onboard the James Webb Space Telescope (JWST). NIRISS has four observing modes: 1) broadband imaging featuring seven of the eight NIRCam broadband filters, 2) wide-field slitless spectroscopy (WFSS) at a resolving power of $\sim$150 between 0.8 and 2.2 $μ$m, 3) single-object cross-dispersed slitless spectroscopy (SOSS) enabling simultaneous wavelength coverage between 0.6 and 2.8 $μ$m at R$\sim$700, a mode optimized for exoplanet spectroscopy of relatively bright ($J<6.3$) stars and 4) aperture masking interferometry (AMI) between 2.8 and 4.8 $μ$m enabling high-contrast ($\sim10^{-3}-10^{-4}$) imaging at angular separations between 70 and 400 milliarcsec for relatively bright ($M<8$) sources. This paper presents an overview of the NIRISS instrument, its design, its scientific capabilities, and a summary of in-flight performance. NIRISS shows significantly better response shortward of $\sim2.5\,μ$m resulting in 10-40% sensitivity improvement for broadband and low-resolution spectroscopy compared to pre-flight predictions. Two time-series observations performed during instrument commissioning in the SOSS mode yield very stable spectro-photometry performance within $\sim$10% of the expected noise. The first space-based companion detection of the tight binary star AB Dor AC through AMI was demonstrated.
△ Less
Submitted 5 June, 2023;
originally announced June 2023.
-
Non-stationary Gaussian Process Surrogates
Authors:
Annie Sauer,
Andrew Cooper,
Robert B. Gramacy
Abstract:
We provide a survey of non-stationary surrogate models which utilize Gaussian processes (GPs) or variations thereof, including non-stationary kernel adaptations, partition and local GPs, and spatial war**s through deep Gaussian processes. We also overview publicly available software implementations and conclude with a bake-off involving an 8-dimensional satellite drag computer experiment. Code f…
▽ More
We provide a survey of non-stationary surrogate models which utilize Gaussian processes (GPs) or variations thereof, including non-stationary kernel adaptations, partition and local GPs, and spatial war**s through deep Gaussian processes. We also overview publicly available software implementations and conclude with a bake-off involving an 8-dimensional satellite drag computer experiment. Code for this example is provided in a public git repository.
△ Less
Submitted 30 May, 2023;
originally announced May 2023.
-
The James Webb Space Telescope Mission
Authors:
Jonathan P. Gardner,
John C. Mather,
Randy Abbott,
James S. Abell,
Mark Abernathy,
Faith E. Abney,
John G. Abraham,
Roberto Abraham,
Yasin M. Abul-Huda,
Scott Acton,
Cynthia K. Adams,
Evan Adams,
David S. Adler,
Maarten Adriaensen,
Jonathan Albert Aguilar,
Mansoor Ahmed,
Nasif S. Ahmed,
Tanjira Ahmed,
Rüdeger Albat,
Loïc Albert,
Stacey Alberts,
David Aldridge,
Mary Marsha Allen,
Shaune S. Allen,
Martin Altenburg
, et al. (983 additional authors not shown)
Abstract:
Twenty-six years ago a small committee report, building on earlier studies, expounded a compelling and poetic vision for the future of astronomy, calling for an infrared-optimized space telescope with an aperture of at least $4m$. With the support of their governments in the US, Europe, and Canada, 20,000 people realized that vision as the $6.5m$ James Webb Space Telescope. A generation of astrono…
▽ More
Twenty-six years ago a small committee report, building on earlier studies, expounded a compelling and poetic vision for the future of astronomy, calling for an infrared-optimized space telescope with an aperture of at least $4m$. With the support of their governments in the US, Europe, and Canada, 20,000 people realized that vision as the $6.5m$ James Webb Space Telescope. A generation of astronomers will celebrate their accomplishments for the life of the mission, potentially as long as 20 years, and beyond. This report and the scientific discoveries that follow are extended thank-you notes to the 20,000 team members. The telescope is working perfectly, with much better image quality than expected. In this and accompanying papers, we give a brief history, describe the observatory, outline its objectives and current observing program, and discuss the inventions and people who made it possible. We cite detailed reports on the design and the measured performance on orbit.
△ Less
Submitted 10 April, 2023;
originally announced April 2023.
-
Improved OFDM Signal Cancellation through Window Estimation
Authors:
Daniel Chew,
Samuel Berhanu,
Chris Baumgart,
A. Brinton Cooper
Abstract:
The ability to cancel an OFDM signal is important to many wireless communication systems including Power-Domain Non-orthogonal Multiple Access (PD-NOMA), Rate-Splitting Multiple Access (RSMA), and spectrum underlay for dynamic spectrum access. In this paper, we show that estimating the windowing applied at the transmitter is important to that cancellation. Windowing at the transmitter is a popular…
▽ More
The ability to cancel an OFDM signal is important to many wireless communication systems including Power-Domain Non-orthogonal Multiple Access (PD-NOMA), Rate-Splitting Multiple Access (RSMA), and spectrum underlay for dynamic spectrum access. In this paper, we show that estimating the windowing applied at the transmitter is important to that cancellation. Windowing at the transmitter is a popular means to control the bandwidth of an Orthogonal Frequency Division Multiplexed (OFDM) symbol and is overlooked in most literature on OFDM signal cancellation. We show the limitation to the amount of cancellation that can be achieved without knowledge of OFDM windowing. We show that the window can be estimated from received samples alone, and that window estimate can be used to improve the signal cancellation. The window is estimated in the presence of noise and imperfect estimates of the center frequency offset (CFO) and the channel. We conclude with results using synthetic and over-the-air data where we demonstrate a 5.3 dB improvement to OFDM signal cancellation over existing methods in an over-the-air experiment.
△ Less
Submitted 22 March, 2023;
originally announced March 2023.
-
Spin-parities of sub-threshold resonances in the $^{18}$F(p, $α$)$^{15}$O reaction
Authors:
F. Portillo,
R. Longland,
A. L. Cooper,
S. Hunt,
A. M. Laird,
C. Marshall,
K. Setoodehnia
Abstract:
The $^{18}$F(p, $α$)$^{15}$O reaction is key to determining the $^{18}$F abundance in classical novae. However, the cross section for this reaction has large uncertainties at low energies largely caused by interference effects. Here, we resolve a longstanding issue with unknown spin-parities of sub-threshold states in $^{19}$Ne that reduces these uncertainties. The $^{20}$Ne($^3$He, $^4$He)…
▽ More
The $^{18}$F(p, $α$)$^{15}$O reaction is key to determining the $^{18}$F abundance in classical novae. However, the cross section for this reaction has large uncertainties at low energies largely caused by interference effects. Here, we resolve a longstanding issue with unknown spin-parities of sub-threshold states in $^{19}$Ne that reduces these uncertainties. The $^{20}$Ne($^3$He, $^4$He)$^{19}$Ne neutron pick-up reaction was used to populate $^{19}$Ne excited states, focusing on the energy region of astrophysical interest ($\approx$ 6 - 7 MeV). The experiment was performed at the Triangle Universities Nuclear Laboratory using the high resolution Enge split-pole magnetic spectrograph. Spins and parities were found for states in the astrophysical energy range. In particular, the state at 6.133 MeV (E$_{r}^{\text{c.m.}} = -278$ keV) was found to have spin and parity of $3/2^+$ and we confirm the existence of an unresolved doublet close to 6.288 MeV (E$_{r}^{\text{c.m.}} = -120$ keV) with J$^π$ = $1/2^+$ and a high-spin state. Using these results, we demonstrate a significant factor of two decrease in the reaction rate uncertainties at nova temperatures.
△ Less
Submitted 31 March, 2023; v1 submitted 6 March, 2023;
originally announced March 2023.
-
Democratizing Making: Scaffolding Participation Using e-Waste to Engage Under-resourced Communities in Technology Design
Authors:
Dhaval Vyas,
Awais Hameed Khan,
Anabelle Cooper
Abstract:
Maker culture and DIY practices are central to democratizing the design of technology; enabling non-designers (future end-users) to actively participate in the design process. However, little is known about how individuals from under-resourced communities and low socioeconomic status (SES) backgrounds, can practically leverage maker practices to design technology, creating value for themselves or…
▽ More
Maker culture and DIY practices are central to democratizing the design of technology; enabling non-designers (future end-users) to actively participate in the design process. However, little is known about how individuals from under-resourced communities and low socioeconomic status (SES) backgrounds, can practically leverage maker practices to design technology, creating value for themselves or their communities. To investigate this, we collaborated with an e-waste recycling centre, involving 24 participants (staff and low-SES volunteers) in two participatory maker workshop activities. Participants were provided with a generative e-waste toolkit, through which they repurposed e-waste materials and developed novel technology prototypes that created value from their perspectives and agendas. Our findings unpack three factors that influenced their making: balancing personal and community needs; incorporating convenience and productivity; and re-thinking sustainability and connection; and discuss strategies for scaffolding participation and engagement of under-resourced communities in making using an e-waste generative toolkit to democratize technology design.
△ Less
Submitted 20 February, 2023;
originally announced February 2023.
-
Forward Learning with Top-Down Feedback: Empirical and Analytical Characterization
Authors:
Ravi Srinivasan,
Francesca Mignacco,
Martino Sorbaro,
Maria Refinetti,
Avi Cooper,
Gabriel Kreiman,
Giorgia Dellaferrera
Abstract:
"Forward-only" algorithms, which train neural networks while avoiding a backward pass, have recently gained attention as a way of solving the biologically unrealistic aspects of backpropagation. Here, we first address compelling challenges related to the "forward-only" rules, which include reducing the performance gap with backpropagation and providing an analytical understanding of their dynamics…
▽ More
"Forward-only" algorithms, which train neural networks while avoiding a backward pass, have recently gained attention as a way of solving the biologically unrealistic aspects of backpropagation. Here, we first address compelling challenges related to the "forward-only" rules, which include reducing the performance gap with backpropagation and providing an analytical understanding of their dynamics. To this end, we show that the forward-only algorithm with top-down feedback is well-approximated by an "adaptive-feedback-alignment" algorithm, and we analytically track its performance during learning in a prototype high-dimensional setting. Then, we compare different versions of forward-only algorithms, focusing on the Forward-Forward and PEPITA frameworks, and we show that they share the same learning principles. Overall, our work unveils the connections between three key neuro-inspired learning rules, providing a link between "forward-only" algorithms, i.e., Forward-Forward and PEPITA, and an approximation of backpropagation, i.e., Feedback Alignment.
△ Less
Submitted 22 March, 2024; v1 submitted 10 February, 2023;
originally announced February 2023.
-
DAHe white dwarfs from the DESI survey
Authors:
Christopher J. Manser,
Boris T. Gänsicke,
Keith Inight,
Akshay Robert,
S. Ahlen,
C. Allende Prieto,
D. Brooks,
A. P. Cooper,
A. de la Macorra,
A. Font-Ribera,
K. Honscheid,
T. Kisner,
M. Landriau,
Aaron M. Meisner,
R. Miquel,
Jundan Nie,
C. Poppett,
Gregory Tarlé,
Zhimin Zhou
Abstract:
A new class of white dwarfs, dubbed DAHe, that present Zeeman-split Balmer lines in emission has recently emerged. However, the physical origin of these emission lines remains unclear. We present here a sample of 21 newly identified DAHe systems and determine magnetic field strengths and (for a subset) periods which span the ranges of ~ 6.5 -- 147 MG and ~ 0.4 -- 36 h respectively. All but four of…
▽ More
A new class of white dwarfs, dubbed DAHe, that present Zeeman-split Balmer lines in emission has recently emerged. However, the physical origin of these emission lines remains unclear. We present here a sample of 21 newly identified DAHe systems and determine magnetic field strengths and (for a subset) periods which span the ranges of ~ 6.5 -- 147 MG and ~ 0.4 -- 36 h respectively. All but four of these systems were identified from the Dark Energy Spectroscopic Instrument (DESI) survey sample of more than 47000 white dwarf candidates observed during its first year of observations. We present detailed analysis of the new DAHe WDJ161634.36+541011.51 with a spin period of 95.3 min, which exhibits an anti-correlation between broadband flux and Balmer line strength that is typically observed for this class of systems. All DAHe systems cluster closely on the Gaia Hertzsprung-Russell diagram where they represent ~ 1 per cent of white dwarfs within that region. This grou** further solidifies their unexplained emergence at relatively late cooling times and we discuss this in context of current formation theories. Nine of the new DAHe systems are identifiable from SDSS spectra of white dwarfs that had been previously classified as featureless DC-type systems. We suggest high S/N, unbiased observations of DCs as a possible route for discovering additional DAHe systems.
△ Less
Submitted 8 March, 2023; v1 submitted 2 February, 2023;
originally announced February 2023.
-
Coordinating Distributed Example Orders for Provably Accelerated Training
Authors:
A. Feder Cooper,
Wentao Guo,
Khiem Pham,
Tiancheng Yuan,
Charlie F. Ruan,
Yucheng Lu,
Christopher De Sa
Abstract:
Recent research on online Gradient Balancing (GraB) has revealed that there exist permutation-based example orderings for SGD that are guaranteed to outperform random reshuffling (RR). Whereas RR arbitrarily permutes training examples, GraB leverages stale gradients from prior epochs to order examples -- achieving a provably faster convergence rate than RR. However, GraB is limited by design: whil…
▽ More
Recent research on online Gradient Balancing (GraB) has revealed that there exist permutation-based example orderings for SGD that are guaranteed to outperform random reshuffling (RR). Whereas RR arbitrarily permutes training examples, GraB leverages stale gradients from prior epochs to order examples -- achieving a provably faster convergence rate than RR. However, GraB is limited by design: while it demonstrates an impressive ability to scale-up training on centralized data, it does not naturally extend to modern distributed ML workloads. We therefore propose Coordinated Distributed GraB (CD-GraB), which uses insights from prior work on kernel thinning to translate the benefits of provably faster permutation-based example ordering to distributed settings. With negligible overhead, CD-GraB exhibits a linear speedup in convergence rate over centralized GraB and outperforms distributed RR on a variety of benchmark tasks.
△ Less
Submitted 21 December, 2023; v1 submitted 1 February, 2023;
originally announced February 2023.
-
Arbitrariness and Social Prediction: The Confounding Role of Variance in Fair Classification
Authors:
A. Feder Cooper,
Katherine Lee,
Madiha Zahrah Choksi,
Solon Barocas,
Christopher De Sa,
James Grimmelmann,
Jon Kleinberg,
Siddhartha Sen,
Baobao Zhang
Abstract:
Variance in predictions across different trained models is a significant, under-explored source of error in fair binary classification. In practice, the variance on some data examples is so large that decisions can be effectively arbitrary. To investigate this problem, we take an experimental approach and make four overarching contributions: We: 1) Define a metric called self-consistency, derived…
▽ More
Variance in predictions across different trained models is a significant, under-explored source of error in fair binary classification. In practice, the variance on some data examples is so large that decisions can be effectively arbitrary. To investigate this problem, we take an experimental approach and make four overarching contributions: We: 1) Define a metric called self-consistency, derived from variance, which we use as a proxy for measuring and reducing arbitrariness; 2) Develop an ensembling algorithm that abstains from classification when a prediction would be arbitrary; 3) Conduct the largest to-date empirical study of the role of variance (vis-a-vis self-consistency and arbitrariness) in fair binary classification; and, 4) Release a toolkit that makes the US Home Mortgage Disclosure Act (HMDA) datasets easily usable for future research. Altogether, our experiments reveal shocking insights about the reliability of conclusions on benchmark datasets. Most fair binary classification benchmarks are close-to-fair when taking into account the amount of arbitrariness present in predictions -- before we even try to apply any fairness interventions. This finding calls into question the practical utility of common algorithmic fairness methods, and in turn suggests that we should reconsider how we choose to measure fairness in binary classification.
△ Less
Submitted 6 March, 2024; v1 submitted 27 January, 2023;
originally announced January 2023.
-
Cross-validatory model selection for Bayesian autoregressions with exogenous regressors
Authors:
Alex Cooper,
Dan Simpson,
Lauren Kennedy,
Catherine Forbes,
Aki Vehtari
Abstract:
Bayesian cross-validation (CV) is a popular method for predictive model assessment that is simple to implement and broadly applicable. A wide range of CV schemes is available for time series applications, including generic leave-one-out (LOO) and K-fold methods, as well as specialized approaches intended to deal with serial dependence such as leave-future-out (LFO), h-block, and hv-block.
Existi…
▽ More
Bayesian cross-validation (CV) is a popular method for predictive model assessment that is simple to implement and broadly applicable. A wide range of CV schemes is available for time series applications, including generic leave-one-out (LOO) and K-fold methods, as well as specialized approaches intended to deal with serial dependence such as leave-future-out (LFO), h-block, and hv-block.
Existing large-sample results show that both specialized and generic methods are applicable to models of serially-dependent data. However, large sample consistency results overlook the impact of sampling variability on accuracy in finite samples. Moreover, the accuracy of a CV scheme depends on many aspects of the procedure. We show that poor design choices can lead to elevated rates of adverse selection.
In this paper, we consider the problem of identifying the regression component of an important class of models of data with serial dependence, autoregressions of order p with q exogenous regressors (ARX(p,q)), under the logarithmic scoring rule. We show that when serial dependence is present, scores computed using the joint (multivariate) density have lower variance and better model selection accuracy than the popular pointwise estimator. In addition, we present a detailed case study of the special case of ARX models with fixed autoregressive structure and variance. For this class, we derive the finite-sample distribution of the CV estimators and the model selection statistic. We conclude with recommendations for practitioners.
△ Less
Submitted 10 October, 2023; v1 submitted 19 January, 2023;
originally announced January 2023.
-
The JWST Early Release Science Program for Direct Observations of Exoplanetary Systems: Best Practices for Data Collection in Cycle 2 and Beyond
Authors:
Sasha Hinkley,
Beth Biller,
Andrew Skemer,
Aarynn L. Carter,
Julien Girard,
Dean Hines,
Jens Kammerer,
Jarron Leisenring,
William Balmer,
Elodie Choquet,
Maxwell A. Millar-Blanchaer,
Marshall Perrin,
Laurent Pueyo,
Jason Wang,
Kimberly Ward-Duong,
Anthony Boccaletti,
Brittany Miles,
Polychronis Patapis,
Isabel Rebollido,
Emily Rickman,
B. Sargent,
Kadin Worthen,
Kielan Hoch,
Christine Chen,
Stephanie Sallum
, et al. (13 additional authors not shown)
Abstract:
We present a set of recommended best practices for JWST data collection for members of the community focussed on the direct imaging and spectroscopy of exoplanetary systems. These findings and recommendations are based on the early analysis of the JWST Early Release Science Program 1386, "High-Contrast Imaging of Exoplanets and Exoplanetary Systems with JWST." Our goal is for this information to b…
▽ More
We present a set of recommended best practices for JWST data collection for members of the community focussed on the direct imaging and spectroscopy of exoplanetary systems. These findings and recommendations are based on the early analysis of the JWST Early Release Science Program 1386, "High-Contrast Imaging of Exoplanets and Exoplanetary Systems with JWST." Our goal is for this information to be useful for observers in preparation of JWST proposals for Cycle 2 and beyond. In addition to compiling a set of best practices from our ERS program, in a few cases we also draw on the expertise gained within the instrument commissioning programs, as well as include a handful of data processing best practices. We anticipate that this document will be regularly updated and resubmitted to arXiv.org to ensure that we have distributed our knowledge of best-practices for data collection as widely and efficiently as possible.
△ Less
Submitted 25 January, 2023; v1 submitted 17 January, 2023;
originally announced January 2023.