-
High-dimensional multiple imputation (HDMI) for partially observed confounders including natural language processing-derived auxiliary covariates
Authors:
Janick Weberpals,
Pamela A. Shaw,
Kueiyu Joshua Lin,
Richard Wyss,
Joseph M Plasek,
Li Zhou,
Kerry Ngan,
Thomas DeRamus,
Sudha R. Raman,
Bradley G. Hammill,
Hana Lee,
Sengwee Toh,
John G. Connolly,
Kimberly J. Dandreo,
Fang Tian,
Wei Liu,
Jie Li,
José J. Hernández-Muñoz,
Sebastian Schneeweiss,
Rishi J. Desai
Abstract:
Multiple imputation (MI) models can be improved by including auxiliary covariates (AC), but their performance in high-dimensional data is not well understood. We aimed to develop and compare high-dimensional MI (HDMI) approaches using structured and natural language processing (NLP)-derived AC in studies with partially observed confounders. We conducted a plasmode simulation study using data from…
▽ More
Multiple imputation (MI) models can be improved by including auxiliary covariates (AC), but their performance in high-dimensional data is not well understood. We aimed to develop and compare high-dimensional MI (HDMI) approaches using structured and natural language processing (NLP)-derived AC in studies with partially observed confounders. We conducted a plasmode simulation study using data from opioid vs. non-steroidal anti-inflammatory drug (NSAID) initiators (X) with observed serum creatinine labs (Z2) and time-to-acute kidney injury as outcome. We simulated 100 cohorts with a null treatment effect, including X, Z2, atrial fibrillation (U), and 13 other investigator-derived confounders (Z1) in the outcome generation. We then imposed missingness (MZ2) on 50% of Z2 measurements as a function of Z2 and U and created different HDMI candidate AC using structured and NLP-derived features. We mimicked scenarios where U was unobserved by omitting it from all AC candidate sets. Using LASSO, we data-adaptively selected HDMI covariates associated with Z2 and MZ2 for MI, and with U to include in propensity score models. The treatment effect was estimated following propensity score matching in MI datasets and we benchmarked HDMI approaches against a baseline imputation and complete case analysis with Z1 only. HDMI using claims data showed the lowest bias (0.072). Combining claims and sentence embeddings led to an improvement in the efficiency displaying the lowest root-mean-squared-error (0.173) and coverage (94%). NLP-derived AC alone did not perform better than baseline MI. HDMI approaches may decrease bias in studies with partially observed confounders where missingness depends on unobserved factors.
△ Less
Submitted 17 May, 2024;
originally announced May 2024.
-
Probabilistic Forward Modeling of Galaxy Catalogs with Normalizing Flows
Authors:
John Franklin Crenshaw,
J. Bryce Kalmbach,
Alexander Gagliano,
Ziang Yan,
Andrew J. Connolly,
Alex I. Malz,
Samuel J. Schmidt,
The LSST Dark Energy Science Collaboration
Abstract:
Evaluating the accuracy and calibration of the redshift posteriors produced by photometric redshift (photo-z) estimators is vital for enabling precision cosmology and extragalactic astrophysics with modern wide-field photometric surveys. Evaluating photo-z posteriors on a per-galaxy basis is difficult, however, as real galaxies have a true redshift but not a true redshift posterior. We introduce P…
▽ More
Evaluating the accuracy and calibration of the redshift posteriors produced by photometric redshift (photo-z) estimators is vital for enabling precision cosmology and extragalactic astrophysics with modern wide-field photometric surveys. Evaluating photo-z posteriors on a per-galaxy basis is difficult, however, as real galaxies have a true redshift but not a true redshift posterior. We introduce PZFlow, a Python package for the probabilistic forward modeling of galaxy catalogs with normalizing flows. For catalogs simulated with PZFlow, there is a natural notion of "true" redshift posteriors that can be used for photo-z validation. We use PZFlow to simulate a photometric galaxy catalog where each galaxy has a redshift, noisy photometry, shape information, and a true redshift posterior. We also demonstrate the use of an ensemble of normalizing flows for photo-z estimation. We discuss how PZFlow will be used to validate the photo-z estimation pipeline of the Dark Energy Science Collaboration (DESC), and the wider applicability of PZFlow for statistical modeling of any tabular data.
△ Less
Submitted 7 May, 2024;
originally announced May 2024.
-
Using AI for Wavefront Estimation with the Rubin Observatory Active Optics System
Authors:
John Franklin Crenshaw,
Andrew J. Connolly,
Joshua E. Meyers,
J. Bryce Kalmbach,
Guillem Megias Homar,
Tiago Ribeiro,
Krzysztof Suberlak,
Sandrine Thomas,
Te-wei Tsai
Abstract:
The Vera C. Rubin Observatory will, over a period of 10 years, repeatedly survey the southern sky. To ensure that images generated by Rubin meet the quality requirements for precision science, the observatory will use an Active Optics System (AOS) to correct for alignment and mirror surface perturbations introduced by gravity and temperature gradients in the optical system. To accomplish this Rubi…
▽ More
The Vera C. Rubin Observatory will, over a period of 10 years, repeatedly survey the southern sky. To ensure that images generated by Rubin meet the quality requirements for precision science, the observatory will use an Active Optics System (AOS) to correct for alignment and mirror surface perturbations introduced by gravity and temperature gradients in the optical system. To accomplish this Rubin will use out-of-focus images from sensors located at the edge of the focal plane to learn and correct for perturbations to the wavefront. We have designed and integrated a deep learning model for wavefront estimation into the AOS pipeline. In this paper, we compare the performance of this deep learning approach to Rubin's baseline algorithm when applied to images from two different simulations of the Rubin optical system. We show the deep learning approach is faster and more accurate, achieving the atmospheric error floor both for high-quality images, and low-quality images with heavy blending and vignetting. Compared to the baseline algorithm, the deep learning model is 40x faster, the median error 2x better under ideal conditions, 5x better in the presence of vignetting by the Rubin camera, and 14x better in the presence of blending in crowded fields. In addition, the deep learning model surpasses the required optical quality in simulations of the AOS closed loop. This system promises to increase the survey area useful for precision science by up to 8%. We discuss how this system might be deployed when commissioning and operating Rubin.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
The DECam Ecliptic Exploration Project (DEEP) VI: first multi-year observations of trans-Neptunian objects
Authors:
Hayden Smotherman,
Pedro H. Bernardinelli,
Stephen K. N. Portillo,
Andrew J. Connolly,
J. Bryce Kalmbach,
Steven Stetzler,
Mario Juric,
Dino Bektesvic,
Zachary Langford,
Fred C. Adams,
William J. Oldroyd,
Matthew J. Holman,
Colin Orion Chandler,
Cesar Fuentes,
David W. Gerdes,
Hsing Wen Lin,
Larissa Markwardt,
Andrew McNeill,
Michael Mommert,
Kevin J. Napier,
Matthew J. Payne,
Darin Ragozzine,
Andrew S. Rivkin,
Hilke Schlichting,
Scott S. Sheppard
, et al. (3 additional authors not shown)
Abstract:
We present the first set of trans-Neptunian objects (TNOs) observed on multiple nights in data taken from the DECam Ecliptic Exploration Project (DEEP). Of these 110 TNOs, 105 do not coincide with previously known TNOs and appear to be new discoveries. Each individual detection for our objects resulted from a digital tracking search at TNO rates of motion, using two to four hour exposure sets, and…
▽ More
We present the first set of trans-Neptunian objects (TNOs) observed on multiple nights in data taken from the DECam Ecliptic Exploration Project (DEEP). Of these 110 TNOs, 105 do not coincide with previously known TNOs and appear to be new discoveries. Each individual detection for our objects resulted from a digital tracking search at TNO rates of motion, using two to four hour exposure sets, and the detections were subsequently linked across multiple observing seasons. This procedure allows us to find objects with magnitudes $m_{VR} \approx 26$. The object discovery processing also included a comprehensive population of objects injected into the images, with a recovery and linking rate of at least $94\%$. The final orbits were obtained using a specialized orbit fitting procedure that accounts for the positional errors derived from the digital tracking procedure. Our results include robust orbits and magnitudes for classical TNOs with absolute magnitudes $H \sim 10$, as well as a dynamically detached object found at 76 au (semi-major axis $a\approx 77 \, \mathrm{au}$). We find a disagreement between our population of classical TNOs and the CFEPS-L7 three component model for the Kuiper belt.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
The DECam Ecliptic Exploration Project (DEEP) III: Survey characterization and simulation methods
Authors:
Pedro H. Bernardinelli,
Hayden Smotherman,
Zachary Langford,
Stephen K. N. Portillo,
Andrew J. Connolly,
J. Bryce Kalmbach,
Steven Stetzler,
Mario Juric,
William J. Oldroyd,
Hsing Wen Lin,
Fred C. Adams,
Colin Orion Chandler,
Cesar Fuentes,
David W. Gerdes,
Matthew J. Holman,
Larissa Markwardt,
Andrew McNeill,
Michael Mommert,
Kevin J. Napier,
Matthew J. Payne,
Darin Ragozzine,
Andrew S. Rivkin,
Hilke Schlichting,
Scott S. Sheppard,
Ryder Strauss
, et al. (2 additional authors not shown)
Abstract:
We present a detailed study of the observational biases of the DECam Ecliptic Exploration Project's (DEEP) B1 data release and survey simulation software that enables direct statistical comparisons between models and our data. We inject a synthetic population of objects into the images, and then subsequently recover them in the same processing as our real detections. This enables us to characteriz…
▽ More
We present a detailed study of the observational biases of the DECam Ecliptic Exploration Project's (DEEP) B1 data release and survey simulation software that enables direct statistical comparisons between models and our data. We inject a synthetic population of objects into the images, and then subsequently recover them in the same processing as our real detections. This enables us to characterize the survey's completeness as a function of apparent magnitudes and on-sky rates of motion. We study the statistically optimal functional form for the magnitude, and develop a methodology that can estimate the magnitude and rate efficiencies for all survey's pointing groups simultaneously. We have determined that our peak completeness is on average 80\% in each pointing group, and our magnitude drops to $25\%$ of this value at $m_{25} = 26.22$. We describe the freely available survey simulation software and its methodology. We conclude by using it to infer that our effective search area for objects at 40 au is $14.8°^2$, and that our lack of dynamically cold distant objects means that there at most $8\times 10^3$ objects with $60 < a < 80$ au and absolute magnitudes $H \leq 8$.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
The DECam Ecliptic Exploration Project (DEEP): V. The Absolute Magnitude Distribution of the Cold Classical Kuiper Belt
Authors:
Kevin J. Napier,
Hsing-Wen Lin,
David W. Gerdes,
Fred C. Adams,
Anna M. Simpson,
Matthew W. Porter,
Katherine G. Weber,
Larissa Markwardt,
Gabriel Gowman,
Hayden Smotherman,
Pedro H. Bernardinelli,
Mario Jurić,
Andrew J. Connolly,
J. Bryce Kalmbach,
Stephen K. N. Portillo,
David E. Trilling,
Ryder Strauss,
William J. Oldroyd,
Chadwick A. Trujillo,
Colin Orion Chandler,
Matthew J. Holman,
Hilke E. Schlichting,
Andrew McNeill,
the DEEP Collaboration
Abstract:
The DECam Ecliptic Exploration Project (DEEP) is a deep survey of the trans-Neptunian solar system being carried out on the 4-meter Blanco telescope at Cerro Tololo Inter-American Observatory in Chile using the Dark Energy Camera (DECam). By using a shift-and-stack technique to achieve a mean limiting magnitude of $r \sim 26.2$, DEEP achieves an unprecedented combination of survey area and depth,…
▽ More
The DECam Ecliptic Exploration Project (DEEP) is a deep survey of the trans-Neptunian solar system being carried out on the 4-meter Blanco telescope at Cerro Tololo Inter-American Observatory in Chile using the Dark Energy Camera (DECam). By using a shift-and-stack technique to achieve a mean limiting magnitude of $r \sim 26.2$, DEEP achieves an unprecedented combination of survey area and depth, enabling quantitative leaps forward in our understanding of the Kuiper Belt populations. This work reports results from an analysis of twenty 3 sq.\ deg.\ DECam fields along the invariable plane. We characterize the efficiency and false-positive rates for our moving-object detection pipeline, and use this information to construct a Bayesian signal probability for each detected source. This procedure allows us to treat all of our Kuiper Belt Object (KBO) detections statistically, simultaneously accounting for efficiency and false positives. We detect approximately 2300 candidate sources with KBO-like motion at S/N $>6.5$. We use a subset of these objects to compute the luminosity function of the Kuiper Belt as a whole, as well as the Cold Classical (CC) population. We also investigate the absolute magnitude ($H$) distribution of the CCs, and find consistency with both an exponentially tapered power-law, which is predicted by streaming instability models of planetesimal formation, and a rolling power law. Finally, we provide an updated mass estimate for the Cold Classical Kuiper Belt of $M_{CC}(H_r < 12) = 0.0017^{+0.0010}_{-0.0004} M_{\oplus}$, assuming albedo $p = 0.15$ and density $ρ= 1$ g cm$^{-3}$.
△ Less
Submitted 18 September, 2023;
originally announced September 2023.
-
A Brief Wellbeing Training Session Delivered by a Humanoid Social Robot: A Pilot Randomized Controlled Trial
Authors:
Nicole Robinson,
Jennifer Connolly,
Gavin Suddrey,
David J. Kavanagh
Abstract:
Mental health and psychological distress are rising in adults, showing the importance of wellbeing promotion, support, and technique practice that is effective and accessible. Interactive social robots have been tested to deliver health programs but have not been explored to deliver wellbeing technique training in detail. A pilot randomised controlled trial was conducted to explore the feasibility…
▽ More
Mental health and psychological distress are rising in adults, showing the importance of wellbeing promotion, support, and technique practice that is effective and accessible. Interactive social robots have been tested to deliver health programs but have not been explored to deliver wellbeing technique training in detail. A pilot randomised controlled trial was conducted to explore the feasibility of an autonomous humanoid social robot to deliver a brief mindful breathing technique to promote information around wellbeing. It contained two conditions: brief technique training (Technique) and control designed to represent a simple wait-list activity to represent a relationship-building discussion (Simple Rapport). This trial also explored willingness to discuss health-related topics with a robot. Recruitment uptake rate through convenience sampling was high (53%). A total of 230 participants took part (mean age = 29 years) with 71% being higher education students. There were moderate ratings of technique enjoyment, perceived usefulness, and likelihood to repeat the technique again. Interaction effects were found across measures with scores varying across gender and distress levels. Males with high distress and females with low distress who received the simple rapport activity reported greater comfort to discuss non-health topics than males with low distress and females with high distress. This trial marks a notable step towards the design and deployment of an autonomous wellbeing intervention to investigate the impact of a brief robot-delivered mindfulness training program for a sub-clinical population.
△ Less
Submitted 11 August, 2023;
originally announced August 2023.
-
Estimation of the non-linear parameter in Generalised Diversity-Interactions models is unaffected by change in structure of the interaction terms
Authors:
Rishabh Vishwakarma,
Laura Byrne,
John Connolly,
Rafael de Andrade Moral,
Caroline Brophy
Abstract:
Research over the past few decades has assumed the richness (number of species) to be the primary driver of the biodiversity and ecosystem function (BEF) relationship. However, biodiversity is multi-dimensional, and richness alone does not capture all its attributes. Diversity-Interactions modelling is a regression-based framework that models the biodiversity and ecosystem function relationship by…
▽ More
Research over the past few decades has assumed the richness (number of species) to be the primary driver of the biodiversity and ecosystem function (BEF) relationship. However, biodiversity is multi-dimensional, and richness alone does not capture all its attributes. Diversity-Interactions modelling is a regression-based framework that models the biodiversity and ecosystem function relationship by incorporating species-specific effects along with the interactions between species. The species interactions in a Diversity-Interactions model can take several different forms ranging in complexity from a single interaction term (assuming all pairs of species interact in the same way) to many interaction terms (e.g. assuming a separate interaction for all pairs of species). The specification of the interactions may also include a non-linear parameter (θ) as an exponent to the product of the species proportions to allow for non-linear relationship between the response and species interactions, giving rise to Generalized Diversity-Interactions modelling. The structure of the interaction terms describes the underlying biological processes and thus the selection of a correct structure is important. The inclusion of θ introduces complexity to the model selection process: we could (a) select the interaction structure first by assuming θ=1 and then estimate θ, or (b) estimate θ first and then select the appropriate interaction structure by fixing θ, or (c) test for θ and its inclusion for each interaction structure. It is also unknown whether θ is robust to changes in the underlying interaction structure. Using a simulation study, we test the robustness of θ and compare multiple model selection approaches to identify an optimal and computationally efficient model selection procedure for Generalized Diversity-Interactions models.
△ Less
Submitted 18 January, 2023;
originally announced January 2023.
-
Going beyond richness: Modelling the BEF relationship using species identity, evenness, richness and species interactions via the DImodels R package
Authors:
Rafael A. Moral,
Rishabh Vishwakarma,
John Connolly,
Laura Byrne,
Catherine Hurley,
John A. Finn,
Caroline Brophy
Abstract:
BEF studies aim to understand how ecosystems respond to a gradient of species diversity. Diversity-Interactions (DI) models are suitable for analysing the BEF relationship. These models relate an ecosystem function response of a community to the identity of the species in the community, their evenness (proportions) and interactions. The number of species in the community (richness) is also implici…
▽ More
BEF studies aim to understand how ecosystems respond to a gradient of species diversity. Diversity-Interactions (DI) models are suitable for analysing the BEF relationship. These models relate an ecosystem function response of a community to the identity of the species in the community, their evenness (proportions) and interactions. The number of species in the community (richness) is also implicitly modelled through this approach. It is common in BEF studies to model an ecosystem function as a function of richness; while this can uncover trends in the BEF relationship, by definition, species diversity is much broader than richness alone, and important patterns in the BEF relationship may remain hidden. In this paper, we introduce the DImodels R package for implementing DI models. We also compare DI models to traditional modelling approaches to highlight the advantages of using a multi-dimensional definition of species diversity. We show that using DI models can lead to considerably improved model fit over other methods; it does this by incorporating variation due to the multiple facets of species diversity. Predicting from a DI model is not limited to the study design points, the model can extrapolate to predict for any species composition and proportions (assuming there is sufficient coverage of this space in the study design). Expressing the BEF relationship as a function of richness alone can be useful to capture overall trends. However, collapsing the multiple dimensions of species diversity to a single dimension (such as richness) can result in valuable ecological information being lost. DI modelling provides a framework to test the multiple components of species diversity in the BEF relationship. It facilitates uncovering a deeper ecological understanding of the BEF relationship and can lead to enhanced inference.
△ Less
Submitted 5 May, 2023; v1 submitted 9 December, 2022;
originally announced December 2022.
-
From Data to Software to Science with the Rubin Observatory LSST
Authors:
Katelyn Breivik,
Andrew J. Connolly,
K. E. Saavik Ford,
Mario Jurić,
Rachel Mandelbaum,
Adam A. Miller,
Dara Norman,
Knut Olsen,
William O'Mullane,
Adrian Price-Whelan,
Timothy Sacco,
J. L. Sokoloski,
Ashley Villar,
Viviana Acquaviva,
Tomas Ahumada,
Yusra AlSayyad,
Catarina S. Alves,
Igor Andreoni,
Timo Anguita,
Henry J. Best,
Federica B. Bianco,
Rosaria Bonito,
Andrew Bradshaw,
Colin J. Burke,
Andresa Rodrigues de Campos
, et al. (75 additional authors not shown)
Abstract:
The Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST) dataset will dramatically alter our understanding of the Universe, from the origins of the Solar System to the nature of dark matter and dark energy. Much of this research will depend on the existence of robust, tested, and scalable algorithms, software, and services. Identifying and develo** such tools ahead of time has the po…
▽ More
The Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST) dataset will dramatically alter our understanding of the Universe, from the origins of the Solar System to the nature of dark matter and dark energy. Much of this research will depend on the existence of robust, tested, and scalable algorithms, software, and services. Identifying and develo** such tools ahead of time has the potential to significantly accelerate the delivery of early science from LSST. Develo** these collaboratively, and making them broadly available, can enable more inclusive and equitable collaboration on LSST science.
To facilitate such opportunities, a community workshop entitled "From Data to Software to Science with the Rubin Observatory LSST" was organized by the LSST Interdisciplinary Network for Collaboration and Computing (LINCC) and partners, and held at the Flatiron Institute in New York, March 28-30th 2022. The workshop included over 50 in-person attendees invited from over 300 applications. It identified seven key software areas of need: (i) scalable cross-matching and distributed joining of catalogs, (ii) robust photometric redshift determination, (iii) software for determination of selection functions, (iv) frameworks for scalable time-series analyses, (v) services for image access and reprocessing at scale, (vi) object image access (cutouts) and analysis at scale, and (vii) scalable job execution systems.
This white paper summarizes the discussions of this workshop. It considers the motivating science use cases, identified cross-cutting algorithms, software, and services, their high-level technical specifications, and the principles of inclusive collaborations needed to develop them. We provide it as a useful roadmap of needs, as well as to spur action and collaboration between groups and individuals looking to develop reusable software for early LSST science.
△ Less
Submitted 4 August, 2022;
originally announced August 2022.
-
MUSSES2020J: The Earliest Discovery of a Fast Blue Ultraluminous Transient at Redshift 1.063
Authors:
Ji-an Jiang,
Naoki Yasuda,
Keiichi Maeda,
Nozomu Tominaga,
Mamoru Doi,
Željko Ivezić,
Peter Yoachim,
Kohki Uno,
Takashi J. Moriya,
Brajesh Kumar,
Yen-Chen Pan,
Masayuki Tanaka,
Masaomi Tanaka,
Ken'ichi Nomoto,
Saurabh W. Jha,
Pilar Ruiz-Lapuente,
David Jones,
Toshikazu Shigeyama,
Nao Suzuki,
Mitsuru Kokubo,
Hisanori Furusawa,
Satoshi Miyazaki,
Andrew J. Connolly,
D. K. Sahu,
G. C. Anupama
Abstract:
In this Letter, we report the discovery of an ultraluminous fast-evolving transient in rest-frame UV wavelengths, MUSSES2020J, soon after its occurrence by using the Hyper Suprime-Cam (HSC) mounted on the 8.2 m Subaru telescope. The rise time of about 5 days with an extremely high UV peak luminosity shares similarities to a handful of fast blue optical transients whose peak luminosities are compar…
▽ More
In this Letter, we report the discovery of an ultraluminous fast-evolving transient in rest-frame UV wavelengths, MUSSES2020J, soon after its occurrence by using the Hyper Suprime-Cam (HSC) mounted on the 8.2 m Subaru telescope. The rise time of about 5 days with an extremely high UV peak luminosity shares similarities to a handful of fast blue optical transients whose peak luminosities are comparable with the most luminous supernovae while their timescales are significantly shorter (hereafter "fast blue ultraluminous transient," FBUT). In addition, MUSSES2020J is located near the center of a normal low-mass galaxy at a redshift of 1.063, suggesting a possible connection between the energy source of MUSSES2020J and the central part of the host galaxy. Possible physical mechanisms powering this extreme transient such as a wind-driven tidal disruption event and an interaction between supernova and circumstellar material are qualitatively discussed based on the first multiband early-phase light curve of FBUTs, although whether the scenarios can quantitatively explain the early photometric behavior of MUSSES2020J requires systematical theoretical investigations. Thanks to the ultrahigh luminosity in UV and blue optical wavelengths of these extreme transients, a promising number of FBUTs from the local to the high-z universe can be discovered through deep wide-field optical surveys in the near future.
△ Less
Submitted 10 June, 2022; v1 submitted 30 May, 2022;
originally announced May 2022.
-
Semi-Markov Offline Reinforcement Learning for Healthcare
Authors:
Mehdi Fatemi,
Mary Wu,
Jeremy Petch,
Walter Nelson,
Stuart J. Connolly,
Alexander Benz,
Anthony Carnicelli,
Marzyeh Ghassemi
Abstract:
Reinforcement learning (RL) tasks are typically framed as Markov Decision Processes (MDPs), assuming that decisions are made at fixed time intervals. However, many applications of great importance, including healthcare, do not satisfy this assumption, yet they are commonly modelled as MDPs after an artificial resha** of the data. In addition, most healthcare (and similar) problems are offline by…
▽ More
Reinforcement learning (RL) tasks are typically framed as Markov Decision Processes (MDPs), assuming that decisions are made at fixed time intervals. However, many applications of great importance, including healthcare, do not satisfy this assumption, yet they are commonly modelled as MDPs after an artificial resha** of the data. In addition, most healthcare (and similar) problems are offline by nature, allowing for only retrospective studies. To address both challenges, we begin by discussing the Semi-MDP (SMDP) framework, which formally handles actions of variable timings. We next present a formal way to apply SMDP modifications to nearly any given value-based offline RL method. We use this theory to introduce three SMDP-based offline RL algorithms, namely, SDQN, SDDQN, and SBCQ. We then experimentally demonstrate that only these SMDP-based algorithms learn the optimal policy in variable-time environments, whereas their MDP counterparts do not. Finally, we apply our new algorithms to a real-world offline dataset pertaining to warfarin dosing for stroke prevention and demonstrate similar results.
△ Less
Submitted 20 March, 2022; v1 submitted 17 March, 2022;
originally announced March 2022.
-
Machine Learning and Cosmology
Authors:
Cora Dvorkin,
Siddharth Mishra-Sharma,
Brian Nord,
V. Ashley Villar,
Camille Avestruz,
Keith Bechtol,
Aleksandra Ćiprijanović,
Andrew J. Connolly,
Lehman H. Garrison,
Gautham Narayan,
Francisco Villaescusa-Navarro
Abstract:
Methods based on machine learning have recently made substantial inroads in many corners of cosmology. Through this process, new computational tools, new perspectives on data collection, model development, analysis, and discovery, as well as new communities and educational pathways have emerged. Despite rapid progress, substantial potential at the intersection of cosmology and machine learning rem…
▽ More
Methods based on machine learning have recently made substantial inroads in many corners of cosmology. Through this process, new computational tools, new perspectives on data collection, model development, analysis, and discovery, as well as new communities and educational pathways have emerged. Despite rapid progress, substantial potential at the intersection of cosmology and machine learning remains untapped. In this white paper, we summarize current and ongoing developments relating to the application of machine learning within cosmology and provide a set of recommendations aimed at maximizing the scientific impact of these burgeoning tools over the coming decade through both technical development as well as the fostering of emerging communities.
△ Less
Submitted 15 March, 2022;
originally announced March 2022.
-
Sifting Through the Static: Moving Object Detection in Difference Images
Authors:
Hayden Smotherman,
Andrew J. Connolly,
J. Bryce Kalmbach,
Stephen K. N. Portillo,
Dino Bektesevic,
Siegfried Eggl,
Mario Juric,
Joachim Moeyens,
Peter J. Whidden
Abstract:
Trans-Neptunian Objects (TNOs) provide a window into the history of the Solar System, but they can be challenging to observe due to their distance from the Sun and relatively low brightness. Here we report the detection of 75 moving objects that we could not link to any other known objects, the faintest of which has a VR magnitude of $25.02 \pm 0.93$ using the KBMOD platform. We recover an additio…
▽ More
Trans-Neptunian Objects (TNOs) provide a window into the history of the Solar System, but they can be challenging to observe due to their distance from the Sun and relatively low brightness. Here we report the detection of 75 moving objects that we could not link to any other known objects, the faintest of which has a VR magnitude of $25.02 \pm 0.93$ using the KBMOD platform. We recover an additional 24 sources with previously-known orbits. We place constraints on the barycentric distance, inclination, and longitude of ascending node of these objects. The unidentified objects have a median barycentric distance of 41.28 au, placing them in the outer Solar System. The observed inclination and magnitude distribution of all detected objects is consistent with previously published KBO distributions. We describe extensions to KBMOD, including a robust percentile-based lightcurve filter, an in-line graphics processing unit (GPU) filter, new coadded stamp generation, and a convolutional neural network (CNN) stamp filter, which allow KBMOD to take advantage of difference images. These enchancements mark a significant improvement in the readiness of KBMOD for deployment on future big data surveys such as LSST.
△ Less
Submitted 7 September, 2021;
originally announced September 2021.
-
Characterizing accelerated precipitation in proton irradiated steel
Authors:
Mark Laver,
Brian J. Connolly,
Christopher Cooper,
Joachim Kohlbrecher,
Stavros Samothrakitis,
Keith Wilford
Abstract:
Ion irradiation provides a promising substitute to neutron tests for investigating the effects of radiation on materials for fission and fusion reactor plants. Here we show proton irradiation can quantitatively reproduce precipitation that leads to embrittlement in reactor pressure vessel steels, at dose rates 10,000 times greater than experienced in fission reactor operation. Small-angle neutron…
▽ More
Ion irradiation provides a promising substitute to neutron tests for investigating the effects of radiation on materials for fission and fusion reactor plants. Here we show proton irradiation can quantitatively reproduce precipitation that leads to embrittlement in reactor pressure vessel steels, at dose rates 10,000 times greater than experienced in fission reactor operation. Small-angle neutron scattering (SANS) is used to characterize precipitate size distributions in copper-containing steels irradiated to average doses of approx. 7 mdpa with 5 MeV protons. Comparing our results with the literature on reactor pressure vessel steels containing at least 1 at.% nickel, we find a power-law scaling of dose with exponent 0.25--0.30 accounts for the effects of dose rate on precipitate volume fraction over 6 orders of magnitude in dose rate. In conjunction with dose rate, carbon is identified as performing a leading role in determining precipitate sizes, adding to the known effects of nickel, manganese and irradiation temperature. We discuss the composition of precipitates inferred from SANS, taking previous atom probe tomography studies into consideration.
△ Less
Submitted 20 August, 2021;
originally announced August 2021.
-
Optimization of the Observing Cadence for the Rubin Observatory Legacy Survey of Space and Time: a pioneering process of community-focused experimental design
Authors:
Federica B. Bianco,
Željko Ivezić,
R. Lynne Jones,
Melissa L. Graham,
Phil Marshall,
Abhijit Saha,
Michael A. Strauss,
Peter Yoachim,
Tiago Ribeiro,
Timo Anguita,
Franz E. Bauer,
Eric C. Bellm,
Robert D. Blum,
William N. Brandt,
Sarah Brough,
Màrcio Catelan,
William I. Clarkson,
Andrew J. Connolly,
Eric Gawiser,
John Gizis,
Renee Hlozek,
Sugata Kaviraj,
Charles T. Liu,
Michelle Lochner,
Ashish A. Mahabal
, et al. (21 additional authors not shown)
Abstract:
Vera C. Rubin Observatory is a ground-based astronomical facility under construction, a joint project of the National Science Foundation and the U.S. Department of Energy, designed to conduct a multi-purpose 10-year optical survey of the southern hemisphere sky: the Legacy Survey of Space and Time. Significant flexibility in survey strategy remains within the constraints imposed by the core scienc…
▽ More
Vera C. Rubin Observatory is a ground-based astronomical facility under construction, a joint project of the National Science Foundation and the U.S. Department of Energy, designed to conduct a multi-purpose 10-year optical survey of the southern hemisphere sky: the Legacy Survey of Space and Time. Significant flexibility in survey strategy remains within the constraints imposed by the core science goals of probing dark energy and dark matter, cataloging the Solar System, exploring the transient optical sky, and map** the Milky Way. The survey's massive data throughput will be transformational for many other astrophysics domains and Rubin's data access policy sets the stage for a huge potential users' community. To ensure that the survey science potential is maximized while serving as broad a community as possible, Rubin Observatory has involved the scientific community at large in the process of setting and refining the details of the observing strategy. The motivation, history, and decision-making process of this strategy optimization are detailed in this paper, giving context to the science-driven proposals and recommendations for the survey strategy included in this Focus Issue.
△ Less
Submitted 1 September, 2021; v1 submitted 3 August, 2021;
originally announced August 2021.
-
THOR: An Algorithm for Cadence-Independent Asteroid Discovery
Authors:
Joachim Moeyens,
Mario Juric,
Jes Ford,
Dino Bektesevic,
Andrew J. Connolly,
Siegfried Eggl,
Željko Ivezić,
R. Lynne Jones,
J. Bryce Kalmbach,
Hayden Smotherman
Abstract:
We present "Tracklet-less Heliocentric Orbit Recovery" (THOR), an algorithm for linking of observations of Solar System objects across multiple epochs that does not require intra-night tracklets or a predefined cadence of observations within a search window. By sparsely covering regions of interest in the phase space with "test orbits", transforming nearby observations over a few nights into the c…
▽ More
We present "Tracklet-less Heliocentric Orbit Recovery" (THOR), an algorithm for linking of observations of Solar System objects across multiple epochs that does not require intra-night tracklets or a predefined cadence of observations within a search window. By sparsely covering regions of interest in the phase space with "test orbits", transforming nearby observations over a few nights into the co-rotating frame of the test orbit at each epoch, and then performing a generalized Hough transform on the transformed detections followed by orbit determination (OD) filtering, candidate clusters of observations belonging to the same objects can be recovered at moderate computational cost and little to no constraints on cadence. We validate the effectiveness of this approach by running on simulations as well as on real data from the Zwicky Transient Facility (ZTF). Applied to a short, 2-week, slice of ZTF observations, we demonstrate THOR can recover 97.4% of all previously known and discoverable objects in the targeted ($a > 1.7$ au) population with 5 or more observations and with purity between 97.7% and 100%. This includes 10 likely new discoveries, and a recovery of an $e \sim 1$ comet C/2018 U1 (the comet would have been a ZTF discovery had THOR been running in 2018 when the data were taken). The THOR package and demo Jupyter notebooks are open source and available at https://github.com/moeyensj/thor.
△ Less
Submitted 3 May, 2021;
originally announced May 2021.
-
DESC DC2 Data Release Note
Authors:
LSST Dark Energy Science Collaboration,
Bela Abolfathi,
Robert Armstrong,
Humna Awan,
Yadu N. Babuji,
Franz Erik Bauer,
George Beckett,
Rahul Biswas,
Joanne R. Bogart,
Dominique Boutigny,
Kyle Chard,
James Chiang,
Johann Cohen-Tanugi,
Andrew J. Connolly,
Scott F. Daniel,
Seth W. Digel,
Alex Drlica-Wagner,
Richard Dubois,
Eric Gawiser,
Thomas Glanzman,
Salman Habib,
Andrew P. Hearin,
Katrin Heitmann,
Fabio Hernandez,
Renée Hložek
, et al. (32 additional authors not shown)
Abstract:
In preparation for cosmological analyses of the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST), the LSST Dark Energy Science Collaboration (LSST DESC) has created a 300 deg$^2$ simulated survey as part of an effort called Data Challenge 2 (DC2). The DC2 simulated sky survey, in six optical bands with observations following a reference LSST observing cadence, was processed with th…
▽ More
In preparation for cosmological analyses of the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST), the LSST Dark Energy Science Collaboration (LSST DESC) has created a 300 deg$^2$ simulated survey as part of an effort called Data Challenge 2 (DC2). The DC2 simulated sky survey, in six optical bands with observations following a reference LSST observing cadence, was processed with the LSST Science Pipelines (19.0.0). In this Note, we describe the public data release of the resulting object catalogs for the coadded images of five years of simulated observations along with associated truth catalogs. We include a brief description of the major features of the available data sets. To enable convenient access to the data products, we have developed a web portal connected to Globus data services. We describe how to access the data and provide example Jupyter Notebooks in Python to aid first interactions with the data. We welcome feedback and questions about the data release via a GitHub repository.
△ Less
Submitted 13 June, 2022; v1 submitted 12 January, 2021;
originally announced January 2021.
-
Recommended Target Fields for Commissioning the Vera C. Rubin Observatory
Authors:
A. Amon,
K. Bechtol,
A. J. Connolly,
S. W. Digel,
A. Drlica-Wagner,
E. Gawiser,
M. Jarvis,
S. W. Jha,
A. von der Linden,
M. Moniez,
G. Narayan,
N. Regnault,
I. Sevilla-Noarbe,
S. J. Schmidt,
S. H. Suyu,
C. W. Walter
Abstract:
The commissioning team for the Vera C. Rubin observatory is planning a set of engineering and science verification observations with the Legacy Survey of Space and Time (LSST) commissioning camera and then the Rubin Observatory LSST Camera. The time frame for these observations is not yet fixed, and the commissioning team will have flexibility in selecting fields to observe. In this document, the…
▽ More
The commissioning team for the Vera C. Rubin observatory is planning a set of engineering and science verification observations with the Legacy Survey of Space and Time (LSST) commissioning camera and then the Rubin Observatory LSST Camera. The time frame for these observations is not yet fixed, and the commissioning team will have flexibility in selecting fields to observe. In this document, the Dark Energy Science Collaboration (DESC) Commissioning Working Group presents a prioritized list of target fields appropriate for testing various aspects of DESC-relevant science performance, grouped by season for visibility from Rubin Observatory at Cerro Pachon. Our recommended fields include Deep-Drilling fields (DDFs) to full LSST depth for photo-$z$ and shape calibration purposes, HST imaging fields to full depth for deblending studies, and an $\sim$200 square degree area to 1-year depth in several filters for higher-level validation of wide-area science cases for DESC. We also anticipate that commissioning observations will be needed for template building for transient science over a broad RA range. We include detailed descriptions of our recommended fields along with associated references. We are optimistic that this document will continue to be useful during LSST operations, as it provides a comprehensive list of overlap** data-sets and the references describing them.
△ Less
Submitted 28 October, 2020;
originally announced October 2020.
-
The LSST DESC DC2 Simulated Sky Survey
Authors:
LSST Dark Energy Science Collaboration,
Bela Abolfathi,
David Alonso,
Robert Armstrong,
Éric Aubourg,
Humna Awan,
Yadu N. Babuji,
Franz Erik Bauer,
Rachel Bean,
George Beckett,
Rahul Biswas,
Joanne R. Bogart,
Dominique Boutigny,
Kyle Chard,
James Chiang,
Chuck F. Claver,
Johann Cohen-Tanugi,
Céline Combet,
Andrew J. Connolly,
Scott F. Daniel,
Seth W. Digel,
Alex Drlica-Wagner,
Richard Dubois,
Emmanuel Gangler,
Eric Gawiser
, et al. (55 additional authors not shown)
Abstract:
We describe the simulated sky survey underlying the second data challenge (DC2) carried out in preparation for analysis of the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST) by the LSST Dark Energy Science Collaboration (LSST DESC). Significant connections across multiple science domains will be a hallmark of LSST; the DC2 program represents a unique modeling effort that stresses…
▽ More
We describe the simulated sky survey underlying the second data challenge (DC2) carried out in preparation for analysis of the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST) by the LSST Dark Energy Science Collaboration (LSST DESC). Significant connections across multiple science domains will be a hallmark of LSST; the DC2 program represents a unique modeling effort that stresses this interconnectivity in a way that has not been attempted before. This effort encompasses a full end-to-end approach: starting from a large N-body simulation, through setting up LSST-like observations including realistic cadences, through image simulations, and finally processing with Rubin's LSST Science Pipelines. This last step ensures that we generate data products resembling those to be delivered by the Rubin Observatory as closely as is currently possible. The simulated DC2 sky survey covers six optical bands in a wide-fast-deep (WFD) area of approximately 300 deg^2 as well as a deep drilling field (DDF) of approximately 1 deg^2. We simulate 5 years of the planned 10-year survey. The DC2 sky survey has multiple purposes. First, the LSST DESC working groups can use the dataset to develop a range of DESC analysis pipelines to prepare for the advent of actual data. Second, it serves as a realistic testbed for the image processing software under development for LSST by the Rubin Observatory. In particular, simulated data provide a controlled way to investigate certain image-level systematic effects. Finally, the DC2 sky survey enables the exploration of new scientific ideas in both static and time-domain cosmology.
△ Less
Submitted 26 January, 2021; v1 submitted 12 October, 2020;
originally announced October 2020.
-
Learning Spectral Templates for Photometric Redshift Estimation from Broadband Photometry
Authors:
John Franklin Crenshaw,
Andrew J. Connolly
Abstract:
Estimating redshifts from broadband photometry is often limited by how accurately we can map the colors of galaxies to an underlying spectral template. Current techniques utilize spectrophotometric samples of galaxies or spectra derived from spectral synthesis models. Both of these approaches have their limitations, either the sample sizes are small and often not representative of the diversity of…
▽ More
Estimating redshifts from broadband photometry is often limited by how accurately we can map the colors of galaxies to an underlying spectral template. Current techniques utilize spectrophotometric samples of galaxies or spectra derived from spectral synthesis models. Both of these approaches have their limitations, either the sample sizes are small and often not representative of the diversity of galaxy colors or the model colors can be biased (often as a function of wavelength) which introduces systematics in the derived redshifts. In this paper we learn the underlying spectral energy distributions from an ensemble of $\sim$100K galaxies with measured redshifts and colors. We show that we are able to reconstruct emission and absorption lines at a significantly higher resolution than the broadband filters used to measure the photometry for a sample of 20 spectral templates. We find that our training algorithm reduces the fraction of outliers in the derived photometric redshifts by up to 28%, bias up to 91%, and scatter up to 25%, when compared to estimates using a standard set of spectral templates. We discuss the current limitations of this approach and its applicability for recovering the underlying properties of galaxies. Our derived templates and the code used to produce these results are publicly available in a dedicated Github repository: https://github.com/dirac-institute/photoz_template_learning.
△ Less
Submitted 10 August, 2020;
originally announced August 2020.
-
Leveraging Synthetic Subject Invariant EEG Signals for Zero Calibration BCI
Authors:
Nik Khadijah Nik Aznan,
Amir Atapour-Abarghouei,
Stephen Bonner,
Jason D. Connolly,
Toby P. Breckon
Abstract:
Recently, substantial progress has been made in the area of Brain-Computer Interface (BCI) using modern machine learning techniques to decode and interpret brain signals. While Electroencephalography (EEG) has provided a non-invasive method of interfacing with a human brain, the acquired data is often heavily subject and session dependent. This makes seamless incorporation of such data into real-w…
▽ More
Recently, substantial progress has been made in the area of Brain-Computer Interface (BCI) using modern machine learning techniques to decode and interpret brain signals. While Electroencephalography (EEG) has provided a non-invasive method of interfacing with a human brain, the acquired data is often heavily subject and session dependent. This makes seamless incorporation of such data into real-world applications intractable as the subject and session data variance can lead to long and tedious calibration requirements and cross-subject generalisation issues. Focusing on a Steady State Visual Evoked Potential (SSVEP) classification systems, we propose a novel means of generating highly-realistic synthetic EEG data invariant to any subject, session or other environmental conditions. Our approach, entitled the Subject Invariant SSVEP Generative Adversarial Network (SIS-GAN), produces synthetic EEG data from multiple SSVEP classes using a single network. Additionally, by taking advantage of a fixed-weight pre-trained subject classification network, we ensure that our generative model remains agnostic to subject-specific features and thus produces subject-invariant data that can be applied to new previously unseen subjects. Our extensive experimental evaluation demonstrates the efficacy of our synthetic data, leading to superior performance, with improvements of up to 16 percentage points in zero-calibration classification tasks when trained using our subject-invariant synthetic EEG signals.
△ Less
Submitted 23 October, 2020; v1 submitted 16 July, 2020;
originally announced July 2020.
-
Theory of random nanoparticle layers in photovoltaic devices applied to self-aggregated metal samples
Authors:
Christin David,
James P. Connolly,
Christian Chaverri Ramos,
F. Javier García de Abajo,
Guillermo Sánchez Plaza
Abstract:
Random Al and Ag nanoparticle distributions are studied on varying substrates, where we exploit the nanosphere self-aggregation method (NSA) for fabrication. Relying on the measured particle size distributions of these samples, we develop a theoretical model that can be applied to arbitrary random nanostructure layers as is demonstrated for several distinct NSA samples. As a proof of concept, the…
▽ More
Random Al and Ag nanoparticle distributions are studied on varying substrates, where we exploit the nanosphere self-aggregation method (NSA) for fabrication. Relying on the measured particle size distributions of these samples, we develop a theoretical model that can be applied to arbitrary random nanostructure layers as is demonstrated for several distinct NSA samples. As a proof of concept, the optical properties of the exact same particles distributions, made from the quasi random modeling input with electron beam lithography (EBL), are investigated from both theory and experiment. Our numerical procedure is based on rigorous solutions of Maxwell's equations and yields optical spectra of fully interacting randomly positioned nanoparticle arrays. These results constitute a new methodology for improving the optical performance of layers of nanoparticles with direct application to enhanced photovoltaics.
△ Less
Submitted 29 June, 2020;
originally announced June 2020.
-
Photometric Redshifts with the LSST II: The Impact of Near-Infrared and Near-Ultraviolet Photometry
Authors:
Melissa L. Graham,
Andrew J. Connolly,
Winnie Wang,
Samuel J. Schmidt,
Christopher B. Morrison,
Željko Ivezić,
Sébastien Fabbro,
Patrick Côté,
Scott F. Daniel,
R. Lynne Jones,
Mario Jurić,
Peter Yoachim,
J. Bryce Kalmbach
Abstract:
Accurate photometric redshift (photo-$z$) estimates are essential to the cosmological science goals of the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST). In this work we use simulated photometry for mock galaxy catalogs to explore how LSST photo-$z$ estimates can be improved by the addition of near-infrared (NIR) and/or ultraviolet (UV) photometry from the Euclid, WFIRST, and/or…
▽ More
Accurate photometric redshift (photo-$z$) estimates are essential to the cosmological science goals of the Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST). In this work we use simulated photometry for mock galaxy catalogs to explore how LSST photo-$z$ estimates can be improved by the addition of near-infrared (NIR) and/or ultraviolet (UV) photometry from the Euclid, WFIRST, and/or CASTOR space telescopes. Generally, we find that deeper optical photometry can reduce the standard deviation of the photo-$z$ estimates more than adding NIR or UV filters, but that additional filters are the only way to significantly lower the fraction of galaxies with catastrophically under- or over-estimated photo-$z$. For Euclid, we find that the addition of ${JH}$ $5σ$ photometric detections can reduce the standard deviation for galaxies with $z>1$ ($z>0.3$) by ${\sim}20\%$ (${\sim}10\%$), and the fraction of outliers by ${\sim}40\%$ (${\sim}25\%$). For WFIRST, we show how the addition of deep ${YJHK}$ photometry could reduce the standard deviation by ${\gtrsim}50\%$ at $z>1.5$ and drastically reduce the fraction of outliers to just ${\sim}2\%$ overall. For CASTOR, we find that the addition of its ${UV}$ and $u$-band photometry could reduce the standard deviation by ${\sim}30\%$ and the fraction of outliers by ${\sim}50\%$ for galaxies with $z<0.5$. We also evaluate the photo-$z$ results within sky areas that overlap with both the NIR and UV surveys, and when spectroscopic training sets built from the surveys' small-area deep fields are used.
△ Less
Submitted 16 April, 2020;
originally announced April 2020.
-
Dimensionality Reduction of SDSS Spectra with Variational Autoencoders
Authors:
Stephen K. N. Portillo,
John K. Parejko,
Jorge R. Vergara,
Andrew J. Connolly
Abstract:
High resolution galaxy spectra contain much information about galactic physics, but the high dimensionality of these spectra makes it difficult to fully utilize the information they contain. We apply variational autoencoders (VAEs), a non-linear dimensionality reduction technique, to a sample of spectra from the Sloan Digital Sky Survey. In contrast to Principal Component Analysis (PCA), a widely…
▽ More
High resolution galaxy spectra contain much information about galactic physics, but the high dimensionality of these spectra makes it difficult to fully utilize the information they contain. We apply variational autoencoders (VAEs), a non-linear dimensionality reduction technique, to a sample of spectra from the Sloan Digital Sky Survey. In contrast to Principal Component Analysis (PCA), a widely used technique, VAEs can capture non-linear relationships between latent parameters and the data. We find that a VAE can reconstruct the SDSS spectra well with only six latent parameters, outperforming PCA with the same number of components. Different galaxy classes are naturally separated in this latent space, without class labels having been given to the VAE. The VAE latent space is interpretable because the VAE can be used to make synthetic spectra at any point in latent space. For example, making synthetic spectra along tracks in latent space yields sequences of realistic spectra that interpolate between two different types of galaxies. Using the latent space to find outliers may yield interesting spectra: in our small sample, we immediately find unusual data artifacts and stars misclassified as galaxies. In this exploratory work, we show that VAEs create compact, interpretable latent spaces that capture non-linear features of the data. While a VAE takes substantial time to train (~1 day for 48000 spectra), once trained, VAEs can enable the fast exploration of large astronomical data sets.
△ Less
Submitted 9 July, 2020; v1 submitted 24 February, 2020;
originally announced February 2020.
-
Evaluation of probabilistic photometric redshift estimation approaches for The Rubin Observatory Legacy Survey of Space and Time (LSST)
Authors:
S. J. Schmidt,
A. I. Malz,
J. Y. H. Soo,
I. A. Almosallam,
M. Brescia,
S. Cavuoti,
J. Cohen-Tanugi,
A. J. Connolly,
J. DeRose,
P. E. Freeman,
M. L. Graham,
K. G. Iyer,
M. J. Jarvis,
J. B. Kalmbach,
E. Kovacs,
A. B. Lee,
G. Longo,
C. B. Morrison,
J. A. Newman,
E. Nourbakhsh,
E. Nuss,
T. Pospisil,
H. Tranin,
R. H. Wechsler,
R. Zhou
, et al. (2 additional authors not shown)
Abstract:
Many scientific investigations of photometric galaxy surveys require redshift estimates, whose uncertainty properties are best encapsulated by photometric redshift (photo-z) posterior probability density functions (PDFs). A plethora of photo-z PDF estimation methodologies abound, producing discrepant results with no consensus on a preferred approach. We present the results of a comprehensive exper…
▽ More
Many scientific investigations of photometric galaxy surveys require redshift estimates, whose uncertainty properties are best encapsulated by photometric redshift (photo-z) posterior probability density functions (PDFs). A plethora of photo-z PDF estimation methodologies abound, producing discrepant results with no consensus on a preferred approach. We present the results of a comprehensive experiment comparing twelve photo-z algorithms applied to mock data produced for The Rubin Observatory Legacy Survey of Space and Time (LSST) Dark Energy Science Collaboration (DESC). By supplying perfect prior information, in the form of the complete template library and a representative training set as inputs to each code, we demonstrate the impact of the assumptions underlying each technique on the output photo-z PDFs. In the absence of a notion of true, unbiased photo-z PDFs, we evaluate and interpret multiple metrics of the ensemble properties of the derived photo-z PDFs as well as traditional reductions to photo-z point estimates. We report systematic biases and overall over/under-breadth of the photo-z PDFs of many popular codes, which may indicate avenues for improvement in the algorithms or implementations. Furthermore, we raise attention to the limitations of established metrics for assessing photo-z PDF accuracy; though we identify the conditional density estimate (CDE) loss as a promising metric of photo-z PDF performance in the case where true redshifts are available but true photo-z PDFs are not, we emphasize the need for science-specific performancemetrics.
△ Less
Submitted 31 July, 2021; v1 submitted 10 January, 2020;
originally announced January 2020.
-
Applying Information Theory to Design Optimal Filters for Photometric Redshifts
Authors:
J. Bryce Kalmbach,
Jacob T. VanderPlas,
Andrew J. Connolly
Abstract:
In this paper we apply ideas from information theory to create a method for the design of optimal filters for photometric redshift estimation. We show the method applied to a series of simple example filters in order to motivate an intuition for how photometric redshift estimators respond to the properties of photometric passbands. We then design a realistic set of six filters covering optical wav…
▽ More
In this paper we apply ideas from information theory to create a method for the design of optimal filters for photometric redshift estimation. We show the method applied to a series of simple example filters in order to motivate an intuition for how photometric redshift estimators respond to the properties of photometric passbands. We then design a realistic set of six filters covering optical wavelengths that optimize photometric redshifts for $z <= 2.3$ and $i < 25.3$. We create a simulated catalog for these optimal filters and use our filters with a photometric redshift estimation code to show that we can improve the standard deviation of the photometric redshift error by 7.1% overall and improve outliers 9.9% over the standard filters proposed for the Large Synoptic Survey Telescope (LSST). We compare features of our optimal filters to LSST and find that the LSST filters incorporate key features for optimal photometric redshift estimation. Finally, we describe how information theory can be applied to a range of optimization problems in astronomy.
△ Less
Submitted 5 January, 2020;
originally announced January 2020.
-
Algorithms and Statistical Models for Scientific Discovery in the Petabyte Era
Authors:
Brian Nord,
Andrew J. Connolly,
Jamie Kinney,
Jeremy Kubica,
Gautaum Narayan,
Joshua E. G. Peek,
Chad Schafer,
Erik J. Tollerud,
Camille Avestruz,
G. Jogesh Babu,
Simon Birrer,
Douglas Burke,
João Caldeira,
Douglas A. Caldwell,
Joleen K. Carlberg,
Yen-Chi Chen,
Chuanfei Dong,
Eric D. Feigelson,
V. Zach Golkhou,
Vinay Kashyap,
T. S. Li,
Thomas Loredo,
Luisa Lucie-Smith,
Kaisey S. Mandel,
J. R. Martínez-Galarza
, et al. (13 additional authors not shown)
Abstract:
The field of astronomy has arrived at a turning point in terms of size and complexity of both datasets and scientific collaboration. Commensurately, algorithms and statistical models have begun to adapt --- e.g., via the onset of artificial intelligence --- which itself presents new challenges and opportunities for growth. This white paper aims to offer guidance and ideas for how we can evolve our…
▽ More
The field of astronomy has arrived at a turning point in terms of size and complexity of both datasets and scientific collaboration. Commensurately, algorithms and statistical models have begun to adapt --- e.g., via the onset of artificial intelligence --- which itself presents new challenges and opportunities for growth. This white paper aims to offer guidance and ideas for how we can evolve our technical and collaborative frameworks to promote efficient algorithmic development and take advantage of opportunities for scientific discovery in the petabyte era. We discuss challenges for discovery in large and complex data sets; challenges and requirements for the next stage of development of statistical methodologies and algorithmic tool sets; how we might change our paradigms of collaboration and education; and the ethical implications of scientists' contributions to widely applicable algorithms and computational modeling. We start with six distinct recommendations that are supported by the commentary following them. This white paper is related to a larger corpus of effort that has taken place within and around the Petabytes to Science Workshops (https://petabytestoscience.github.io/).
△ Less
Submitted 4 November, 2019;
originally announced November 2019.
-
First inverse kinematics study of the $^{22}$Ne$(p,γ)^{23}$Na reaction and its role in AGB star and classical nova nucleosynthesis
Authors:
M. Williams,
A. Lennarz,
A. M. Laird,
U. Battino,
J. José D. Connolly,
C. Ruiz,
A. Chen,
B. Davids,
N. Esker,
B. R. Fulton,
R. Garg,
x M. Gay,
U. Greife,
U. Hager,
D. Hutcheon,
M. Lovely,
S. Lyons,
A. Psaltis,
J. E. Riley,
A. Tattersall
Abstract:
The abundances of sodium and oxygen are observed to be anti-correlated in all well-studied globular clusters. Asymptotic giant branch (AGB) stars undergoing hot bottom burning (HBB) are thought to be prime candidates for producing sodium-rich oxygen-poor material and expelling it into the cluster ISM. The 22Ne(p,gamma)23Na reaction has been shown to strongly influence the amount of 23Na produced d…
▽ More
The abundances of sodium and oxygen are observed to be anti-correlated in all well-studied globular clusters. Asymptotic giant branch (AGB) stars undergoing hot bottom burning (HBB) are thought to be prime candidates for producing sodium-rich oxygen-poor material and expelling it into the cluster ISM. The 22Ne(p,gamma)23Na reaction has been shown to strongly influence the amount of 23Na produced during HBB. This reaction is also important for classical novae nucleosynthesis, with sensitivity studies showing that the abundances of several isotopes in the Ne-Al region are significantly altered when varying the reaction rate between available compilations. Here we present the first inverse kinematics measurements of key resonances strengths as well as the direct capture S-factor. Together, this study represents the largest centre of mass energy range (149-1222 keV) over which this reaction has been measured in a single experiment. Our results for low-energy resonances at Ecm=149, 181 and 248 keV are in good agreement with recent forward kinematics results; we also find a direct capture S-factor consistent with the literature value of 62 keV.b. However, in the case of the important reference resonance at Ecm = 458 keV we find a strength value of wg=0.44 +/- 0.02 eV, which is significantly lower than recent results. Using our new recommended rate we explore the impact of these results on both AGB star and classical novae nucleosynthesis. In the case of AGB stars we see very little abundance changes with respect to the rate included in the STARLIB-2013. However, we observe changes of up to a factor of 2 in isotopes produced in both the carbon-oxygen (CO) and oxygen-neon (ONe) classical novae models considered here. The 22Ne(p,gamma)23Na reaction rate is now sufficiently well constrained to not significantly contribute toward abundance uncertainties from classical novae nucleosynthesis models.
△ Less
Submitted 3 October, 2019;
originally announced October 2019.
-
AXS: A framework for fast astronomical data processing based on Apache Spark
Authors:
Petar Zečević,
Colin T. Slater,
Mario Jurić,
Andrew J. Connolly,
Sven Lončarić,
Eric C. Bellm,
V. Zach Golkhou,
Krzysztof Suberlak
Abstract:
We introduce AXS (Astronomy eXtensions for Spark), a scalable open-source astronomical data analysis framework built on Apache Spark, a widely used industry-standard engine for big data processing. Building on capabilities present in Spark, AXS aims to enable querying and analyzing almost arbitrarily large astronomical catalogs using familiar Python/AstroPy concepts, DataFrame APIs, and SQL statem…
▽ More
We introduce AXS (Astronomy eXtensions for Spark), a scalable open-source astronomical data analysis framework built on Apache Spark, a widely used industry-standard engine for big data processing. Building on capabilities present in Spark, AXS aims to enable querying and analyzing almost arbitrarily large astronomical catalogs using familiar Python/AstroPy concepts, DataFrame APIs, and SQL statements. We achieve this by i) adding support to Spark for efficient on-line positional cross-matching and ii) supplying a Python library supporting commonly-used operations for astronomical data analysis. To support scalable cross-matching, we developed a variant of the ZONES algorithm (Gray et al. 2004) capable of operating in distributed, shared-nothing architecture. We couple this to a data partitioning scheme that enables fast catalog cross-matching and handles the data skew often present in deep all-sky data sets. The cross-match and other often-used functionalities are exposed to the end users through an easy-to-use Python API. We demonstrate AXS' technical and scientific performance on SDSS, ZTF, Gaia DR2, and AllWise catalogs. Using AXS we were able to perform on-the-fly cross-match of Gaia DR2 (1.8 billion rows) and AllWise (900 million rows) data sets in ~ 30 seconds. We discuss how cloud-ready distributed systems like AXS provide a natural way to enable comprehensive end-user analyses of large datasets such as LSST.
△ Less
Submitted 24 May, 2019; v1 submitted 22 May, 2019;
originally announced May 2019.
-
Petabytes to Science
Authors:
Amanda E. Bauer,
Eric C. Bellm,
Adam S. Bolton,
Surajit Chaudhuri,
A. J. Connolly,
Kelle L. Cruz,
Vandana Desai,
Alex Drlica-Wagner,
Frossie Economou,
Niall Gaffney,
J. Kavelaars,
J. Kinney,
Ting S. Li,
B. Lundgren,
R. Margutti,
G. Narayan,
B. Nord,
Dara J. Norman,
W. O'Mullane,
S. Padhi,
J. E. G. Peek,
C. Schafer,
Megan E. Schwamb,
Arfon M. Smith,
Erik J. Tollerud
, et al. (2 additional authors not shown)
Abstract:
A Kavli foundation sponsored workshop on the theme \emph{Petabytes to Science} was held 12$^{th}$ to 14$^{th}$ of February 2019 in Las Vegas. The aim of the this workshop was to discuss important trends and technologies which may support astronomy. We also tackled how to better shape the workforce for the new trends and how we should approach education and public outreach. This document was coauth…
▽ More
A Kavli foundation sponsored workshop on the theme \emph{Petabytes to Science} was held 12$^{th}$ to 14$^{th}$ of February 2019 in Las Vegas. The aim of the this workshop was to discuss important trends and technologies which may support astronomy. We also tackled how to better shape the workforce for the new trends and how we should approach education and public outreach. This document was coauthored during the workshop and edited in the weeks after. It comprises the discussions and highlights many recommendations which came out of the workshop.
We shall distill parts of this document and formulate potential white papers for the decadal survey.
△ Less
Submitted 17 November, 2019; v1 submitted 13 May, 2019;
originally announced May 2019.
-
Models and Simulations for the Photometric LSST Astronomical Time Series Classification Challenge (PLAsTiCC)
Authors:
R. Kessler,
G. Narayan,
A. Avelino,
E. Bachelet,
R. Biswas,
P. J. Brown,
D. F. Chernoff,
A. J. Connolly,
M. Dai,
S. Daniel,
R. Di Stefano,
M. R. Drout,
L. Galbany,
S. González-Gaitán,
M. L. Graham,
R. Hložek,
E. E. O. Ishida,
J. Guillochon,
S. W. Jha,
D. O. Jones,
K. S. Mandel,
D. Muthukrishna,
A. O'Grady,
C. M. Peters,
J. R. Pierel
, et al. (4 additional authors not shown)
Abstract:
We describe the simulated data sample for the "Photometric LSST Astronomical Time Series Classification Challenge" (PLAsTiCC), a publicly available challenge to classify transient and variable events that will be observed by the Large Synoptic Survey Telescope (LSST), a new facility expected to start in the early 2020s. The challenge was hosted by Kaggle, ran from 2018 September 28 to 2018 Decembe…
▽ More
We describe the simulated data sample for the "Photometric LSST Astronomical Time Series Classification Challenge" (PLAsTiCC), a publicly available challenge to classify transient and variable events that will be observed by the Large Synoptic Survey Telescope (LSST), a new facility expected to start in the early 2020s. The challenge was hosted by Kaggle, ran from 2018 September 28 to 2018 December 17, and included 1,094 teams competing for prizes. Here we provide details of the 18 transient and variable source models, which were not revealed until after the challenge, and release the model libraries at https://doi.org/10.5281/zenodo.2612896. We describe the LSST Operations Simulator used to predict realistic observing conditions, and we describe the publicly available SNANA simulation code used to transform the models into observed fluxes and uncertainties in the LSST passbands (ugrizy). Although PLAsTiCC has finished, the publicly available models and simulation tools are being used within the astronomy community to further improve classification, and to study contamination in photometrically identified samples of type Ia supernova used to measure properties of dark energy. Our simulation framework will continue serving as a platform to improve the PLAsTiCC models, and to develop new models.
△ Less
Submitted 10 July, 2019; v1 submitted 27 March, 2019;
originally announced March 2019.
-
Enabling Deep All-Sky Searches of Outer Solar System Objects
Authors:
Mario Jurić,
R. Lynne Jones,
J. Bryce Kalmbach,
Peter Whidden,
Dino Bektešević,
Hayden Smotherman,
Joachim Moeyens,
Andrew J. Connolly,
Michele T. Bannister,
Wesley Fraser,
David Gerdes,
Michael Mommert,
Darin Ragozzine,
Megan E. Schwamb,
David Trilling
Abstract:
A foundational goal of the Large Synoptic Survey Telescope (LSST) is to map the Solar System small body populations that provide key windows into understanding of its formation and evolution. This is especially true of the populations of the Outer Solar System -- objects at the orbit of Neptune $r > 30$AU and beyond. In this whitepaper, we propose a minimal change to the LSST cadence that can grea…
▽ More
A foundational goal of the Large Synoptic Survey Telescope (LSST) is to map the Solar System small body populations that provide key windows into understanding of its formation and evolution. This is especially true of the populations of the Outer Solar System -- objects at the orbit of Neptune $r > 30$AU and beyond. In this whitepaper, we propose a minimal change to the LSST cadence that can greatly enhance LSST's ability to discover faint distant Solar System objects across the entire wide-fast-deep (WFD) survey area. Specifically, we propose that the WFD cadence be constrained so as to deliver least one sequence of $\gtrsim 10$ visits per year taken in a $\sim 10$ day period in any combination of $g, r$, and $i$ bands. Combined with advanced shift-and-stack algorithms (Whidden et al. 2019) this modification would enable a nearly complete census of the outer Solar System to $\sim 25.5$ magnitude, yielding $4-8$x more KBO discoveries than with single-epoch baseline, and enabling rapid identification and follow-up of unusual distant Solar System objects in $\gtrsim 5$x greater volume of space. These increases would enhance the science cases discussed in Schwamb et al. (2018) whitepaper, including probing Neptune's past migration history as well as discovering hypothesized planet(s) beyond the orbit of Neptune (or at least placing significant constraints on their existence).
△ Less
Submitted 24 January, 2019;
originally announced January 2019.
-
Simulating Brain Signals: Creating Synthetic EEG Data via Neural-Based Generative Models for Improved SSVEP Classification
Authors:
Nik Khadijah Nik Aznan,
Amir Atapour-Abarghouei,
Stephen Bonner,
Jason Connolly,
Noura Al Moubayed,
Toby Breckon
Abstract:
Despite significant recent progress in the area of Brain-Computer Interface (BCI), there are numerous shortcomings associated with collecting Electroencephalography (EEG) signals in real-world environments. These include, but are not limited to, subject and session data variance, long and arduous calibration processes and predictive generalisation issues across different subjects or sessions. This…
▽ More
Despite significant recent progress in the area of Brain-Computer Interface (BCI), there are numerous shortcomings associated with collecting Electroencephalography (EEG) signals in real-world environments. These include, but are not limited to, subject and session data variance, long and arduous calibration processes and predictive generalisation issues across different subjects or sessions. This implies that many downstream applications, including Steady State Visual Evoked Potential (SSVEP) based classification systems, can suffer from a shortage of reliable data. Generating meaningful and realistic synthetic data can therefore be of significant value in circumventing this problem. We explore the use of modern neural-based generative models trained on a limited quantity of EEG data collected from different subjects to generate supplementary synthetic EEG signal vectors, subsequently utilised to train an SSVEP classifier. Extensive experimental analysis demonstrates the efficacy of our generated data, leading to improvements across a variety of evaluations, with the crucial task of cross-subject generalisation improving by over 35% with the use of such synthetic data.
△ Less
Submitted 3 April, 2019; v1 submitted 15 January, 2019;
originally announced January 2019.
-
Fast algorithms for slow moving asteroids: constraints on the distribution of Kuiper Belt Objects
Authors:
Peter J. Whidden,
J. Bryce Kalmbach,
Andrew J. Connolly,
R. Lynne Jones,
Hayden Smotherman,
Dino Bektesevic,
Colin Slater,
Andrew C. Becker,
Željko Ivezić,
Mario Jurić,
Bryce Bolin,
Joachim Moeyens,
Francisco Förster,
V. Zach Golkhou
Abstract:
We introduce a new computational technique for searching for faint moving sources in astronomical images. Starting from a maximum likelihood estimate for the probability of the detection of a source within a series of images, we develop a massively parallel algorithm for searching through candidate asteroid trajectories that utilizes Graphics Processing Units (GPU). This technique can search over…
▽ More
We introduce a new computational technique for searching for faint moving sources in astronomical images. Starting from a maximum likelihood estimate for the probability of the detection of a source within a series of images, we develop a massively parallel algorithm for searching through candidate asteroid trajectories that utilizes Graphics Processing Units (GPU). This technique can search over 10^10 possible asteroid trajectories in stacks of the order 10-15 4K x 4K images in under a minute using a single consumer grade GPU. We apply this algorithm to data from the 2015 campaign of the High Cadence Transient Survey (HiTS) obtained with the Dark Energy Camera (DECam). We find 39 previously unknown Kuiper Belt Objects in the 150 square degrees of the survey. Comparing these asteroids to an existing model for the inclination distribution of the Kuiper Belt we demonstrate that we recover a KBO population above our detection limit consistent with previous studies. Software used in this analysis is made available as an open source package.
△ Less
Submitted 8 January, 2019;
originally announced January 2019.
-
Using Variable Natural Environment Brain-Computer Interface Stimuli for Real-time Humanoid Robot Navigation
Authors:
Nik Khadijah Nik Aznan,
Jason D. Connolly,
Noura Al Moubayed,
Toby P. Breckon
Abstract:
This paper addresses the challenge of humanoid robot teleoperation in a natural indoor environment via a Brain-Computer Interface (BCI). We leverage deep Convolutional Neural Network (CNN) based image and signal understanding to facilitate both real-time bject detection and dry-Electroencephalography (EEG) based human cortical brain bio-signals decoding. We employ recent advances in dry-EEG techno…
▽ More
This paper addresses the challenge of humanoid robot teleoperation in a natural indoor environment via a Brain-Computer Interface (BCI). We leverage deep Convolutional Neural Network (CNN) based image and signal understanding to facilitate both real-time bject detection and dry-Electroencephalography (EEG) based human cortical brain bio-signals decoding. We employ recent advances in dry-EEG technology to stream and collect the cortical waveforms from subjects while they fixate on variable Steady State Visual Evoked Potential (SSVEP) stimuli generated directly from the environment the robot is navigating. To these ends, we propose the use of novel variable BCI stimuli by utilising the real-time video streamed via the on-board robot camera as visual input for SSVEP, where the CNN detected natural scene objects are altered and flickered with differing frequencies (10Hz, 12Hz and 15Hz). These stimuli are not akin to traditional stimuli - as both the dimensions of the flicker regions and their on-screen position changes depending on the scene objects detected. On-screen object selection via such a dry-EEG enabled SSVEP methodology, facilitates the on-line decoding of human cortical brain signals, via a specialised secondary CNN, directly into teleoperation robot commands (approach object, move in a specific direction: right, left or back). This SSVEP decoding model is trained via a priori offline experimental data in which very similar visual input is present for all subjects. The resulting classification demonstrates high performance with mean accuracy of 85% for the real-time robot navigation experiment across multiple test subjects.
△ Less
Submitted 1 March, 2019; v1 submitted 26 November, 2018;
originally announced November 2018.
-
A Framework for Telescope Schedulers: With Applications to the Large Synoptic Survey Telescope
Authors:
Elahesadat Naghib,
Peter Yoachim,
Robert J. Vanderbei,
Andrew J. Connolly,
R. Lynne Jones
Abstract:
How ground-based telescopes schedule their observations in response to competing science priorities and constraints, variations in the weather, and the visibility of a particular part of the sky can significantly impact their efficiency. In this paper we introduce the Feature-Based telescope scheduler that is an automated, proposal-free decision making algorithm that offers \textit{controllability…
▽ More
How ground-based telescopes schedule their observations in response to competing science priorities and constraints, variations in the weather, and the visibility of a particular part of the sky can significantly impact their efficiency. In this paper we introduce the Feature-Based telescope scheduler that is an automated, proposal-free decision making algorithm that offers \textit{controllability} of the behavior, \textit{adjustability} of the mission, and quick \textit{recoverability} from interruptions for large ground-based telescopes. By framing this scheduler in the context of a coherent mathematical model the functionality and performance of the algorithm is simple to interpret and adapt to a broad range of astronomical applications. This paper presents a generic version of the Feature-Based scheduler, with minimal manual tailoring, to demonstrate its potential and flexibility as a foundation for large ground-based telescope schedulers which can later be adjusted for other instruments. In addition, a modified version of the Feature-Based scheduler for the Large Synoptic Survey Telescope (LSST) is introduced and compared to previous LSST scheduler simulations.
△ Less
Submitted 10 October, 2018;
originally announced October 2018.
-
On the Classification of SSVEP-Based Dry-EEG Signals via Convolutional Neural Networks
Authors:
Nik Khadijah Nik Aznan,
Stephen Bonner,
Jason D. Connolly,
Noura Al Moubayed,
Toby P. Breckon
Abstract:
In this paper, we propose a novel Convolutional Neural Network (CNN) approach for the classification of raw dry-EEG signals without any data pre-processing. To illustrate the effectiveness of our approach, we utilise the Steady State Visual Evoked Potential (SSVEP) paradigm as our use case. SSVEP can be utilised to allow people with severe physical disabilities such as Complete Locked-In Syndrome…
▽ More
In this paper, we propose a novel Convolutional Neural Network (CNN) approach for the classification of raw dry-EEG signals without any data pre-processing. To illustrate the effectiveness of our approach, we utilise the Steady State Visual Evoked Potential (SSVEP) paradigm as our use case. SSVEP can be utilised to allow people with severe physical disabilities such as Complete Locked-In Syndrome or Amyotrophic Lateral Sclerosis to be aided via BCI applications, as it requires only the subject to fixate upon the sensory stimuli of interest. Here we utilise SSVEP flicker frequencies between 10 to 30 Hz, which we record as subject cortical waveforms via the dry-EEG headset. Our proposed end-to-end CNN allows us to automatically and accurately classify SSVEP stimulation directly from the dry-EEG waveforms. Our CNN architecture utilises a common SSVEP Convolutional Unit (SCU), comprising of a 1D convolutional layer, batch normalization and max pooling. Furthermore, we compare several deep learning neural network variants with our primary CNN architecture, in addition to traditional machine learning classification approaches. Experimental evaluation shows our CNN architecture to be significantly better than competing approaches, achieving a classification accuracy of 96% whilst demonstrating superior cross-subject performance and even being able to generalise well to unseen subjects whose data is entirely absent from the training process.
△ Less
Submitted 2 August, 2018; v1 submitted 10 May, 2018;
originally announced May 2018.
-
APO Time Resolved Color Photometry of Highly-Elongated Interstellar Object 1I/'Oumuamua
Authors:
Bryce T. Bolin,
Harold A. Weaver,
Yanga R. Fernandez,
Carey M. Lisse,
Daniela Huppenkothen,
R. Lynne Jones,
Mario Juric,
Joachim Moeyens,
Charles A. Schambeau,
Colin T. Slater,
Zeljko Ivezic,
Andrew J. Connolly
Abstract:
We report on $g$, $r$ and $i$ band observations of the Interstellar Object 'Oumuamua (1I) taken on 2017 October 29 from 04:28 to 08:40 UTC by the Apache Point Observatory (APO) 3.5m telescope's ARCTIC camera. We find that 1I's colors are $g-r=0.41\pm0.24$ and $r-i=0.23\pm0.25$, consistent with the visible spectra of Masiero (2017), Ye et al. (2017) and Fitzsimmons et al. (2017), and most comparabl…
▽ More
We report on $g$, $r$ and $i$ band observations of the Interstellar Object 'Oumuamua (1I) taken on 2017 October 29 from 04:28 to 08:40 UTC by the Apache Point Observatory (APO) 3.5m telescope's ARCTIC camera. We find that 1I's colors are $g-r=0.41\pm0.24$ and $r-i=0.23\pm0.25$, consistent with the visible spectra of Masiero (2017), Ye et al. (2017) and Fitzsimmons et al. (2017), and most comparable to the population of Solar System C/D asteroids, Trojans, or comets. We find no evidence of any cometary activity at a heliocentric distance of 1.46 au, approximately 1.5 months after 1I's closest approach distance to the Sun. Significant brightness variability was seen in the $r$ observations, with the object becoming notably brighter towards the end of the run. By combining our APO photometric time series data with the Discovery Channel Telescope (DCT) data of Knight et al. (2017), taken 20 h later on 2017 October 30, we construct an almost complete light curve with a most probable lightcurve period of $P \simeq 4~{\rm h}$. Our results imply a double peaked rotation period of 8.1 $\pm$ 0.02 h, with a peak-to-peak amplitude of 1.5 - 2.1 mags. Assuming that 1I's shape can be approximated by an ellipsoid, the amplitude constraint implies that 1I has an axial ratio of 3.5 to 10.3, which is strikingly elongated. Assuming that 1I is rotating above its critical break up limit, our results are compatible with 1I having having modest cohesive strength and may have obtained its elongated shape during a tidal disruption event before being ejected from its home system. Astrometry useful for constraining 1I's orbit was also obtained and published in Weaver et al. (2017).
△ Less
Submitted 29 January, 2018; v1 submitted 13 November, 2017;
originally announced November 2017.
-
Estimating Spectra from Photometry
Authors:
J. Bryce Kalmbach,
Andrew J. Connolly
Abstract:
Measuring the physical properties of galaxies such as redshift frequently requires the use of Spectral Energy Distributions (SEDs). SED template sets are, however, often small in number and cover limited portions of photometric color space. Here we present a new method to estimate SEDs as a function of color from a small training set of template SEDs. We first cover the mathematical background beh…
▽ More
Measuring the physical properties of galaxies such as redshift frequently requires the use of Spectral Energy Distributions (SEDs). SED template sets are, however, often small in number and cover limited portions of photometric color space. Here we present a new method to estimate SEDs as a function of color from a small training set of template SEDs. We first cover the mathematical background behind the technique before demonstrating our ability to reconstruct spectra based upon colors and then compare to other common interpolation and extrapolation methods. When the photometric filters and spectra overlap we show reduction of error in the estimated spectra of over 65% compared to the more commonly used techniques. We also show an expansion of the method to wavelengths beyond the range of the photometric filters. Finally, we demonstrate the usefulness of our technique by generating 50 additional SED templates from an original set of 10 and applying the new set to photometric redshift estimation. We are able to reduce the photometric redshifts standard deviation by at least 22.0% and the outlier rejected bias by over 86.2% compared to original set for z $\leq$ 3.
△ Less
Submitted 6 November, 2017;
originally announced November 2017.
-
Scientific Synergy Between LSST and Euclid
Authors:
Jason Rhodes,
Robert C. Nichol,
Éric Aubourg,
Rachel Bean,
Dominique Boutigny,
Malcolm N. Bremer,
Peter Capak,
Vincenzo Cardone,
Benoît Carry,
Christopher J. Conselice,
Andrew J. Connolly,
Jean-Charles Cuillandre,
N. A. Hatch,
George Helou,
Shoubaneh Hemmati,
Hendrik Hildebrandt,
Renée Hložek,
Lynne Jones,
Steven Kahn,
Alina Kiessling,
Thomas Kitching,
Robert Lupton,
Rachel Mandelbaum,
Katarina Markovic,
Phil Marshall
, et al. (12 additional authors not shown)
Abstract:
Euclid and the Large Synoptic Survey Telescope (LSST) are poised to dramatically change the astronomy landscape early in the next decade. The combination of high cadence, deep, wide-field optical photometry from LSST with high resolution, wide-field optical photometry and near-infrared photometry and spectroscopy from Euclid will be powerful for addressing a wide range of astrophysical questions.…
▽ More
Euclid and the Large Synoptic Survey Telescope (LSST) are poised to dramatically change the astronomy landscape early in the next decade. The combination of high cadence, deep, wide-field optical photometry from LSST with high resolution, wide-field optical photometry and near-infrared photometry and spectroscopy from Euclid will be powerful for addressing a wide range of astrophysical questions. We explore Euclid/LSST synergy, ignoring the political issues associated with data access to focus on the scientific, technical, and financial benefits of coordination. We focus primarily on dark energy cosmology, but also discuss galaxy evolution, transient objects, solar system science, and galaxy cluster studies. We concentrate on synergies that require coordination in cadence or survey overlap, or would benefit from pixel-level co-processing that is beyond the scope of what is currently planned, rather than scientific programs that could be accomplished only at the catalog level without coordination in data processing or survey strategies. We provide two quantitative examples of scientific synergies: the decrease in photo-z errors (benefitting many science cases) when high resolution Euclid data are used for LSST photo-z determination, and the resulting increase in weak lensing signal-to-noise ratio from smaller photo-z errors. We briefly discuss other areas of coordination, including high performance computing resources and calibration data. Finally, we address concerns about the loss of independence and potential cross-checks between the two missions and potential consequences of not collaborating.
△ Less
Submitted 29 November, 2017; v1 submitted 23 October, 2017;
originally announced October 2017.
-
A hybrid type Ia supernova with an early flash triggered by helium-shell detonation
Authors:
Ji-an Jiang,
Mamoru Doi,
Keiichi Maeda,
Toshikazu Shigeyama,
Ken'ichi Nomoto,
Naoki Yasuda,
Saurabh W. Jha,
Masaomi Tanaka,
Tomoki Morokuma,
Nozomu Tominaga,
Željko Ivezić,
Pilar Ruiz-Lapuente,
Maximilian D. Stritzinger,
Paolo A. Mazzali,
Christopher Ashall,
Jeremy Mould,
Dietrich Baade,
Nao Suzuki,
Andrew J. Connolly,
Ferdinando Patat,
Lifan Wang,
Peter Yoachim,
David Jones,
Hisanori Furusawa,
Satoshi Miyazaki
Abstract:
Type Ia supernovae (SNe Ia) arise from the thermonuclear explosion of carbon-oxygen white dwarfs. Though the uniformity of their light curves makes them powerful cosmological distance indicators, long-standing issues remain regarding their progenitors and explosion mechanisms. Recent detection of the early ultraviolet pulse of a peculiar subluminous SN Ia has been claimed as new evidence for the c…
▽ More
Type Ia supernovae (SNe Ia) arise from the thermonuclear explosion of carbon-oxygen white dwarfs. Though the uniformity of their light curves makes them powerful cosmological distance indicators, long-standing issues remain regarding their progenitors and explosion mechanisms. Recent detection of the early ultraviolet pulse of a peculiar subluminous SN Ia has been claimed as new evidence for the companion-ejecta interaction through the single-degenerate channel. Here, we report the discovery of a prominent but red optical flash at $\sim$ 0.5 days after the explosion of a SN Ia which shows hybrid features of different SN Ia sub-classes: a light curve typical of normal-brightness SNe Ia, but with strong titanium absorptions, commonly seen in the spectra of subluminous ones. We argue that the early flash of such a hybrid SN Ia is different from predictions of previously suggested scenarios such as the companion-ejecta interaction. Instead it can be naturally explained by a SN explosion triggered by a detonation of a thin helium shell either on a near-Chandrasekhar-mass white dwarf ($\gtrsim$ 1.3 M$_{\odot}$) with low-yield $^{56}$Ni or on a sub-Chandrasekhar-mass white dwarf ($\sim$ 1.0 M$_{\odot}$) merging with a less massive white dwarf. This finding provides compelling evidence that one branch of the previously proposed explosion models, the helium-ignition scenario, does exist in nature, and such a scenario may account for explosions of white dwarfs in a wider mass range in contrast to what was previously supposed.
△ Less
Submitted 4 October, 2017;
originally announced October 2017.
-
The Discovery of a Five-Image Lensed Quasar at z = 3.34 using PanSTARRS1 and Gaia
Authors:
Fernanda Ostrovski,
Cameron A. Lemon,
Matthew W. Auger,
Richard G. McMahon,
Christopher D. Fassnacht,
Geoff C. -F. Chen,
Andrew J. Connolly,
Sergey E. Koposov,
Estelle Pons,
Sophie L. Reed,
Cristian E. Rusu
Abstract:
We report the discovery, spectroscopic confirmation, and mass modelling of the gravitationally lensed quasar system PS J0630-1201. The lens was discovered by matching a photometric quasar catalogue compiled from Pan-STARRS and WISE photometry to the Gaia DR1 catalogue, exploiting the high spatial resolution of the latter (FWHM $\sim $0.1") to identify the three brightest components of the lens. Fo…
▽ More
We report the discovery, spectroscopic confirmation, and mass modelling of the gravitationally lensed quasar system PS J0630-1201. The lens was discovered by matching a photometric quasar catalogue compiled from Pan-STARRS and WISE photometry to the Gaia DR1 catalogue, exploiting the high spatial resolution of the latter (FWHM $\sim $0.1") to identify the three brightest components of the lens. Follow-up spectroscopic observations with the WHT confirm the multiple objects are quasars at redshift $z_{q}=3.34$. Further follow-up with Keck AO high-resolution imaging reveals that the system is composed of two lensing galaxies and the quasar is lensed into a $\sim$2.8" separation four-image cusp configuration with a fifth image clearly visible, and a 1.0" arc due to the lensed quasar host galaxy. The system is well-modelled with two singular isothermal ellipsoids, reproducing the position of the fifth image. We discuss future prospects for measuring time delays between the images and constraining any offset between mass and light using the faintly detected Einstein arcs associated with the quasar host galaxy.
△ Less
Submitted 16 October, 2017; v1 submitted 26 September, 2017;
originally announced September 2017.
-
Robust period estimation using mutual information for multi-band light curves in the synoptic survey era
Authors:
Pablo Huijse,
Pablo A. Estevez,
Francisco Forster,
Scott F. Daniel,
Andrew J. Connolly,
Pavlos Protopapas,
Rodrigo Carrasco,
Jose C. Principe
Abstract:
The Large Synoptic Survey Telescope (LSST) will produce an unprecedented amount of light curves using six optical bands. Robust and efficient methods that can aggregate data from multidimensional sparsely-sampled time series are needed. In this paper we present a new method for light curve period estimation based on the quadratic mutual information (QMI). The proposed method does not assume a part…
▽ More
The Large Synoptic Survey Telescope (LSST) will produce an unprecedented amount of light curves using six optical bands. Robust and efficient methods that can aggregate data from multidimensional sparsely-sampled time series are needed. In this paper we present a new method for light curve period estimation based on the quadratic mutual information (QMI). The proposed method does not assume a particular model for the light curve nor its underlying probability density and it is robust to non-Gaussian noise and outliers. By combining the QMI from several bands the true period can be estimated even when no single-band QMI yields the period. Period recovery performance as a function of average magnitude and sample size is measured using 30,000 synthetic multi-band light curves of RR Lyrae and Cepheid variables generated by the LSST Operations and Catalog simulators. The results show that aggregating information from several bands is highly beneficial in LSST sparsely-sampled time series, obtaining an absolute increase in period recovery rate up to 50%. We also show that the QMI is more robust to noise and light curve length (sample size) than the multiband generalizations of the Lomb Scargle and Analysis of Variance periodograms, recovering the true period in 10-30% more cases than its competitors. A python package containing efficient Cython implementations of the QMI and other methods is provided.
△ Less
Submitted 11 September, 2017;
originally announced September 2017.
-
Photometric Redshifts with the LSST: Evaluating Survey Observing Strategies
Authors:
Melissa L. Graham,
Andrew J. Connolly,
Željko Ivezić,
Samuel J. Schmidt,
R. Lynne Jones,
Mario Jurić,
Scott F. Daniel,
Peter Yoachim
Abstract:
In this paper we present and characterize a nearest-neighbors color-matching photometric redshift estimator that features a direct relationship between the precision and accuracy of the input magnitudes and the output photometric redshifts. This aspect makes our estimator an ideal tool for evaluating the impact of changes to LSST survey parameters that affect the measurement errors of the photomet…
▽ More
In this paper we present and characterize a nearest-neighbors color-matching photometric redshift estimator that features a direct relationship between the precision and accuracy of the input magnitudes and the output photometric redshifts. This aspect makes our estimator an ideal tool for evaluating the impact of changes to LSST survey parameters that affect the measurement errors of the photometry, which is the main motivation of our work (i.e., it is not intended to provide the "best" photometric redshifts for LSST data). We show how the photometric redshifts will improve with time over the 10-year LSST survey and confirm that the nominal distribution of visits per filter provides the most accurate photo-$z$ results. The LSST survey strategy naturally produces observations over a range of airmass, which offers the opportunity of using an SED- and $z$-dependent atmospheric affect on the observed photometry as a color-independent redshift indicator. We show that measuring this airmass effect and including it as a prior has the potential to improve the photometric redshifts and can ameliorate extreme outliers, but that it will only be adequately measured for the brightest galaxies, which limits its overall impact on LSST photometric redshifts. We furthermore demonstrate how this airmass effect can induce a bias in the photo-$z$ results, and caution against survey strategies that prioritize high-airmass observations for the purpose of improving this prior. Ultimately, we intend for this work to serve as a guide for the expectations and preparations of the LSST science community with regards to the minimum quality of photo-$z$ as the survey progresses.
△ Less
Submitted 6 December, 2017; v1 submitted 28 June, 2017;
originally announced June 2017.
-
Everything we'd like to do with LSST data, but we don't know (yet) how
Authors:
Željko Ivezić,
Andrew J. Connolly,
Mario Jurić
Abstract:
The Large Synoptic Survey Telescope (LSST), the next-generation optical imaging survey sited at Cerro Pachon in Chile, will provide an unprecedented database of astronomical measurements. The LSST design, with an 8.4m (6.7m effective) primary mirror, a 9.6 sq. deg. field of view, and a 3.2 Gigapixel camera, will allow about 10,000 sq. deg. of sky to be covered twice per night, every three to four…
▽ More
The Large Synoptic Survey Telescope (LSST), the next-generation optical imaging survey sited at Cerro Pachon in Chile, will provide an unprecedented database of astronomical measurements. The LSST design, with an 8.4m (6.7m effective) primary mirror, a 9.6 sq. deg. field of view, and a 3.2 Gigapixel camera, will allow about 10,000 sq. deg. of sky to be covered twice per night, every three to four nights on average, with typical 5-sigma depth for point sources of $r$=24.5 (AB). With over 800 observations in $ugrizy$ bands over a 10-year period, these data will enable a deep stack reaching $r$=27.5 (about 5 magnitudes deeper than SDSS) and faint time-domain astronomy. The measured properties of newly discovered and known astrometric and photometric transients will be publicly reported within 60 sec after observation. The vast database of about 30 trillion observations of 40 billion objects will be mined for the unexpected and used for precision experiments in astrophysics. In addition to a brief introduction to LSST, we discuss a number of astro-statistical challenges that need to be overcome to extract maximum information and science results from LSST dataset.
△ Less
Submitted 14 December, 2016;
originally announced December 2016.
-
VDES J2325-5229 a z=2.7 gravitationally lensed quasar discovered using morphology independent supervised machine learning
Authors:
Fernanda Ostrovski,
Richard G. McMahon,
Andrew J. Connolly,
Cameron A. Lemon,
Matthew W. Auger,
Manda Banerji,
Johnathan M. Hung,
Sergey E. Koposov,
Christopher E. Lidman,
Sophie L. Reed,
Sahar Allam,
Aurélien Benoit-Lévy,
Emmanuel Bertin,
David Brooks,
Elizabeth Buckley-Geer,
Aurelio Carnero Rosell,
Matias Carrasco Kind,
Jorge Carretero,
Carlos E. Cunha,
Luiz N. da Costa,
Shantanu Desai,
H. Thomas Diehl,
Jörg P. Dietrich,
August E. Evrard,
David A. Finley
, et al. (34 additional authors not shown)
Abstract:
We present the discovery and preliminary characterization of a gravitationally lensed quasar with a source redshift $z_{s}=2.74$ and image separation of $2.9"$ lensed by a foreground $z_{l}=0.40$ elliptical galaxy. Since the images of gravitationally lensed quasars are the superposition of multiple point sources and a foreground lensing galaxy, we have developed a morphology independent multi-wave…
▽ More
We present the discovery and preliminary characterization of a gravitationally lensed quasar with a source redshift $z_{s}=2.74$ and image separation of $2.9"$ lensed by a foreground $z_{l}=0.40$ elliptical galaxy. Since the images of gravitationally lensed quasars are the superposition of multiple point sources and a foreground lensing galaxy, we have developed a morphology independent multi-wavelength approach to the photometric selection of lensed quasar candidates based on Gaussian Mixture Models (GMM) supervised machine learning. Using this technique and $gi$ multicolour photometric observations from the Dark Energy Survey (DES), near IR $JK$ photometry from the VISTA Hemisphere Survey (VHS) and WISE mid IR photometry, we have identified a candidate system with two catalogue components with $i_{AB}=18.61$ and $i_{AB}=20.44$ comprised of an elliptical galaxy and two blue point sources. Spectroscopic follow-up with NTT and the use of an archival AAT spectrum show that the point sources can be identified as a lensed quasar with an emission line redshift of $z=2.739\pm0.003$ and a foreground early type galaxy with $z=0.400\pm0.002$. We model the system as a single isothermal ellipsoid and find the Einstein radius $θ_E \sim 1.47"$, enclosed mass $M_{enc} \sim 4 \times 10^{11}$M$_{\odot}$ and a time delay of $\sim$52 days. The relatively wide separation, month scale time delay duration and high redshift make this an ideal system for constraining the expansion rate beyond a redshift of 1.
△ Less
Submitted 15 November, 2016; v1 submitted 5 July, 2016;
originally announced July 2016.
-
Quantum Wells in Photovoltaic Cells
Authors:
C Rohr,
P Abbott,
I M Ballard,
D B Bushnell,
J P Connolly,
N J Ekins- Daukes,
K W J Barnham
Abstract:
The fundamental efficiency limit of a single bandgap solar cell is about 31% at one sun with a bandgap of about Eg = 1.35 eV (1), determined by the trade-off of maximising current with a smaller bandgap and voltage with a larger bandgap. Multiple bandgaps can be introduced to absorb the broad solar spectrum more efficiently. This can be realised in multi- junction cells, for example, where two or…
▽ More
The fundamental efficiency limit of a single bandgap solar cell is about 31% at one sun with a bandgap of about Eg = 1.35 eV (1), determined by the trade-off of maximising current with a smaller bandgap and voltage with a larger bandgap. Multiple bandgaps can be introduced to absorb the broad solar spectrum more efficiently. This can be realised in multi- junction cells, for example, where two or more cells are stacked on top of each other either mechanically or monolithically connected by a tunnel junction. An alternative or complementary (see section 1.4) approach is the quantum well cell (QWC).
△ Less
Submitted 16 June, 2016;
originally announced June 2016.
-
Short circuit current enhancement in GaAs/AlGaAs MQW solar cells
Authors:
James P. Connolly,
Keith W. J. Barnham,
Jenny Nelson,
Christine Roberts,
Malcolm Pate,
John S. Roberts
Abstract:
The GaAs/AlGaAs quantum well solar cell (QWSC) shows promise as a novel approach to higher efficiency solar cells but suffers from a poor short circuit current Jsc. We report on efforts to reduce this problem with the use of compositional grading and back surface mirroring. We present experimental quantum efficiency (QE) data on a range of compositionally graded QWSCs and devices in which the back…
▽ More
The GaAs/AlGaAs quantum well solar cell (QWSC) shows promise as a novel approach to higher efficiency solar cells but suffers from a poor short circuit current Jsc. We report on efforts to reduce this problem with the use of compositional grading and back surface mirroring. We present experimental quantum efficiency (QE) data on a range of compositionally graded QWSCs and devices in which the back surface of the cell is coated with a mirror, increasing the optical thickness of the quantum well layer in the long wavelength range. The experimental QE spectra are reproduced by a model which deals with arbitrary compositional profiles and optical cavities formed in the mirrored cells. The model is used to design an optimised QWSC, and projected Jsc values given. Applications including II-VI and tandem solar cells are considered.
△ Less
Submitted 11 June, 2016; v1 submitted 9 June, 2016;
originally announced June 2016.
-
Optimisation of High Efficiency AlGaAs MQW Solar Cells
Authors:
J. P. Connolly,
K. W. J. Barnham,
J. Nelson,
P. Griffin,
G. Haarpaintner,
C. Roberts,
M. Pate,
J. S. Roberts
Abstract:
The GaAs/AlGaAs materials system is well suited to multi-bandgap applications such as the multiple quantum well solar cell. GaAs quantum wells are inserted in the undoped AlGaAs active region of a pin structure to extend the absorption range while retaining a higher open circuit voltage than would be provided by a cell made of the well material alone. Unfortunately aluminium gallium arsenide (AlGa…
▽ More
The GaAs/AlGaAs materials system is well suited to multi-bandgap applications such as the multiple quantum well solar cell. GaAs quantum wells are inserted in the undoped AlGaAs active region of a pin structure to extend the absorption range while retaining a higher open circuit voltage than would be provided by a cell made of the well material alone. Unfortunately aluminium gallium arsenide (AlGaAs) suffers from poor transport characteristics due to DX centres and oxygen contamination during growth, which degrade the spectral response. We investigate three mechanisms for improving the spectral response of the MQW solar cell while an experimental study of the open circuit voltage examines the voltage enhancement. An optimised structure for a high efficiency GaAs/AlGaAs solar cell is proposed.
△ Less
Submitted 11 June, 2016; v1 submitted 9 June, 2016;
originally announced June 2016.