Search | arXiv e-print repository

DeepSurveySim: Simulation Software and Benchmark Challenges for Astronomical Observation Scheduling

Abstract: Modern astronomical surveys have multiple competing scientific goals. Optimizing the observation schedule for these goals presents significant computational and theoretical challenges, and state-of-the-art methods rely on expensive human inspection of simulated telescope schedules. Automated methods, such as reinforcement learning, have recently been explored to accelerate scheduling. However, the… ▽ More Modern astronomical surveys have multiple competing scientific goals. Optimizing the observation schedule for these goals presents significant computational and theoretical challenges, and state-of-the-art methods rely on expensive human inspection of simulated telescope schedules. Automated methods, such as reinforcement learning, have recently been explored to accelerate scheduling. However, there do not yet exist benchmark data sets or user-friendly software frameworks for testing and comparing these methods. We present DeepSurveySim -- a high-fidelity and flexible simulation tool for use in telescope scheduling. DeepSurveySim provides methods for tracking and approximating sky conditions for a set of observations from a user-supplied telescope configuration. We envision this tool being used to produce benchmark data sets and for evaluating the efficacy of ground-based telescope scheduling algorithms, particularly for machine learning algorithms that would suffer in efficacy if limited to real data for training.We introduce three example survey configurations and related code implementations as benchmark problems that can be simulated with DeepSurveySim. △ Less

Submitted 14 December, 2023; originally announced December 2023.

Report number: FERMILAB-CONF-23-643-CSAID

arXiv:2311.18094 [pdf, other]

Self-Driving Telescopes: Autonomous Scheduling of Astronomical Observation Campaigns with Offline Reinforcement Learning

Authors: Franco Terranova, M. Voetberg, Brian Nord, Amanda Pagul

Abstract: Modern astronomical experiments are designed to achieve multiple scientific goals, from studies of galaxy evolution to cosmic acceleration. These goals require data of many different classes of night-sky objects, each of which has a particular set of observational needs. These observational needs are typically in strong competition with one another. This poses a challenging multi-objective optimiz… ▽ More Modern astronomical experiments are designed to achieve multiple scientific goals, from studies of galaxy evolution to cosmic acceleration. These goals require data of many different classes of night-sky objects, each of which has a particular set of observational needs. These observational needs are typically in strong competition with one another. This poses a challenging multi-objective optimization problem that remains unsolved. The effectiveness of Reinforcement Learning (RL) as a valuable paradigm for training autonomous systems has been well-demonstrated, and it may provide the basis for self-driving telescopes capable of optimizing the scheduling for astronomy campaigns. Simulated datasets containing examples of interactions between a telescope and a discrete set of sky locations on the celestial sphere can be used to train an RL model to sequentially gather data from these several locations to maximize a cumulative reward as a measure of the quality of the data gathered. We use simulated data to test and compare multiple implementations of a Deep Q-Network (DQN) for the task of optimizing the schedule of observations from the Stone Edge Observatory (SEO). We combine multiple improvements on the DQN and adjustments to the dataset, showing that DQNs can achieve an average reward of 87%+-6% of the maximum achievable reward in each state on the test set. This is the first comparison of offline RL algorithms for a particular astronomical challenge and the first open-source framework for performing such a comparison and assessment task. △ Less

Submitted 29 November, 2023; originally announced November 2023.

Comments: Accepted in Machine Learning and the Physical Sciences Workshop at NeurIPS 2023; 6 pages, 5 figures

Report number: FERMILAB-CONF-23-654-CSAID

arXiv:2311.17238 [pdf, other]

Domain Adaptation for Measurements of Strong Gravitational Lenses

Authors: Paxson Swierc, Megan Zhao, Aleksandra Ćiprijanović, Brian Nord

Abstract: Upcoming surveys are predicted to discover galaxy-scale strong lenses on the order of $10^5$, making deep learning methods necessary in lensing data analysis. Currently, there is insufficient real lensing data to train deep learning algorithms, but the alternative of training only on simulated data results in poor performance on real data. Domain Adaptation may be able to bridge the gap between si… ▽ More Upcoming surveys are predicted to discover galaxy-scale strong lenses on the order of $10^5$, making deep learning methods necessary in lensing data analysis. Currently, there is insufficient real lensing data to train deep learning algorithms, but the alternative of training only on simulated data results in poor performance on real data. Domain Adaptation may be able to bridge the gap between simulated and real datasets. We utilize domain adaptation for the estimation of Einstein radius ($Θ_E$) in simulated galaxy-scale gravitational lensing images with different levels of observational realism. We evaluate two domain adaptation techniques - Domain Adversarial Neural Networks (DANN) and Maximum Mean Discrepancy (MMD). We train on a source domain of simulated lenses and apply it to a target domain of lenses simulated to emulate noise conditions in the Dark Energy Survey (DES). We show that both domain adaptation techniques can significantly improve the model performance on the more complex target domain dataset. This work is the first application of domain adaptation for a regression task in strong lensing imaging analysis. Our results show the potential of using domain adaptation to perform analysis of future survey data with a deep neural network trained on simulated data. △ Less

Submitted 28 November, 2023; originally announced November 2023.

Comments: Accepted in Machine Learning and the Physical Sciences Workshop at NeurIPS 2023; 9 pages, 2 figures, 2 tables

Report number: FERMILAB-CONF-23-645-CSAID

arXiv:2311.01588 [pdf, other]

Domain Adaptive Graph Neural Networks for Constraining Cosmological Parameters Across Multiple Data Sets

Authors: Andrea Roncoli, Aleksandra Ćiprijanović, Maggie Voetberg, Francisco Villaescusa-Navarro, Brian Nord

Abstract: Deep learning models have been shown to outperform methods that rely on summary statistics, like the power spectrum, in extracting information from complex cosmological data sets. However, due to differences in the subgrid physics implementation and numerical approximations across different simulation suites, models trained on data from one cosmological simulation show a drop in performance when t… ▽ More Deep learning models have been shown to outperform methods that rely on summary statistics, like the power spectrum, in extracting information from complex cosmological data sets. However, due to differences in the subgrid physics implementation and numerical approximations across different simulation suites, models trained on data from one cosmological simulation show a drop in performance when tested on another. Similarly, models trained on any of the simulations would also likely experience a drop in performance when applied to observational data. Training on data from two different suites of the CAMELS hydrodynamic cosmological simulations, we examine the generalization capabilities of Domain Adaptive Graph Neural Networks (DA-GNNs). By utilizing GNNs, we capitalize on their capacity to capture structured scale-free cosmological information from galaxy distributions. Moreover, by including unsupervised domain adaptation via Maximum Mean Discrepancy (MMD), we enable our models to extract domain-invariant features. We demonstrate that DA-GNN achieves higher accuracy and robustness on cross-dataset tasks (up to $28\%$ better relative error and up to almost an order of magnitude better $χ^2$). Using data visualizations, we show the effects of domain adaptation on proper latent space data alignment. This shows that DA-GNNs are a promising method for extracting domain-independent cosmological information, a vital step toward robust deep learning for real cosmic survey data. △ Less

Submitted 15 April, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

Comments: Accepted in Machine Learning and the Physical Sciences Workshop at NeurIPS 2023; 9 pages, 2 figures, 1 table

Report number: FERMILAB-CONF-23-644-CSAID

arXiv:2310.12528 [pdf, other]

Constructing Impactful Machine Learning Research for Astronomy: Best Practices for Researchers and Reviewers

Authors: D. Huppenkothen, M. Ntampaka, M. Ho, M. Fouesneau, B. Nord, J. E. G. Peek, M. Walmsley, J. F. Wu, C. Avestruz, T. Buck, M. Brescia, D. P. Finkbeiner, A. D. Goulding, T. Kacprzak, P. Melchior, M. Pasquato, N. Ramachandra, Y. -S. Ting, G. van de Ven, S. Villar, V. A. Villar, E. Zinger

Abstract: Machine learning has rapidly become a tool of choice for the astronomical community. It is being applied across a wide range of wavelengths and problems, from the classification of transients to neural network emulators of cosmological simulations, and is shifting paradigms about how we generate and report scientific results. At the same time, this class of method comes with its own set of best pr… ▽ More Machine learning has rapidly become a tool of choice for the astronomical community. It is being applied across a wide range of wavelengths and problems, from the classification of transients to neural network emulators of cosmological simulations, and is shifting paradigms about how we generate and report scientific results. At the same time, this class of method comes with its own set of best practices, challenges, and drawbacks, which, at present, are often reported on incompletely in the astrophysical literature. With this paper, we aim to provide a primer to the astronomical community, including authors, reviewers, and editors, on how to implement machine learning models and report their results in a way that ensures the accuracy of the results, reproducibility of the findings, and usefulness of the method. △ Less

Submitted 19 October, 2023; originally announced October 2023.

Comments: 14 pages, 3 figures; submitted to the Bulletin of the American Astronomical Society

arXiv:2305.01695 [pdf, other]

The Dark Energy Survey Six-Year Calibration Star Catalog

Authors: E. S. Rykoff, D. L. Tucker, D. L. Burke, S. S. Allam, K. Bechtol, G. M. Bernstein, D. Brout, R. A. Gruendl, J. Lasker, J. A. Smith, W. C. Wester, B. Yanny, T. M. C. Abbott, M. Aguena, O. Alves, F. Andrade-Oliveira, J. Annis, D. Bacon, E. Bertin, D. Brooks, A. Carnero Rosell, J. Carretero, F. J. Castander, A. Choi, L. N. da Costa , et al. (42 additional authors not shown)

Abstract: This Technical Note presents a catalog of calibrated reference stars that was generated by the Forward Calibration Method (FGCM) pipeline (arXiv:1706.01542) as part of the FGCM photometric calibration of the full Dark Energy Survey (DES) 6-Year data set (Y6). This catalog provides DES grizY magnitudes for 17 million stars with i-band magnitudes mostly in the range 16 < i < 21 spread over the full… ▽ More This Technical Note presents a catalog of calibrated reference stars that was generated by the Forward Calibration Method (FGCM) pipeline (arXiv:1706.01542) as part of the FGCM photometric calibration of the full Dark Energy Survey (DES) 6-Year data set (Y6). This catalog provides DES grizY magnitudes for 17 million stars with i-band magnitudes mostly in the range 16 < i < 21 spread over the full DES footprint covering 5000 square degrees over the Southern Galactic Cap at galactic latitudes b < -20 degrees (plus a few outlying fields disconnected from the main survey footprint). These stars are calibrated to a uniformity of better than 1.8 milli-mag (0.18%) RMS over the survey area. The absolute calibration of the catalog is computed with reference to the STISNIC.007 spectrum of the Hubble Space Telescope CalSpec standard star C26202; including systematic errors, the absolute flux system is known at the approximately 1% level. As such, these stars provide a useful reference catalog for calibrating grizY-band or grizY-like band photometry in the Southern Hemisphere, particularly for observations within the DES footprint. △ Less

Submitted 2 May, 2023; originally announced May 2023.

Comments: 21 pages, 15 figures, Fermilab Technical Note. Official Data Access Site: https://des.ncsa.illinois.edu/releases/other ; Temporary Data Access Site: https://data.darkenergysurvey.org/public_calib/DES_6yr_CalibStarCat/index.html

Report number: FERMILAB-TM-2784-PPD-SCD

arXiv:2302.02005 [pdf, other]

DeepAstroUDA: Semi-Supervised Universal Domain Adaptation for Cross-Survey Galaxy Morphology Classification and Anomaly Detection

Authors: A. Ćiprijanović, A. Lewis, K. Pedro, S. Madireddy, B. Nord, G. N. Perdue, S. M. Wild

Abstract: Artificial intelligence methods show great promise in increasing the quality and speed of work with large astronomical datasets, but the high complexity of these methods leads to the extraction of dataset-specific, non-robust features. Therefore, such methods do not generalize well across multiple datasets. We present a universal domain adaptation method, \textit{DeepAstroUDA}, as an approach to o… ▽ More Artificial intelligence methods show great promise in increasing the quality and speed of work with large astronomical datasets, but the high complexity of these methods leads to the extraction of dataset-specific, non-robust features. Therefore, such methods do not generalize well across multiple datasets. We present a universal domain adaptation method, \textit{DeepAstroUDA}, as an approach to overcome this challenge. This algorithm performs semi-supervised domain adaptation and can be applied to datasets with different data distributions and class overlaps. Non-overlap** classes can be present in any of the two datasets (the labeled source domain, or the unlabeled target domain), and the method can even be used in the presence of unknown classes. We apply our method to three examples of galaxy morphology classification tasks of different complexities ($3$-class and $10$-class problems), with anomaly detection: 1) datasets created after different numbers of observing years from a single survey (LSST mock data of $1$ and $10$ years of observations); 2) data from different surveys (SDSS and DECaLS); and 3) data from observing fields with different depths within one survey (wide field and Stripe 82 deep field of SDSS). For the first time, we demonstrate the successful use of domain adaptation between very discrepant observational datasets. \textit{DeepAstroUDA} is capable of bridging the gap between two astronomical surveys, increasing classification accuracy in both domains (up to $40\%$ on the unlabeled data), and making model performance consistent across datasets. Furthermore, our method also performs well as an anomaly detection algorithm and successfully clusters unknown class samples even in the unlabeled target dataset. △ Less

Submitted 22 March, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

Comments: Accepted in Machine Learning Science and Technology (MLST); 24 pages, 14 figures

Report number: FERMILAB-PUB-23-034-CSAID

arXiv:2211.10305 [pdf, other]

Neural Inference of Gaussian Processes for Time Series Data of Quasars

Authors: Egor Danilov, Aleksandra Ćiprijanović, Brian Nord

Abstract: The study of quasar light curves poses two problems: inference of the power spectrum and interpolation of an irregularly sampled time series. A baseline approach to these tasks is to interpolate a time series with a Damped Random Walk (DRW) model, in which the spectrum is inferred using Maximum Likelihood Estimation (MLE). However, the DRW model does not describe the smoothness of the time series,… ▽ More The study of quasar light curves poses two problems: inference of the power spectrum and interpolation of an irregularly sampled time series. A baseline approach to these tasks is to interpolate a time series with a Damped Random Walk (DRW) model, in which the spectrum is inferred using Maximum Likelihood Estimation (MLE). However, the DRW model does not describe the smoothness of the time series, and MLE faces many problems in terms of optimization and numerical precision. In this work, we introduce a new stochastic model that we call $\textit{Convolved Damped Random Walk}$ (CDRW). This model introduces a concept of smoothness to a DRW, which enables it to describe quasar spectra completely. We also introduce a new method of inference of Gaussian process parameters, which we call $\textit{Neural Inference}$. This method uses the powers of state-of-the-art neural networks to improve the conventional MLE inference technique. In our experiments, the Neural Inference method results in significant improvement over the baseline MLE (RMSE: $0.318 \rightarrow 0.205$, $0.464 \rightarrow 0.444$). Moreover, the combination of both the CDRW model and Neural Inference significantly outperforms the baseline DRW and MLE in interpolating a typical quasar light curve ($χ^2$: $0.333 \rightarrow 0.998$, $2.695 \rightarrow 0.981$). The code is published on GitHub. △ Less

Submitted 17 November, 2022; originally announced November 2022.

Comments: Machine Learning and the Physical Sciences workshop, NeurIPS 2022

arXiv:2211.09126 [pdf, other]

doi 10.1088/2632-2153/ac98f4

DIGS: Deep Inference of Galaxy Spectra with Neural Posterior Estimation

Authors: Gourav Khullar, Brian Nord, Aleksandra Ciprijanovic, Jason Poh, Fei Xu

Abstract: With the advent of billion-galaxy surveys with complex data, the need of the hour is to efficiently model galaxy spectral energy distributions (SEDs) with robust uncertainty quantification. The combination of Simulation-Based inference (SBI) and amortized Neural Posterior Estimation (NPE) has been successfully used to analyse simulated and real galaxy photometry both precisely and efficiently. In… ▽ More With the advent of billion-galaxy surveys with complex data, the need of the hour is to efficiently model galaxy spectral energy distributions (SEDs) with robust uncertainty quantification. The combination of Simulation-Based inference (SBI) and amortized Neural Posterior Estimation (NPE) has been successfully used to analyse simulated and real galaxy photometry both precisely and efficiently. In this work, we utilise this combination and build on existing literature to analyse simulated noisy galaxy spectra. Here, we demonstrate a proof-of-concept study of spectra that is a) an efficient analysis of galaxy SEDs and inference of galaxy parameters with physically interpretable uncertainties; and b) amortized calculations of posterior distributions of said galaxy parameters at the modest cost of a few galaxy fits with MCMC methods. We utilise the SED generator and inference framework Prospector to generate simulated spectra, and train a dataset of 2$\times$10$^6$ spectra (corresponding to a 5-parameter SED model) with NPE. We show that SBI -- with its combination of fast and amortized posterior estimations -- is capable of inferring accurate galaxy stellar masses and metallicities. Our uncertainty constraints are comparable to or moderately weaker than traditional inverse-modeling with Bayesian MCMC methods (e.g., 0.17 and 0.26 dex in stellar mass and metallicity for a given galaxy, respectively). We also find that our inference framework conducts rapid SED inference (0.9-1.2$\times$10$^5$ galaxy spectra via SBI/SNPE at the cost of 1 MCMC-based fit). With this work, we set the stage for further work that focuses of SED fitting of galaxy spectra with SBI, in the era of JWST galaxy survey programs and the wide-field Roman Space Telescope spectroscopic surveys. △ Less

Submitted 16 November, 2022; originally announced November 2022.

Comments: Manuscript accepted in Machine Learning: Science and Technology (MLST) as a Letter (October 10th, 2022); 12 Pages, 6 Figures and 1 Table; Data and code can be found in published github repository

Report number: FERMILAB-PUB-22-557-PPD-SCD

arXiv:2211.05836 [pdf, other]

Strong Lensing Parameter Estimation on Ground-Based Imaging Data Using Simulation-Based Inference

Authors: Jason Poh, Ashwin Samudre, Aleksandra Ćiprijanović, Brian Nord, Gourav Khullar, Dimitrios Tanoglidis, Joshua A. Frieman

Abstract: Current ground-based cosmological surveys, such as the Dark Energy Survey (DES), are predicted to discover thousands of galaxy-scale strong lenses, while future surveys, such as the Vera Rubin Observatory Legacy Survey of Space and Time (LSST) will increase that number by 1-2 orders of magnitude. The large number of strong lenses discoverable in future surveys will make strong lensing a highly com… ▽ More Current ground-based cosmological surveys, such as the Dark Energy Survey (DES), are predicted to discover thousands of galaxy-scale strong lenses, while future surveys, such as the Vera Rubin Observatory Legacy Survey of Space and Time (LSST) will increase that number by 1-2 orders of magnitude. The large number of strong lenses discoverable in future surveys will make strong lensing a highly competitive and complementary cosmic probe. To leverage the increased statistical power of the lenses that will be discovered through upcoming surveys, automated lens analysis techniques are necessary. We present two Simulation-Based Inference (SBI) approaches for lens parameter estimation of galaxy-galaxy lenses. We demonstrate the successful application of Neural Posterior Estimation (NPE) to automate the inference of a 12-parameter lens mass model for DES-like ground-based imaging data. We compare our NPE constraints to a Bayesian Neural Network (BNN) and find that it outperforms the BNN, producing posterior distributions that are for the most part both more accurate and more precise; in particular, several source-light model parameters are systematically biased in the BNN implementation. △ Less

Submitted 10 November, 2022; originally announced November 2022.

Comments: Accepted to the Workshop on Machine Learning and the Physical Sciences at the 36th Conference on Neural Information Processing Systems 2022 (NeurIPS 2022)

arXiv:2211.00677 [pdf, other]

Semi-Supervised Domain Adaptation for Cross-Survey Galaxy Morphology Classification and Anomaly Detection

Authors: Aleksandra Ćiprijanović, Ashia Lewis, Kevin Pedro, Sandeep Madireddy, Brian Nord, Gabriel N. Perdue, Stefan M. Wild

Abstract: In the era of big astronomical surveys, our ability to leverage artificial intelligence algorithms simultaneously for multiple datasets will open new avenues for scientific discovery. Unfortunately, simply training a deep neural network on images from one data domain often leads to very poor performance on any other dataset. Here we develop a Universal Domain Adaptation method DeepAstroUDA, capabl… ▽ More In the era of big astronomical surveys, our ability to leverage artificial intelligence algorithms simultaneously for multiple datasets will open new avenues for scientific discovery. Unfortunately, simply training a deep neural network on images from one data domain often leads to very poor performance on any other dataset. Here we develop a Universal Domain Adaptation method DeepAstroUDA, capable of performing semi-supervised domain alignment that can be applied to datasets with different types of class overlap. Extra classes can be present in any of the two datasets, and the method can even be used in the presence of unknown classes. For the first time, we demonstrate the successful use of domain adaptation on two very different observational datasets (from SDSS and DECaLS). We show that our method is capable of bridging the gap between two astronomical surveys, and also performs well for anomaly detection and clustering of unknown data in the unlabeled dataset. We apply our model to two examples of galaxy morphology classification tasks with anomaly detection: 1) classifying spiral and elliptical galaxies with detection of merging galaxies (three classes including one unknown anomaly class); 2) a more granular problem where the classes describe more detailed morphological properties of galaxies, with the detection of gravitational lenses (ten classes including one unknown anomaly class). △ Less

Submitted 11 November, 2022; v1 submitted 1 November, 2022; originally announced November 2022.

Comments: 3 figures, 1 table; accepted to Machine Learning and the Physical Sciences - Workshop at the 36th conference on Neural Information Processing Systems (NeurIPS)

Report number: FERMILAB-CONF-22-791-SCD

arXiv:2211.00024 [pdf, other]

doi 10.1088/2632-2153/acc444

A robust estimator of mutual information for deep learning interpretability

Authors: Davide Piras, Hiranya V. Peiris, Andrew Pontzen, Luisa Lucie-Smith, Ningyuan Guo, Brian Nord

Abstract: We develop the use of mutual information (MI), a well-established metric in information theory, to interpret the inner workings of deep learning models. To accurately estimate MI from a finite number of samples, we present GMM-MI (pronounced $``$Jimmie$"$), an algorithm based on Gaussian mixture models that can be applied to both discrete and continuous settings. GMM-MI is computationally efficien… ▽ More We develop the use of mutual information (MI), a well-established metric in information theory, to interpret the inner workings of deep learning models. To accurately estimate MI from a finite number of samples, we present GMM-MI (pronounced $``$Jimmie$"$), an algorithm based on Gaussian mixture models that can be applied to both discrete and continuous settings. GMM-MI is computationally efficient, robust to the choice of hyperparameters and provides the uncertainty on the MI estimate due to the finite sample size. We extensively validate GMM-MI on toy data for which the ground truth MI is known, comparing its performance against established mutual information estimators. We then demonstrate the use of our MI estimator in the context of representation learning, working with synthetic data and physical datasets describing highly non-linear processes. We train deep learning models to encode high-dimensional data within a meaningful compressed (latent) representation, and use GMM-MI to quantify both the level of disentanglement between the latent variables, and their association with relevant physical quantities, thus unlocking the interpretability of the latent representation. We make GMM-MI publicly available. △ Less

Submitted 23 March, 2023; v1 submitted 31 October, 2022; originally announced November 2022.

Comments: 30 pages, 8 figures. Minor changes to match version accepted for publication in Machine Learning: Science and Technology. GMM-MI available at https://github.com/dpiras/GMM-MI

Journal ref: Machine Learning: Science and Technology, Volume 4, Number 2, 025006, April 2023

arXiv:2209.07071 [pdf, other]

doi 10.3847/2515-5172/ac9140

Noise2Astro: Astronomical Image Denoising With Self-Supervised NeuralNetworks

Authors: Yunchong Zhang, Brian Nord, Amanda Pagul, Michael Lepori

Abstract: In observational astronomy, noise obscures signals of interest. Large-scale astronomical surveys are growing in size and complexity, which will produce more data and increase the workload of data processing. Develo** automated tools, such as convolutional neural networks (CNN), for denoising has become a promising area of research. We investigate the feasibility of CNN-based self-supervised lear… ▽ More In observational astronomy, noise obscures signals of interest. Large-scale astronomical surveys are growing in size and complexity, which will produce more data and increase the workload of data processing. Develo** automated tools, such as convolutional neural networks (CNN), for denoising has become a promising area of research. We investigate the feasibility of CNN-based self-supervised learning algorithms (e.g., Noise2Noise) for denoising astronomical images. We experimented with Noise2Noise on simulated noisy astronomical data. We evaluate the results based on the accuracy of recovering flux and morphology. This algorithm can well recover the flux for Poisson noise ( $98.13${\raisebox{0.5ex}{\tiny$^{+0.77}_{-0.90} $}$\large\%$}) and for Gaussian noise when image data has a smooth signal profile ($96.45${\raisebox{0.5ex}{\tiny$^{+0.80}_{-0.96} $}$\large\%$}). △ Less

Submitted 15 September, 2022; originally announced September 2022.

Journal ref: Res. Notes AAS 6 187 (2022)

arXiv:2208.00134 [pdf, other]

Estimating Cosmological Constraints from Galaxy Cluster Abundance using Simulation-Based Inference

Authors: Moonzarin Reza, Yuanyuan Zhang, Brian Nord, Jason Poh, Aleksandra Ciprijanovic, Louis Strigari

Abstract: Inferring the values and uncertainties of cosmological parameters in a cosmology model is of paramount importance for modern cosmic observations. In this paper, we use the simulation-based inference (SBI) approach to estimate cosmological constraints from a simplified galaxy cluster observation analysis. Using data generated from the Quijote simulation suite and analytical models, we train a machi… ▽ More Inferring the values and uncertainties of cosmological parameters in a cosmology model is of paramount importance for modern cosmic observations. In this paper, we use the simulation-based inference (SBI) approach to estimate cosmological constraints from a simplified galaxy cluster observation analysis. Using data generated from the Quijote simulation suite and analytical models, we train a machine learning algorithm to learn the probability function between cosmological parameters and the possible galaxy cluster observables. The posterior distribution of the cosmological parameters at a given observation is then obtained by sampling the predictions from the trained algorithm. Our results show that the SBI method can successfully recover the truth values of the cosmological parameters within the 2σ limit for this simplified galaxy cluster analysis, and acquires similar posterior constraints obtained with a likelihood-based Markov Chain Monte Carlo method, the current state-of the-art method used in similar cosmological studies. △ Less

Submitted 29 July, 2022; originally announced August 2022.

Comments: Accepted at the ICML 2022 Workshop on Machine Learning for Astrophysics

Report number: FERMILAB-CONF-22-487-SCD

arXiv:2204.05924 [pdf, other]

doi 10.3847/1538-4357/ac721b

DeepZipper II: Searching for Lensed Supernovae in Dark Energy Survey Data with Deep Learning

Authors: Robert Morgan, B. Nord, K. Bechtol, A. Möller, W. G. Hartley, S. Birrer, S. J. González, M. Martinez, R. A. Gruendl, E. J. Buckley-Geer, A. J. Shajib, A. Carnero Rosell, C. Lidman, T. Collett, T. M. C. Abbott, M. Aguena, F. Andrade-Oliveira, J. Annis, D. Bacon, S. Bocquet, D. Brooks, D. L. Burke, M. Carrasco Kind, J. Carretero, F. J. Castander , et al. (42 additional authors not shown)

Abstract: Gravitationally lensed supernovae (LSNe) are important probes of cosmic expansion, but they remain rare and difficult to find. Current cosmic surveys likely contain and 5-10 LSNe in total while next-generation experiments are expected to contain several hundreds to a few thousands of these systems. We search for these systems in observed Dark Energy Survey (DES) 5-year SN fields -- 10 3-sq. deg. r… ▽ More Gravitationally lensed supernovae (LSNe) are important probes of cosmic expansion, but they remain rare and difficult to find. Current cosmic surveys likely contain and 5-10 LSNe in total while next-generation experiments are expected to contain several hundreds to a few thousands of these systems. We search for these systems in observed Dark Energy Survey (DES) 5-year SN fields -- 10 3-sq. deg. regions of sky imaged in the $griz$ bands approximately every six nights over five years. To perform the search, we utilize the DeepZipper approach: a multi-branch deep learning architecture trained on image-level simulations of LSNe that simultaneously learns spatial and temporal relationships from time series of images. We find that our method obtains a LSN recall of 61.13% and a false positive rate of 0.02% on the DES SN field data. DeepZipper selected 2,245 candidates from a magnitude-limited ($m_i$ $<$ 22.5) catalog of 3,459,186 systems. We employ human visual inspection to review systems selected by the network and find three candidate LSNe in the DES SN fields. △ Less

Submitted 20 May, 2022; v1 submitted 12 April, 2022; originally announced April 2022.

Comments: Accepted by ApJ

arXiv:2204.00285 [pdf, other]

doi 10.1093/mnras/stac2342

KilonovaNet: Surrogate Models of Kilonova Spectra with Conditional Variational Autoencoders

Authors: Kamilė Lukošiūtė, Geert Raaijmakers, Zoheyr Doctor, Marcelle Soares-Santos, Brian Nord

Abstract: Detailed radiative transfer simulations of kilonova spectra play an essential role in multimessenger astrophysics. Using the simulation results in parameter inference studies requires building a surrogate model from the simulation outputs to use in algorithms requiring sampling. In this work, we present KilonovaNet, an implementation of conditional variational autoencoders (cVAEs) for the construc… ▽ More Detailed radiative transfer simulations of kilonova spectra play an essential role in multimessenger astrophysics. Using the simulation results in parameter inference studies requires building a surrogate model from the simulation outputs to use in algorithms requiring sampling. In this work, we present KilonovaNet, an implementation of conditional variational autoencoders (cVAEs) for the construction of surrogate models of kilonova spectra. This method can be trained on spectra directly, removing overhead time of pre-processing spectra, and greatly speeds up parameter inference time. We build surrogate models of three state-of-the-art kilonova simulation data sets and present in-depth surrogate error evaluation methods, which can in general be applied to any surrogate construction method. By creating synthetic photometric observations from the spectral surrogate, we perform parameter inference for the observed light curve data of GW170817 and compare the results with previous analyses. Given the speed with which KilonovaNet performs during parameter inference, it will serve as a useful tool in future gravitational wave observing runs to quickly analyze potential kilonova candidates △ Less

Submitted 1 April, 2022; originally announced April 2022.

arXiv:2203.08827 [pdf, other]

doi 10.1103/PhysRevD.105.103533

Discovering the building blocks of dark matter halo density profiles with neural networks

Authors: Luisa Lucie-Smith, Hiranya V. Peiris, Andrew Pontzen, Brian Nord, Jeyan Thiyagalingam, Davide Piras

Abstract: The density profiles of dark matter halos are typically modeled using empirical formulae fitted to the density profiles of relaxed halo populations. We present a neural network model that is trained to learn the map** from the raw density field containing each halo to the dark matter density profile. We show that the model recovers the widely-used Navarro-Frenk-White (NFW) profile out to the vir… ▽ More The density profiles of dark matter halos are typically modeled using empirical formulae fitted to the density profiles of relaxed halo populations. We present a neural network model that is trained to learn the map** from the raw density field containing each halo to the dark matter density profile. We show that the model recovers the widely-used Navarro-Frenk-White (NFW) profile out to the virial radius, and can additionally describe the variability in the outer profile of the halos. The neural network architecture consists of a supervised encoder-decoder framework, which first compresses the density inputs into a low-dimensional latent representation, and then outputs $ρ(r)$ for any desired value of radius $r$. The latent representation contains all the information used by the model to predict the density profiles. This allows us to interpret the latent representation by quantifying the mutual information between the representation and the halos' ground-truth density profiles. A two-dimensional representation is sufficient to accurately model the density profiles up to the virial radius; however, a three-dimensional representation is required to describe the outer profiles beyond the virial radius. The additional dimension in the representation contains information about the infalling material in the outer profiles of dark matter halos, thus discovering the splashback boundary of halos without prior knowledge of the halos' dynamical history. △ Less

Submitted 13 May, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

Comments: 12 pages, 6 figures. Minor changes to match version accepted for publication in PRD

arXiv:2203.08113 [pdf, ps, other]

Data Preservation for Cosmology

Authors: Marcelo Alvarez, Stephen Bailey, Deborah Bard, Lisa Gerhardt, Julien Guy, Stéphanie Juneau, Anthony Kremin, Brian Nord, David Schlegel, Laurie Stephey, Rollin Thomas, Benjamin Weaver

Abstract: We describe the needs and opportunities for preserving cosmology datasets and simulations, and facilitating their joint analysis beyond the lifetime of individual projects. We recommend that DOE fund a new cosmology data archive center to coordinate this work across the multiple DOE computing facilities. We describe the needs and opportunities for preserving cosmology datasets and simulations, and facilitating their joint analysis beyond the lifetime of individual projects. We recommend that DOE fund a new cosmology data archive center to coordinate this work across the multiple DOE computing facilities. △ Less

Submitted 15 March, 2022; originally announced March 2022.

Comments: Submitted to the Proceedings of the US Community Study on the Future of Particle Physics (Snowmass 2021). Feedback and additional co-signers are welcome

arXiv:2203.08056 [pdf, ps, other]

Machine Learning and Cosmology

Authors: Cora Dvorkin, Siddharth Mishra-Sharma, Brian Nord, V. Ashley Villar, Camille Avestruz, Keith Bechtol, Aleksandra Ćiprijanović, Andrew J. Connolly, Lehman H. Garrison, Gautham Narayan, Francisco Villaescusa-Navarro

Abstract: Methods based on machine learning have recently made substantial inroads in many corners of cosmology. Through this process, new computational tools, new perspectives on data collection, model development, analysis, and discovery, as well as new communities and educational pathways have emerged. Despite rapid progress, substantial potential at the intersection of cosmology and machine learning rem… ▽ More Methods based on machine learning have recently made substantial inroads in many corners of cosmology. Through this process, new computational tools, new perspectives on data collection, model development, analysis, and discovery, as well as new communities and educational pathways have emerged. Despite rapid progress, substantial potential at the intersection of cosmology and machine learning remains untapped. In this white paper, we summarize current and ongoing developments relating to the application of machine learning within cosmology and provide a set of recommendations aimed at maximizing the scientific impact of these burgeoning tools over the coming decade through both technical development as well as the fostering of emerging communities. △ Less

Submitted 15 March, 2022; originally announced March 2022.

Comments: Contribution to Snowmass 2021. 32 pages

arXiv:2203.08024 [pdf, other]

Snowmass 2021 CMB-S4 White Paper

Authors: Kevork Abazajian, Arwa Abdulghafour, Graeme E. Addison, Peter Adshead, Zeeshan Ahmed, Marco Ajello, Daniel Akerib, Steven W. Allen, David Alonso, Marcelo Alvarez, Mustafa A. Amin, Mandana Amiri, Adam Anderson, Behzad Ansarinejad, Melanie Archipley, Kam S. Arnold, Matt Ashby, Han Aung, Carlo Baccigalupi, Carina Baker, Abhishek Bakshi, Debbie Bard, Denis Barkats, Darcy Barron, Peter S. Barry , et al. (331 additional authors not shown)

Abstract: This Snowmass 2021 White Paper describes the Cosmic Microwave Background Stage 4 project CMB-S4, which is designed to cross critical thresholds in our understanding of the origin and evolution of the Universe, from the highest energies at the dawn of time through the growth of structure to the present day. We provide an overview of the science case, the technical design, and project plan. This Snowmass 2021 White Paper describes the Cosmic Microwave Background Stage 4 project CMB-S4, which is designed to cross critical thresholds in our understanding of the origin and evolution of the Universe, from the highest energies at the dawn of time through the growth of structure to the present day. We provide an overview of the science case, the technical design, and project plan. △ Less

Submitted 15 March, 2022; originally announced March 2022.

Comments: Contribution to Snowmass 2021. arXiv admin note: substantial text overlap with arXiv:1908.01062, arXiv:1907.04473

arXiv:2112.14299 [pdf, other]

DeepAdversaries: Examining the Robustness of Deep Learning Models for Galaxy Morphology Classification

Authors: Aleksandra Ćiprijanović, Diana Kafkes, Gregory Snyder, F. Javier Sánchez, Gabriel Nathan Perdue, Kevin Pedro, Brian Nord, Sandeep Madireddy, Stefan M. Wild

Abstract: With increased adoption of supervised deep learning methods for processing and analysis of cosmological survey data, the assessment of data perturbation effects (that can naturally occur in the data processing and analysis pipelines) and the development of methods that increase model robustness are increasingly important. In the context of morphological classification of galaxies, we study the eff… ▽ More With increased adoption of supervised deep learning methods for processing and analysis of cosmological survey data, the assessment of data perturbation effects (that can naturally occur in the data processing and analysis pipelines) and the development of methods that increase model robustness are increasingly important. In the context of morphological classification of galaxies, we study the effects of perturbations in imaging data. In particular, we examine the consequences of using neural networks when training on baseline data and testing on perturbed data. We consider perturbations associated with two primary sources: 1) increased observational noise as represented by higher levels of Poisson noise and 2) data processing noise incurred by steps such as image compression or telescope errors as represented by one-pixel adversarial attacks. We also test the efficacy of domain adaptation techniques in mitigating the perturbation-driven errors. We use classification accuracy, latent space visualizations, and latent space distance to assess model robustness. Without domain adaptation, we find that processing pixel-level errors easily flip the classification into an incorrect class and that higher observational noise makes the model trained on low-noise data unable to classify galaxy morphologies. On the other hand, we show that training with domain adaptation improves model robustness and mitigates the effects of these perturbations, improving the classification accuracy by 23% on data with higher observational noise. Domain adaptation also increases by a factor of ~2.3 the latent space distance between the baseline and the incorrectly classified one-pixel perturbed image, making the model more robust to inadvertent perturbations. △ Less

Submitted 6 July, 2022; v1 submitted 28 December, 2021; originally announced December 2021.

Comments: 20 pages, 6 figures, 5 tables; accepted in MLST

Report number: FERMILAB-PUB-21-767-SCD

arXiv:2112.01541 [pdf, other]

doi 10.3847/1538-4357/ac5178

DeepZipper: A Novel Deep Learning Architecture for Lensed Supernovae Identification

Authors: Robert Morgan, B. Nord, K. Bechtol, S. J. González, E. Buckley-Geer, A. Möller, J. W. Park, A. G. Kim, S. Birrer, M. Aguena, J. Annis, S. Bocquet, D. Brooks, A. Carnero Rosell, M. Carrasco Kind, J. Carretero, R. Cawthon, L. N. da Costa, T. M. Davis, J. De Vicente, P. Doel, I. Ferrero, D. Friedel, J. Frieman, J. García-Bellido , et al. (26 additional authors not shown)

Abstract: Large-scale astronomical surveys have the potential to capture data on large numbers of strongly gravitationally lensed supernovae (LSNe). To facilitate timely analysis and spectroscopic follow-up before the supernova fades, an LSN needs to be identified soon after it begins. To quickly identify LSNe in optical survey datasets, we designed ZipperNet, a multi-branch deep neural network that combine… ▽ More Large-scale astronomical surveys have the potential to capture data on large numbers of strongly gravitationally lensed supernovae (LSNe). To facilitate timely analysis and spectroscopic follow-up before the supernova fades, an LSN needs to be identified soon after it begins. To quickly identify LSNe in optical survey datasets, we designed ZipperNet, a multi-branch deep neural network that combines convolutional layers (traditionally used for images) with long short-term memory (LSTM) layers (traditionally used for time series). We tested ZipperNet on the task of classifying objects from four categories -- no lens, galaxy-galaxy lens, lensed type Ia supernova, lensed core-collapse supernova -- within high-fidelity simulations of three cosmic survey data sets -- the Dark Energy Survey (DES), Rubin Observatory's Legacy Survey of Space and Time (LSST), and a Dark Energy Spectroscopic Instrument (DESI) imaging survey. Among our results, we find that for the LSST-like dataset, ZipperNet classifies LSNe with a receiver operating characteristic area under the curve of 0.97, predicts the spectroscopic type of the lensed supernovae with 79\% accuracy, and demonstrates similarly high performance for LSNe 1-2 epochs after first detection. We anticipate that a model like ZipperNet, which simultaneously incorporates spatial and temporal information, can play a significant role in the rapid identification of lensed transient systems in cosmic survey experiments. △ Less

Submitted 19 May, 2022; v1 submitted 2 December, 2021; originally announced December 2021.

Comments: Published in ApJ

Report number: FERMILAB-PUB-21-392-E-SCD

arXiv:2111.14566 [pdf, ps, other]

Building Trustworthy Machine Learning Models for Astronomy

Authors: Michelle Ntampaka, Matthew Ho, Brian Nord

Abstract: Astronomy is entering an era of data-driven discovery, due in part to modern machine learning (ML) techniques enabling powerful new ways to interpret observations. This shift in our scientific approach requires us to consider whether we can trust the black box. Here, we overview methods for an often-overlooked step in the development of ML models: building community trust in the algorithms. Trust… ▽ More Astronomy is entering an era of data-driven discovery, due in part to modern machine learning (ML) techniques enabling powerful new ways to interpret observations. This shift in our scientific approach requires us to consider whether we can trust the black box. Here, we overview methods for an often-overlooked step in the development of ML models: building community trust in the algorithms. Trust is an essential ingredient not just for creating more robust data analysis techniques, but also for building confidence within the astronomy community to embrace machine learning methods and results. △ Less

Submitted 29 November, 2021; originally announced November 2021.

Comments: Prepared for the Astronomical Data Analysis Software and Systems (ADASS) XXXI Proceedings

arXiv:2111.00961 [pdf, other]

Robustness of deep learning algorithms in astronomy -- galaxy morphology studies

Authors: A. Ćiprijanović, D. Kafkes, G. N. Perdue, K. Pedro, G. Snyder, F. J. Sánchez, S. Madireddy, S. M. Wild, B. Nord

Abstract: Deep learning models are being increasingly adopted in wide array of scientific domains, especially to handle high-dimensionality and volume of the scientific data. However, these models tend to be brittle due to their complexity and overparametrization, especially to the inadvertent adversarial perturbations that can appear due to common image processing such as compression or blurring that are o… ▽ More Deep learning models are being increasingly adopted in wide array of scientific domains, especially to handle high-dimensionality and volume of the scientific data. However, these models tend to be brittle due to their complexity and overparametrization, especially to the inadvertent adversarial perturbations that can appear due to common image processing such as compression or blurring that are often seen with real scientific data. It is crucial to understand this brittleness and develop models robust to these adversarial perturbations. To this end, we study the effect of observational noise from the exposure time, as well as the worst case scenario of a one-pixel attack as a proxy for compression or telescope errors on performance of ResNet18 trained to distinguish between galaxies of different morphologies in LSST mock data. We also explore how domain adaptation techniques can help improve model robustness in case of this type of naturally occurring attacks and help scientists build more trustworthy and stable models. △ Less

Submitted 2 November, 2021; v1 submitted 1 November, 2021; originally announced November 2021.

Comments: Accepted in: Fourth Workshop on Machine Learning and the Physical Sciences (35th Conference on Neural Information Processing Systems; NeurIPS2021); final version

Report number: FERMILAB-CONF-21-561-SCD

arXiv:2110.02418 [pdf, other]

doi 10.3847/1538-4365/ac470b

The DES Bright Arcs Survey: Candidate Strongly Lensed Galaxy Systems from the Dark Energy Survey 5,000 Sq. Deg. Footprint

Authors: J. H. O'Donnell, R. D. Wilkinson, H. T. Diehl, C. Aros-Bunster, K. Bechtol, S. Birrer, E. J. Buckley-Geer, A. Carnero Rosell, M. Carrasco Kind, L. N. da Costa, S. J. Gonzalez Lozano, R. A. Gruendl, M. Hilton, H. Lin, K. A. Lindgren, J. Martin, A. Pieres, E. S. Rykoff, I. Sevilla-Noarbe, E. Sheldon, C. Sifón, D. L. Tucker, B. Yanny, T. M. C. Abbott, M. Aguena , et al. (57 additional authors not shown)

Abstract: We report the combined results of eight searches for strong gravitational lens systems in the full 5,000 sq. deg. of Dark Energy Survey (DES) observations. The observations accumulated by the end of the third observing season fully covered the DES footprint in 5 filters (grizY), with an $i-$band limiting magnitude (at $10σ$) of 23.44. In four searches, a list of potential candidates was identified… ▽ More We report the combined results of eight searches for strong gravitational lens systems in the full 5,000 sq. deg. of Dark Energy Survey (DES) observations. The observations accumulated by the end of the third observing season fully covered the DES footprint in 5 filters (grizY), with an $i-$band limiting magnitude (at $10σ$) of 23.44. In four searches, a list of potential candidates was identified using a color and magnitude selection from the object catalogs created from the first three observing seasons. Three other searches were conducted at the locations of previously identified galaxy clusters. Cutout images of potential candidates were then visually scanned using an object viewer. An additional set of candidates came from a data-quality check of a subset of the color-coadd "tiles" created from the full DES six-season data set. A short list of the most promising strong lens candidates was then numerically ranked according to whether or not we judged them to be bona fide strong gravitational lens systems. These searches discovered a diverse set of 247 strong lens candidate systems, of which 81 are identified for the first time. We provide the coordinates, magnitudes, and photometric properties of the lens and source objects, and an estimate of the Einstein radius for 81 new systems and 166 previously reported. This catalog will be of use for selecting interesting systems for detailed follow-up, studies of galaxy cluster and group mass profiles, as well as a training/validation set for automated strong lens searches. △ Less

Submitted 3 January, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

Comments: 38 pages, 17 figures, 4 tables, accepted by ApJS

arXiv:2109.09781 [pdf, other]

doi 10.1093/mnras/stac925

Finding quadruply imaged quasars with machine learning. I. Methods

Authors: A. Akhazhanov, A. More, A. Amini, C. Hazlett, T. Treu, S. Birrer, A. Shajib, P. Schechter, C. Lemon, B. Nord, M. Aguena, S. Allam, F. Andrade-Oliveira, J. Annis, D. Brooks, E. Buckley-Geer, D. L. Burke, A. Carnero Rosell, M. Carrasco Kind, J. Carretero, A. Choi, C. Conselice, M. Costanzi, L. N. da Costa, M. E. S. Pereira , et al. (46 additional authors not shown)

Abstract: Strongly lensed quadruply imaged quasars (quads) are extraordinary objects. They are very rare in the sky -- only a few tens are known to date -- and yet they provide unique information about a wide range of topics, including the expansion history and the composition of the Universe, the distribution of stars and dark matter in galaxies, the host galaxies of quasars, and the stellar initial mass f… ▽ More Strongly lensed quadruply imaged quasars (quads) are extraordinary objects. They are very rare in the sky -- only a few tens are known to date -- and yet they provide unique information about a wide range of topics, including the expansion history and the composition of the Universe, the distribution of stars and dark matter in galaxies, the host galaxies of quasars, and the stellar initial mass function. Finding them in astronomical images is a classic "needle in a haystack" problem, as they are outnumbered by other (contaminant) sources by many orders of magnitude. To solve this problem, we develop state-of-the-art deep learning methods and train them on realistic simulated quads based on real images of galaxies taken from the Dark Energy Survey, with realistic source and deflector models, including the chromatic effects of microlensing. The performance of the best methods on a mixture of simulated and real objects is excellent, yielding area under the receiver operating curve in the range 0.86 to 0.89. Recall is close to 100% down to total magnitude i~21 indicating high completeness, while precision declines from 85% to 70% in the range i~17-21. The methods are extremely fast: training on 2 million samples takes 20 hours on a GPU machine, and 10^8 multi-band cutouts can be evaluated per GPU-hour. The speed and performance of the method pave the way to apply it to large samples of astronomical sources, bypassing the need for photometric pre-selection that is likely to be a major cause of incompleteness in current samples of known quads. △ Less

Submitted 20 September, 2021; originally announced September 2021.

Comments: 17 pages, 14 figures, submitted to MNRAS

arXiv:2109.08246 [pdf, other]

DeepGhostBusters: Using Mask R-CNN to Detect and Mask Ghosting and Scattered-Light Artifacts from Optical Survey Images

Authors: Dimitrios Tanoglidis, Aleksandra Ćiprijanović, Alex Drlica-Wagner, Brian Nord, Michael H. L. S. Wang, Ariel Jacob Amsellem, Kathryn Downey, Sydney Jenkins, Diana Kafkes, Zhuoqi Zhang

Abstract: Wide-field astronomical surveys are often affected by the presence of undesirable reflections (often known as "ghosting artifacts" or "ghosts") and scattered-light artifacts. The identification and mitigation of these artifacts is important for rigorous astronomical analyses of faint and low-surface-brightness systems. However, the identification of ghosts and scattered-light artifacts is challeng… ▽ More Wide-field astronomical surveys are often affected by the presence of undesirable reflections (often known as "ghosting artifacts" or "ghosts") and scattered-light artifacts. The identification and mitigation of these artifacts is important for rigorous astronomical analyses of faint and low-surface-brightness systems. However, the identification of ghosts and scattered-light artifacts is challenging due to a) the complex morphology of these features and b) the large data volume of current and near-future surveys. In this work, we use images from the Dark Energy Survey (DES) to train, validate, and test a deep neural network (Mask R-CNN) to detect and localize ghosts and scattered-light artifacts. We find that the ability of the Mask R-CNN model to identify affected regions is superior to that of conventional algorithms and traditional convolutional neural networks methods. We propose that a multi-step pipeline combining Mask R-CNN segmentation with a classical CNN classifier provides a powerful technique for the automated detection of ghosting and scattered-light artifacts in current and near-future surveys. △ Less

Submitted 16 September, 2021; originally announced September 2021.

Comments: 24 pages, 18 figures. Code and data related to this work can be found at: https://github.com/dtanoglidis/DeepGhostBusters

Report number: FERMILAB-PUB-21-374-AE

arXiv:2109.06172 [pdf, other]

SkyPy: A package for modelling the Universe

Authors: Adam Amara, Lucia F. de la Bella, Simon Birrer, Sarah Bridle, Juan Pablo Cordero, Ginevra Favole, Ian Harrison, Ian W. Harry, William G. Hartley, Coleman Krawczyk, Andrew Lundgren, Brian Nord, Laura K. Nuttall, Richard P. Rollins, Philipp Sudek, Sut-Ieng Tam, Nicolas Tessore, Arthur E. Tolley, Keiichi Umetsu, Andrew R. Williamson, Laura Wolz

Abstract: SkyPy is an open-source Python package for simulating the astrophysical sky. It comprises a library of physical and empirical models across a range of observables and a command-line script to run end-to-end simulations. The library provides functions that sample realisations of sources and their associated properties from probability distributions. Simulation pipelines are constructed from these m… ▽ More SkyPy is an open-source Python package for simulating the astrophysical sky. It comprises a library of physical and empirical models across a range of observables and a command-line script to run end-to-end simulations. The library provides functions that sample realisations of sources and their associated properties from probability distributions. Simulation pipelines are constructed from these models using a YAML-based configuration syntax, while task scheduling and data dependencies are handled internally and the modular design allows users to interface with external software. SkyPy is developed and maintained by a diverse community of domain experts with a focus on software sustainability and interoperability. By fostering development, it provides a framework for correlated simulations of a range of cosmological probes including galaxy populations, large scale structure, the cosmic microwave background, supernovae and gravitational waves. Version 0.4 implements functions that model various properties of galaxies including luminosity functions, redshift distributions and optical photometry from spectral energy distribution templates. Future releases will provide additional modules, for example, to simulate populations of dark matter halos and model the galaxy-halo connection, making use of existing software packages from the astrophysics community where appropriate. △ Less

Submitted 11 September, 2021; originally announced September 2021.

Comments: Published by JOSS. The package is available at https://github.com/skypyproject/skypy. Comments, issues and pull requests are welcome

arXiv:2106.11315 [pdf, other]

doi 10.3847/1538-4357/ac3760

Expediting DECam Multimessenger Counterpart Searches with Convolutional Neural Networks

Authors: Adam Shandonay, Robert Morgan, Keith Bechtol, Clecio R. Bom, Brian Nord, Alyssa Garcia, Ben Henghes, Kenneth Herner, Megan Tabbutt, Antonella Palmese, Luidhy Santana-Silva, Marcelle Soares-Santos, Mandeep S. S. Gill, Juan Garcia-Bellido

Abstract: Searches for counterparts to multimessenger events with optical imagers use difference imaging to detect new transient sources. However, even with existing artifact detection algorithms, this process simultaneously returns several classes of false positives: false detections from poor quality image subtractions, false detections from low signal-to-noise images, and detections of pre-existing varia… ▽ More Searches for counterparts to multimessenger events with optical imagers use difference imaging to detect new transient sources. However, even with existing artifact detection algorithms, this process simultaneously returns several classes of false positives: false detections from poor quality image subtractions, false detections from low signal-to-noise images, and detections of pre-existing variable sources. Currently, human visual inspection to remove the false positives is a central part of multimessenger follow-up observations, but when next generation gravitational wave and neutrino detectors come online and increase the rate of multimessenger events, the visual inspection process will be prohibitively expensive. We approach this problem with two convolutional neural networks operating on the difference imaging outputs. The first network focuses on removing false detections and demonstrates an accuracy of 92 percent on our dataset. The second network focuses on sorting all real detections by the probability of being a transient source within a host galaxy and distinguishes between various classes of images that previously required additional human inspection. We find the number of images requiring human inspection will decrease by a factor of 1.5 using our approach alone and a factor of 3.6 using our approach in combination with existing algorithms, facilitating rapid multimessenger counterpart identification by the astronomical community. △ Less

Submitted 20 May, 2022; v1 submitted 21 June, 2021; originally announced June 2021.

Comments: Published in ApJ

Report number: FERMILAB-PUB-21-268-AE

arXiv:2106.09761 [pdf, other]

Unsupervised Resource Allocation with Graph Neural Networks

Authors: Miles Cranmer, Peter Melchior, Brian Nord

Abstract: We present an approach for maximizing a global utility function by learning how to allocate resources in an unsupervised way. We expect interactions between allocation targets to be important and therefore propose to learn the reward structure for near-optimal allocation policies with a GNN. By relaxing the resource constraint, we can employ gradient-based optimization in contrast to more standard… ▽ More We present an approach for maximizing a global utility function by learning how to allocate resources in an unsupervised way. We expect interactions between allocation targets to be important and therefore propose to learn the reward structure for near-optimal allocation policies with a GNN. By relaxing the resource constraint, we can employ gradient-based optimization in contrast to more standard evolutionary algorithms. Our algorithm is motivated by a problem in modern astronomy, where one needs to select-based on limited initial information-among $10^9$ galaxies those whose detailed measurement will lead to optimal inference of the composition of the universe. Our technique presents a way of flexibly learning an allocation strategy by only requiring forward simulators for the physics of interest and the measurement process. We anticipate that our technique will also find applications in a range of resource allocation problems. △ Less

Submitted 17 June, 2021; originally announced June 2021.

Comments: Accepted to PMLR/contributed oral at NeurIPS 2020 Pre-registration Workshop. Code at https://github.com/MilesCranmer/gnn_resource_allocation

arXiv:2105.10524 [pdf, other]

doi 10.1016/j.ascom.2021.100474

A Machine Learning Approach to the Detection of Ghosting and Scattered Light Artifacts in Dark Energy Survey Images

Authors: Chihway Chang, Alex Drlica-Wagner, Stephen M. Kent, Brian Nord, Donah Michelle Wang, Michael H. L. S. Wang

Abstract: Astronomical images are often plagued by unwanted artifacts that arise from a number of sources including imperfect optics, faulty image sensors, cosmic ray hits, and even airplanes and artificial satellites. Spurious reflections (known as "ghosts") and the scattering of light off the surfaces of a camera and/or telescope are particularly difficult to avoid. Detecting ghosts and scattered light ef… ▽ More Astronomical images are often plagued by unwanted artifacts that arise from a number of sources including imperfect optics, faulty image sensors, cosmic ray hits, and even airplanes and artificial satellites. Spurious reflections (known as "ghosts") and the scattering of light off the surfaces of a camera and/or telescope are particularly difficult to avoid. Detecting ghosts and scattered light efficiently in large cosmological surveys that will acquire petabytes of data can be a daunting task. In this paper, we use data from the Dark Energy Survey to develop, train, and validate a machine learning model to detect ghosts and scattered light using convolutional neural networks. The model architecture and training procedure is discussed in detail, and the performance on the training and validation set is presented. Testing is performed on data and results are compared with those from a ray-tracing algorithm. As a proof of principle, we have shown that our method is promising for the Rubin Observatory and beyond. △ Less

Submitted 21 May, 2021; originally announced May 2021.

Comments: accepted for publication in "Astronomy and Computing"

Report number: FERMILAB-TM-2723-E-SCD

arXiv:2103.13932 [pdf, other]

QUOTAS: A new research platform for the data-driven investigation of black holes

Authors: Priyamvada Natarajan, Kwok Sun Tang, Robert McGibbon, Sadegh Khochfar, Brian Nord, Steinn Sigurdsson, Joe Tricot, Nico Cappelluti, Daniel George, Jack Hidary

Abstract: We present QUOTAS, a novel research platform for the data-driven investigation of super-massive black hole (SMBH) populations. While SMBH data sets -- observations and simulations -- have grown rapidly in complexity and abundance, our computational environments and analysis tools have not matured commensurately to exhaust opportunities for discovery. Motivated to explore BH host galaxy and the par… ▽ More We present QUOTAS, a novel research platform for the data-driven investigation of super-massive black hole (SMBH) populations. While SMBH data sets -- observations and simulations -- have grown rapidly in complexity and abundance, our computational environments and analysis tools have not matured commensurately to exhaust opportunities for discovery. Motivated to explore BH host galaxy and the parent dark matter halo connection, in this pilot version of QUOTAS, we assemble and co-locate the high-redshift, luminous quasar population at $z \geq 3$ alongside simulated data of the same epochs. Leveraging machine learning algorithms (ML) we expand simulation volumes that successfully replicate halo populations beyond the training set. Training ML on the Illustris-TNG300 simulation that includes baryonic physics, we populate the larger LEGACY Expanse dark matter-only box with quasars. Our first science results comparing observational and ML simulated quasars at $z \sim 3$, reveal that while the recovered Black Hole Mass Functions and clustering are in good agreement, simulated SMBHs fail to accrete, shine and grow at high enough rates to match observed quasars. We conclude that sub-grid models of mass accretion and SMBH feedback implemented in Illustris-TNG300 do not reproduce their observed mass growth. QUOTAS, demonstrates the power of ML, both for analyzing large complex datasets, and offering a unique opportunity to interrogate our theoretical model assumptions. We deploy ML again to derive and devise an optimal survey strategy for bringing the undetected lower luminosity quasar population into view. QUOTAS, and all related materials are publicly available at the Google Kaggle platform. △ Less

Submitted 14 April, 2023; v1 submitted 25 March, 2021; originally announced March 2021.

Comments: Revised version: 38 pages, 4 tables and 14 figures, accepted for publication in ApJ

arXiv:2103.01373 [pdf, other]

doi 10.1093/mnras/stab1677

DeepMerge II: Building Robust Deep Learning Algorithms for Merging Galaxy Identification Across Domains

Authors: A. Ćiprijanović, D. Kafkes, K. Downey, S. Jenkins, G. N. Perdue, S. Madireddy, T. Johnston, G. F. Snyder, B. Nord

Abstract: In astronomy, neural networks are often trained on simulation data with the prospect of being used on telescope observations. Unfortunately, training a model on simulation data and then applying it to instrument data leads to a substantial and potentially even detrimental decrease in model accuracy on the new target dataset. Simulated and instrument data represent different data domains, and for a… ▽ More In astronomy, neural networks are often trained on simulation data with the prospect of being used on telescope observations. Unfortunately, training a model on simulation data and then applying it to instrument data leads to a substantial and potentially even detrimental decrease in model accuracy on the new target dataset. Simulated and instrument data represent different data domains, and for an algorithm to work in both, domain-invariant learning is necessary. Here we employ domain adaptation techniques$-$ Maximum Mean Discrepancy (MMD) as an additional transfer loss and Domain Adversarial Neural Networks (DANNs)$-$ and demonstrate their viability to extract domain-invariant features within the astronomical context of classifying merging and non-merging galaxies. Additionally, we explore the use of Fisher loss and entropy minimization to enforce better in-domain class discriminability. We show that the addition of each domain adaptation technique improves the performance of a classifier when compared to conventional deep learning algorithms. We demonstrate this on two examples: between two Illustris-1 simulated datasets of distant merging galaxies, and between Illustris-1 simulated data of nearby merging galaxies and observed data from the Sloan Digital Sky Survey. The use of domain adaptation techniques in our experiments leads to an increase of target domain classification accuracy of up to ${\sim}20\%$. With further development, these techniques will allow astronomers to successfully implement neural network models trained on simulation data to efficiently detect and study astrophysical objects in current and future large-scale astronomical surveys. △ Less

Submitted 1 March, 2021; originally announced March 2021.

Comments: Submitted to MNRAS; 21 pages, 9 figures, 9 tables

Report number: FERMILAB-PUB-21-072-SCD

Journal ref: MNRAS, Volume 506, Issue 1, September 2021, Page 677

arXiv:2102.13123 [pdf, other]

doi 10.1093/mnras/stab2229

DeepSZ: Identification of Sunyaev-Zel'dovich Galaxy Clusters using Deep Learning

Authors: Zhen Lin, Nicholas Huang, Camille Avestruz, W. L. Kimmy Wu, Shubhendu Trivedi, João Caldeira, Brian Nord

Abstract: Galaxy clusters identified from the Sunyaev Zel'dovich (SZ) effect are a key ingredient in multi-wavelength cluster-based cosmology. We present a comparison between two methods of cluster identification: the standard Matched Filter (MF) method in SZ cluster finding and a method using Convolutional Neural Networks (CNN). We further implement and show results for a `combined' identifier. We apply th… ▽ More Galaxy clusters identified from the Sunyaev Zel'dovich (SZ) effect are a key ingredient in multi-wavelength cluster-based cosmology. We present a comparison between two methods of cluster identification: the standard Matched Filter (MF) method in SZ cluster finding and a method using Convolutional Neural Networks (CNN). We further implement and show results for a `combined' identifier. We apply the methods to simulated millimeter maps for several observing frequencies for an SPT-3G-like survey. There are some key differences between the methods. The MF method requires image pre-processing to remove point sources and a model for the noise, while the CNN method requires very little pre-processing of images. Additionally, the CNN requires tuning of hyperparameters in the model and takes as input, cutout images of the sky. Specifically, we use the CNN to classify whether or not an 8 arcmin $\times$ 8 arcmin cutout of the sky contains a cluster. We compare differences in purity and completeness. The MF signal-to-noise ratio depends on both mass and redshift. Our CNN, trained for a given mass threshold, captures a different set of clusters than the MF, some of which have SNR below the MF detection threshold. However, the CNN tends to mis-classify cutouts whose clusters are located near the edge of the cutout, which can be mitigated with staggered cutouts. We leverage the complementarity of the two methods, combining the scores from each method for identification. The purity and completeness of the MF alone are both 0.61, assuming a standard detection threshold. The purity and completeness of the CNN alone are 0.59 and 0.61. The combined classification method yields 0.60 and 0.77, a significant increase for completeness with a modest decrease in purity. We advocate for combined methods that increase the confidence of many lower signal-to-noise clusters. △ Less

Submitted 8 March, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

Report number: FERMILAB-PUB-21-077-SCD

arXiv:2102.02830 [pdf, other]

doi 10.21105/joss.02854

deeplenstronomy: A dataset simulation package for strong gravitational lensing

Authors: Robert Morgan, Brian Nord, Simon Birrer, Joshua Yao-Yu Lin, Jason Poh

Abstract: Automated searches for strong gravitational lensing in optical imaging survey datasets often employ machine learning and deep learning approaches. These techniques require more example systems to train the algorithms than have presently been discovered, which creates a need for simulated images as training dataset supplements. This work introduces and summarizes deeplenstronomy, an open-source Pyt… ▽ More Automated searches for strong gravitational lensing in optical imaging survey datasets often employ machine learning and deep learning approaches. These techniques require more example systems to train the algorithms than have presently been discovered, which creates a need for simulated images as training dataset supplements. This work introduces and summarizes deeplenstronomy, an open-source Python package that enables efficient, large-scale, and reproducible simulation of images of astronomical systems. A full suite of unit tests, documentation, and example notebooks are available at https://deepskies.github.io/deeplenstronomy/ . △ Less

Submitted 4 February, 2021; originally announced February 2021.

Comments: Published in the Journal of Open Source Software

Journal ref: Journal of Open Source Software, 6(58), 2854 (2021)

arXiv:2011.10577 [pdf, other]

doi 10.1103/PhysRevD.109.063524

Deep learning insights into cosmological structure formation

Authors: Luisa Lucie-Smith, Hiranya V. Peiris, Andrew Pontzen, Brian Nord, Jeyan Thiyagalingam

Abstract: The evolution of linear initial conditions present in the early universe into extended halos of dark matter at late times can be computed using cosmological simulations. However, a theoretical understanding of this complex process remains elusive; in particular, the role of anisotropic information in the initial conditions in establishing the final mass of dark matter halos remains a long-standing… ▽ More The evolution of linear initial conditions present in the early universe into extended halos of dark matter at late times can be computed using cosmological simulations. However, a theoretical understanding of this complex process remains elusive; in particular, the role of anisotropic information in the initial conditions in establishing the final mass of dark matter halos remains a long-standing puzzle. Here, we build a deep learning framework to investigate this question. We train a three-dimensional convolutional neural network (CNN) to predict the mass of dark matter halos from the initial conditions, and quantify in full generality the amounts of information in the isotropic and anisotropic aspects of the initial density field about final halo masses. We find that anisotropies add a small, albeit statistically significant amount of information over that contained within spherical averages of the density field about final halo mass. However, the overall scatter in the final mass predictions does not change qualitatively with this additional information, only decreasing from 0.9 dex to 0.7 dex. Given such a small improvement, our results demonstrate that isotropic aspects of the initial density field essentially saturate the relevant information about final halo mass. Therefore, instead of searching for information directly encoded in initial conditions anisotropies, a more promising route to accurate, fast halo mass predictions is to add approximate dynamical information based e.g. on perturbation theory. More broadly, our results indicate that deep learning frameworks can provide a powerful tool for extracting physical insight into cosmological structure formation. △ Less

Submitted 1 March, 2024; v1 submitted 20 November, 2020; originally announced November 2020.

Comments: 17 pages, 10 figures. Accepted in PRD

Journal ref: Phys. Rev. D 109, 063524 (2024)

arXiv:2011.03591 [pdf, other]

Domain adaptation techniques for improved cross-domain study of galaxy mergers

Authors: A. Ćiprijanović, D. Kafkes, S. Jenkins, K. Downey, G. N. Perdue, S. Madireddy, T. Johnston, B. Nord

Abstract: In astronomy, neural networks are often trained on simulated data with the prospect of being applied to real observations. Unfortunately, simply training a deep neural network on images from one domain does not guarantee satisfactory performance on new images from a different domain. The ability to share cross-domain knowledge is the main advantage of modern deep domain adaptation techniques. Here… ▽ More In astronomy, neural networks are often trained on simulated data with the prospect of being applied to real observations. Unfortunately, simply training a deep neural network on images from one domain does not guarantee satisfactory performance on new images from a different domain. The ability to share cross-domain knowledge is the main advantage of modern deep domain adaptation techniques. Here we demonstrate the use of two techniques - Maximum Mean Discrepancy (MMD) and adversarial training with Domain Adversarial Neural Networks (DANN) - for the classification of distant galaxy mergers from the Illustris-1 simulation, where the two domains presented differ only due to inclusion of observational noise. We show how the addition of either MMD or adversarial training greatly improves the performance of the classifier on the target domain when compared to conventional machine learning algorithms, thereby demonstrating great promise for their use in astronomy. △ Less

Submitted 13 November, 2020; v1 submitted 6 November, 2020; originally announced November 2020.

Comments: Accepted in: Machine Learning and the Physical Sciences - Workshop at the 34th Conference on Neural Information Processing Systems (NeurIPS); final version

Report number: FERMILAB-CONF-20-582-SCD

arXiv:2008.12619 [pdf, other]

doi 10.3847/1538-4357/ac1596

CMB-S4: Forecasting Constraints on Primordial Gravitational Waves

Authors: CMB-S4 Collaboration, :, Kevork Abazajian, Graeme E. Addison, Peter Adshead, Zeeshan Ahmed, Daniel Akerib, Aamir Ali, Steven W. Allen, David Alonso, Marcelo Alvarez, Mustafa A. Amin, Adam Anderson, Kam S. Arnold, Peter Ashton, Carlo Baccigalupi, Debbie Bard, Denis Barkats, Darcy Barron, Peter S. Barry, James G. Bartlett, Ritoban Basu Thakur, Nicholas Battaglia, Rachel Bean, Chris Bebek , et al. (212 additional authors not shown)

Abstract: CMB-S4---the next-generation ground-based cosmic microwave background (CMB) experiment---is set to significantly advance the sensitivity of CMB measurements and enhance our understanding of the origin and evolution of the Universe, from the highest energies at the dawn of time through the growth of structure to the present day. Among the science cases pursued with CMB-S4, the quest for detecting p… ▽ More CMB-S4---the next-generation ground-based cosmic microwave background (CMB) experiment---is set to significantly advance the sensitivity of CMB measurements and enhance our understanding of the origin and evolution of the Universe, from the highest energies at the dawn of time through the growth of structure to the present day. Among the science cases pursued with CMB-S4, the quest for detecting primordial gravitational waves is a central driver of the experimental design. This work details the development of a forecasting framework that includes a power-spectrum-based semi-analytic projection tool, targeted explicitly towards optimizing constraints on the tensor-to-scalar ratio, $r$, in the presence of Galactic foregrounds and gravitational lensing of the CMB. This framework is unique in its direct use of information from the achieved performance of current Stage 2--3 CMB experiments to robustly forecast the science reach of upcoming CMB-polarization endeavors. The methodology allows for rapid iteration over experimental configurations and offers a flexible way to optimize the design of future experiments given a desired scientific goal. To form a closed-loop process, we couple this semi-analytic tool with map-based validation studies, which allow for the injection of additional complexity and verification of our forecasts with several independent analysis methods. We document multiple rounds of forecasts for CMB-S4 using this process and the resulting establishment of the current reference design of the primordial gravitational-wave component of the Stage-4 experiment, optimized to achieve our science goals of detecting primordial gravitational waves for $r > 0.003$ at greater than $5σ$, or, in the absence of a detection, of reaching an upper limit of $r < 0.001$ at $95\%$ CL. △ Less

Submitted 27 August, 2020; originally announced August 2020.

Comments: 24 pages, 8 figures, 9 tables, submitted to ApJ. arXiv admin note: text overlap with arXiv:1907.04473

arXiv:2005.07710 [pdf, other]

doi 10.3847/1538-3881/abac0a

Flare Statistics for Young Stars from a Convolutional Neural Network Analysis of $\textit{TESS}$ Data

Authors: Adina D. Feinstein, Benjamin T. Montet, Megan Ansdell, Brian Nord, Jacob L. Bean, Maximilian N. Günther, Michael A. Gully-Santiago, Joshua E. Schlieder

Abstract: All-sky photometric time-series missions have allowed for the monitoring of thousands of young ($t_{\rm age} < 800$Myr) to understand the evolution of stellar activity. Here we developed a convolutional neural network (CNN), $\texttt{stella}$, specifically trained to find flares in $\textit{Transiting Exoplanet Survey Satellite}$ ($\textit{TESS}$) short-cadence data. We applied the network to 3200… ▽ More All-sky photometric time-series missions have allowed for the monitoring of thousands of young ($t_{\rm age} < 800$Myr) to understand the evolution of stellar activity. Here we developed a convolutional neural network (CNN), $\texttt{stella}$, specifically trained to find flares in $\textit{Transiting Exoplanet Survey Satellite}$ ($\textit{TESS}$) short-cadence data. We applied the network to 3200 young stars to evaluate flare rates as a function of age and spectral type. The CNN takes a few seconds to identify flares on a single light curve. We also measured rotation periods for 1500 of our targets and find that flares of all amplitudes are present across all spot phases, suggesting high spot coverage across the entire surface. Additionally, flare rates and amplitudes decrease for stars $t_{\rm age} > 50$Myr across all temperatures $T_{\rm eff} \geq 4000$K, while stars from $2300 \leq T_{\rm eff} < 4000$K show no evolution across 800 Myr. Stars of $T_{\rm eff} \leq 4000$K also show higher flare rates and amplitudes across all ages. We investigate the effects of high flare rates on photoevaporative atmospheric mass loss for young planets. In the presence of flares, planets lose 4-7% more atmosphere over the first 1 Gyr. $\texttt{stella}$ is an open-source Python tool-kit hosted on GitHub and PyPI. △ Less

Submitted 29 July, 2020; v1 submitted 15 May, 2020; originally announced May 2020.

Comments: 21 pages, 17 figures, 1 table, AJ accepted

arXiv:2004.11981 [pdf, other]

DeepMerge: Classifying High-redshift Merging Galaxies with Deep Neural Networks

Authors: A. Ćiprijanović, G. F. Snyder, B. Nord, J. E. G. Peek

Abstract: We investigate and demonstrate the use of convolutional neural networks (CNNs) for the task of distinguishing between merging and non-merging galaxies in simulated images, and for the first time at high redshifts (i.e. $z=2$). We extract images of merging and non-merging galaxies from the Illustris-1 cosmological simulation and apply observational and experimental noise that mimics that from the H… ▽ More We investigate and demonstrate the use of convolutional neural networks (CNNs) for the task of distinguishing between merging and non-merging galaxies in simulated images, and for the first time at high redshifts (i.e. $z=2$). We extract images of merging and non-merging galaxies from the Illustris-1 cosmological simulation and apply observational and experimental noise that mimics that from the Hubble Space Telescope; the data without noise form a "pristine" data set and that with noise form a "noisy" data set. The test set classification accuracy of the CNN is $79\%$ for pristine and $76\%$ for noisy. The CNN outperforms a Random Forest classifier, which was shown to be superior to conventional one- or two-dimensional statistical methods (Concentration, Asymmetry, the Gini, $M_{20}$ statistics etc.), which are commonly used when classifying merging galaxies. We also investigate the selection effects of the classifier with respect to merger state and star formation rate, finding no bias. Finally, we extract Grad-CAMs (Gradient-weighted Class Activation Map**) from the results to further assess and interrogate the fidelity of the classification model. △ Less

Submitted 24 April, 2020; originally announced April 2020.

Comments: 17 pages, 8 figures, submitted to Astronomy & Computing

arXiv:2002.11124 [pdf, other]

doi 10.1103/PhysRevD.102.023509

Dark Energy Survey Year 1 Results: Cosmological Constraints from Cluster Abundances and Weak Lensing

Authors: DES Collaboration, Tim Abbott, Michel Aguena, Alex Alarcon, Sahar Allam, Steve Allen, James Annis, Santiago Avila, David Bacon, Alberto Bermeo, Gary Bernstein, Emmanuel Bertin, Sunayana Bhargava, Sebastian Bocquet, David Brooks, Dillon Brout, Elizabeth Buckley-Geer, David Burke, Aurelio Carnero Rosell, Matias Carrasco Kind, Jorge Carretero, Francisco Javier Castander, Ross Cawthon, Chihway Chang, Xinyi Chen , et al. (107 additional authors not shown)

Abstract: We perform a joint analysis of the counts and weak lensing signal of redMaPPer clusters selected from the Dark Energy Survey (DES) Year 1 dataset. Our analysis uses the same shear and source photometric redshifts estimates as were used in the DES combined probes analysis. Our analysis results in surprisingly low values for $S_8 =σ_8(Ω_{\rm m}/0.3)^{0.5}= 0.65\pm 0.04$, driven by a low matter densi… ▽ More We perform a joint analysis of the counts and weak lensing signal of redMaPPer clusters selected from the Dark Energy Survey (DES) Year 1 dataset. Our analysis uses the same shear and source photometric redshifts estimates as were used in the DES combined probes analysis. Our analysis results in surprisingly low values for $S_8 =σ_8(Ω_{\rm m}/0.3)^{0.5}= 0.65\pm 0.04$, driven by a low matter density parameter, $Ω_{\rm m}=0.179^{+0.031}_{-0.038}$, with $σ_8-Ω_{\rm m}$ posteriors in $2.4σ$ tension with the DES Y1 3x2pt results, and in $5.6σ$ with the Planck CMB analysis. These results include the impact of post-unblinding changes to the analysis, which did not improve the level of consistency with other data sets compared to the results obtained at the unblinding. The fact that multiple cosmological probes (supernovae, baryon acoustic oscillations, cosmic shear, galaxy clustering and CMB anisotropies), and other galaxy cluster analyses all favor significantly higher matter densities suggests the presence of systematic errors in the data or an incomplete modeling of the relevant physics. Cross checks with X-ray and microwave data, as well as independent constraints on the observable--mass relation from SZ selected clusters, suggest that the discrepancy resides in our modeling of the weak lensing signal rather than the cluster abundance. Repeating our analysis using a higher richness threshold ($λ\ge 30$) significantly reduces the tension with other probes, and points to one or more richness-dependent effects not captured by our model. △ Less

Submitted 25 February, 2020; originally announced February 2020.

Comments: 35 pages, 20 figures, submitted to Physical Review D

Journal ref: Phys. Rev. D 102, 023509 (2020)

arXiv:2001.00674 [pdf]

Reframing astronomical research through an anticolonial lens -- for TMT and beyond

Authors: Chanda Prescod-Weinstein, Lucianne M. Walkowicz, Sarah Tuttle, Brian Nord, Hilding R. Neilson

Abstract: This white paper explains that professional astronomy has benefited from settler colonial white supremacist patriarchy. We explicate the impact that this has had on communities which are not the beneficiaries of colonialism and white supremacy. We advocate for astronomers to reject these benefits in the future, and we make proposals regarding the steps involved in rejecting colonialist white supre… ▽ More This white paper explains that professional astronomy has benefited from settler colonial white supremacist patriarchy. We explicate the impact that this has had on communities which are not the beneficiaries of colonialism and white supremacy. We advocate for astronomers to reject these benefits in the future, and we make proposals regarding the steps involved in rejecting colonialist white supremacy's benefits. We center ten recommendations on the timely issue of what to do about the Thirty Meter Telescope (TMT) on Maunakea in Hawaii. This paper is written in solidarity with and support of efforts by Native Hawaiian scientists (e.g. Kahanamoku et al. 2019). △ Less

Submitted 2 January, 2020; originally announced January 2020.

Comments: 8 pages, APC (State of the Profession) White Paper submitted to the Astro2020 Decadal Survey

arXiv:1911.06341 [pdf, other]

Deep Learning in Wide-field Surveys: Fast Analysis of Strong Lenses in Ground-based Cosmic Experiments

Authors: Clecio Bom, Jason Poh, Brian Nord, Manuel Blanco-Valentin, Luciana Dias

Abstract: Searches and analyses of strong gravitational lenses are challenging due to the rarity and image complexity of these astronomical objects. Next-generation surveys (both ground- and space-based) will provide more opportunities to derive science from these objects, but only if they can be analyzed on realistic time-scales. Currently, these analyses are expensive. In this work, we present a regressio… ▽ More Searches and analyses of strong gravitational lenses are challenging due to the rarity and image complexity of these astronomical objects. Next-generation surveys (both ground- and space-based) will provide more opportunities to derive science from these objects, but only if they can be analyzed on realistic time-scales. Currently, these analyses are expensive. In this work, we present a regression analysis with uncertainty estimates using deep learning models to measure four parameters of strong gravitational lenses in simulated Dark Energy Survey data. Using only $gri$-band images, we predict Einstein Radius, lens velocity dispersion, lens redshift to within $10-15\%$ of truth values and source redshift to $30\%$ of truth values, along with predictive uncertainties. This work helps to take a step along the path of faster analyses of strong lenses with deep learning frameworks. △ Less

Submitted 14 November, 2019; originally announced November 2019.

arXiv:1911.06259 [pdf, other]

Restricted Boltzmann Machines for galaxy morphology classification with a quantum annealer

Authors: João Caldeira, Joshua Job, Steven H. Adachi, Brian Nord, Gabriel N. Perdue

Abstract: We present the application of Restricted Boltzmann Machines (RBMs) to the task of astronomical image classification using a quantum annealer built by D-Wave Systems. Morphological analysis of galaxies provides critical information for studying their formation and evolution across cosmic time scales. We compress galaxy images using principal component analysis to fit a representation on the quantum… ▽ More We present the application of Restricted Boltzmann Machines (RBMs) to the task of astronomical image classification using a quantum annealer built by D-Wave Systems. Morphological analysis of galaxies provides critical information for studying their formation and evolution across cosmic time scales. We compress galaxy images using principal component analysis to fit a representation on the quantum hardware. Then, we train RBMs with discriminative and generative algorithms, including contrastive divergence and hybrid generative-discriminative approaches, to classify different galaxy morphologies. The methods we compare include Quantum Annealing (QA), Markov Chain Monte Carlo (MCMC) Gibbs Sampling, and Simulated Annealing (SA) as well as machine learning algorithms like gradient boosted decision trees. We find that RBMs implemented on D-Wave hardware perform well, and that they show some classification performance advantages on small datasets, but they don't offer a broadly strategic advantage for this task. During this exploration, we analyzed the steps required for Boltzmann sampling with the D-Wave 2000Q, including a study of temperature estimation, and examined the impact of qubit noise by comparing and contrasting the original D-Wave 2000Q to the lower-noise version recently made available. While these analyses ultimately had minimal impact on the performance of the RBMs, we include them for reference. △ Less

Submitted 13 February, 2020; v1 submitted 14 November, 2019; originally announced November 2019.

Comments: 15 pages; LaTeX; 14 figures

Report number: FERMILAB-PUB-19-546-QIS-SCD

arXiv:1911.05796 [pdf, ps, other]

Response to NITRD, NCO, NSF Request for Information on "Update to the 2016 National Artificial Intelligence Research and Development Strategic Plan"

Authors: J. Amundson, J. Annis, C. Avestruz, D. Bowring, J. Caldeira, G. Cerati, C. Chang, S. Dodelson, D. Elvira, A. Farahi, K. Genser, L. Gray, O. Gutsche, P. Harris, J. Kinney, J. B. Kowalkowski, R. Kutschke, S. Mrenna, B. Nord, A. Para, K. Pedro, G. N. Perdue, A. Scheinker, P. Spentzouris, J. St. John , et al. (5 additional authors not shown)

Abstract: We present a response to the 2018 Request for Information (RFI) from the NITRD, NCO, NSF regarding the "Update to the 2016 National Artificial Intelligence Research and Development Strategic Plan." Through this document, we provide a response to the question of whether and how the National Artificial Intelligence Research and Development Strategic Plan (NAIRDSP) should be updated from the perspect… ▽ More We present a response to the 2018 Request for Information (RFI) from the NITRD, NCO, NSF regarding the "Update to the 2016 National Artificial Intelligence Research and Development Strategic Plan." Through this document, we provide a response to the question of whether and how the National Artificial Intelligence Research and Development Strategic Plan (NAIRDSP) should be updated from the perspective of Fermilab, America's premier national laboratory for High Energy Physics (HEP). We believe the NAIRDSP should be extended in light of the rapid pace of development and innovation in the field of Artificial Intelligence (AI) since 2016, and present our recommendations below. AI has profoundly impacted many areas of human life, promising to dramatically reshape society --- e.g., economy, education, science --- in the coming years. We are still early in this process. It is critical to invest now in this technology to ensure it is safe and deployed ethically. Science and society both have a strong need for accuracy, efficiency, transparency, and accountability in algorithms, making investments in scientific AI particularly valuable. Thus far the US has been a leader in AI technologies, and we believe as a national Laboratory it is crucial to help maintain and extend this leadership. Moreover, investments in AI will be important for maintaining US leadership in the physical sciences. △ Less

Submitted 4 November, 2019; originally announced November 2019.

Report number: FERMILAB-FN-1092-SCD

arXiv:1911.02479 [pdf, ps, other]

Algorithms and Statistical Models for Scientific Discovery in the Petabyte Era

Authors: Brian Nord, Andrew J. Connolly, Jamie Kinney, Jeremy Kubica, Gautaum Narayan, Joshua E. G. Peek, Chad Schafer, Erik J. Tollerud, Camille Avestruz, G. Jogesh Babu, Simon Birrer, Douglas Burke, João Caldeira, Douglas A. Caldwell, Joleen K. Carlberg, Yen-Chi Chen, Chuanfei Dong, Eric D. Feigelson, V. Zach Golkhou, Vinay Kashyap, T. S. Li, Thomas Loredo, Luisa Lucie-Smith, Kaisey S. Mandel, J. R. Martínez-Galarza , et al. (13 additional authors not shown)

Abstract: The field of astronomy has arrived at a turning point in terms of size and complexity of both datasets and scientific collaboration. Commensurately, algorithms and statistical models have begun to adapt --- e.g., via the onset of artificial intelligence --- which itself presents new challenges and opportunities for growth. This white paper aims to offer guidance and ideas for how we can evolve our… ▽ More The field of astronomy has arrived at a turning point in terms of size and complexity of both datasets and scientific collaboration. Commensurately, algorithms and statistical models have begun to adapt --- e.g., via the onset of artificial intelligence --- which itself presents new challenges and opportunities for growth. This white paper aims to offer guidance and ideas for how we can evolve our technical and collaborative frameworks to promote efficient algorithmic development and take advantage of opportunities for scientific discovery in the petabyte era. We discuss challenges for discovery in large and complex data sets; challenges and requirements for the next stage of development of statistical methodologies and algorithmic tool sets; how we might change our paradigms of collaboration and education; and the ethical implications of scientists' contributions to widely applicable algorithms and computational modeling. We start with six distinct recommendations that are supported by the commentary following them. This white paper is related to a larger corpus of effort that has taken place within and around the Petabytes to Science Workshops (https://petabytestoscience.github.io/). △ Less

Submitted 4 November, 2019; originally announced November 2019.

Comments: arXiv admin note: substantial text overlap with arXiv:1905.05116

Report number: FERMILAB-FN-1093-A-AE-SCD

arXiv:1911.02162 [pdf, other]

doi 10.1093/mnras/staa200

Observation and Confirmation of Nine Strong Lensing Systems in Dark Energy Survey Year 1 Data

Authors: B. Nord, E. Buckley-Geer, H. Lin, N. Kuropatkin, T. Collett, D. L. Tucker, H. T. Diehl, A. Agnello, A. Amara, T. M. C. Abbott, S. Allam, J. Annis, S. Avila, K. Bechtol, D. Brooks, D. L. Burke, A. Carnero Rosell, M. Carrasco Kind, J. Carretero, C. E. Cunha, L. N. da Costa, C. Davis, J. De Vicente, P. Doel, T. F. Eifler , et al. (42 additional authors not shown)

Abstract: We describe the observation and confirmation of \nbconfirmtext\ new strong gravitational lenses discovered in Year 1 data from the Dark Energy Survey (DES). We created candidate lists based on a) galaxy group and cluster samples and b) photometrically selected galaxy samples. We selected 46 candidates through visual inspection and then used the Gemini Multi-Object Spectrograph (GMOS) at the Gemini… ▽ More We describe the observation and confirmation of \nbconfirmtext\ new strong gravitational lenses discovered in Year 1 data from the Dark Energy Survey (DES). We created candidate lists based on a) galaxy group and cluster samples and b) photometrically selected galaxy samples. We selected 46 candidates through visual inspection and then used the Gemini Multi-Object Spectrograph (GMOS) at the Gemini South telescope to acquire spectroscopic follow-up of 21 of these candidates. Through analysis of this spectroscopic follow-up data, we confirmed nine new lensing systems and rejected 2 candidates, but the analysis was inconclusive on 10 candidates. For each of the confirmed systems, we report measured spectroscopic properties, estimated \einsteinradiussub, and estimated enclosed masses. The sources that we targeted have an i-band surface brightness range of iSB ~ 22 - 24 mag arcsec^2 and a spectroscopic redshift range of zspec ~0.8 - 2.6. The lens galaxies have a photometric redshift range of zlens ~ 0.3 - 0.7. The lensing systems range in image-lens separation 2 - 9 arcsec and in enclosed mass 10^12 - 10^13 Msol. △ Less

Submitted 5 November, 2019; originally announced November 2019.

Report number: FERMILAB-PUB-17-042-PPD

arXiv:1910.14088 [pdf]

A Need for Dedicated Outreach Expertise and Online Programming: Astro2020 Science White Paper

Authors: Amanda E Bauer, Britt Lundgren, William O'Mullane, Lauren Corlies, Megan E Schwamb, Brian Nord, Dara J Norman

Abstract: Maximizing the public impact of astronomy projects in the next decade requires NSF-funded centers to support the development of online, mobile-friendly outreach and education activities. EPO teams with astronomy, education, and web development expertise should be in place to build accessible programs at scale and support astronomers doing outreach. Maximizing the public impact of astronomy projects in the next decade requires NSF-funded centers to support the development of online, mobile-friendly outreach and education activities. EPO teams with astronomy, education, and web development expertise should be in place to build accessible programs at scale and support astronomers doing outreach. △ Less

Submitted 30 October, 2019; originally announced October 2019.

Comments: 9 pages, Astro2020 Science White Paper. arXiv admin note: substantial text overlap with arXiv:1905.05116

arXiv:1910.08376 [pdf]

The Growing Importance of a Tech Savvy Astronomy and Astrophysics Workforce

Authors: Dara Norman, Kelle Cruz, Vandana Desai, Britt Lundgren, Eric Bellm, Frossie Economou, Arfon Smith, Amanda Bauer, Brian Nord, Chad Schafer, Gautham Narayan, Ting Li, Erik Tollerud, Brigitta Sipocz, Heloise Stevance, Timothy Pickering, Manodeep Sinha, Joseph Harrington, Jeyhan Kartaltepe, Dany Vohl, Adrian Price-Whelan, Brian Cherinka, Chi-kwan Chan, Benjamin Weiner, Maryam Modjaz , et al. (4 additional authors not shown)

Abstract: Fundamental coding and software development skills are increasingly necessary for success in nearly every aspect of astronomical and astrophysical research as large surveys and high resolution simulations become the norm. However, professional training in these skills is inaccessible or impractical for many members of our community. Students and professionals alike have been expected to acquire th… ▽ More Fundamental coding and software development skills are increasingly necessary for success in nearly every aspect of astronomical and astrophysical research as large surveys and high resolution simulations become the norm. However, professional training in these skills is inaccessible or impractical for many members of our community. Students and professionals alike have been expected to acquire these skills on their own, apart from formal classroom curriculum or on-the-job training. Despite the recognized importance of these skills, there is little opportunity to develop them - even for interested researchers. To ensure a workforce capable of taking advantage of the computational resources and the large volumes of data coming in the next decade, we must identify and support ways to make software development training widely accessible to community members, regardless of affiliation or career level. To develop and sustain a technology capable astronomical and astrophysical workforce, we recommend that agencies make funding and other resources available in order to encourage, support and, in some cases, require progress on necessary training, infrastructure and policies. In this white paper, we focus on recommendations for how funding agencies can lead in the promotion of activities to support the astronomy and astrophysical workforce in the 2020s. △ Less

Submitted 17 October, 2019; originally announced October 2019.

Comments: Submitted as a ASTRO2020 Decadal Survey APC position paper. arXiv admin note: substantial text overlap with arXiv:1905.05116

arXiv:1907.06981 [pdf]

Astro2020 APC White Paper: Elevating the Role of Software as a Product of the Research Enterprise

Authors: Arfon M. Smith, Dara Norman, Kelle Cruz, Vandana Desai, Eric Bellm, Britt Lundgren, Frossie Economou, Brian D. Nord, Chad Schafer, Gautham Narayan, Joseph Harrington, Erik Tollerud, Brigitta Sipőcz, Timothy Pickering, Molly S. Peeples, Bruce Berriman, Peter Teuben, David Rodriguez, Andre Gradvohl, Lior Shamir, Alice Allen, Joel R. Brownstein, Adam Ginsburg, Manodeep Sinha, Cameron Hummels , et al. (20 additional authors not shown)

Abstract: Software is a critical part of modern research, and yet there are insufficient mechanisms in the scholarly ecosystem to acknowledge, cite, and measure the impact of research software. The majority of academic fields rely on a one-dimensional credit model whereby academic articles (and their associated citations) are the dominant factor in the success of a researcher's career. In the petabyte era o… ▽ More Software is a critical part of modern research, and yet there are insufficient mechanisms in the scholarly ecosystem to acknowledge, cite, and measure the impact of research software. The majority of academic fields rely on a one-dimensional credit model whereby academic articles (and their associated citations) are the dominant factor in the success of a researcher's career. In the petabyte era of astronomical science, citing software and measuring its impact enables academia to retain and reward researchers that make significant software contributions. These highly skilled researchers must be retained to maximize the scientific return from petabyte-scale datasets. Evolving beyond the one-dimensional credit model requires overcoming several key challenges, including the current scholarly ecosystem and scientific culture issues. This white paper will present these challenges and suggest practical solutions for elevating the role of software as a product of the research enterprise. △ Less

Submitted 14 July, 2019; originally announced July 2019.

Comments: arXiv admin note: substantial text overlap with arXiv:1905.05116

Showing 1–50 of 164 results for author: Nord, B