Search | arXiv e-print repository

Domain Adaptation for Measurements of Strong Gravitational Lenses

Authors: Paxson Swierc, Megan Zhao, Aleksandra Ćiprijanović, Brian Nord

Abstract: Upcoming surveys are predicted to discover galaxy-scale strong lenses on the order of $10^5$, making deep learning methods necessary in lensing data analysis. Currently, there is insufficient real lensing data to train deep learning algorithms, but the alternative of training only on simulated data results in poor performance on real data. Domain Adaptation may be able to bridge the gap between si… ▽ More Upcoming surveys are predicted to discover galaxy-scale strong lenses on the order of $10^5$, making deep learning methods necessary in lensing data analysis. Currently, there is insufficient real lensing data to train deep learning algorithms, but the alternative of training only on simulated data results in poor performance on real data. Domain Adaptation may be able to bridge the gap between simulated and real datasets. We utilize domain adaptation for the estimation of Einstein radius ($Θ_E$) in simulated galaxy-scale gravitational lensing images with different levels of observational realism. We evaluate two domain adaptation techniques - Domain Adversarial Neural Networks (DANN) and Maximum Mean Discrepancy (MMD). We train on a source domain of simulated lenses and apply it to a target domain of lenses simulated to emulate noise conditions in the Dark Energy Survey (DES). We show that both domain adaptation techniques can significantly improve the model performance on the more complex target domain dataset. This work is the first application of domain adaptation for a regression task in strong lensing imaging analysis. Our results show the potential of using domain adaptation to perform analysis of future survey data with a deep neural network trained on simulated data. △ Less

Submitted 28 November, 2023; originally announced November 2023.

Comments: Accepted in Machine Learning and the Physical Sciences Workshop at NeurIPS 2023; 9 pages, 2 figures, 2 tables

Report number: FERMILAB-CONF-23-645-CSAID

arXiv:2311.01588 [pdf, other]

Domain Adaptive Graph Neural Networks for Constraining Cosmological Parameters Across Multiple Data Sets

Authors: Andrea Roncoli, Aleksandra Ćiprijanović, Maggie Voetberg, Francisco Villaescusa-Navarro, Brian Nord

Abstract: Deep learning models have been shown to outperform methods that rely on summary statistics, like the power spectrum, in extracting information from complex cosmological data sets. However, due to differences in the subgrid physics implementation and numerical approximations across different simulation suites, models trained on data from one cosmological simulation show a drop in performance when t… ▽ More Deep learning models have been shown to outperform methods that rely on summary statistics, like the power spectrum, in extracting information from complex cosmological data sets. However, due to differences in the subgrid physics implementation and numerical approximations across different simulation suites, models trained on data from one cosmological simulation show a drop in performance when tested on another. Similarly, models trained on any of the simulations would also likely experience a drop in performance when applied to observational data. Training on data from two different suites of the CAMELS hydrodynamic cosmological simulations, we examine the generalization capabilities of Domain Adaptive Graph Neural Networks (DA-GNNs). By utilizing GNNs, we capitalize on their capacity to capture structured scale-free cosmological information from galaxy distributions. Moreover, by including unsupervised domain adaptation via Maximum Mean Discrepancy (MMD), we enable our models to extract domain-invariant features. We demonstrate that DA-GNN achieves higher accuracy and robustness on cross-dataset tasks (up to $28\%$ better relative error and up to almost an order of magnitude better $χ^2$). Using data visualizations, we show the effects of domain adaptation on proper latent space data alignment. This shows that DA-GNNs are a promising method for extracting domain-independent cosmological information, a vital step toward robust deep learning for real cosmic survey data. △ Less

Submitted 15 April, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

Comments: Accepted in Machine Learning and the Physical Sciences Workshop at NeurIPS 2023; 9 pages, 2 figures, 1 table

Report number: FERMILAB-CONF-23-644-CSAID

arXiv:2307.04072 [pdf, other]

The LSST AGN Data Challenge: Selection methods

Authors: Đorđe V. Savić, Isidora Jankov, Weixiang Yu, Vincenzo Petrecca, Matthew J. Temple, Qingling Ni, Raphael Shirley, Andjelka B. Kovacevic, Mladen Nikolic, Dragana Ilic, Luka C. Popovic, Maurizio Paolillo, Swayamtrupta Panda, Aleksandra Ciprijanovic, Gordon T. Richards

Abstract: Development of the Rubin Observatory Legacy Survey of Space and Time (LSST) includes a series of Data Challenges (DC) arranged by various LSST Scientific Collaborations (SC) that are taking place during the projects preoperational phase. The AGN Science Collaboration Data Challenge (AGNSCDC) is a partial prototype of the expected LSST AGN data, aimed at validating machine learning approaches for A… ▽ More Development of the Rubin Observatory Legacy Survey of Space and Time (LSST) includes a series of Data Challenges (DC) arranged by various LSST Scientific Collaborations (SC) that are taking place during the projects preoperational phase. The AGN Science Collaboration Data Challenge (AGNSCDC) is a partial prototype of the expected LSST AGN data, aimed at validating machine learning approaches for AGN selection and characterization in large surveys like LSST. The AGNSC-DC took part in 2021 focusing on accuracy, robustness, and scalability. The training and the blinded datasets were constructed to mimic the future LSST release catalogs using the data from the Sloan Digital Sky Survey Stripe 82 region and the XMM-Newton Large Scale Structure Survey region. Data features were divided into astrometry, photometry, color, morphology, redshift and class label with the addition of variability features and images. We present the results of four DC submitted solutions using both classical and machine learning methods. We systematically test the performance of supervised (support vector machine, random forest, extreme gradient boosting, artificial neural network, convolutional neural network) and unsupervised (deep embedding clustering) models when applied to the problem of classifying/clustering sources as stars, galaxies or AGNs. We obtained classification accuracy 97.5% for supervised and clustering accuracy 96.0% for unsupervised models and 95.0% with a classic approach for a blinded dataset. We find that variability features significantly improve the accuracy of the trained models and correlation analysis among different bands enables a fast and inexpensive first order selection of quasar candidates △ Less

Submitted 8 July, 2023; originally announced July 2023.

Comments: Accepted by ApJ. 21 pages, 14 figures, 5 tables

Report number: FERMILAB-PUB-22-735-SCD

arXiv:2302.02005 [pdf, other]

DeepAstroUDA: Semi-Supervised Universal Domain Adaptation for Cross-Survey Galaxy Morphology Classification and Anomaly Detection

Authors: A. Ćiprijanović, A. Lewis, K. Pedro, S. Madireddy, B. Nord, G. N. Perdue, S. M. Wild

Abstract: Artificial intelligence methods show great promise in increasing the quality and speed of work with large astronomical datasets, but the high complexity of these methods leads to the extraction of dataset-specific, non-robust features. Therefore, such methods do not generalize well across multiple datasets. We present a universal domain adaptation method, \textit{DeepAstroUDA}, as an approach to o… ▽ More Artificial intelligence methods show great promise in increasing the quality and speed of work with large astronomical datasets, but the high complexity of these methods leads to the extraction of dataset-specific, non-robust features. Therefore, such methods do not generalize well across multiple datasets. We present a universal domain adaptation method, \textit{DeepAstroUDA}, as an approach to overcome this challenge. This algorithm performs semi-supervised domain adaptation and can be applied to datasets with different data distributions and class overlaps. Non-overlap** classes can be present in any of the two datasets (the labeled source domain, or the unlabeled target domain), and the method can even be used in the presence of unknown classes. We apply our method to three examples of galaxy morphology classification tasks of different complexities ($3$-class and $10$-class problems), with anomaly detection: 1) datasets created after different numbers of observing years from a single survey (LSST mock data of $1$ and $10$ years of observations); 2) data from different surveys (SDSS and DECaLS); and 3) data from observing fields with different depths within one survey (wide field and Stripe 82 deep field of SDSS). For the first time, we demonstrate the successful use of domain adaptation between very discrepant observational datasets. \textit{DeepAstroUDA} is capable of bridging the gap between two astronomical surveys, increasing classification accuracy in both domains (up to $40\%$ on the unlabeled data), and making model performance consistent across datasets. Furthermore, our method also performs well as an anomaly detection algorithm and successfully clusters unknown class samples even in the unlabeled target dataset. △ Less

Submitted 22 March, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

Comments: Accepted in Machine Learning Science and Technology (MLST); 24 pages, 14 figures

Report number: FERMILAB-PUB-23-034-CSAID

arXiv:2211.10305 [pdf, other]

Neural Inference of Gaussian Processes for Time Series Data of Quasars

Authors: Egor Danilov, Aleksandra Ćiprijanović, Brian Nord

Abstract: The study of quasar light curves poses two problems: inference of the power spectrum and interpolation of an irregularly sampled time series. A baseline approach to these tasks is to interpolate a time series with a Damped Random Walk (DRW) model, in which the spectrum is inferred using Maximum Likelihood Estimation (MLE). However, the DRW model does not describe the smoothness of the time series,… ▽ More The study of quasar light curves poses two problems: inference of the power spectrum and interpolation of an irregularly sampled time series. A baseline approach to these tasks is to interpolate a time series with a Damped Random Walk (DRW) model, in which the spectrum is inferred using Maximum Likelihood Estimation (MLE). However, the DRW model does not describe the smoothness of the time series, and MLE faces many problems in terms of optimization and numerical precision. In this work, we introduce a new stochastic model that we call $\textit{Convolved Damped Random Walk}$ (CDRW). This model introduces a concept of smoothness to a DRW, which enables it to describe quasar spectra completely. We also introduce a new method of inference of Gaussian process parameters, which we call $\textit{Neural Inference}$. This method uses the powers of state-of-the-art neural networks to improve the conventional MLE inference technique. In our experiments, the Neural Inference method results in significant improvement over the baseline MLE (RMSE: $0.318 \rightarrow 0.205$, $0.464 \rightarrow 0.444$). Moreover, the combination of both the CDRW model and Neural Inference significantly outperforms the baseline DRW and MLE in interpolating a typical quasar light curve ($χ^2$: $0.333 \rightarrow 0.998$, $2.695 \rightarrow 0.981$). The code is published on GitHub. △ Less

Submitted 17 November, 2022; originally announced November 2022.

Comments: Machine Learning and the Physical Sciences workshop, NeurIPS 2022

arXiv:2211.09126 [pdf, other]

doi 10.1088/2632-2153/ac98f4

DIGS: Deep Inference of Galaxy Spectra with Neural Posterior Estimation

Authors: Gourav Khullar, Brian Nord, Aleksandra Ciprijanovic, Jason Poh, Fei Xu

Abstract: With the advent of billion-galaxy surveys with complex data, the need of the hour is to efficiently model galaxy spectral energy distributions (SEDs) with robust uncertainty quantification. The combination of Simulation-Based inference (SBI) and amortized Neural Posterior Estimation (NPE) has been successfully used to analyse simulated and real galaxy photometry both precisely and efficiently. In… ▽ More With the advent of billion-galaxy surveys with complex data, the need of the hour is to efficiently model galaxy spectral energy distributions (SEDs) with robust uncertainty quantification. The combination of Simulation-Based inference (SBI) and amortized Neural Posterior Estimation (NPE) has been successfully used to analyse simulated and real galaxy photometry both precisely and efficiently. In this work, we utilise this combination and build on existing literature to analyse simulated noisy galaxy spectra. Here, we demonstrate a proof-of-concept study of spectra that is a) an efficient analysis of galaxy SEDs and inference of galaxy parameters with physically interpretable uncertainties; and b) amortized calculations of posterior distributions of said galaxy parameters at the modest cost of a few galaxy fits with MCMC methods. We utilise the SED generator and inference framework Prospector to generate simulated spectra, and train a dataset of 2$\times$10$^6$ spectra (corresponding to a 5-parameter SED model) with NPE. We show that SBI -- with its combination of fast and amortized posterior estimations -- is capable of inferring accurate galaxy stellar masses and metallicities. Our uncertainty constraints are comparable to or moderately weaker than traditional inverse-modeling with Bayesian MCMC methods (e.g., 0.17 and 0.26 dex in stellar mass and metallicity for a given galaxy, respectively). We also find that our inference framework conducts rapid SED inference (0.9-1.2$\times$10$^5$ galaxy spectra via SBI/SNPE at the cost of 1 MCMC-based fit). With this work, we set the stage for further work that focuses of SED fitting of galaxy spectra with SBI, in the era of JWST galaxy survey programs and the wide-field Roman Space Telescope spectroscopic surveys. △ Less

Submitted 16 November, 2022; originally announced November 2022.

Comments: Manuscript accepted in Machine Learning: Science and Technology (MLST) as a Letter (October 10th, 2022); 12 Pages, 6 Figures and 1 Table; Data and code can be found in published github repository

Report number: FERMILAB-PUB-22-557-PPD-SCD

arXiv:2211.05836 [pdf, other]

Strong Lensing Parameter Estimation on Ground-Based Imaging Data Using Simulation-Based Inference

Authors: Jason Poh, Ashwin Samudre, Aleksandra Ćiprijanović, Brian Nord, Gourav Khullar, Dimitrios Tanoglidis, Joshua A. Frieman

Abstract: Current ground-based cosmological surveys, such as the Dark Energy Survey (DES), are predicted to discover thousands of galaxy-scale strong lenses, while future surveys, such as the Vera Rubin Observatory Legacy Survey of Space and Time (LSST) will increase that number by 1-2 orders of magnitude. The large number of strong lenses discoverable in future surveys will make strong lensing a highly com… ▽ More Current ground-based cosmological surveys, such as the Dark Energy Survey (DES), are predicted to discover thousands of galaxy-scale strong lenses, while future surveys, such as the Vera Rubin Observatory Legacy Survey of Space and Time (LSST) will increase that number by 1-2 orders of magnitude. The large number of strong lenses discoverable in future surveys will make strong lensing a highly competitive and complementary cosmic probe. To leverage the increased statistical power of the lenses that will be discovered through upcoming surveys, automated lens analysis techniques are necessary. We present two Simulation-Based Inference (SBI) approaches for lens parameter estimation of galaxy-galaxy lenses. We demonstrate the successful application of Neural Posterior Estimation (NPE) to automate the inference of a 12-parameter lens mass model for DES-like ground-based imaging data. We compare our NPE constraints to a Bayesian Neural Network (BNN) and find that it outperforms the BNN, producing posterior distributions that are for the most part both more accurate and more precise; in particular, several source-light model parameters are systematically biased in the BNN implementation. △ Less

Submitted 10 November, 2022; originally announced November 2022.

Comments: Accepted to the Workshop on Machine Learning and the Physical Sciences at the 36th Conference on Neural Information Processing Systems 2022 (NeurIPS 2022)

arXiv:2211.00677 [pdf, other]

Semi-Supervised Domain Adaptation for Cross-Survey Galaxy Morphology Classification and Anomaly Detection

Authors: Aleksandra Ćiprijanović, Ashia Lewis, Kevin Pedro, Sandeep Madireddy, Brian Nord, Gabriel N. Perdue, Stefan M. Wild

Abstract: In the era of big astronomical surveys, our ability to leverage artificial intelligence algorithms simultaneously for multiple datasets will open new avenues for scientific discovery. Unfortunately, simply training a deep neural network on images from one data domain often leads to very poor performance on any other dataset. Here we develop a Universal Domain Adaptation method DeepAstroUDA, capabl… ▽ More In the era of big astronomical surveys, our ability to leverage artificial intelligence algorithms simultaneously for multiple datasets will open new avenues for scientific discovery. Unfortunately, simply training a deep neural network on images from one data domain often leads to very poor performance on any other dataset. Here we develop a Universal Domain Adaptation method DeepAstroUDA, capable of performing semi-supervised domain alignment that can be applied to datasets with different types of class overlap. Extra classes can be present in any of the two datasets, and the method can even be used in the presence of unknown classes. For the first time, we demonstrate the successful use of domain adaptation on two very different observational datasets (from SDSS and DECaLS). We show that our method is capable of bridging the gap between two astronomical surveys, and also performs well for anomaly detection and clustering of unknown data in the unlabeled dataset. We apply our model to two examples of galaxy morphology classification tasks with anomaly detection: 1) classifying spiral and elliptical galaxies with detection of merging galaxies (three classes including one unknown anomaly class); 2) a more granular problem where the classes describe more detailed morphological properties of galaxies, with the detection of gravitational lenses (ten classes including one unknown anomaly class). △ Less

Submitted 11 November, 2022; v1 submitted 1 November, 2022; originally announced November 2022.

Comments: 3 figures, 1 table; accepted to Machine Learning and the Physical Sciences - Workshop at the 36th conference on Neural Information Processing Systems (NeurIPS)

Report number: FERMILAB-CONF-22-791-SCD

arXiv:2208.00134 [pdf, other]

Estimating Cosmological Constraints from Galaxy Cluster Abundance using Simulation-Based Inference

Authors: Moonzarin Reza, Yuanyuan Zhang, Brian Nord, Jason Poh, Aleksandra Ciprijanovic, Louis Strigari

Abstract: Inferring the values and uncertainties of cosmological parameters in a cosmology model is of paramount importance for modern cosmic observations. In this paper, we use the simulation-based inference (SBI) approach to estimate cosmological constraints from a simplified galaxy cluster observation analysis. Using data generated from the Quijote simulation suite and analytical models, we train a machi… ▽ More Inferring the values and uncertainties of cosmological parameters in a cosmology model is of paramount importance for modern cosmic observations. In this paper, we use the simulation-based inference (SBI) approach to estimate cosmological constraints from a simplified galaxy cluster observation analysis. Using data generated from the Quijote simulation suite and analytical models, we train a machine learning algorithm to learn the probability function between cosmological parameters and the possible galaxy cluster observables. The posterior distribution of the cosmological parameters at a given observation is then obtained by sampling the predictions from the trained algorithm. Our results show that the SBI method can successfully recover the truth values of the cosmological parameters within the 2σ limit for this simplified galaxy cluster analysis, and acquires similar posterior constraints obtained with a likelihood-based Markov Chain Monte Carlo method, the current state-of the-art method used in similar cosmological studies. △ Less

Submitted 29 July, 2022; originally announced August 2022.

Comments: Accepted at the ICML 2022 Workshop on Machine Learning for Astrophysics

Report number: FERMILAB-CONF-22-487-SCD

arXiv:2207.03471 [pdf, other]

Inferring Structural Parameters of Low-Surface-Brightness-Galaxies with Uncertainty Quantification using Bayesian Neural Networks

Authors: Dimitrios Tanoglidis, Aleksandra Ćiprijanović, Alex Drlica-Wagner

Abstract: Measuring the structural parameters (size, total brightness, light concentration, etc.) of galaxies is a significant first step towards a quantitative description of different galaxy populations. In this work, we demonstrate that a Bayesian Neural Network (BNN) can be used for the inference, with uncertainty quantification, of such morphological parameters from simulated low-surface-brightness gal… ▽ More Measuring the structural parameters (size, total brightness, light concentration, etc.) of galaxies is a significant first step towards a quantitative description of different galaxy populations. In this work, we demonstrate that a Bayesian Neural Network (BNN) can be used for the inference, with uncertainty quantification, of such morphological parameters from simulated low-surface-brightness galaxy images. Compared to traditional profile-fitting methods, we show that the uncertainties obtained using BNNs are comparable in magnitude, well-calibrated, and the point estimates of the parameters are closer to the true values. Our method is also significantly faster, which is very important with the advent of the era of large galaxy surveys and big data in astrophysics. △ Less

Submitted 7 July, 2022; originally announced July 2022.

Comments: 9 pages, 7 figures. accepted to the ICML 2022 Machine Learning for Astrophysics workshop

Report number: FERMILAB-CONF-22-477-SCD

arXiv:2203.08056 [pdf, ps, other]

Machine Learning and Cosmology

Authors: Cora Dvorkin, Siddharth Mishra-Sharma, Brian Nord, V. Ashley Villar, Camille Avestruz, Keith Bechtol, Aleksandra Ćiprijanović, Andrew J. Connolly, Lehman H. Garrison, Gautham Narayan, Francisco Villaescusa-Navarro

Abstract: Methods based on machine learning have recently made substantial inroads in many corners of cosmology. Through this process, new computational tools, new perspectives on data collection, model development, analysis, and discovery, as well as new communities and educational pathways have emerged. Despite rapid progress, substantial potential at the intersection of cosmology and machine learning rem… ▽ More Methods based on machine learning have recently made substantial inroads in many corners of cosmology. Through this process, new computational tools, new perspectives on data collection, model development, analysis, and discovery, as well as new communities and educational pathways have emerged. Despite rapid progress, substantial potential at the intersection of cosmology and machine learning remains untapped. In this white paper, we summarize current and ongoing developments relating to the application of machine learning within cosmology and provide a set of recommendations aimed at maximizing the scientific impact of these burgeoning tools over the coming decade through both technical development as well as the fostering of emerging communities. △ Less

Submitted 15 March, 2022; originally announced March 2022.

Comments: Contribution to Snowmass 2021. 32 pages

arXiv:2112.14299 [pdf, other]

DeepAdversaries: Examining the Robustness of Deep Learning Models for Galaxy Morphology Classification

Authors: Aleksandra Ćiprijanović, Diana Kafkes, Gregory Snyder, F. Javier Sánchez, Gabriel Nathan Perdue, Kevin Pedro, Brian Nord, Sandeep Madireddy, Stefan M. Wild

Abstract: With increased adoption of supervised deep learning methods for processing and analysis of cosmological survey data, the assessment of data perturbation effects (that can naturally occur in the data processing and analysis pipelines) and the development of methods that increase model robustness are increasingly important. In the context of morphological classification of galaxies, we study the eff… ▽ More With increased adoption of supervised deep learning methods for processing and analysis of cosmological survey data, the assessment of data perturbation effects (that can naturally occur in the data processing and analysis pipelines) and the development of methods that increase model robustness are increasingly important. In the context of morphological classification of galaxies, we study the effects of perturbations in imaging data. In particular, we examine the consequences of using neural networks when training on baseline data and testing on perturbed data. We consider perturbations associated with two primary sources: 1) increased observational noise as represented by higher levels of Poisson noise and 2) data processing noise incurred by steps such as image compression or telescope errors as represented by one-pixel adversarial attacks. We also test the efficacy of domain adaptation techniques in mitigating the perturbation-driven errors. We use classification accuracy, latent space visualizations, and latent space distance to assess model robustness. Without domain adaptation, we find that processing pixel-level errors easily flip the classification into an incorrect class and that higher observational noise makes the model trained on low-noise data unable to classify galaxy morphologies. On the other hand, we show that training with domain adaptation improves model robustness and mitigates the effects of these perturbations, improving the classification accuracy by 23% on data with higher observational noise. Domain adaptation also increases by a factor of ~2.3 the latent space distance between the baseline and the incorrectly classified one-pixel perturbed image, making the model more robust to inadvertent perturbations. △ Less

Submitted 6 July, 2022; v1 submitted 28 December, 2021; originally announced December 2021.

Comments: 20 pages, 6 figures, 5 tables; accepted in MLST

Report number: FERMILAB-PUB-21-767-SCD

arXiv:2111.00961 [pdf, other]

Robustness of deep learning algorithms in astronomy -- galaxy morphology studies

Authors: A. Ćiprijanović, D. Kafkes, G. N. Perdue, K. Pedro, G. Snyder, F. J. Sánchez, S. Madireddy, S. M. Wild, B. Nord

Abstract: Deep learning models are being increasingly adopted in wide array of scientific domains, especially to handle high-dimensionality and volume of the scientific data. However, these models tend to be brittle due to their complexity and overparametrization, especially to the inadvertent adversarial perturbations that can appear due to common image processing such as compression or blurring that are o… ▽ More Deep learning models are being increasingly adopted in wide array of scientific domains, especially to handle high-dimensionality and volume of the scientific data. However, these models tend to be brittle due to their complexity and overparametrization, especially to the inadvertent adversarial perturbations that can appear due to common image processing such as compression or blurring that are often seen with real scientific data. It is crucial to understand this brittleness and develop models robust to these adversarial perturbations. To this end, we study the effect of observational noise from the exposure time, as well as the worst case scenario of a one-pixel attack as a proxy for compression or telescope errors on performance of ResNet18 trained to distinguish between galaxies of different morphologies in LSST mock data. We also explore how domain adaptation techniques can help improve model robustness in case of this type of naturally occurring attacks and help scientists build more trustworthy and stable models. △ Less

Submitted 2 November, 2021; v1 submitted 1 November, 2021; originally announced November 2021.

Comments: Accepted in: Fourth Workshop on Machine Learning and the Physical Sciences (35th Conference on Neural Information Processing Systems; NeurIPS2021); final version

Report number: FERMILAB-CONF-21-561-SCD

arXiv:2109.08246 [pdf, other]

DeepGhostBusters: Using Mask R-CNN to Detect and Mask Ghosting and Scattered-Light Artifacts from Optical Survey Images

Authors: Dimitrios Tanoglidis, Aleksandra Ćiprijanović, Alex Drlica-Wagner, Brian Nord, Michael H. L. S. Wang, Ariel Jacob Amsellem, Kathryn Downey, Sydney Jenkins, Diana Kafkes, Zhuoqi Zhang

Abstract: Wide-field astronomical surveys are often affected by the presence of undesirable reflections (often known as "ghosting artifacts" or "ghosts") and scattered-light artifacts. The identification and mitigation of these artifacts is important for rigorous astronomical analyses of faint and low-surface-brightness systems. However, the identification of ghosts and scattered-light artifacts is challeng… ▽ More Wide-field astronomical surveys are often affected by the presence of undesirable reflections (often known as "ghosting artifacts" or "ghosts") and scattered-light artifacts. The identification and mitigation of these artifacts is important for rigorous astronomical analyses of faint and low-surface-brightness systems. However, the identification of ghosts and scattered-light artifacts is challenging due to a) the complex morphology of these features and b) the large data volume of current and near-future surveys. In this work, we use images from the Dark Energy Survey (DES) to train, validate, and test a deep neural network (Mask R-CNN) to detect and localize ghosts and scattered-light artifacts. We find that the ability of the Mask R-CNN model to identify affected regions is superior to that of conventional algorithms and traditional convolutional neural networks methods. We propose that a multi-step pipeline combining Mask R-CNN segmentation with a classical CNN classifier provides a powerful technique for the automated detection of ghosting and scattered-light artifacts in current and near-future surveys. △ Less

Submitted 16 September, 2021; originally announced September 2021.

Comments: 24 pages, 18 figures. Code and data related to this work can be found at: https://github.com/dtanoglidis/DeepGhostBusters

Report number: FERMILAB-PUB-21-374-AE

arXiv:2103.01373 [pdf, other]

doi 10.1093/mnras/stab1677

DeepMerge II: Building Robust Deep Learning Algorithms for Merging Galaxy Identification Across Domains

Authors: A. Ćiprijanović, D. Kafkes, K. Downey, S. Jenkins, G. N. Perdue, S. Madireddy, T. Johnston, G. F. Snyder, B. Nord

Abstract: In astronomy, neural networks are often trained on simulation data with the prospect of being used on telescope observations. Unfortunately, training a model on simulation data and then applying it to instrument data leads to a substantial and potentially even detrimental decrease in model accuracy on the new target dataset. Simulated and instrument data represent different data domains, and for a… ▽ More In astronomy, neural networks are often trained on simulation data with the prospect of being used on telescope observations. Unfortunately, training a model on simulation data and then applying it to instrument data leads to a substantial and potentially even detrimental decrease in model accuracy on the new target dataset. Simulated and instrument data represent different data domains, and for an algorithm to work in both, domain-invariant learning is necessary. Here we employ domain adaptation techniques$-$ Maximum Mean Discrepancy (MMD) as an additional transfer loss and Domain Adversarial Neural Networks (DANNs)$-$ and demonstrate their viability to extract domain-invariant features within the astronomical context of classifying merging and non-merging galaxies. Additionally, we explore the use of Fisher loss and entropy minimization to enforce better in-domain class discriminability. We show that the addition of each domain adaptation technique improves the performance of a classifier when compared to conventional deep learning algorithms. We demonstrate this on two examples: between two Illustris-1 simulated datasets of distant merging galaxies, and between Illustris-1 simulated data of nearby merging galaxies and observed data from the Sloan Digital Sky Survey. The use of domain adaptation techniques in our experiments leads to an increase of target domain classification accuracy of up to ${\sim}20\%$. With further development, these techniques will allow astronomers to successfully implement neural network models trained on simulation data to efficiently detect and study astrophysical objects in current and future large-scale astronomical surveys. △ Less

Submitted 1 March, 2021; originally announced March 2021.

Comments: Submitted to MNRAS; 21 pages, 9 figures, 9 tables

Report number: FERMILAB-PUB-21-072-SCD

Journal ref: MNRAS, Volume 506, Issue 1, September 2021, Page 677

arXiv:2011.12437 [pdf, other]

DeepShadows: Separating Low Surface Brightness Galaxies from Artifacts using Deep Learning

Authors: Dimitrios Tanoglidis, Aleksandra Ćiprijanović, Alex Drlica-Wagner

Abstract: Searches for low-surface-brightness galaxies (LSBGs) in galaxy surveys are plagued by the presence of a large number of artifacts (e.g., objects blended in the diffuse light from stars and galaxies, Galactic cirrus, star-forming regions in the arms of spiral galaxies, etc.) that have to be rejected through time consuming visual inspection. In future surveys, which are expected to collect hundreds… ▽ More Searches for low-surface-brightness galaxies (LSBGs) in galaxy surveys are plagued by the presence of a large number of artifacts (e.g., objects blended in the diffuse light from stars and galaxies, Galactic cirrus, star-forming regions in the arms of spiral galaxies, etc.) that have to be rejected through time consuming visual inspection. In future surveys, which are expected to collect hundreds of petabytes of data and detect billions of objects, such an approach will not be feasible. We investigate the use of convolutional neural networks (CNNs) for the problem of separating LSBGs from artifacts in survey images. We take advantage of the fact that, for the first time, we have available a large number of labeled LSBGs and artifacts from the Dark Energy Survey, that we use to train, validate, and test a CNN model. That model, which we call DeepShadows, achieves a test accuracy of $92.0 \%$, a significant improvement relative to feature-based machine learning models. We also study the ability to use transfer learning to adapt this model to classify objects from the deeper Hyper-Suprime-Cam survey, and we show that after the model is retrained on a very small sample from the new survey, it can reach an accuracy of $87.6\%$. These results demonstrate that CNNs offer a very promising path in the quest to study the low-surface-brightness universe. △ Less

Submitted 24 November, 2020; originally announced November 2020.

Comments: 22 pages, 11 figures. Code and data related to this work can be found at: https://github.com/dtanoglidis/DeepShadows

arXiv:2011.03591 [pdf, other]

Domain adaptation techniques for improved cross-domain study of galaxy mergers

Authors: A. Ćiprijanović, D. Kafkes, S. Jenkins, K. Downey, G. N. Perdue, S. Madireddy, T. Johnston, B. Nord

Abstract: In astronomy, neural networks are often trained on simulated data with the prospect of being applied to real observations. Unfortunately, simply training a deep neural network on images from one domain does not guarantee satisfactory performance on new images from a different domain. The ability to share cross-domain knowledge is the main advantage of modern deep domain adaptation techniques. Here… ▽ More In astronomy, neural networks are often trained on simulated data with the prospect of being applied to real observations. Unfortunately, simply training a deep neural network on images from one domain does not guarantee satisfactory performance on new images from a different domain. The ability to share cross-domain knowledge is the main advantage of modern deep domain adaptation techniques. Here we demonstrate the use of two techniques - Maximum Mean Discrepancy (MMD) and adversarial training with Domain Adversarial Neural Networks (DANN) - for the classification of distant galaxy mergers from the Illustris-1 simulation, where the two domains presented differ only due to inclusion of observational noise. We show how the addition of either MMD or adversarial training greatly improves the performance of the classifier on the target domain when compared to conventional machine learning algorithms, thereby demonstrating great promise for their use in astronomy. △ Less

Submitted 13 November, 2020; v1 submitted 6 November, 2020; originally announced November 2020.

Comments: Accepted in: Machine Learning and the Physical Sciences - Workshop at the 34th Conference on Neural Information Processing Systems (NeurIPS); final version

Report number: FERMILAB-CONF-20-582-SCD

arXiv:2004.11981 [pdf, other]

DeepMerge: Classifying High-redshift Merging Galaxies with Deep Neural Networks

Authors: A. Ćiprijanović, G. F. Snyder, B. Nord, J. E. G. Peek

Abstract: We investigate and demonstrate the use of convolutional neural networks (CNNs) for the task of distinguishing between merging and non-merging galaxies in simulated images, and for the first time at high redshifts (i.e. $z=2$). We extract images of merging and non-merging galaxies from the Illustris-1 cosmological simulation and apply observational and experimental noise that mimics that from the H… ▽ More We investigate and demonstrate the use of convolutional neural networks (CNNs) for the task of distinguishing between merging and non-merging galaxies in simulated images, and for the first time at high redshifts (i.e. $z=2$). We extract images of merging and non-merging galaxies from the Illustris-1 cosmological simulation and apply observational and experimental noise that mimics that from the Hubble Space Telescope; the data without noise form a "pristine" data set and that with noise form a "noisy" data set. The test set classification accuracy of the CNN is $79\%$ for pristine and $76\%$ for noisy. The CNN outperforms a Random Forest classifier, which was shown to be superior to conventional one- or two-dimensional statistical methods (Concentration, Asymmetry, the Gini, $M_{20}$ statistics etc.), which are commonly used when classifying merging galaxies. We also investigate the selection effects of the classifier with respect to merger state and star formation rate, finding no bias. Finally, we extract Grad-CAMs (Gradient-weighted Class Activation Map**) from the results to further assess and interrogate the fidelity of the classification model. △ Less

Submitted 24 April, 2020; originally announced April 2020.

Comments: 17 pages, 8 figures, submitted to Astronomy & Computing

arXiv:1912.06379 [pdf, ps, other]

doi 10.2298/SAJ1999023V

Updated radio $Σ-D$ relation for Galactic supernova remnants -- II

Authors: B. Vukotić, A. Ćiprijanović, M. M. Vučetić, D. Onić, D. Urošević

Abstract: In this paper we present the updated empirical radio surface-brightness-to-diameter ($Σ$--$D$) relation for Galactic supernova remnants (SNRs) calibrated using $110$ SNRs with reliable distances. We apply orthogonal fitting procedure and kernel density smoothing in $Σ-D$ plane and compare the results with the latest theoretical $Σ-D$ relations derived from simulations of radio evolution of SNRs. W… ▽ More In this paper we present the updated empirical radio surface-brightness-to-diameter ($Σ$--$D$) relation for Galactic supernova remnants (SNRs) calibrated using $110$ SNRs with reliable distances. We apply orthogonal fitting procedure and kernel density smoothing in $Σ-D$ plane and compare the results with the latest theoretical $Σ-D$ relations derived from simulations of radio evolution of SNRs. We argue that the best agreement between the empirical and simulated $Σ-D$ relations is achieved if the mixed-morphology SNRs and SNRs of both, low brightness and small diameter, are filtered out from the calibration sample. The distances to $5$ newly discovered remnants and $27$ new candidates for shell SNRs are estimated from our full and filtered calibration samples. △ Less

Submitted 13 December, 2019; originally announced December 2019.

Comments: 20 pages, 2 figures, 4 tables

Journal ref: Serbian Astronomical Journal, 2019, volume 199

arXiv:1902.08671 [pdf, ps, other]

doi 10.2298/SAJ190131003V

Optical observations of the nearby galaxy NGC 2366 through narrowband H$α$ and SII filters. Supernova remnants status

Authors: M. M. Vučetić, D. Onić, N. Petrov, A. Ćiprijanović, M. Z. Pavlović

Abstract: We present detection of 67 HII regions and two optical supernova remnant (SNR) candidates in the nearby irregular galaxy NGC 2366. The SNR candidates were detected by applying [SII]/H$α$ ratio criterion to observations made with the 2-m RCC telescope at Rozhen National Astronomical Observatory in Bulgaria. In this paper we report coordinates, diameters, H$α$ and [SII] fluxes for detected objects a… ▽ More We present detection of 67 HII regions and two optical supernova remnant (SNR) candidates in the nearby irregular galaxy NGC 2366. The SNR candidates were detected by applying [SII]/H$α$ ratio criterion to observations made with the 2-m RCC telescope at Rozhen National Astronomical Observatory in Bulgaria. In this paper we report coordinates, diameters, H$α$ and [SII] fluxes for detected objects across the two fields of view in NGC 2366 galaxy. Using archival XMM-Newton observations we suggest possible X-ray counterparts of two optical SNR candidates. Also, we discard classification of two previous radio SNR candidates in this galaxy, since they appear to be background galaxies. △ Less

Submitted 22 February, 2019; originally announced February 2019.

Comments: 12 pages, 6 figures, accepted for publication in Serbian Astronomical Journal

arXiv:1809.01045 [pdf]

doi 10.1007/978-3-030-13876-9_67

Women Scientists Who Made Nuclear Astrophysics

Authors: C. V. Hampton, M. Lugaro, P. Papakonstantinou, P. G. Isar, B. Nordström, N. Özkan, M. Aliotta, A. Ćiprijanović, S. Curtis, M. Di Criscienzo, J. den Hartogh, A. S. Font, A. Kankainen, C. Kobayashi, C. Lederer-Woods, E. Niemczura, T. Rauscher, A. Spyrou, S. Van Eck, M. Yavahchova, W. Chantereau, S. E. de Mink, E. Kaiser, F. -K. Thielemann, C. Travaglio , et al. (2 additional authors not shown)

Abstract: Female role models reduce the impact on women of stereotype threat, i.e., of being at risk of conforming to a negative stereotype about one's social, gender, or racial group. This can lead women scientists to underperform or to leave their scientific career because of negative stereotypes such as, not being as talented or as interested in science as men. Sadly, history rarely provides role models… ▽ More Female role models reduce the impact on women of stereotype threat, i.e., of being at risk of conforming to a negative stereotype about one's social, gender, or racial group. This can lead women scientists to underperform or to leave their scientific career because of negative stereotypes such as, not being as talented or as interested in science as men. Sadly, history rarely provides role models for women scientists; instead, it often renders these women invisible. In response to this situation, we present a selection of twelve outstanding women who helped to develop nuclear astrophysics. △ Less

Submitted 25 August, 2018; originally announced September 2018.

Comments: 5 pages; to appear in Springer Proceedings in Physics (Proc. of Intl. Conf. "Nuclei in the Cosmos XV", LNGS Assergi, Italy, June 2018)

arXiv:1802.08939 [pdf, ps, other]

Constraining the Collective Radio Emission of Large Scale Accretion Shocks

Authors: A. Ćiprijanović, T. Prodanović, M. Z. Pavlović

Abstract: Accretion of gas onto already virialized structures like galaxy clusters should give rise to accretion shocks which can potentially accelerate cosmic rays. Here, we use the radio emission detected from Coma cluster and models of evolution of cosmic accretion shocks, to constrain the possible contribution of unresolved galaxy clusters to the cosmic radio background. We assume that Coma is a typical… ▽ More Accretion of gas onto already virialized structures like galaxy clusters should give rise to accretion shocks which can potentially accelerate cosmic rays. Here, we use the radio emission detected from Coma cluster and models of evolution of cosmic accretion shocks, to constrain the possible contribution of unresolved galaxy clusters to the cosmic radio background. We assume that Coma is a typical galaxy cluster and that its entire radio emission is produced by cosmic rays accelerated in accretion shocks, making our prediction an upper limit. Our models predict that at lower frequencies accretion shocks can have a potentially large contribution to the cosmic radio background, while on larger frequencies, e.g. 5 GHz, their contribution must be lower than < 2-35%, depending on the models of evolution of accretion shocks that we use. △ Less

Submitted 24 February, 2018; originally announced February 2018.

Comments: 18th Serbian Astronomical Conference, 17-21. October 2017, Belgrade, Serbia

arXiv:1609.08344 [pdf, ps, other]

doi 10.1016/j.astropartphys.2016.09.004

Galactic Cosmic-Ray Induced Production of Lithium in the Small Magellanic Cloud

Authors: A. Ćiprijanović

Abstract: Recently, the first lithium detection outside of the Milky Way was made in low-metallicity gas of the Small Magellanic Cloud, which was at the level of the expected primordial value. Part of the observed lithium in any environment has primordial origin, but there is always some post-BBN (Big Bang Nucleosynthesis) contamination, since lithium can also be produced in cosmic-ray interactions with the… ▽ More Recently, the first lithium detection outside of the Milky Way was made in low-metallicity gas of the Small Magellanic Cloud, which was at the level of the expected primordial value. Part of the observed lithium in any environment has primordial origin, but there is always some post-BBN (Big Bang Nucleosynthesis) contamination, since lithium can also be produced in cosmic-ray interactions with the interstellar medium. Using the fact that processes involving cosmic rays produce lithium, but also gamma rays through neutral pion decay, we use the Small Magellanic Cloud gamma-ray observations by Fermi-LAT to make predictions on the amount of lithium in this galaxy that was produced by galactic cosmic rays accelerated in supernova remnants. By including both fusion processes, as well as spallation of heavier nuclei, we find that galactic cosmic rays could produce a very small amount of lithium. In the case of 6Li isotope (which should only be produced by cosmic rays) we can only explain 0.16% of the measured abundance. If these cosmic rays are indeed responsible for such small lithium production, observed abundances could be the result of some other sources, which are discussed in the paper. △ Less

Submitted 27 September, 2016; originally announced September 2016.

Comments: 9 pages, 1 figure, accepted in Astroparticle Physics

arXiv:1511.05307 [pdf, ps, other]

doi 10.2298/SAJ150911002V

Optical observations of the nearby galaxy IC342 with narrow band [SII] and H$α$ filters. II - Detection of 16 Optically-Identified Supernova Remnant Candidates

Authors: M. M. Vucetic, A. Ciprijanovic, M. Z. Pavlovic, T. G. Pannuti, N. Petrov, U. D. Goker, E. N. Ercan

Abstract: We present the detection of 16 optical supernova remnant (SNR) candidates in the nearby spiral galaxy IC342. The candidates were detected by applying [SII]/H$α$ ratio criterion on observations made with the 2 m RCC telescope at Rozhen National Astronomical Observatory in Bulgaria. In this paper, we report the coordinates, diameters, H$α$ and [SII] fluxes for 16 SNRs detected in two fields of view… ▽ More We present the detection of 16 optical supernova remnant (SNR) candidates in the nearby spiral galaxy IC342. The candidates were detected by applying [SII]/H$α$ ratio criterion on observations made with the 2 m RCC telescope at Rozhen National Astronomical Observatory in Bulgaria. In this paper, we report the coordinates, diameters, H$α$ and [SII] fluxes for 16 SNRs detected in two fields of view in the IC342 galaxy. Also, we estimate that the contamination of total H$α$ flux from SNRs in the observed portion of IC342 is 1.4%. This would represent the fractional error when the star formation rate (SFR) for this galaxy is derived from the total galaxy's H$α$ emission. △ Less

Submitted 17 November, 2015; originally announced November 2015.

Comments: 8 pages, 2 figures, 2 tables; to be published in Serbian Astronomical Jornal

Showing 1–24 of 24 results for author: Ciprijanovic, A