Search | arXiv e-print repository

Gamma-ray Bursts as Distance Indicators by a Statistical Learning Approach

Authors: Maria Giovanna Dainotti, Aditya Narendra, Agnieszka Pollo, Vahe Petrosian, Malgorzata Bogdan, Kazunari Iwasaki, Jason Xavier Prochaska, Enrico Rinaldi, David Zhou

Abstract: Gamma-ray bursts (GRBs) can be probes of the early universe, but currently, only 26% of GRBs observed by the Neil Gehrels Swift Observatory GRBs have known redshifts ($z$) due to observational limitations. To address this, we estimated the GRB redshift (distance) via a supervised statistical learning model that uses optical afterglow observed by Swift and ground-based telescopes. The inferred reds… ▽ More Gamma-ray bursts (GRBs) can be probes of the early universe, but currently, only 26% of GRBs observed by the Neil Gehrels Swift Observatory GRBs have known redshifts ($z$) due to observational limitations. To address this, we estimated the GRB redshift (distance) via a supervised statistical learning model that uses optical afterglow observed by Swift and ground-based telescopes. The inferred redshifts are strongly correlated (a Pearson coefficient of 0.93) with the observed redshifts, thus proving the reliability of this method. The inferred and observed redshifts allow us to estimate the number of GRBs occurring at a given redshift (GRB rate) to be 8.47-9 $yr^{-1} Gpc^{-1}$ for $1.9<z<2.3$. Since GRBs come from the collapse of massive stars, we compared this rate with the star formation rate highlighting a discrepancy of a factor of 3 at $z<1$. △ Less

Submitted 2 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

Comments: 10 figures. Submitted for publication at The Astrophysical Journal Letters. arXiv admin note: text overlap with arXiv:1907.05074. Passed second reviewer response

arXiv:2401.03589 [pdf, other]

Inferring the redshift of more than 150 GRBs with a Machine Learning Ensemble model

Authors: Maria Giovanna Dainotti, Elias Taira, Eric Wang, Elias Lehman, Aditya Narendra, Agnieszka Pollo, Grzegorz M. Madejski, Vahe Petrosian, Malgorzata Bogdan, Apratim Dey, Shubham Bhardwaj

Abstract: Gamma-Ray Bursts (GRBs), due to their high luminosities are detected up to redshift 10, and thus have the potential to be vital cosmological probes of early processes in the universe. Fulfilling this potential requires a large sample of GRBs with known redshifts, but due to observational limitations, only 11\% have known redshifts ($z$). There have been numerous attempts to estimate redshifts via… ▽ More Gamma-Ray Bursts (GRBs), due to their high luminosities are detected up to redshift 10, and thus have the potential to be vital cosmological probes of early processes in the universe. Fulfilling this potential requires a large sample of GRBs with known redshifts, but due to observational limitations, only 11\% have known redshifts ($z$). There have been numerous attempts to estimate redshifts via correlation studies, most of which have led to inaccurate predictions. To overcome this, we estimated GRB redshift via an ensemble supervised machine learning model that uses X-ray afterglows of long-duration GRBs observed by the Neil Gehrels Swift Observatory. The estimated redshifts are strongly correlated (a Pearson coefficient of 0.93) and have a root mean square error, namely the square root of the average squared error $\langleΔz^2\rangle$, of 0.46 with the observed redshifts showing the reliability of this method. The addition of GRB afterglow parameters improves the predictions considerably by 63\% compared to previous results in peer-reviewed literature. Finally, we use our machine learning model to infer the redshifts of 154 GRBs, which increase the known redshifts of long GRBs with plateaus by 94\%, a significant milestone for enhancing GRB population studies that require large samples with redshift. △ Less

Submitted 7 January, 2024; originally announced January 2024.

Comments: 12 Figures, 24 pages. Accepted for publication at The Astrophysical Journal Supplement Series

arXiv:2308.14288 [pdf, other]

doi 10.1093/mnras/stad2593

GRB Optical and X-ray Plateau Properties Classifier Using Unsupervised Machine Learning

Authors: Shubham Bhardwaj, Maria G. Dainotti, Sachin Venkatesh, Aditya Narendra, Anish Kalsi, Enrico Rinaldi, Agnieszka Pollo

Abstract: The division of Gamma-ray bursts (GRBs) into different classes, other than the "short" and "long", has been an active field of research. We investigate whether GRBs can be classified based on a broader set of parameters, including prompt and plateau emission ones. Observational evidence suggests the existence of more GRB sub-classes, but results so far are either conflicting or not statistically s… ▽ More The division of Gamma-ray bursts (GRBs) into different classes, other than the "short" and "long", has been an active field of research. We investigate whether GRBs can be classified based on a broader set of parameters, including prompt and plateau emission ones. Observational evidence suggests the existence of more GRB sub-classes, but results so far are either conflicting or not statistically significant. The novelty here is producing a machine-learning-based classification of GRBs using their observed X-rays and optical properties. We used two data samples: the first, composed of 203 GRBs, is from the Neil Gehrels Swift Observatory (Swift/XRT), and the latter, composed of 134 GRBs, is from the ground-based Telescopes and Swift/UVOT. Both samples possess the plateau emission (a flat part of the light curve happening after the prompt emission, the main GRB event). We have applied the Gaussian Mixture Model (GMM) to explore multiple parameter spaces and sub-class combinations to reveal if there is a match between the current observational sub-classes and the statistical classification. With these samples and the algorithm, we spot a few micro-trends in certain cases, but we cannot conclude that any clear trend exists in classifying GRBs. These microtrends could point towards a deeper understanding of the physical meaning of these classes (e.g., a different environment of the same progenitor or different progenitors). However, a larger sample and different algorithms could achieve such goals. Thus, this methodology can lead to deeper insights in the future. △ Less

Submitted 6 September, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

Comments: 20 pages, 10 figures (one has 4 panels, two have a single panel, six have 8 panels, one has 6 panels), 4 tables. Accepted for publication in MNRAS

Report number: RIKEN-iTHEMS-Report-23

Journal ref: MNRAS, Volume 525, Issue 4, pp.5204-5223, November 2023

arXiv:2307.11945 [pdf, other]

doi 10.1093/mnras/stad2193

Fermi LAT AGN classification using supervised machine learning

Authors: Nathaniel Cooper, Maria Giovanna Dainotti, Aditya Narendra, Ioannis Liodakis, Malgorzata Bogdan

Abstract: Classifying Active Galactic Nuclei (AGN) is a challenge, especially for BL Lac Objects (BLLs), which are identified by their weak emission line spectra. To address the problem of classification, we use data from the 4th Fermi Catalog, Data Release 3. Missing data hinders the use of machine learning to classify AGN. A previous paper found that Multiple Imputation by Chain Equations (MICE) imputatio… ▽ More Classifying Active Galactic Nuclei (AGN) is a challenge, especially for BL Lac Objects (BLLs), which are identified by their weak emission line spectra. To address the problem of classification, we use data from the 4th Fermi Catalog, Data Release 3. Missing data hinders the use of machine learning to classify AGN. A previous paper found that Multiple Imputation by Chain Equations (MICE) imputation is useful for estimating missing values. Since many AGN have missing redshift and the highest energy, we use data imputation with MICE and K-nearest neighbor (kNN) algorithm to fill in these missing variables. Then, we classify AGN into the BLLs or the Flat Spectrum Radio Quasars (FSRQs) using the SuperLearner, an ensemble method that includes several classification algorithms like logistic regression, support vector classifiers, Random Forests, Ranger Random Forests, multivariate adaptive regression spline (MARS), Bayesian regression, Extreme Gradient Boosting. We find that a SuperLearner model using MARS regression and Random Forests algorithms is 91.1% accurate for kNN imputed data and 91.2% for MICE imputed data. Furthermore, the kNN-imputed SuperLearner model predicts that 892 of the 1519 unclassified blazars are BLLs and 627 are Flat Spectrum Radio Quasars (FSRQs), while the MICE-imputed SuperLearner model predicts 890 BLLs and 629 FSRQs in the unclassified set. Thus, we can conclude that both imputation methods work efficiently and with high accuracy and that our methodology ushers the way for using SuperLearner as a novel classification method in the AGN community and, in general, in the astrophysics community. △ Less

Submitted 21 July, 2023; originally announced July 2023.

Comments: 15 pages, 8 figures, to be published in Monthly Notices of the Royal Astronomical Society

Journal ref: 2023, MNRAS, 525, 1731

arXiv:2305.12126 [pdf, other]

doi 10.3847/1538-4365/acdd07

A Stochastic Approach To Reconstruct Gamma Ray Burst Lightcurves

Authors: Maria G. Dainotti, Ritwik Sharma, Aditya Narendra, Delina Levine, Enrico Rinaldi, Agnieszka Pollo, Gopal Bhatta

Abstract: Gamma-Ray Bursts (GRBs), being observed at high redshift (z = 9.4), vital to cosmological studies and investigating Population III stars. To tackle these studies, we need correlations among relevant GRB variables with the requirement of small uncertainties on their variables. Thus, we must have good coverage of GRB light curves (LCs). However, gaps in the LC hinder the precise determination of GRB… ▽ More Gamma-Ray Bursts (GRBs), being observed at high redshift (z = 9.4), vital to cosmological studies and investigating Population III stars. To tackle these studies, we need correlations among relevant GRB variables with the requirement of small uncertainties on their variables. Thus, we must have good coverage of GRB light curves (LCs). However, gaps in the LC hinder the precise determination of GRB properties and are often unavoidable. Therefore, extensive categorization of GRB LCs remains a hurdle. We address LC gaps using a 'stochastic reconstruction,' wherein we fit two pre-existing models (Willingale 2007; W07 and Broken Power Law; BPL) to the observed LC, then use the distribution of flux residuals from the original data to generate data to fill in the temporal gaps. We also demonstrate a model-independent LC reconstruction via Gaussian Processes. At 10% noise, the uncertainty of the end time of the plateau, its correspondent flux, and the temporal decay index after the plateau decreases, on average, by 33.3% 35.03%, and 43.32%, respectively for the W07, and by 33.3%, 30.78%, 43.9% for the BPL. The slope of the plateau decreases by 14.76% in the BPL. After using the Gaussian Process technique, we see similar trends of a decrease in uncertainty for all model parameters for both the W07 and BPL models. These improvements are essential for the application of GRBs as standard candles in cosmology, for the investigation of theoretical models and for inferring the redshift of GRBs with future machine learning analysis. △ Less

Submitted 22 May, 2023; v1 submitted 20 May, 2023; originally announced May 2023.

Comments: 20 pages, 6 tables, 11 figures

Report number: RIKEN-iTHEMS-Report-23; Accepted for publication at APJSS

arXiv:2203.00087 [pdf, other]

Using Multivariate Imputation by Chained Equations to Predict Redshifts of Active Galactic Nuclei

Authors: Spencer James Gibson, Aditya Narendra, Maria Giovanna Dainotti, Malgorzata Bogdan, Agniezska Pollo, Artem Poliszczuk, Enrico Rinaldi, Ioannis Liodakis

Abstract: Redshift measurement of active galactic nuclei (AGNs) remains a time-consuming and challenging task, as it requires follow up spectroscopic observations and detailed analysis. Hence, there exists an urgent requirement for alternative redshift estimation techniques. The use of machine learning (ML) for this purpose has been growing over the last few years, primarily due to the availability of large… ▽ More Redshift measurement of active galactic nuclei (AGNs) remains a time-consuming and challenging task, as it requires follow up spectroscopic observations and detailed analysis. Hence, there exists an urgent requirement for alternative redshift estimation techniques. The use of machine learning (ML) for this purpose has been growing over the last few years, primarily due to the availability of large-scale galactic surveys. However, due to observational errors, a significant fraction of these data sets often have missing entries, rendering that fraction unusable for ML regression applications. In this study, we demonstrate the performance of an imputation technique called Multivariate Imputation by Chained Equations (MICE), which rectifies the issue of missing data entries by imputing them using the available information in the catalog. We use the Fermi-LAT Fourth Data Release Catalog (4LAC) and impute 24% of the catalog. Subsequently, we follow the methodology described in Dainotti et al. (2021) and create an ML model for estimating the redshift of 4LAC AGNs. We present results which highlight positive impact of MICE imputation technique on the machine learning models performance and obtained redshift estimation accuracy. △ Less

Submitted 28 February, 2022; originally announced March 2022.

Comments: 21 Pages, 11 Figures, 3 Tables

arXiv:2201.05374 [pdf, other]

doi 10.3847/1538-4365/ac545a

Predicting the redshift of gamma-ray loud AGNs using Supervised Machine Learning: Part 2

Authors: Aditya Narendra, Spencer James Gibson, Maria Giovanna Dainotti, Malgorzata Bogdan, Agnieszka Pollo, Ioannis Liodakis, Artem Poliszczuk

Abstract: Measuring the redshift of active galactic nuclei (AGNs) requires the use of time-consuming and expensive spectroscopic analysis. However, obtaining redshift measurements of AGNs is crucial as it can enable AGN population studies, provide insight into the star formation rate, the luminosity function, and the density rate evolution. Hence, there is a requirement for alternative redshift measurement… ▽ More Measuring the redshift of active galactic nuclei (AGNs) requires the use of time-consuming and expensive spectroscopic analysis. However, obtaining redshift measurements of AGNs is crucial as it can enable AGN population studies, provide insight into the star formation rate, the luminosity function, and the density rate evolution. Hence, there is a requirement for alternative redshift measurement techniques. In this project, we aim to use the Fermi gamma-ray space telescope's 4LAC Data Release (DR2) catalog to train a machine learning model capable of predicting the redshift reliably. In addition, this project aims at improving and extending with the new 4LAC Catalog the predictive capabilities of the machine learning (ML) methodology published in Dainotti et al. (2021). Furthermore, we implement feature engineering to expand the parameter space and a bias correction technique to our final results. This study uses additional machine learning techniques inside the ensemble method, the SuperLearner, previously used in Dainotti et al.(2021). Additionally, we also test a novel ML model called Sorted L-One Penalized Estimation (SLOPE). Using these methods we provide a catalog of estimated redshift values for those AGNs that do not have a spectroscopic redshift measurement. These estimates can serve as a redshift reference for the community to verify as updated Fermi catalogs are released with more redshift measurements. △ Less

Submitted 14 January, 2022; originally announced January 2022.

Comments: 26 pages, 16 figures, 3 tables

arXiv:2107.10952 [pdf, other]

doi 10.3847/1538-4357/ac1748

Predicting the redshift of gamma-ray loud AGNs using supervised machine learning

Authors: Maria Giovanna Dainotti, Malgorzata Bogdan, Aditya Narendra, Spencer James Gibson, Blazej Miasojedow, Ioannis Liodakis, Agnieszka Pollo, Trevor Nelson, Kamil Wozniak, Zooey Nguyen, Johan Larrson

Abstract: AGNs are very powerful galaxies characterized by extremely bright emissions coming out from their central massive black holes. Knowing the redshifts of AGNs provides us with an opportunity to determine their distance to investigate important astrophysical problems such as the evolution of the early stars, their formation along with the structure of early galaxies. The redshift determination is cha… ▽ More AGNs are very powerful galaxies characterized by extremely bright emissions coming out from their central massive black holes. Knowing the redshifts of AGNs provides us with an opportunity to determine their distance to investigate important astrophysical problems such as the evolution of the early stars, their formation along with the structure of early galaxies. The redshift determination is challenging because it requires detailed follow-up of multi-wavelength observations, often involving various astronomical facilities. Here, we employ machine learning algorithms to estimate redshifts from the observed gamma-ray properties and photometric data of gamma-ray loud AGN from the Fourth Fermi-LAT Catalog. The prediction is obtained with the Superlearner algorithm, using LASSO selected set of predictors. We obtain a tight correlation, with a Pearson Correlation Coefficient of 71.3% between the inferred and the observed redshifts, an average Δz_norm = 11.6 x 10^-4. We stress that notwithstanding the small sample of gamma-ray loud AGNs, we obtain a reliable predictive model using Superlearner, which is an ensemble of several machine learning models. △ Less

Submitted 22 July, 2021; originally announced July 2021.

Comments: 29 pages, 19 Figures with a total of 39 panels

Showing 1–8 of 8 results for author: Narendra, A