-
Gamma-ray Bursts as Distance Indicators by a Statistical Learning Approach
Authors:
Maria Giovanna Dainotti,
Aditya Narendra,
Agnieszka Pollo,
Vahe Petrosian,
Malgorzata Bogdan,
Kazunari Iwasaki,
Jason Xavier Prochaska,
Enrico Rinaldi,
David Zhou
Abstract:
Gamma-ray bursts (GRBs) can be probes of the early universe, but currently, only 26% of GRBs observed by the Neil Gehrels Swift Observatory GRBs have known redshifts ($z$) due to observational limitations. To address this, we estimated the GRB redshift (distance) via a supervised statistical learning model that uses optical afterglow observed by Swift and ground-based telescopes. The inferred reds…
▽ More
Gamma-ray bursts (GRBs) can be probes of the early universe, but currently, only 26% of GRBs observed by the Neil Gehrels Swift Observatory GRBs have known redshifts ($z$) due to observational limitations. To address this, we estimated the GRB redshift (distance) via a supervised statistical learning model that uses optical afterglow observed by Swift and ground-based telescopes. The inferred redshifts are strongly correlated (a Pearson coefficient of 0.93) with the observed redshifts, thus proving the reliability of this method. The inferred and observed redshifts allow us to estimate the number of GRBs occurring at a given redshift (GRB rate) to be 8.47-9 $yr^{-1} Gpc^{-1}$ for $1.9<z<2.3$. Since GRBs come from the collapse of massive stars, we compared this rate with the star formation rate highlighting a discrepancy of a factor of 3 at $z<1$.
△ Less
Submitted 2 May, 2024; v1 submitted 6 February, 2024;
originally announced February 2024.
-
Inferring the redshift of more than 150 GRBs with a Machine Learning Ensemble model
Authors:
Maria Giovanna Dainotti,
Elias Taira,
Eric Wang,
Elias Lehman,
Aditya Narendra,
Agnieszka Pollo,
Grzegorz M. Madejski,
Vahe Petrosian,
Malgorzata Bogdan,
Apratim Dey,
Shubham Bhardwaj
Abstract:
Gamma-Ray Bursts (GRBs), due to their high luminosities are detected up to redshift 10, and thus have the potential to be vital cosmological probes of early processes in the universe. Fulfilling this potential requires a large sample of GRBs with known redshifts, but due to observational limitations, only 11\% have known redshifts ($z$). There have been numerous attempts to estimate redshifts via…
▽ More
Gamma-Ray Bursts (GRBs), due to their high luminosities are detected up to redshift 10, and thus have the potential to be vital cosmological probes of early processes in the universe. Fulfilling this potential requires a large sample of GRBs with known redshifts, but due to observational limitations, only 11\% have known redshifts ($z$). There have been numerous attempts to estimate redshifts via correlation studies, most of which have led to inaccurate predictions. To overcome this, we estimated GRB redshift via an ensemble supervised machine learning model that uses X-ray afterglows of long-duration GRBs observed by the Neil Gehrels Swift Observatory. The estimated redshifts are strongly correlated (a Pearson coefficient of 0.93) and have a root mean square error, namely the square root of the average squared error $\langleΔz^2\rangle$, of 0.46 with the observed redshifts showing the reliability of this method. The addition of GRB afterglow parameters improves the predictions considerably by 63\% compared to previous results in peer-reviewed literature. Finally, we use our machine learning model to infer the redshifts of 154 GRBs, which increase the known redshifts of long GRBs with plateaus by 94\%, a significant milestone for enhancing GRB population studies that require large samples with redshift.
△ Less
Submitted 7 January, 2024;
originally announced January 2024.
-
GRB Optical and X-ray Plateau Properties Classifier Using Unsupervised Machine Learning
Authors:
Shubham Bhardwaj,
Maria G. Dainotti,
Sachin Venkatesh,
Aditya Narendra,
Anish Kalsi,
Enrico Rinaldi,
Agnieszka Pollo
Abstract:
The division of Gamma-ray bursts (GRBs) into different classes, other than the "short" and "long", has been an active field of research. We investigate whether GRBs can be classified based on a broader set of parameters, including prompt and plateau emission ones. Observational evidence suggests the existence of more GRB sub-classes, but results so far are either conflicting or not statistically s…
▽ More
The division of Gamma-ray bursts (GRBs) into different classes, other than the "short" and "long", has been an active field of research. We investigate whether GRBs can be classified based on a broader set of parameters, including prompt and plateau emission ones. Observational evidence suggests the existence of more GRB sub-classes, but results so far are either conflicting or not statistically significant. The novelty here is producing a machine-learning-based classification of GRBs using their observed X-rays and optical properties. We used two data samples: the first, composed of 203 GRBs, is from the Neil Gehrels Swift Observatory (Swift/XRT), and the latter, composed of 134 GRBs, is from the ground-based Telescopes and Swift/UVOT. Both samples possess the plateau emission (a flat part of the light curve happening after the prompt emission, the main GRB event). We have applied the Gaussian Mixture Model (GMM) to explore multiple parameter spaces and sub-class combinations to reveal if there is a match between the current observational sub-classes and the statistical classification. With these samples and the algorithm, we spot a few micro-trends in certain cases, but we cannot conclude that any clear trend exists in classifying GRBs. These microtrends could point towards a deeper understanding of the physical meaning of these classes (e.g., a different environment of the same progenitor or different progenitors). However, a larger sample and different algorithms could achieve such goals. Thus, this methodology can lead to deeper insights in the future.
△ Less
Submitted 6 September, 2023; v1 submitted 28 August, 2023;
originally announced August 2023.
-
Fermi LAT AGN classification using supervised machine learning
Authors:
Nathaniel Cooper,
Maria Giovanna Dainotti,
Aditya Narendra,
Ioannis Liodakis,
Malgorzata Bogdan
Abstract:
Classifying Active Galactic Nuclei (AGN) is a challenge, especially for BL Lac Objects (BLLs), which are identified by their weak emission line spectra. To address the problem of classification, we use data from the 4th Fermi Catalog, Data Release 3. Missing data hinders the use of machine learning to classify AGN. A previous paper found that Multiple Imputation by Chain Equations (MICE) imputatio…
▽ More
Classifying Active Galactic Nuclei (AGN) is a challenge, especially for BL Lac Objects (BLLs), which are identified by their weak emission line spectra. To address the problem of classification, we use data from the 4th Fermi Catalog, Data Release 3. Missing data hinders the use of machine learning to classify AGN. A previous paper found that Multiple Imputation by Chain Equations (MICE) imputation is useful for estimating missing values. Since many AGN have missing redshift and the highest energy, we use data imputation with MICE and K-nearest neighbor (kNN) algorithm to fill in these missing variables. Then, we classify AGN into the BLLs or the Flat Spectrum Radio Quasars (FSRQs) using the SuperLearner, an ensemble method that includes several classification algorithms like logistic regression, support vector classifiers, Random Forests, Ranger Random Forests, multivariate adaptive regression spline (MARS), Bayesian regression, Extreme Gradient Boosting. We find that a SuperLearner model using MARS regression and Random Forests algorithms is 91.1% accurate for kNN imputed data and 91.2% for MICE imputed data. Furthermore, the kNN-imputed SuperLearner model predicts that 892 of the 1519 unclassified blazars are BLLs and 627 are Flat Spectrum Radio Quasars (FSRQs), while the MICE-imputed SuperLearner model predicts 890 BLLs and 629 FSRQs in the unclassified set. Thus, we can conclude that both imputation methods work efficiently and with high accuracy and that our methodology ushers the way for using SuperLearner as a novel classification method in the AGN community and, in general, in the astrophysics community.
△ Less
Submitted 21 July, 2023;
originally announced July 2023.
-
A Stochastic Approach To Reconstruct Gamma Ray Burst Lightcurves
Authors:
Maria G. Dainotti,
Ritwik Sharma,
Aditya Narendra,
Delina Levine,
Enrico Rinaldi,
Agnieszka Pollo,
Gopal Bhatta
Abstract:
Gamma-Ray Bursts (GRBs), being observed at high redshift (z = 9.4), vital to cosmological studies and investigating Population III stars. To tackle these studies, we need correlations among relevant GRB variables with the requirement of small uncertainties on their variables. Thus, we must have good coverage of GRB light curves (LCs). However, gaps in the LC hinder the precise determination of GRB…
▽ More
Gamma-Ray Bursts (GRBs), being observed at high redshift (z = 9.4), vital to cosmological studies and investigating Population III stars. To tackle these studies, we need correlations among relevant GRB variables with the requirement of small uncertainties on their variables. Thus, we must have good coverage of GRB light curves (LCs). However, gaps in the LC hinder the precise determination of GRB properties and are often unavoidable. Therefore, extensive categorization of GRB LCs remains a hurdle. We address LC gaps using a 'stochastic reconstruction,' wherein we fit two pre-existing models (Willingale 2007; W07 and Broken Power Law; BPL) to the observed LC, then use the distribution of flux residuals from the original data to generate data to fill in the temporal gaps. We also demonstrate a model-independent LC reconstruction via Gaussian Processes. At 10% noise, the uncertainty of the end time of the plateau, its correspondent flux, and the temporal decay index after the plateau decreases, on average, by 33.3% 35.03%, and 43.32%, respectively for the W07, and by 33.3%, 30.78%, 43.9% for the BPL. The slope of the plateau decreases by 14.76% in the BPL. After using the Gaussian Process technique, we see similar trends of a decrease in uncertainty for all model parameters for both the W07 and BPL models. These improvements are essential for the application of GRBs as standard candles in cosmology, for the investigation of theoretical models and for inferring the redshift of GRBs with future machine learning analysis.
△ Less
Submitted 22 May, 2023; v1 submitted 20 May, 2023;
originally announced May 2023.
-
Using Multivariate Imputation by Chained Equations to Predict Redshifts of Active Galactic Nuclei
Authors:
Spencer James Gibson,
Aditya Narendra,
Maria Giovanna Dainotti,
Malgorzata Bogdan,
Agniezska Pollo,
Artem Poliszczuk,
Enrico Rinaldi,
Ioannis Liodakis
Abstract:
Redshift measurement of active galactic nuclei (AGNs) remains a time-consuming and challenging task, as it requires follow up spectroscopic observations and detailed analysis. Hence, there exists an urgent requirement for alternative redshift estimation techniques. The use of machine learning (ML) for this purpose has been growing over the last few years, primarily due to the availability of large…
▽ More
Redshift measurement of active galactic nuclei (AGNs) remains a time-consuming and challenging task, as it requires follow up spectroscopic observations and detailed analysis. Hence, there exists an urgent requirement for alternative redshift estimation techniques. The use of machine learning (ML) for this purpose has been growing over the last few years, primarily due to the availability of large-scale galactic surveys. However, due to observational errors, a significant fraction of these data sets often have missing entries, rendering that fraction unusable for ML regression applications. In this study, we demonstrate the performance of an imputation technique called Multivariate Imputation by Chained Equations (MICE), which rectifies the issue of missing data entries by imputing them using the available information in the catalog. We use the Fermi-LAT Fourth Data Release Catalog (4LAC) and impute 24% of the catalog. Subsequently, we follow the methodology described in Dainotti et al. (2021) and create an ML model for estimating the redshift of 4LAC AGNs. We present results which highlight positive impact of MICE imputation technique on the machine learning models performance and obtained redshift estimation accuracy.
△ Less
Submitted 28 February, 2022;
originally announced March 2022.
-
Predicting the redshift of gamma-ray loud AGNs using Supervised Machine Learning: Part 2
Authors:
Aditya Narendra,
Spencer James Gibson,
Maria Giovanna Dainotti,
Malgorzata Bogdan,
Agnieszka Pollo,
Ioannis Liodakis,
Artem Poliszczuk
Abstract:
Measuring the redshift of active galactic nuclei (AGNs) requires the use of time-consuming and expensive spectroscopic analysis. However, obtaining redshift measurements of AGNs is crucial as it can enable AGN population studies, provide insight into the star formation rate, the luminosity function, and the density rate evolution. Hence, there is a requirement for alternative redshift measurement…
▽ More
Measuring the redshift of active galactic nuclei (AGNs) requires the use of time-consuming and expensive spectroscopic analysis. However, obtaining redshift measurements of AGNs is crucial as it can enable AGN population studies, provide insight into the star formation rate, the luminosity function, and the density rate evolution. Hence, there is a requirement for alternative redshift measurement techniques. In this project, we aim to use the Fermi gamma-ray space telescope's 4LAC Data Release (DR2) catalog to train a machine learning model capable of predicting the redshift reliably. In addition, this project aims at improving and extending with the new 4LAC Catalog the predictive capabilities of the machine learning (ML) methodology published in Dainotti et al. (2021). Furthermore, we implement feature engineering to expand the parameter space and a bias correction technique to our final results. This study uses additional machine learning techniques inside the ensemble method, the SuperLearner, previously used in Dainotti et al.(2021). Additionally, we also test a novel ML model called Sorted L-One Penalized Estimation (SLOPE). Using these methods we provide a catalog of estimated redshift values for those AGNs that do not have a spectroscopic redshift measurement. These estimates can serve as a redshift reference for the community to verify as updated Fermi catalogs are released with more redshift measurements.
△ Less
Submitted 14 January, 2022;
originally announced January 2022.
-
Predicting the redshift of gamma-ray loud AGNs using supervised machine learning
Authors:
Maria Giovanna Dainotti,
Malgorzata Bogdan,
Aditya Narendra,
Spencer James Gibson,
Blazej Miasojedow,
Ioannis Liodakis,
Agnieszka Pollo,
Trevor Nelson,
Kamil Wozniak,
Zooey Nguyen,
Johan Larrson
Abstract:
AGNs are very powerful galaxies characterized by extremely bright emissions coming out from their central massive black holes. Knowing the redshifts of AGNs provides us with an opportunity to determine their distance to investigate important astrophysical problems such as the evolution of the early stars, their formation along with the structure of early galaxies. The redshift determination is cha…
▽ More
AGNs are very powerful galaxies characterized by extremely bright emissions coming out from their central massive black holes. Knowing the redshifts of AGNs provides us with an opportunity to determine their distance to investigate important astrophysical problems such as the evolution of the early stars, their formation along with the structure of early galaxies. The redshift determination is challenging because it requires detailed follow-up of multi-wavelength observations, often involving various astronomical facilities. Here, we employ machine learning algorithms to estimate redshifts from the observed gamma-ray properties and photometric data of gamma-ray loud AGN from the Fourth Fermi-LAT Catalog. The prediction is obtained with the Superlearner algorithm, using LASSO selected set of predictors. We obtain a tight correlation, with a Pearson Correlation Coefficient of 71.3% between the inferred and the observed redshifts, an average Δz_norm = 11.6 x 10^-4. We stress that notwithstanding the small sample of gamma-ray loud AGNs, we obtain a reliable predictive model using Superlearner, which is an ensemble of several machine learning models.
△ Less
Submitted 22 July, 2021;
originally announced July 2021.