-
Neural Networks Assisted Metropolis-Hastings for Bayesian Estimation of Critical Exponent on Elliptic Black Hole Solution in 4D Using Quantum Perturbation Theory
Authors:
Armin Hatefi,
Ehsan Hatefi,
Roberto J. Lopez-Sastre
Abstract:
It is well-known that the critical gravitational collapse produces continuous self-similar solutions characterized by the Choptuik critical exponent, $γ$. We examine the solutions in the domains of the linear perturbation equations, considering the numerical measurement errors. Specifically, we study quantum perturbation theory for the four-dimensional Einstein-axion-dilaton system of the elliptic…
▽ More
It is well-known that the critical gravitational collapse produces continuous self-similar solutions characterized by the Choptuik critical exponent, $γ$. We examine the solutions in the domains of the linear perturbation equations, considering the numerical measurement errors. Specifically, we study quantum perturbation theory for the four-dimensional Einstein-axion-dilaton system of the elliptic class of $\text{SL}(2,\mathbb{R})$ transformations. We develop a novel artificial neural network-assisted Metropolis-Hastings algorithm based on quantum perturbation theory to find the distribution of the critical exponent in a Bayesian framework. Unlike existing methods, this new probabilistic approach identifies the available deterministic solution and explores the range of physically distinguishable critical exponents that may arise due to numerical measurement errors.
△ Less
Submitted 19 June, 2024; v1 submitted 6 June, 2024;
originally announced June 2024.
-
Liu-type Shrinkage Estimators for Mixture of Poisson Regressions with Experts: A Heart Disease Study
Authors:
Elsayed Ghanem,
Moein Yoosefi,
Armin Hatefi
Abstract:
Count data play a critical role in medical research, such as heart disease. The Poisson regression model is a common technique for evaluating the impact of a set of covariates on the count responses. The mixture of Poisson regression models with experts is a practical tool to exploit the covariates, not only to handle the heterogeneity in the Poisson regressions but also to learn the mixing struct…
▽ More
Count data play a critical role in medical research, such as heart disease. The Poisson regression model is a common technique for evaluating the impact of a set of covariates on the count responses. The mixture of Poisson regression models with experts is a practical tool to exploit the covariates, not only to handle the heterogeneity in the Poisson regressions but also to learn the mixing structure of the population. Multicollinearity is one of the most common challenges with regression models, leading to ill-conditioned design matrices of Poisson regression components and expert classes. The maximum likelihood method produces unreliable and misleading estimates for the effects of the covariates in multicollinearity. In this research, we develop Ridge and Liu-type methods as two shrinkage approaches to cope with the ill-conditioned design matrices of the mixture of Poisson regression models with experts. Through various numerical studies, we demonstrate that the shrinkage methods offer more reliable estimates for the coefficients of the mixture model in multicollinearity while maintaining the classification performance of the ML method. The shrinkage methods are finally applied to a heart study to analyze the heart disease rate stages.
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
Sequential Monte Carlo with Cross-validated Neural Networks for Complexity of Hyperbolic Black Hole Solutions in 4D
Authors:
Armin Hatefi,
Ehsan Hatefi
Abstract:
This paper investigates the self-similar solutions of the Einstein-axion-dilaton configuration from type IIB string theory and the global SL(2,R) symmetry. We consider the Continuous Self Similarity (CSS), where the scale transformation is controlled by an SL(2, R) boost or hyperbolic translation. The solutions stay invariant under the combination of space-time dilation with internal SL(2,R) trans…
▽ More
This paper investigates the self-similar solutions of the Einstein-axion-dilaton configuration from type IIB string theory and the global SL(2,R) symmetry. We consider the Continuous Self Similarity (CSS), where the scale transformation is controlled by an SL(2, R) boost or hyperbolic translation. The solutions stay invariant under the combination of space-time dilation with internal SL(2,R) transformations. We develop a new formalism based on Sequential Monte Carlo (SMC) and artificial neural networks (NNs) to estimate the self-similar solutions to the equations of motion in the hyperbolic class in four dimensions. Due to the complex and highly nonlinear patterns, researchers typically have to use various constraints and numerical approximation methods to estimate the equations of motion; thus, they have to overlook the measurement errors in parameter estimation. Through a Bayesian framework, we incorporate measurement errors into our models to find the solutions to the hyperbolic equations of motion. It is well known that the hyperbolic class suffers from multiple solutions where the critical collapse functions have overlap domains for these solutions. To deal with this complexity, for the first time in literature on the axion-dilaton system, we propose the SMC approach to obtain the multi-modal posterior distributions. Through a probabilistic perspective, we confirm the deterministic $α$ and $β$ solutions available in the literature and determine all possible solutions that may occur due to measurement errors. We finally proposed the penalized Leave-One-Out Cross-validation (LOOCV) to combine the Bayesian NN-based estimates optimally. The approach enables us to determine the optimum weights while dealing with the co-linearity issue in the NN-based estimates and better predict the critical functions corresponding to multiple solutions of the equations of motion.
△ Less
Submitted 24 November, 2023; v1 submitted 15 August, 2023;
originally announced August 2023.
-
Modeling the complexity of Elliptic Black Hole Solution In 4D Using Hamiltonian Monte Carlo with Stacked Neural Networks
Authors:
Armin Hatefi,
Ehsan Hatefi,
Roberto J. López-Sastre
Abstract:
In this paper, we study the black hole solution of self-similar gravitational collapse in the Einstein-axion-dilaton system for the elliptic class in four dimensions. The solution is invariant under space-time dilation, which is combined with internal SL(2,R) transformations. Due to the complex and highly nonlinear pattern of the equations of motion in the physics of black holes, researchers typic…
▽ More
In this paper, we study the black hole solution of self-similar gravitational collapse in the Einstein-axion-dilaton system for the elliptic class in four dimensions. The solution is invariant under space-time dilation, which is combined with internal SL(2,R) transformations. Due to the complex and highly nonlinear pattern of the equations of motion in the physics of black holes, researchers typically have to use various numerical techniques to make the equations tractable to estimate the parameters and the critical solutions. To this end, they have to ignore the numerical measurement errors in estimating the parameters. To our knowledge, for the first time in the literature on axion-dilation systems, we propose to estimate the critical collapse functions in a Bayesian framework. We develop a novel methodology to translate the modelling of the complexity of the elliptic black hole to a sampling problem using Hamiltonian Monte Carlo with stacked neural networks. Unlike methods in the literature, this probabilistic approach enables us not only to recover the available deterministic solution but also to explore possibly all physically distinguishable self-similar solutions that may occur due to numerical measurement errors.
△ Less
Submitted 28 September, 2023; v1 submitted 26 July, 2023;
originally announced July 2023.
-
Bayesian Mixture Modelling with Ranked Set Samples
Authors:
Amirhossein Alvandi,
Sedigheh Omidvar,
Armin Hatefi,
Mohammad Jafari Jozani,
Omer Ozturk,
Nader Nematollahi
Abstract:
We consider the Bayesian estimation of the parameters of a finite mixture model from independent order statistics arising from imperfect ranked set sampling designs. As a cost-effective method, ranked set sampling enables us to incorporate easily attainable characteristics, as ranking information, into data collection and Bayesian estimation. To handle the special structure of the ranked set sampl…
▽ More
We consider the Bayesian estimation of the parameters of a finite mixture model from independent order statistics arising from imperfect ranked set sampling designs. As a cost-effective method, ranked set sampling enables us to incorporate easily attainable characteristics, as ranking information, into data collection and Bayesian estimation. To handle the special structure of the ranked set samples, we develop a Bayesian estimation approach exploiting the Expectation-Maximization (EM) algorithm in estimating the ranking parameters and Metropolis within Gibbs Sampling to estimate the parameters of the underlying mixture model. Our findings show that the proposed RSS-based Bayesian estimation method outperforms the commonly used Bayesian counterpart using simple random sampling. The developed method is finally applied to estimate the bone disorder status of women aged 50 and older.
△ Less
Submitted 13 June, 2023;
originally announced June 2023.
-
Analysis of Black Hole Solutions in Parabolic Class Using Neural Networks
Authors:
Ehsan Hatefi,
Armin Hatefi,
Roberto J. López-Sastre
Abstract:
In this paper, we introduce a numerical method based on Artificial Neural Networks (ANNs) for the analysis of black hole solutions to the Einstein-axion-dilaton system in a high dimensional parabolic class. Leveraging a profile root-finding technique based on General Relativity we describe an ANN solver to directly tackle the system of ordinary differential equations. Through our extensive numeric…
▽ More
In this paper, we introduce a numerical method based on Artificial Neural Networks (ANNs) for the analysis of black hole solutions to the Einstein-axion-dilaton system in a high dimensional parabolic class. Leveraging a profile root-finding technique based on General Relativity we describe an ANN solver to directly tackle the system of ordinary differential equations. Through our extensive numerical analysis, we demonstrate, for the first time, that there is no self-similar critical solution for the parabolic class in the high dimensions of space-time. Specifically, we develop $95\%$ ANN-based confidence intervals for all the solutions in their domains. At the $95\%$ confidence level, our ANN estimators confirm that there is no black hole solution in higher dimensions, hence the gravitational collapse does not occur. Results provide some doubts about the universality of the Choptuik phenomena. Therefore, we conclude that the fastest-growing mode of the perturbations that determine the critical exponent does not exist for the parabolic class in the high dimensions.
△ Less
Submitted 23 July, 2023; v1 submitted 9 February, 2023;
originally announced February 2023.
-
Unsupervised Liu-type Shrinkage Estimators for Mixture of Regression Models
Authors:
Elsayed Ghanem,
Armin Hatefi,
Hamid Usefi
Abstract:
In many applications (e.g., medical studies), the population of interest (e.g., disease status) comprises heterogeneous subpopulations. The mixture of probabilistic regression models is one of the most common techniques to incorporate the information of covariates into learning of the population heterogeneity. Despite its flexibility, the model may lead to unreliable estimates in the presence of m…
▽ More
In many applications (e.g., medical studies), the population of interest (e.g., disease status) comprises heterogeneous subpopulations. The mixture of probabilistic regression models is one of the most common techniques to incorporate the information of covariates into learning of the population heterogeneity. Despite its flexibility, the model may lead to unreliable estimates in the presence of multicollinearity problem. In this paper, we develop Liu-type shrinkage methods through an unsupervised learning approach to estimate the model coefficients in multicollinearity. The performance of the developed methods is evaluated via classification and stochastic versions of EM algorithms. The numerical studies show that the proposed methods outperform their Ridge and maximum likelihood counterparts. Finally, the developed methods are applied to analyze the bone mineral data of women aged 50 and older.
△ Less
Submitted 10 September, 2022;
originally announced September 2022.
-
Liu-type Shrinkage Estimators for Mixture of Logistic Regressions: An Osteoporosis Study
Authors:
Elsayed Ghanem,
Armin Hatefi,
Hamid Usefi
Abstract:
The logistic regression model is one of the most powerful statistical methods for the analysis of binary data. The logistic regression allows to use a set of covariates to explain the binary responses. The mixture of logistic regression models is used to fit heterogeneous populations through an unsupervised learning approach. The multicollinearity problem is one of the most common problems in logi…
▽ More
The logistic regression model is one of the most powerful statistical methods for the analysis of binary data. The logistic regression allows to use a set of covariates to explain the binary responses. The mixture of logistic regression models is used to fit heterogeneous populations through an unsupervised learning approach. The multicollinearity problem is one of the most common problems in logistics and a mixture of logistic regressions where the covariates are highly correlated. This problem results in unreliable maximum likelihood estimates for the regression coefficients. This research developed shrinkage methods to deal with the multicollinearity in a mixture of logistic regression models. These shrinkage methods include ridge and Liu-type estimators. Through extensive numerical studies, we show that the developed methods provide more reliable results in estimating the coefficients of the mixture. Finally, we applied the shrinkage methods to analyze the bone disorder status of women aged 50 and older.
△ Less
Submitted 7 September, 2023; v1 submitted 4 September, 2022;
originally announced September 2022.
-
Nonlinear Statistical Spline Smoothers for Critical Spherical Black Hole Solutions in 4-dimension
Authors:
Ehsan Hatefi,
Armin Hatefi
Abstract:
This paper focuses on self-similar gravitational collapse solutions of the Einstein--axion-dilaton configuration for two conjugacy classes of SL(2, R) transformations. These solutions are invariant under spacetime dilation, combined with internal transformations. For the first time in Einstein--axion-dilaton literature, we apply the nonlinear statistical spline regression methods to estimate the c…
▽ More
This paper focuses on self-similar gravitational collapse solutions of the Einstein--axion-dilaton configuration for two conjugacy classes of SL(2, R) transformations. These solutions are invariant under spacetime dilation, combined with internal transformations. For the first time in Einstein--axion-dilaton literature, we apply the nonlinear statistical spline regression methods to estimate the critical spherical black hole solutions in four dimension. These spline methods include truncated power basis, natural cubic spline and penalized B-spline. The prediction errors of the statistical models, on average, are almost less than $10^{-2}$, so all the developed models can be considered unbiased estimators for the critical collapse functions over their entire domains. In addition to this excellence, we derived closed forms and continuously differentiable estimators for all the critical collapse functions.
△ Less
Submitted 5 September, 2022; v1 submitted 3 January, 2022;
originally announced January 2022.
-
Multiple Observers Ranked Set Samples for Shrinkage Estimators
Authors:
Andrew David Pearce,
Armin Hatefi
Abstract:
Ranked set sampling (RSS) is used as a powerful data collection technique for situations where measuring the study variable requires a costly and/or tedious process while the sampling units can be ranked easily (e.g., osteoporosis research). In this paper, we develop ridge and Liu-type shrinkage estimators under RSS data from multiple observers to handle the collinearity problem in estimating coef…
▽ More
Ranked set sampling (RSS) is used as a powerful data collection technique for situations where measuring the study variable requires a costly and/or tedious process while the sampling units can be ranked easily (e.g., osteoporosis research). In this paper, we develop ridge and Liu-type shrinkage estimators under RSS data from multiple observers to handle the collinearity problem in estimating coefficients of linear regression, stochastic restricted regression and logistic regression. Through extensive numerical studies, we show that shrinkage methods with the multi-observer RSS result in more efficient coefficient estimates. The developed methods are finally applied to bone mineral data for analysis of bone disorder status of women aged 50 and older.
△ Less
Submitted 15 October, 2021;
originally announced October 2021.
-
Estimation of Critical Collapse Solutions to Black Holes with Nonlinear Statistical Models
Authors:
Ehsan Hatefi,
Armin Hatefi
Abstract:
The self-similar gravitational collapse solutions to the Einstein-axion-dilaton system have already been found out. Those solutions become invariants after combining the spacetime dilation with the transformations of internal SL(2, R). We apply nonlinear statistical models to estimate the functions that appear in the physics of Black Holes of the axion-dilaton system in four dimensions. These stat…
▽ More
The self-similar gravitational collapse solutions to the Einstein-axion-dilaton system have already been found out. Those solutions become invariants after combining the spacetime dilation with the transformations of internal SL(2, R). We apply nonlinear statistical models to estimate the functions that appear in the physics of Black Holes of the axion-dilaton system in four dimensions. These statistical models include parametric polynomial regression, nonparametric kernel regression and semi-parametric local polynomial regression models. Through various numerical studies, we reached accurate numerical and closed-form continuously differentiable estimates for the functions appearing in the metric and equations of motion.
△ Less
Submitted 28 November, 2022; v1 submitted 14 October, 2021;
originally announced October 2021.
-
Analysis of Ordinal Populations from Judgment Post-Stratification
Authors:
Amirhossein Alvandi,
Armin Hatefi
Abstract:
In surveys requiring cost efficiency, such as medical research, measuring the variable of interest (e.g., disease status) is expensive and/or time-consuming; However, we often have access to easily attainable characteristics about sampling units. These characteristics are not typically employed in the data collection process. Judgment post-stratification (JPS) sampling enables us to supplement the…
▽ More
In surveys requiring cost efficiency, such as medical research, measuring the variable of interest (e.g., disease status) is expensive and/or time-consuming; However, we often have access to easily attainable characteristics about sampling units. These characteristics are not typically employed in the data collection process. Judgment post-stratification (JPS) sampling enables us to supplement the random samples from the population of interest with these characteristics as ranking information. In this paper, we develop methods based on JPS samples for the estimation of categorical ordinal populations. We develop various estimators from JPS data even for a situation where JPS suffers from empty strata. We also propose JPS estimators using multiple ranking resources. Through extensive numerical studies, we evaluate the performance of the methods in the estimation of the population. Finally, the developed estimation methods are applied to bone mineral data to estimate the bone disorder status of women aged 50 and older.
△ Less
Submitted 21 February, 2023; v1 submitted 23 September, 2021;
originally announced September 2021.
-
Statistical Inference, Learning and Models in Big Data
Authors:
Beate Franke,
Jean-François Plante,
Ribana Roscher,
Annie Lee,
Cathal Smyth,
Armin Hatefi,
Fuqi Chen,
Einat Gil,
Alexander Schwing,
Alessandro Selvitella,
Michael M. Hoffman,
Roger Grosse,
Dieter Hendricks,
Nancy Reid
Abstract:
The need for new methods to deal with big data is a common theme in most scientific fields, although its definition tends to vary with the context. Statistical ideas are an essential part of this, and as a partial response, a thematic program on statistical inference, learning, and models in big data was held in 2015 in Canada, under the general direction of the Canadian Statistical Sciences Insti…
▽ More
The need for new methods to deal with big data is a common theme in most scientific fields, although its definition tends to vary with the context. Statistical ideas are an essential part of this, and as a partial response, a thematic program on statistical inference, learning, and models in big data was held in 2015 in Canada, under the general direction of the Canadian Statistical Sciences Institute, with major funding from, and most activities located at, the Fields Institute for Research in Mathematical Sciences. This paper gives an overview of the topics covered, describing challenges and strategies that seem common to many different areas of application, and including some examples of applications to make these challenges and strategies more concrete.
△ Less
Submitted 28 January, 2016; v1 submitted 9 September, 2015;
originally announced September 2015.
-
Information content of partially rank-ordered set samples
Authors:
Armin Hatefi,
Mohammad Jafari Jozani
Abstract:
Partially rank-ordered set (PROS) sampling is a generalization of ranked set sampling in which rankers are not required to fully rank the sampling units in each set, hence having more flexibility to perform the necessary judgemental ranking process. The PROS sampling has a wide range of applications in different fields ranging from environmental and ecological studies to medical research and it ha…
▽ More
Partially rank-ordered set (PROS) sampling is a generalization of ranked set sampling in which rankers are not required to fully rank the sampling units in each set, hence having more flexibility to perform the necessary judgemental ranking process. The PROS sampling has a wide range of applications in different fields ranging from environmental and ecological studies to medical research and it has been shown to be superior over ranked set sampling and simple random sampling for estimating the population mean. In this paper, we study the Fisher information content and uncertainty structure of the PROS samples and compare them with those of simple random sample (SRS) and ranked set sample (RSS) counterparts of the same size from the underlying population. We study the uncertainty structure in terms of the Shannon entropy, Renyi entropy and Kullback-Leibler (KL) discrimination measures. Several examples including the FI of PROS samples from the location-scale family of distributions as well as a regression model are discussed.
△ Less
Submitted 27 April, 2015;
originally announced April 2015.
-
Proportion estimation based on a partially rank ordered set sample with multiple concomitants in a breast cancer study
Authors:
Armin Hatefi,
Mohammad Jafari Jozani
Abstract:
In this paper, we use partially rank-ordered set (PROS) sampling design with multiple concomitants in a breast cancer study and propose a method to estimate the proportion of patients with malignant (cancerous) breast tumours in a given population. Through extensive numerical studies, the performance of the estimator is evaluated under various concomitants with different ranking potentials (i.e.,…
▽ More
In this paper, we use partially rank-ordered set (PROS) sampling design with multiple concomitants in a breast cancer study and propose a method to estimate the proportion of patients with malignant (cancerous) breast tumours in a given population. Through extensive numerical studies, the performance of the estimator is evaluated under various concomitants with different ranking potentials (i.e., good, intermediate and bad) and tie-structures. We show that the PROS estimator with multiple concomitants based on the ranking information provided through some easy to obtain cytological characteristics that are associated with the malignancy of breast tumours performs better than its counterparts under simple random sampling (SRS) and ranked set sampling (RSS) designs with logistic regression models. As opposed to available RSS based methods in the literature, our proposed methodology allows to declare ties among the ranks and does not rely on the existence of any specific regression model assumptions.
△ Less
Submitted 10 November, 2014;
originally announced November 2014.