-
The effect of ISM absorption on stellar activity measurements and its relevance for exoplanet studies
Authors:
L. Fossati,
S. E. Marcelja,
D. Staab,
P. E. Cubillos,
K. France,
C. A. Haswell,
S. Ingrassia,
J. S. Jenkins,
T. Koskinen,
A. F. Lanza,
S. Redfield,
A. Youngblood,
G. Pelzmann
Abstract:
Past UV and optical observations of stars hosting hot Jupiters have shown that some of these stars present an anomalously low chromospheric activity, significantly below the basal level. For WASP-13, observations have shown that the apparent lack of activity is possibly caused by absorption from the intervening ISM. Inspired by this result, we study the effect of ISM absorption on activity measure…
▽ More
Past UV and optical observations of stars hosting hot Jupiters have shown that some of these stars present an anomalously low chromospheric activity, significantly below the basal level. For WASP-13, observations have shown that the apparent lack of activity is possibly caused by absorption from the intervening ISM. Inspired by this result, we study the effect of ISM absorption on activity measurements (S and logR'$_{\rm HK}$ indices) for main-sequence late-type stars. To this end, we employ synthetic stellar photospheric spectra combined with varying amounts of chromospheric emission and ISM absorption. We present the effect of ISM absorption on activity measurements by varying several instrumental, stellar, and ISM parameters. We find that for relative velocities between the stellar and ISM lines smaller than 30-40 km/s and for ISM CaII column densities logN$_{\rm CaII}$>12, the ISM absorption has a significant influence on activity measurements. Direct measurements and three dimensional maps of the Galactic ISM absorption indicate that an ISM CaII column density of logN$_{\rm CaII}$=12 is typically reached by a distance of about 100 pc along most sight lines. In particular, for a Sun-like star lying at a distance greater than 100 pc, we expect a depression (bias) in the logR'$_{\rm HK}$ value larger than 0.05-0.1 dex, about the same size as the typical measurement and calibration uncertainties on this parameter. This work shows that the bias introduced by ISM absorption must always be considered when measuring activity for stars lying beyond 100 pc. We also consider the effect of multiple ISM absorption components. We discuss the relevance of this result for exoplanet studies.
△ Less
Submitted 9 February, 2017;
originally announced February 2017.
-
A bimodal correlation between host star chromospheric emission and the surface gravity of hot Jupiters
Authors:
L. Fossati,
S. Ingrassia,
A. F. Lanza
Abstract:
The chromospheric activity index logR'HK of stars hosting transiting hot Jupiters appears to be correlated with the planets' surface gravity. One of the possible explanations is based on the presence of condensations of planetary evaporated material located in a circumstellar cloud that absorbs the CaII H&K and MgII h&k resonance line emission flux, used to measure chromospheric activity. A larger…
▽ More
The chromospheric activity index logR'HK of stars hosting transiting hot Jupiters appears to be correlated with the planets' surface gravity. One of the possible explanations is based on the presence of condensations of planetary evaporated material located in a circumstellar cloud that absorbs the CaII H&K and MgII h&k resonance line emission flux, used to measure chromospheric activity. A larger column density in the condensations, or equivalently a stronger absorption in the chromospheric lines, is obtained when the evaporation rate of the planet is larger, which occurs for a lower gravity of the planet. We analyze here a sample of stars hosting transiting hot Jupiters tuned in order to minimize systematic effects (e.g., interstellar medium absorption). Using a mixture model, we find that the data are best fit by a two-linear-regression model. We interpret this result in terms of the Vaughan-Preston gap. We use a Monte Carlo approach to best take into account the uncertainties, finding that the two intercepts fit the observed peaks of the distribution of logR'HK for main-sequence solar-like stars. We also find that the intercepts are correlated with the slopes, as predicted by the model based on the condensations of planetary evaporated material. Our findings bring further support to this model, although we cannot firmly exclude different explanations. A precise determination of the slopes of the two linear components would allow one to estimate the average effective stellar flux powering planetary evaporation, which can then be used for theoretical population and evolution studies of close-in planets.
△ Less
Submitted 15 October, 2015;
originally announced October 2015.
-
Robust estimation for mixtures of Gaussian factor analyzers, based on trimming and constraints
Authors:
L. A. García-Escudero,
A. Gordaliza,
F. Greselin,
S. Ingrassia,
A. Mayo-Iscar
Abstract:
Mixtures of Gaussian factors are powerful tools for modeling an unobserved heterogeneous population, offering - at the same time - dimension reduction and model-based clustering. Unfortunately, the high prevalence of spurious solutions and the disturbing effects of outlying observations, along maximum likelihood estimation, open serious issues. In this paper we consider restrictions for the compon…
▽ More
Mixtures of Gaussian factors are powerful tools for modeling an unobserved heterogeneous population, offering - at the same time - dimension reduction and model-based clustering. Unfortunately, the high prevalence of spurious solutions and the disturbing effects of outlying observations, along maximum likelihood estimation, open serious issues. In this paper we consider restrictions for the component covariances, to avoid spurious solutions, and trimming, to provide robustness against violations of normality assumptions of the underlying latent factors. A detailed AECM algorithm for this new approach is presented. Simulation results and an application to the AIS dataset show the aim and effectiveness of the proposed methodology.
△ Less
Submitted 21 March, 2015;
originally announced March 2015.
-
Robust estimation of mixtures of regressions with random covariates, via trimming and constraints
Authors:
L. A. Garcia-Escudero,
A. Gordaliza,
F. Greselin,
S. Ingrassia,
A. Mayo-Iscar
Abstract:
A robust estimator for a wide family of mixtures of linear regression is presented. Robustness is based on the joint adoption of the Cluster Weighted Model and of an estimator based on trimming and restrictions. The selected model provides the conditional distribution of the response for each group, as in mixtures of regression, and further supplies local distributions for the explanatory variable…
▽ More
A robust estimator for a wide family of mixtures of linear regression is presented. Robustness is based on the joint adoption of the Cluster Weighted Model and of an estimator based on trimming and restrictions. The selected model provides the conditional distribution of the response for each group, as in mixtures of regression, and further supplies local distributions for the explanatory variables. A novel version of the restrictions has been devised, under this model, for separately controlling the two sources of variability identified in it. This proposal avoids singularities in the log-likelihood, caused by approximate local collinearity in the explanatory variables or local exact fit in regressions, and reduces the occurrence of spurious local maximizers. In a natural way, due to the interaction between the model and the estimator, the procedure is able to resist the harmful influence of bad leverage points along the estimation of the mixture of regressions, which is still an open issue in the literature. The given methodology defines a well-posed statistical problem, whose estimator exists and is consistent to the corresponding solution of the population optimum, under widely general conditions. A feasible EM algorithm has also been provided to obtain the corresponding estimation. Many simulated examples and two real datasets have been chosen to show the ability of the procedure, on the one hand, to detect anomalous data, and, on the other hand, to identify the real cluster regressions without the influence of contamination.
△ Less
Submitted 4 February, 2015;
originally announced February 2015.
-
Multivariate response and parsimony for Gaussian cluster-weighted models
Authors:
Utkarsh J. Dang,
Antonio Punzo,
Paul D. McNicholas,
Salvatore Ingrassia,
Ryan P. Browne
Abstract:
A family of parsimonious Gaussian cluster-weighted models is presented. This family concerns a multivariate extension to cluster-weighted modelling that can account for correlations between multivariate responses. Parsimony is attained by constraining parts of an eigen-decomposition imposed on the component covariance matrices. A sufficient condition for identifiability is provided and an expectat…
▽ More
A family of parsimonious Gaussian cluster-weighted models is presented. This family concerns a multivariate extension to cluster-weighted modelling that can account for correlations between multivariate responses. Parsimony is attained by constraining parts of an eigen-decomposition imposed on the component covariance matrices. A sufficient condition for identifiability is provided and an expectation-maximization algorithm is presented for parameter estimation. Model performance is investigated on both synthetic and classical real data sets and compared with some popular approaches. Finally, accounting for linear dependencies in the presence of a linear regression structure is shown to offer better performance, vis-à-vis clustering, over existing methodologies.
△ Less
Submitted 26 February, 2016; v1 submitted 3 November, 2014;
originally announced November 2014.
-
Fitting Bivariate Mixed-Type Data via the Generalized Linear Exponential Cluster-Weighted Model
Authors:
Salvatore Ingrassia,
Antonio Punzo
Abstract:
The cluster-weighted model (CWM) is a mixture model with random covariates which allows for flexible clustering and density estimation of a random vector composed by a response variable and by a set of covariates. In this class of models, the generalized linear exponential CWM is here introduced especially for modeling bivariate data of mixed-type. Its natural counterpart, in the family of latent…
▽ More
The cluster-weighted model (CWM) is a mixture model with random covariates which allows for flexible clustering and density estimation of a random vector composed by a response variable and by a set of covariates. In this class of models, the generalized linear exponential CWM is here introduced especially for modeling bivariate data of mixed-type. Its natural counterpart, in the family of latent class models, is also defined. Maximum likelihood parameter estimates are derived using the EM algorithm and model selection is carried out using the Bayesian information criterion (BIC). Artificial and real data are finally considered to exemplify and appreciate the proposed model.
△ Less
Submitted 5 August, 2013; v1 submitted 30 March, 2013;
originally announced April 2013.
-
Maximum likelihood estimation in constrained parameter spaces for mixtures of factor analyzers
Authors:
Francesca Greselin,
Salvatore Ingrassia
Abstract:
Mixtures of factor analyzers are becoming more and more popular in the area of model based clustering of high-dimensional data. According to the likelihood approach in data modeling, it is well known that the unconstrained log-likelihood function may present spurious maxima and singularities and this is due to specific patterns of the estimated covariance structure, when their determinant approach…
▽ More
Mixtures of factor analyzers are becoming more and more popular in the area of model based clustering of high-dimensional data. According to the likelihood approach in data modeling, it is well known that the unconstrained log-likelihood function may present spurious maxima and singularities and this is due to specific patterns of the estimated covariance structure, when their determinant approaches 0. To reduce such drawbacks, in this paper we introduce a procedure for the parameter estimation of mixtures of factor analyzers, which maximizes the likelihood function in a constrained parameter space. We then analyze and measure its performance, compared to the usual non-constrained approach, via some simulations and applications to real data sets.
△ Less
Submitted 8 January, 2013;
originally announced January 2013.
-
Modeling high energy cosmic rays mass composition data via mixtures of multivariate skew-t distributions
Authors:
S. Riggi,
S. Ingrassia
Abstract:
We consider multivariate skew-t distributions for modeling composition data of high energy cosmic rays. The model has been validated with simulated data for different primary nuclei and hadronic models focusing on the depth of maximum Xmax and number of muons Nμ observables. Further, we consider mixtures of multivariate skew-t distributions for cosmic ray mass composition determination and event-b…
▽ More
We consider multivariate skew-t distributions for modeling composition data of high energy cosmic rays. The model has been validated with simulated data for different primary nuclei and hadronic models focusing on the depth of maximum Xmax and number of muons Nμ observables. Further, we consider mixtures of multivariate skew-t distributions for cosmic ray mass composition determination and event-by-event classification. With respect to other approaches in the field, it is based on analytical calculations and allows to incorporate different sets of constraints provided by the present hadronic models. We present some applications to simulated data sets generated with different nuclear abundances assumptions. As it does not fully rely on the hadronic model predictions, the method is particularly suited to the current experimental scenario in which evidences of discrepancies of the measured data with respect to the models have been reported for some shower observables, such as the number of muons at ground level.
△ Less
Submitted 7 January, 2013;
originally announced January 2013.
-
Generalized Linear Gaussian Cluster-Weighted Modeling
Authors:
Salvatore Ingrassia,
Simona C. Minotti,
Antonio Punzo,
Giorgio Vittadini
Abstract:
Cluster-Weighted Modeling (CWM) is a flexible mixture approach for modeling the joint probability of data coming from a heterogeneous population as a weighted sum of the products of marginal distributions and conditional distributions. In this paper, we introduce a wide family of Cluster Weighted models in which the conditional distributions are assumed to belong to the exponential family with can…
▽ More
Cluster-Weighted Modeling (CWM) is a flexible mixture approach for modeling the joint probability of data coming from a heterogeneous population as a weighted sum of the products of marginal distributions and conditional distributions. In this paper, we introduce a wide family of Cluster Weighted models in which the conditional distributions are assumed to belong to the exponential family with canonical links which will be referred to as Generalized Linear Gaussian Cluster Weighted Models. Moreover, we show that, in a suitable sense, mixtures of generalized linear models can be considered as nested in Generalized Linear Gaussian Cluster Weighted Models. The proposal is illustrated through many numerical studies based on both simulated and real data sets.
△ Less
Submitted 19 December, 2012; v1 submitted 6 November, 2012;
originally announced November 2012.
-
Clustering and Classification via Cluster-Weighted Factor Analyzers
Authors:
Sanjeena Subedi,
Antonio Punzo,
Salvatore Ingrassia,
Paul D. McNicholas
Abstract:
In model-based clustering and classification, the cluster-weighted model constitutes a convenient approach when the random vector of interest constitutes a response variable Y and a set p of explanatory variables X. However, its applicability may be limited when p is high. To overcome this problem, this paper assumes a latent factor structure for X in each mixture component. This leads to the clus…
▽ More
In model-based clustering and classification, the cluster-weighted model constitutes a convenient approach when the random vector of interest constitutes a response variable Y and a set p of explanatory variables X. However, its applicability may be limited when p is high. To overcome this problem, this paper assumes a latent factor structure for X in each mixture component. This leads to the cluster-weighted factor analyzers (CWFA) model. By imposing constraints on the variance of Y and the covariance matrix of X, a novel family of sixteen CWFA models is introduced for model-based clustering and classification. The alternating expectation-conditional maximization algorithm, for maximum likelihood estimation of the parameters of all the models in the family, is described; to initialize the algorithm, a 5-step hierarchical procedure is proposed, which uses the nested structures of the models within the family and thus guarantees the natural ranking among the sixteen likelihoods. Artificial and real data show that these models have very good clustering and classification performance and that the algorithm is able to recover the parameters very well.
△ Less
Submitted 28 September, 2012;
originally announced September 2012.
-
Maximum Likelihood Estimation of Gaussian Cluster Weighted Models and Relationships with Mixtures of Regression
Authors:
Salvatore Ingrassia,
Simona C. Minotti
Abstract:
Cluster-weighted modeling (CWM) is a mixture approach for modeling the joint probability of a response variable and a set of explanatory variables. The parameters are estimated by means of the expectation-maximization algorithm according to the maximum likelihood approach. Under Gaussian assumptions, we analyse the complete-data likelihood function of cluster weighted models. Further, under suitab…
▽ More
Cluster-weighted modeling (CWM) is a mixture approach for modeling the joint probability of a response variable and a set of explanatory variables. The parameters are estimated by means of the expectation-maximization algorithm according to the maximum likelihood approach. Under Gaussian assumptions, we analyse the complete-data likelihood function of cluster weighted models. Further, under suitable hypotheses we show that the maximization of the likelihood function of Gaussian cluster weighted models leads to the same parameter estimates of finite mixtures of regression and finite mixtures of regression with concomitant variables. In this sense, the latter ones can be considered as nested models of Gaussian cluster weighted models.
△ Less
Submitted 8 August, 2013; v1 submitted 12 July, 2012;
originally announced July 2012.
-
Model-based clustering via linear cluster-weighted models
Authors:
Salvatore Ingrassia,
Simona C. Minotti,
Antonio Punzo
Abstract:
A novel family of twelve mixture models with random covariates, nested in the linear $t$ cluster-weighted model (CWM), is introduced for model-based clustering. The linear $t$ CWM was recently presented as a robust alternative to the better known linear Gaussian CWM. The proposed family of models provides a unified framework that also includes the linear Gaussian CWM as a special case. Maximum lik…
▽ More
A novel family of twelve mixture models with random covariates, nested in the linear $t$ cluster-weighted model (CWM), is introduced for model-based clustering. The linear $t$ CWM was recently presented as a robust alternative to the better known linear Gaussian CWM. The proposed family of models provides a unified framework that also includes the linear Gaussian CWM as a special case. Maximum likelihood parameter estimation is carried out within the EM framework, and both the BIC and the ICL are used for model selection. A simple and effective hierarchical random initialization is also proposed for the EM algorithm. The novel model-based clustering technique is illustrated in some applications to real data. Finally, a simulation study for evaluating the performance of the BIC and the ICL is presented.
△ Less
Submitted 9 March, 2015; v1 submitted 18 June, 2012;
originally announced June 2012.
-
Local statistical modeling by cluster-weighted
Authors:
Salvatore Ingrassia,
Simona C. Minotti,
Giorgio Vittadini
Abstract:
We investigate statistical properties of Cluster-Weighted Modeling, which is a framework for supervised learning originally developed in order to recreate a digital violin with traditional inputs and realistic sound. The analysis is carried out in comparison with Finite Mixtures of Regression models. Based on some geometrical arguments, we highlight that Cluster-WeightedModeling provides a quite g…
▽ More
We investigate statistical properties of Cluster-Weighted Modeling, which is a framework for supervised learning originally developed in order to recreate a digital violin with traditional inputs and realistic sound. The analysis is carried out in comparison with Finite Mixtures of Regression models. Based on some geometrical arguments, we highlight that Cluster-WeightedModeling provides a quite general framework for local statistical modeling. Theoretical results are illustrated on the ground of some numerical simulations.
△ Less
Submitted 15 June, 2011; v1 submitted 13 November, 2009;
originally announced November 2009.