Search | arXiv e-print repository

Tackling the infinite likelihood problem when fitting mixtures of shifted asymmetric Laplace distributions

Authors: Yuan Fang, Brian C. Franczak, Sanjeena Subedi

Abstract: Mixtures of shifted asymmetric Laplace distributions were introduced as a tool for model-based clustering that allowed for the direct parameterization of skewness in addition to location and scale. Following common practices, an expectation-maximization algorithm was developed to fit these mixtures. However, adaptations to account for the `infinite likelihood problem' led to fits that gave good cl… ▽ More Mixtures of shifted asymmetric Laplace distributions were introduced as a tool for model-based clustering that allowed for the direct parameterization of skewness in addition to location and scale. Following common practices, an expectation-maximization algorithm was developed to fit these mixtures. However, adaptations to account for the `infinite likelihood problem' led to fits that gave good classification performance at the expense of parameter recovery. In this paper, we propose a more valuable solution to this problem by develo** a novel Bayesian parameter estimation scheme for mixtures of shifted asymmetric Laplace distributions. Through simulation studies, we show that the proposed parameter estimation scheme gives better parameter estimates compared to the expectation-maximization based scheme. In addition, we also show that the classification performance is as good, and in some cases better, than the expectation-maximization based scheme. The performance of both schemes are also assessed using well-known real data sets. △ Less

Submitted 24 March, 2023; originally announced March 2023.

arXiv:1706.08927 [pdf, other]

Subspace Clustering with the Multivariate-t Distribution

Authors: Angelina Pesevski, Brian C. Franczak, Paul D. McNicholas

Abstract: Clustering procedures suitable for the analysis of very high-dimensional data are needed for many modern data sets. In model-based clustering, a method called high-dimensional data clustering (HDDC) uses a family of Gaussian mixture models for clustering. HDDC is based on the idea that high-dimensional data usually exists in lower-dimensional subspaces; as such, an intrinsic dimension for each sub… ▽ More Clustering procedures suitable for the analysis of very high-dimensional data are needed for many modern data sets. In model-based clustering, a method called high-dimensional data clustering (HDDC) uses a family of Gaussian mixture models for clustering. HDDC is based on the idea that high-dimensional data usually exists in lower-dimensional subspaces; as such, an intrinsic dimension for each sub-population of the observed data can be estimated and cluster analysis can be performed in this lower-dimensional subspace. As a result, only a fraction of the total number of parameters need to be estimated and a computationally efficient parameter estimation scheme based on the EM algorithm was developed. This family of models has gained attention due to its superior classification performance compared to other families of mixture models; however, it still suffers from the usual limitations of Gaussian mixture model-based approaches. In this paper, a robust analogue of the HDDC approach is proposed. This approach, which extends the HDDC procedure to include the mulitvariate-t distribution, encompasses 28 models that rectify the aforementioned shortcomings of the HDDC procedure. Our tHDDC procedure is fitted to both simulated and real data sets and is compared to the HDDC procedure using an image reconstruction problem that arose from satellite imagery of Mars' surface. △ Less

Submitted 27 June, 2017; originally announced June 2017.

Comments: 16 pages, 2 figures

arXiv:1507.04470 [pdf, ps, other]

Direct mass measurements of Cd isotopes show strong shell gap at N=82

Authors: R. Knöbel, M. Diwisch, F. Bosch, D. Boutin, L. Chen, C. Dimopoulou, A. Dolinskii, B. Franczak, B. Franzke, H. Geissel, M. Hausmann, C. Kozhuharov, J. Kurcewicz, S. A. Litvinova, G. Martínez-Pinedo, M. Matoš, M. Mazzocco, G. Münzenberg, S. Nakajima, C. Nociforo, F. Nolden, T. Ohtsubo, A. Ozawa, Z. Patyk, W. R. Plaß , et al. (10 additional authors not shown)

Abstract: A $^{238}$U projectile beam was used to create cadmium isotopes via abrasion-fission at 410 MeV/u in a beryllium target at the entrance of the in-flight separator FRS at GSI. The fission fragments were separated with the FRS and injected into the isochronous storage ring ESR for mass measurements. The Isochronous Mass Spectrometry (IMS) was performed under two different experimental conditions, wi… ▽ More A $^{238}$U projectile beam was used to create cadmium isotopes via abrasion-fission at 410 MeV/u in a beryllium target at the entrance of the in-flight separator FRS at GSI. The fission fragments were separated with the FRS and injected into the isochronous storage ring ESR for mass measurements. The Isochronous Mass Spectrometry (IMS) was performed under two different experimental conditions, with and without B$ρ$-tagging at the dispersive central focal plane of the FRS. In the experiment with B$ρ$-tagging the magnetic rigidity of the injected fragments was determined by an accuracy of $2\times 10^{-4}$. A new method of data analysis, using a correlation matrix for the combined data set from both experiments, has provided mass values for 25 different isotopes for the first time. The high selectivity and sensitivity of the experiment and analysis has given access even to rare isotopes detected with a few atoms per week. In this letter we present for the $^{129,130,131}$Cd isotopes mass values directly measured for the first time. The Cd results clearly show a very pronounced shell effect at $N=82$ which is in agreement with the conclusion from $γ$-ray spectroscopy of $^{130}$Cd and confirms the assumptions of modern shell-model calculations. △ Less

Submitted 16 July, 2015; originally announced July 2015.

Comments: 7 pages, 5 figures, 3 tables

arXiv:1403.2332 [pdf, other]

A Mixture of Coalesced Generalized Hyperbolic Distributions

Authors: Cristina Tortora, Brian C. Franczak, Ryan P. Browne, Paul D. McNicholas

Abstract: A mixture of multiple scaled generalized hyperbolic distributions (MMSGHDs) is introduced. Then, a coalesced generalized hyperbolic distribution (CGHD) is developed by joining a generalized hyperbolic distribution with a multiple scaled generalized hyperbolic distribution. After detailing the development of the MMSGHDs, which arises via implementation of a multi-dimensional weight function, the de… ▽ More A mixture of multiple scaled generalized hyperbolic distributions (MMSGHDs) is introduced. Then, a coalesced generalized hyperbolic distribution (CGHD) is developed by joining a generalized hyperbolic distribution with a multiple scaled generalized hyperbolic distribution. After detailing the development of the MMSGHDs, which arises via implementation of a multi-dimensional weight function, the density of the mixture of CGHDs is developed. A parameter estimation scheme is developed using the ever-expanding class of MM algorithms and the Bayesian information criterion is used for model selection. The issue of cluster convexity is examined and a special case of the MMSGHDs is developed that is guaranteed to have convex clusters. These approaches are illustrated and compared using simulated and real data. The identifiability of the MMSGHDs and the mixture of CGHDs is discussed in an appendix. △ Less

Submitted 27 October, 2018; v1 submitted 10 March, 2014; originally announced March 2014.

arXiv:1403.2285 [pdf, other]

doi 10.1016/j.patrec.2015.02.011

Unsupervised Learning via Mixtures of Skewed Distributions with Hypercube Contours

Authors: Brian C. Franczak, Cristina Tortora, Ryan P. Browne, Paul D. McNicholas

Abstract: Mixture models whose components have skewed hypercube contours are developed via a generalization of the multivariate shifted asymmetric Laplace density. Specifically, we develop mixtures of multiple scaled shifted asymmetric Laplace distributions. The component densities have two unique features: they include a multivariate weight function, and the marginal distributions are also asymmetric Lapla… ▽ More Mixture models whose components have skewed hypercube contours are developed via a generalization of the multivariate shifted asymmetric Laplace density. Specifically, we develop mixtures of multiple scaled shifted asymmetric Laplace distributions. The component densities have two unique features: they include a multivariate weight function, and the marginal distributions are also asymmetric Laplace. We use these mixtures of multiple scaled shifted asymmetric Laplace distributions for clustering applications, but they could equally well be used in the supervised or semi-supervised paradigms. The expectation-maximization algorithm is used for parameter estimation and the Bayesian information criterion is used for model selection. Simulated and real data sets are used to illustrate the approach and, in some cases, to visualize the skewed hypercube structure of the components. △ Less

Submitted 17 September, 2014; v1 submitted 10 March, 2014; originally announced March 2014.

Journal ref: Pattern Recognition Letters, 58, 69-76 (2015)

arXiv:1311.0317 [pdf, other]

Parsimonious Shifted Asymmetric Laplace Mixtures

Authors: Brian C. Franczak, Paul D. McNicholas, Ryan P. Browne, Paula M. Murray

Abstract: A family of parsimonious shifted asymmetric Laplace mixture models is introduced. We extend the mixture of factor analyzers model to the shifted asymmetric Laplace distribution. Imposing constraints on the constitute parts of the resulting decomposed component scale matrices leads to a family of parsimonious models. An explicit two-stage parameter estimation procedure is described, and the Bayesia… ▽ More A family of parsimonious shifted asymmetric Laplace mixture models is introduced. We extend the mixture of factor analyzers model to the shifted asymmetric Laplace distribution. Imposing constraints on the constitute parts of the resulting decomposed component scale matrices leads to a family of parsimonious models. An explicit two-stage parameter estimation procedure is described, and the Bayesian information criterion and the integrated completed likelihood are compared for model selection. This novel family of models is applied to real data, where it is compared to its Gaussian analogue within clustering and classification paradigms. △ Less

Submitted 1 November, 2013; originally announced November 2013.

arXiv:1207.1727 [pdf, other]

doi 10.1109/TPAMI.2013.216

Mixtures of Shifted Asymmetric Laplace Distributions

Authors: Brian C. Franczak, Ryan P. Browne, Paul D. McNicholas

Abstract: A mixture of shifted asymmetric Laplace distributions is introduced and used for clustering and classification. A variant of the EM algorithm is developed for parameter estimation by exploiting the relationship with the general inverse Gaussian distribution. This approach is mathematically elegant and relatively computationally straightforward. Our novel mixture modelling approach is demonstrated… ▽ More A mixture of shifted asymmetric Laplace distributions is introduced and used for clustering and classification. A variant of the EM algorithm is developed for parameter estimation by exploiting the relationship with the general inverse Gaussian distribution. This approach is mathematically elegant and relatively computationally straightforward. Our novel mixture modelling approach is demonstrated on both simulated and real data to illustrate clustering and classification applications. In these analyses, our mixture of shifted asymmetric Laplace distributions performs favourably when compared to the popular Gaussian approach. This work, which marks an important step in the non-Gaussian model-based clustering and classification direction, concludes with discussion as well as suggestions for future work. △ Less

Submitted 21 December, 2012; v1 submitted 6 July, 2012; originally announced July 2012.

Showing 1–7 of 7 results for author: Franczak, B