-
Sampling the Swadesh List to Identify Similar Languages with Tree Spaces
Authors:
Garett Ordway,
Vic Patrangenaru
Abstract:
Communication plays a vital role in human interaction. Studying language is a worthwhile task and more recently has become quantitative in nature with developments of fields like quantitative comparative linguistics and lexicostatistics. With respect to the authors own native languages, the ancestry of the English language and the Latin alphabet are of the primary interest. The Indo-European Tree…
▽ More
Communication plays a vital role in human interaction. Studying language is a worthwhile task and more recently has become quantitative in nature with developments of fields like quantitative comparative linguistics and lexicostatistics. With respect to the authors own native languages, the ancestry of the English language and the Latin alphabet are of the primary interest. The Indo-European Tree traces many modern languages back to the Proto-Indo-European root. Swadesh's cognates played a large role in develo** that historical perspective where some of the primary branches are Germanic, Celtic, Italic, and Balto-Slavic. This paper will use data analysis on open books where the simplest singular space is the 3-spider - a union T3 of three rays with their endpoints glued at a point 0 - which can represent these tree spaces for language clustering. These trees are built using a single linkage method for clustering based on distances between samples from languages which use the Latin Script. Taking three languages at a time, the barycenter is determined. Some initial results have found both non-sticky and sticky sample means. If the mean exhibits non-sticky properties, then one language may come from a different ancestor than the other two. If the mean is considered sticky, then the languages may share a common ancestor or all languages may have different ancestry.
△ Less
Submitted 10 May, 2024;
originally announced May 2024.
-
Two Sample Test for Extrinsic Antimeans on Planar Kendall Shape Spaces with an Application to Medical Imaging
Authors:
Aaid Algahtani,
Vic Patrangenaru
Abstract:
In this paper one develops nonparametric inference procedures for comparing two extrinsic antimeans on compact manifolds. Based on recent Central limit theorems for extrinsic sample antimeans w.r.t. an arbitrary embedding of a compact manifold in a Euclidean space, one derives an asymptotic chi square test for the equality of two extrinsic antimeans. Applications are given to distributions on comp…
▽ More
In this paper one develops nonparametric inference procedures for comparing two extrinsic antimeans on compact manifolds. Based on recent Central limit theorems for extrinsic sample antimeans w.r.t. an arbitrary embedding of a compact manifold in a Euclidean space, one derives an asymptotic chi square test for the equality of two extrinsic antimeans. Applications are given to distributions on complex projective space $CP^{k-2}$ w.r.t. the Veronese-Whitney embedding, that is a submanifold representation for the Kendall planar shape space. Two medical imaging analysis applications are also given.
△ Less
Submitted 13 July, 2021; v1 submitted 9 July, 2021;
originally announced July 2021.
-
A Phylogenetic Trees Analysis of SARS-CoV-2
Authors:
Chen Shen,
Vic Patrangenaru,
Roland Moore
Abstract:
One regards spaces of trees as stratified spaces, to study distributions of phylogenetic trees. Stratified spaces with may have cycles, however spaces of trees with a fixed number of leafs are contractible. Spaces of trees with three leafs, in particular, are spiders with three legs. One gives an elementary proof of the stickiness of intrinsic sample means on spiders. One also represents four leaf…
▽ More
One regards spaces of trees as stratified spaces, to study distributions of phylogenetic trees. Stratified spaces with may have cycles, however spaces of trees with a fixed number of leafs are contractible. Spaces of trees with three leafs, in particular, are spiders with three legs. One gives an elementary proof of the stickiness of intrinsic sample means on spiders. One also represents four leafs tree data in terms of an associated Petersen graph. One applies such ideas to analyze RNA sequences of SARS-CoV-2 from multiple sources, by building samples of trees and running nonparametric statistics for intrinsic means on tree spaces with three and four leafs. SARS-CoV-2 are also used to built trees with leaves consisting in addition to other related coronaviruses.
△ Less
Submitted 14 June, 2021; v1 submitted 13 June, 2021;
originally announced June 2021.
-
Nonparametric Data Analysis on the Space of Perceived Colors
Authors:
Vic Patrangenaru,
Yifang Deng
Abstract:
Moving around in a 3D world, requires the visual system of a living individual to rely on three channels of image recognition, which is done through three types of retinal cones. Newton, Grasmann, Helmholz and Schr$\ddot{o}$dinger laid down the basic assumptions needed to understand colored vision. Such concepts were furthered by Resnikoff, who imagined the space of perceived colors as a 3D homoge…
▽ More
Moving around in a 3D world, requires the visual system of a living individual to rely on three channels of image recognition, which is done through three types of retinal cones. Newton, Grasmann, Helmholz and Schr$\ddot{o}$dinger laid down the basic assumptions needed to understand colored vision. Such concepts were furthered by Resnikoff, who imagined the space of perceived colors as a 3D homogeneous space.
This article is concerned with perceived colors regarded as random objects on a Resnikoff 3D homogeneous space model. Two applications to color differentiation in machine vision are illustrated for the proposed statistical methodology, applied to the Euclidean model for perceived colors.
△ Less
Submitted 5 April, 2020;
originally announced April 2020.
-
Extrinsic Kernel Ridge Regression Classifier for Planar Kendall Shape Space
Authors:
Hwiyoung Lee,
Vic Patrangenaru
Abstract:
Kernel methods have had great success in Statistics and Machine Learning. Despite their growing popularity, however, less effort has been drawn towards develo** kernel based classification methods on Riemannian manifolds due to difficulty in dealing with non-Euclidean geometry. In this paper, motivated by the extrinsic framework of manifold-valued data analysis, we propose a new positive definit…
▽ More
Kernel methods have had great success in Statistics and Machine Learning. Despite their growing popularity, however, less effort has been drawn towards develo** kernel based classification methods on Riemannian manifolds due to difficulty in dealing with non-Euclidean geometry. In this paper, motivated by the extrinsic framework of manifold-valued data analysis, we propose a new positive definite kernel on planar Kendall shape space $Σ_2^k$, called extrinsic Veronese Whitney Gaussian kernel. We show that our approach can be extended to develop Gaussian kernels on any embedded manifold. Furthermore, kernel ridge regression classifier (KRRC) is implemented to address the shape classification problem on $Σ_2^k$, and their promising performances are illustrated through the real data analysis.
△ Less
Submitted 1 October, 2020; v1 submitted 17 December, 2019;
originally announced December 2019.
-
Anti-MANOVA on Compact Manifolds with Applications to 3D Projective Shape Analysis
Authors:
Hwiyoung Lee,
Vic Patrangenaru
Abstract:
Methods of hypotheses testing for equality of extrinsic antimeans on compact manifolds are unveiled in this paper. The two and multiple sample problem for antimeans on compact manifolds is addressed for large samples via asymptotic distributions, as well as for small samples using nonparametric bootstrap. An example of face differentiation using 3D VW antimean projective shape analysis for data ex…
▽ More
Methods of hypotheses testing for equality of extrinsic antimeans on compact manifolds are unveiled in this paper. The two and multiple sample problem for antimeans on compact manifolds is addressed for large samples via asymptotic distributions, as well as for small samples using nonparametric bootstrap. An example of face differentiation using 3D VW antimean projective shape analysis for data extracted from digital camera images is also given.
△ Less
Submitted 1 September, 2019;
originally announced September 2019.
-
Nonparametric Confidence Regions for Veronese-Whitney Means and Antimeans on Planar Kendall Shape Spaces
Authors:
Yunfan Wang,
Vic Patrangenaru
Abstract:
In this paper after a brief revision of VW-means, which are extrinsic means on real and complex projective spaces, relative to the Veronese-Whitney embeddings, we give two examples of sample VW means computations on planar Kendall shape spaces. Here we derive large sample and pivotal nonparametric bootstrap confidence regions for VW-antimeans, using VW-anti-covariance matrices, and their sample co…
▽ More
In this paper after a brief revision of VW-means, which are extrinsic means on real and complex projective spaces, relative to the Veronese-Whitney embeddings, we give two examples of sample VW means computations on planar Kendall shape spaces. Here we derive large sample and pivotal nonparametric bootstrap confidence regions for VW-antimeans, using VW-anti-covariance matrices, and their sample counterparts
△ Less
Submitted 13 October, 2018; v1 submitted 20 June, 2018;
originally announced June 2018.
-
Topological Data Analysis for Object Data
Authors:
Vic Patrangenaru,
Peter Bubenik,
Robert L. Paige,
Daniel Osborne
Abstract:
Statistical analysis on object data presents many challenges. Basic summaries such as means and variances are difficult to compute. We apply ideas from topology to study object data. We present a framework for using persistence landscapes to vectorize object data and perform statistical analysis. We apply to this pipeline to some biological images that were previously shown to be challenging to st…
▽ More
Statistical analysis on object data presents many challenges. Basic summaries such as means and variances are difficult to compute. We apply ideas from topology to study object data. We present a framework for using persistence landscapes to vectorize object data and perform statistical analysis. We apply to this pipeline to some biological images that were previously shown to be challenging to study using shape theory. Surprisingly, the most persistent features are shown to be "topological noise" and the statistical analysis depends on the less persistent features which we refer to as the "geometric signal". We also describe the first steps to a new approach to using topology for object data analysis, which applies topology to distributions on object spaces.
△ Less
Submitted 26 April, 2018;
originally announced April 2018.
-
3D mean Projective Shape Difference for Face Differentiation from Multiple Digital Camera Images
Authors:
K. D. Yao,
V. Patrangenaru,
D. Lester
Abstract:
We give a nonparametric methodology for hypothesis testing for equality of extrinsic mean objects on a manifold embedded in a numerical spaces. The results obtained in the general setting are detailed further in the case of 3D projective shapes represented in a space of symmetric matrices via the quadratic Veronese-Whitney (VW) embedding. Large sample and nonparametric bootstrap confidence regions…
▽ More
We give a nonparametric methodology for hypothesis testing for equality of extrinsic mean objects on a manifold embedded in a numerical spaces. The results obtained in the general setting are detailed further in the case of 3D projective shapes represented in a space of symmetric matrices via the quadratic Veronese-Whitney (VW) embedding. Large sample and nonparametric bootstrap confidence regions are derived for the common VW-mean of random projective shapes for finite 3D configurations. As an example, the VW MANOVA testing methodology is applied to the multi-sample mean problem for independent projective shapes of $3D$ facial configurations retrieved from digital images, via Agisoft PhotoScan technology.
△ Less
Submitted 27 April, 2017; v1 submitted 10 April, 2017;
originally announced April 2017.
-
Testing for the Equality of two Distributions on High Dimensional Object Spaces
Authors:
Ruite Guo,
Vic Patrangenaru
Abstract:
Energy statistics are estimators of the energy distance that depend on the distances between observations. The idea behind energy statistics is to consider a statistical potential energy that would parallel Newton's gravitational potential energy. This statistical potential energy is zero if and only if a certain null hypothesis relating two distributions holds true. In Szekely and Rizzo(2004), a…
▽ More
Energy statistics are estimators of the energy distance that depend on the distances between observations. The idea behind energy statistics is to consider a statistical potential energy that would parallel Newton's gravitational potential energy. This statistical potential energy is zero if and only if a certain null hypothesis relating two distributions holds true. In Szekely and Rizzo(2004), a nonparametric test for equality of two multivariate distributions was given, based on the Euclidean distance between observations. This test was shown to be effective for high dimensional multivariate data, and was implemented by an appropriate distribution free permutation test. As an extension of Szekely and Rizzo (2013), here we consider the energy distance between to independent random objects X and Y on the object space M, that admits an embedding into an Euclidean space. In the case of a Kendall shape space, we can use its VW-embedding into an Euclidean space of matrices and define the extrinsic distance between two shapes as their VW associated distance. The corresponding energy distance between two distributions of Kendall shapes of k-ads will be called VW-energy distance We test our methodology on, to compare the distributions of Kendall shape of the contour of the midsagittal section of the Corpus Callossum in normal vs ADHD diagnosed individuals. Here we use the VW distance between the shapes of two children CC midsections. Using the CC data coming originally from http://fcon 1000.projects.nitrc.org/indi/adhd200/ it appears that the two Kendall shape distributions are not significantly different.
△ Less
Submitted 22 March, 2017;
originally announced March 2017.
-
Nonparametric Estimation of Means on Hilbert Manifolds and Extrinsic Analysis of Mean Shapes of Contours
Authors:
Leif Ellingson,
Vic Patrangenaru,
Frits Ruymgaart
Abstract:
Motivated by the problem of nonparametric inference in high level digital image analysis, we introduce a general extrinsic approach for data analysis on Hilbert manifolds with a focus on means of probability distributions on such sample spaces. To perform inference on these means, we appeal to the concept of neighborhood hypotheses from functional data analysis and derive a one-sample test. We the…
▽ More
Motivated by the problem of nonparametric inference in high level digital image analysis, we introduce a general extrinsic approach for data analysis on Hilbert manifolds with a focus on means of probability distributions on such sample spaces. To perform inference on these means, we appeal to the concept of neighborhood hypotheses from functional data analysis and derive a one-sample test. We then consider analysis of shapes of contours lying in the plane. By embedding the corresponding sample space of such shapes, which is a Hilbert manifold, into a space of Hilbert-Schmidt operators, we can define extrinsic mean shapes of planar contours and their sample analogues. We apply the general methods to this problem while considering the computational restrictions faced when utilizing digital imaging data. Comparisons of computational cost are provided to another method for analyzing shapes of contours.
△ Less
Submitted 8 February, 2013;
originally announced February 2013.
-
Sticky central limit theorems on open books
Authors:
Thomas Hotz,
Sean Skwerer,
Stephan Huckemann,
Huiling Le,
J. S. Marron,
Jonathan C. Mattingly,
Ezra Miller,
James Nolen,
Megan Owen,
Vic Patrangenaru
Abstract:
Given a probability distribution on an open book (a metric space obtained by gluing a disjoint union of copies of a half-space along their boundary hyperplanes), we define a precise concept of when the Fréchet mean (barycenter) is sticky. This nonclassical phenomenon is quantified by a law of large numbers (LLN) stating that the empirical mean eventually almost surely lies on the (codimension $1$…
▽ More
Given a probability distribution on an open book (a metric space obtained by gluing a disjoint union of copies of a half-space along their boundary hyperplanes), we define a precise concept of when the Fréchet mean (barycenter) is sticky. This nonclassical phenomenon is quantified by a law of large numbers (LLN) stating that the empirical mean eventually almost surely lies on the (codimension $1$ and hence measure $0$) spine that is the glued hyperplane, and a central limit theorem (CLT) stating that the limiting distribution is Gaussian and supported on the spine. We also state versions of the LLN and CLT for the cases where the mean is nonsticky (i.e., not lying on the spine) and partly sticky (i.e., is, on the spine but not sticky).
△ Less
Submitted 3 December, 2013; v1 submitted 20 February, 2012;
originally announced February 2012.
-
On Chapter Xii in Cartan's "LeÇONS Sur la GÉOMÉTRIE Des Espaces De Riemann"
Authors:
Vic Patrangenaru
Abstract:
One shows that Cartan's method of adapted frames in Chapter XII of his famous treatise of Riemannian geometry, leads to a classification theorem of homogeneous Riemannian manifolds. Examples of classification in 3D dimensions obtained by Cartan are given using this powerful method.
One shows that Cartan's method of adapted frames in Chapter XII of his famous treatise of Riemannian geometry, leads to a classification theorem of homogeneous Riemannian manifolds. Examples of classification in 3D dimensions obtained by Cartan are given using this powerful method.
△ Less
Submitted 7 April, 2009;
originally announced April 2009.
-
A Nonparametric Approach to 3D Shape Analysis from Digital Camera Images - I. in Memory of W.P. Dayawansa
Authors:
V. Patrangenaru,
X. Liu,
S. Sugathadasa
Abstract:
In this article, for the first time, one develops a nonparametric methodology for an analysis of shapes of configurations of landmarks on real 3D objects from regular camera photographs, thus making 3D shape analysis very accessible. A fundamental result in computer vision by Faugeras (1992), Hartley, Gupta and Chang (1992) is that generically, a finite 3D configuration of points can be retrieve…
▽ More
In this article, for the first time, one develops a nonparametric methodology for an analysis of shapes of configurations of landmarks on real 3D objects from regular camera photographs, thus making 3D shape analysis very accessible. A fundamental result in computer vision by Faugeras (1992), Hartley, Gupta and Chang (1992) is that generically, a finite 3D configuration of points can be retrieved up to a projective transformation, from corresponding configurations in a pair of camera images. Consequently, the projective shape of a 3D configuration can be retrieved from two of its planar views. Given the inherent registration errors, the 3D projective shape can be estimated from a sample of photos of the scene containing that configuration. Projective shapes are here regarded as points on projective shape manifolds. Using large sample and nonparametric bootstrap methodology for extrinsic means on manifolds, one gives confidence regions and tests for the mean projective shape of a 3D configuration from its 2D camera images.
△ Less
Submitted 5 June, 2008;
originally announced June 2008.
-
Directions and projective shapes
Authors:
Kanti V. Mardia,
Vic Patrangenaru
Abstract:
This paper deals with projective shape analysis, which is a study of finite configurations of points modulo projective transformations. The topic has various applications in machine vision. We introduce a convenient projective shape space, as well as an appropriate coordinate system for this shape space. For generic configurations of k points in m dimensions, the resulting projective shape space…
▽ More
This paper deals with projective shape analysis, which is a study of finite configurations of points modulo projective transformations. The topic has various applications in machine vision. We introduce a convenient projective shape space, as well as an appropriate coordinate system for this shape space. For generic configurations of k points in m dimensions, the resulting projective shape space is identified as a product of k-m-2 copies of axial spaces RP^m. This identification leads to the need for develo** multivariate directional and multivariate axial analysis and we propose parametric models, as well as nonparametric methods, for these areas. In particular, we investigate the Frechet extrinsic mean for the multivariate axial case. Asymptotic distributions of the appropriate parametric and nonparametric tests are derived. We illustrate our methodology with examples from machine vision.
△ Less
Submitted 16 August, 2005;
originally announced August 2005.
-
Large sample theory of intrinsic and extrinsic sample means on manifolds--II
Authors:
Rabi Bhattacharya,
Vic Patrangenaru
Abstract:
This article develops nonparametric inference procedures for estimation and testing problems for means on manifolds. A central limit theorem for Frechet sample means is derived leading to an asymptotic distribution theory of intrinsic sample means on Riemannian manifolds. Central limit theorems are also obtained for extrinsic sample means w.r.t. an arbitrary embedding of a differentiable manifol…
▽ More
This article develops nonparametric inference procedures for estimation and testing problems for means on manifolds. A central limit theorem for Frechet sample means is derived leading to an asymptotic distribution theory of intrinsic sample means on Riemannian manifolds. Central limit theorems are also obtained for extrinsic sample means w.r.t. an arbitrary embedding of a differentiable manifold in a Euclidean space. Bootstrap methods particularly suitable for these problems are presented. Applications are given to distributions on the sphere S^d (directional spaces), real projective space RP^{N-1} (axial spaces), complex projective space CP^{k-2} (planar shape spaces) w.r.t. Veronese-Whitney embeddings and a three-dimensional shape space Σ_3^4.
△ Less
Submitted 21 July, 2005;
originally announced July 2005.