Skip to main content

Showing 1–10 of 10 results for author: Trosset, M W

.
  1. arXiv:2402.04436  [pdf, other

    stat.ML cs.LG

    Continuous Multidimensional Scaling

    Authors: Michael W. Trosset, Carey E. Priebe

    Abstract: Multidimensional scaling (MDS) is the act of embedding proximity information about a set of $n$ objects in $d$-dimensional Euclidean space. As originally conceived by the psychometric community, MDS was concerned with embedding a fixed set of proximities associated with a fixed set of objects. Modern concerns, e.g., that arise in develo** asymptotic theories for statistical inference on random g… ▽ More

    Submitted 8 February, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: 15 pages. Modified a sentence in the Abstract for greater clarity

    MSC Class: 62H99

  2. Semisupervised regression in latent structure networks on unknown manifolds

    Authors: Aranyak Acharyya, Joshua Agterberg, Michael W. Trosset, Youngser Park, Carey E. Priebe

    Abstract: Random graphs are increasingly becoming objects of interest for modeling networks in a wide range of applications. Latent position random graph models posit that each node is associated with a latent position vector, and that these vectors follow some geometric structure in the latent space. In this paper, we consider random dot product graphs, in which an edge is formed between two nodes with pro… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Journal ref: Applied Network Science 8 (2023) 75

  3. Popularity Adjusted Block Models are Generalized Random Dot Product Graphs

    Authors: John Koo, Minh Tang, Michael W. Trosset

    Abstract: We connect two random graph models, the Popularity Adjusted Block Model (PABM) and the Generalized Random Dot Product Graph (GRDPG), by demonstrating that the PABM is a special case of the GRDPG in which communities correspond to mutually orthogonal subspaces of latent vectors. This insight allows us to construct new algorithms for community detection and parameter estimation for the PABM, as well… ▽ More

    Submitted 9 June, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: 36 pages, 9 figures

  4. arXiv:2006.10858  [pdf, other

    stat.ML cs.LG

    Rehabilitating Isomap: Euclidean Representation of Geodesic Structure

    Authors: Michael W. Trosset, Gokcen Buyukbas

    Abstract: Manifold learning techniques for nonlinear dimension reduction assume that high-dimensional feature vectors lie on a low-dimensional manifold, then attempt to exploit manifold structure to obtain useful low-dimensional Euclidean representations of the data. Isomap, a seminal manifold learning technique, is an elegant synthesis of two simple ideas: the approximation of Riemannian distances with sho… ▽ More

    Submitted 21 October, 2021; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: 27 pages, 4 figures

    MSC Class: 62H99

  5. arXiv:2004.07348  [pdf, other

    stat.ML cs.LG

    Learning 1-Dimensional Submanifolds for Subsequent Inference on Random Dot Product Graphs

    Authors: Michael W. Trosset, Mingyue Gao, Minh Tang, Carey E. Priebe

    Abstract: A random dot product graph (RDPG) is a generative model for networks in which vertices correspond to positions in a latent Euclidean space and edge probabilities are determined by the dot products of the latent positions. We consider RDPGs for which the latent positions are randomly sampled from an unknown $1$-dimensional submanifold of the latent space. In principle, restricted inference, i.e., p… ▽ More

    Submitted 24 December, 2021; v1 submitted 15 April, 2020; originally announced April 2020.

    Comments: 29 pages

    MSC Class: 62H99

  6. arXiv:1903.08656  [pdf, other

    math.ST stat.ME

    Approximate Information Tests on Statistical Submanifolds

    Authors: Michael W. Trosset, Carey E. Priebe

    Abstract: Parametric inference posits a statistical model that is a specified family of probability distributions. Restricted inference, e.g., restricted likelihood ratio testing, attempts to exploit the structure of a statistical submodel that is a subset of the specified family. We consider the problem of testing a simple hypothesis against alternatives from such a submodel. In the case of an unknown subm… ▽ More

    Submitted 20 March, 2019; originally announced March 2019.

    Comments: 26 pages

    MSC Class: 62H15 (Primary) 62F03; 62G10 (Secondary)

  7. arXiv:1801.00038  [pdf, ps, other

    math.ST

    Identifiability of two-component skew normal mixtures with one known component

    Authors: Shantanu Jain, Michael Levine, Predrag Radivojac, Michael W. Trosset

    Abstract: We give sufficient identifiability conditions for estimating mixing proportions in two-component mixtures of skew normal distributions with one known component. We consider the univariate case as well as two multivariate extensions: a multivariate skew normal distribution (MSN) by Azzalini and Dalla Valle (1996) and the canonical fundamental skew normal distribution (CFUSN) by Arellano-Valle and G… ▽ More

    Submitted 29 December, 2017; originally announced January 2018.

  8. arXiv:1608.00032  [pdf, other

    math.ST

    On the Power of Likelihood Ratio Tests in Dimension-Restricted Submodels

    Authors: Michael W. Trosset, Mingyue Gao, Carey E. Priebe

    Abstract: Likelihood ratio tests are widely used to test statistical hypotheses about parametric families of probability distributions. If interest is restricted to a subfamily of distributions, then it is natural to inquire if the restricted LRT is superior to the unrestricted LRT. Marden's general LRT conjecture posits that any restriction placed on the alternative hypothesis will increase power. The only… ▽ More

    Submitted 29 July, 2016; originally announced August 2016.

    Comments: 21 pages, 1 figure

    MSC Class: 62F30

  9. arXiv:1601.01944  [pdf, other

    stat.ML cs.LG

    Nonparametric semi-supervised learning of class proportions

    Authors: Shantanu Jain, Martha White, Michael W. Trosset, Predrag Radivojac

    Abstract: The problem of develo** binary classifiers from positive and unlabeled data is often encountered in machine learning. A common requirement in this setting is to approximate posterior probabilities of positive and negative classes for a previously unseen data point. This problem can be decomposed into two steps: (i) the development of accurate predictors that discriminate between positive and unl… ▽ More

    Submitted 8 January, 2016; originally announced January 2016.

  10. arXiv:1502.03391  [pdf, other

    stat.ML stat.ME

    Fast Embedding for JOFC Using the Raw Stress Criterion

    Authors: Vince Lyzinski, Youngser Park, Carey E. Priebe, Michael W. Trosset

    Abstract: The Joint Optimization of Fidelity and Commensurability (JOFC) manifold matching methodology embeds an omnibus dissimilarity matrix consisting of multiple dissimilarities on the same set of objects. One approach to this embedding optimizes the preservation of fidelity to each individual dissimilarity matrix together with commensurability of each given observation across modalities via iterative ma… ▽ More

    Submitted 31 October, 2016; v1 submitted 11 February, 2015; originally announced February 2015.

    Comments: 43 pages, 10 figures, 3 tables