Skip to main content

Showing 1–24 of 24 results for author: Murphy, J M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.00837  [pdf, other

    cs.LG eess.SP math.OC stat.ML

    Locality Regularized Reconstruction: Structured Sparsity and Delaunay Triangulations

    Authors: Marshall Mueller, James M. Murphy, Abiy Tasissa

    Abstract: Linear representation learning is widely studied due to its conceptual simplicity and empirical utility in tasks such as compression, classification, and feature extraction. Given a set of points $[\mathbf{x}_1, \mathbf{x}_2, \ldots, \mathbf{x}_n] = \mathbf{X} \in \mathbb{R}^{d \times n}$ and a vector $\mathbf{y} \in \mathbb{R}^d$, the goal is to find coefficients $\mathbf{w} \in \mathbb{R}^n$ so… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 26 pages, 8 figures

  2. arXiv:2312.15447  [pdf, other

    cs.CV cs.LG stat.AP

    Superpixel-based and Spatially-regularized Diffusion Learning for Unsupervised Hyperspectral Image Clustering

    Authors: Kangning Cui, Ruoning Li, Sam L. Polk, Yinyi Lin, Hongsheng Zhang, James M. Murphy, Robert J. Plemmons, Raymond H. Chan

    Abstract: Hyperspectral images (HSIs) provide exceptional spatial and spectral resolution of a scene, crucial for various remote sensing applications. However, the high dimensionality, presence of noise and outliers, and the need for precise labels of HSIs present significant challenges to HSIs analysis, motivating the development of performant HSI clustering algorithms. This paper introduces a novel unsupe… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    Comments: 27 pages, 9 figures, and 2 tables

  3. arXiv:2311.11934  [pdf, other

    stat.ML cs.IT cs.LG math.ST

    Estimation of entropy-regularized optimal transport maps between non-compactly supported measures

    Authors: Matthew Werenski, James M. Murphy, Shuchin Aeron

    Abstract: This paper addresses the problem of estimating entropy-regularized optimal transport (EOT) maps with squared-Euclidean cost between source and target measures that are subGaussian. In the case that the target measure is compactly supported or strongly log-concave, we show that for a recently proposed in-sample estimator, the expected squared $L^2$-error decays at least as fast as $O(n^{-1/3})$ whe… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: 30 pages, 7 figures

  4. arXiv:2307.05750  [pdf, other

    stat.ML cs.DS cs.LG math.DG

    Fermat Distances: Metric Approximation, Spectral Convergence, and Clustering Algorithms

    Authors: Nicolás García Trillos, Anna Little, Daniel McKenzie, James M. Murphy

    Abstract: We analyze the convergence properties of Fermat distances, a family of density-driven metrics defined on Riemannian manifolds with an associated probability measure. Fermat distances may be defined either on discrete samples from the underlying measure, in which case they are random, or in the continuum setting, in which they are induced by geodesics under a density-distorted Riemannian metric. We… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  5. arXiv:2302.07964  [pdf, other

    stat.ML cs.LG

    On Rank Energy Statistics via Optimal Transport: Continuity, Convergence, and Change Point Detection

    Authors: Matthew Werenski, Shoaib Bin Masud, James M. Murphy, Shuchin Aeron

    Abstract: This paper considers the use of recently proposed optimal transport-based multivariate test statistics, namely rank energy and its variant the soft rank energy derived from entropically regularized optimal transport, for the unsupervised nonparametric change point detection (CPD) problem. We show that the soft rank energy enjoys both fast rates of statistical convergence and robust continuity prop… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: 36 pages, 5 figures

  6. arXiv:2210.12135  [pdf, other

    cs.LG cs.CV eess.SP math.OC math.PR math.ST

    Geometric Sparse Coding in Wasserstein Space

    Authors: Marshall Mueller, Shuchin Aeron, James M. Murphy, Abiy Tasissa

    Abstract: Wasserstein dictionary learning is an unsupervised approach to learning a collection of probability distributions that generate observed distributions as Wasserstein barycentric combinations. Existing methods for Wasserstein dictionary learning optimize an objective that seeks a dictionary with sufficient representation capacity via barycentric interpolation to approximate the observed training da… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: 24 pages

  7. arXiv:2204.13497  [pdf, ps, other

    cs.CV cs.LG stat.AP

    Unsupervised Spatial-spectral Hyperspectral Image Reconstruction and Clustering with Diffusion Geometry

    Authors: Kangning Cui, Ruoning Li, Sam L. Polk, James M. Murphy, Robert J. Plemmons, Raymond H. Chan

    Abstract: Hyperspectral images, which store a hundred or more spectral bands of reflectance, have become an important data source in natural and social sciences. Hyperspectral images are often generated in large quantities at a relatively coarse spatial resolution. As such, unsupervised machine learning algorithms incorporating known structure in hyperspectral imagery are needed to analyze these images auto… ▽ More

    Submitted 28 April, 2022; originally announced April 2022.

    Comments: 7 pages, 1 figure

  8. arXiv:2204.09041  [pdf, other

    cs.CV cs.LG stat.AP

    Unsupervised detection of ash dieback disease (Hymenoscyphus fraxineus) using diffusion-based hyperspectral image clustering

    Authors: Sam L. Polk, Aland H. Y. Chan, Kangning Cui, Robert J. Plemmons, David A. Coomes, James M. Murphy

    Abstract: Ash dieback (Hymenoscyphus fraxineus) is an introduced fungal disease that is causing the widespread death of ash trees across Europe. Remote sensing hyperspectral images encode rich structure that has been exploited for the detection of dieback disease in ash trees using supervised machine learning techniques. However, to understand the state of forest health at landscape-scale, accurate unsuperv… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: (6 pages, 2 figures). Accepted to Proceedings of IEEE IGARSS 2022

  9. arXiv:2204.06298  [pdf, other

    cs.CV cs.LG stat.AP

    Active Diffusion and VCA-Assisted Image Segmentation of Hyperspectral Images

    Authors: Sam L. Polk, Kangning Cui, Robert J. Plemmons, James M. Murphy

    Abstract: Hyperspectral images encode rich structure that can be exploited for material discrimination by machine learning algorithms. This article introduces the Active Diffusion and VCA-Assisted Image Segmentation (ADVIS) for active material discrimination. ADVIS selects high-purity, high-density pixels that are far in diffusion distance (a data-dependent metric) from other high-purity, high-density pixel… ▽ More

    Submitted 13 April, 2022; originally announced April 2022.

    Comments: (6 pages, 2 figures). Accepted to Proceedings of IEEE IGARSS 2022

  10. arXiv:2203.09992  [pdf, other

    cs.CV cs.LG stat.AP

    Unsupervised Diffusion and Volume Maximization-Based Clustering of Hyperspectral Images

    Authors: Sam L. Polk, Kangning Cui, Aland H. Y. Chan, David A. Coomes, Robert J. Plemmons, James M. Murphy

    Abstract: Hyperspectral images taken from aircraft or satellites contain information from hundreds of spectral bands, within which lie latent lower-dimensional structures that can be exploited for classifying vegetation and other materials. A disadvantage of working with hyperspectral images is that, due to an inherent trade-off between spectral and spatial resolution, they have a relatively coarse spatial… ▽ More

    Submitted 19 February, 2023; v1 submitted 18 March, 2022; originally announced March 2022.

    Comments: 28 pages, 11 figures

    Journal ref: Remote Sens. 2023, 15(4), 1053

  11. arXiv:2201.12195  [pdf, other

    stat.ML cs.DS cs.LG math.PR math.ST

    Measure Estimation in the Barycentric Coding Model

    Authors: Matthew Werenski, Ruijie Jiang, Abiy Tasissa, Shuchin Aeron, James M. Murphy

    Abstract: This paper considers the problem of measure estimation under the barycentric coding model (BCM), in which an unknown measure is assumed to belong to the set of Wasserstein-2 barycenters of a finite set of known measures. Estimating a measure under this model is equivalent to estimating the unknown barycentric coordinates. We provide novel geometrical, statistical, and computational insights for me… ▽ More

    Submitted 27 June, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: ICML 2022

  12. arXiv:2112.10708  [pdf, other

    cs.SI cs.DM math.FA stat.AP

    Measuring Segregation via Analysis on Graphs

    Authors: Moon Duchin, James M. Murphy, Thomas Weighill

    Abstract: In this paper, we use analysis on graphs to study quantitative measures of segregation. We focus on a classical statistic from the geography and urban sociology literature known as Moran's I, which in our language is a score associated to a real-valued function on a graph, computed with respect to a spatial weight matrix such as the adjacency matrix associated to the geographic units that tile a c… ▽ More

    Submitted 18 May, 2022; v1 submitted 20 December, 2021; originally announced December 2021.

    Comments: 33 pages, 15 figures, 3 tables

    MSC Class: 05C50; 65D18; 68T09; 91D30

  13. arXiv:2111.00043  [pdf, other

    stat.ML cs.LG

    Multivariate rank via entropic optimal transport: sample efficiency and generative modeling

    Authors: Shoaib Bin Masud, Matthew Werenski, James M. Murphy, Shuchin Aeron

    Abstract: The framework of optimal transport has been leveraged to extend the notion of rank to the multivariate setting while preserving desirable properties of the resulting goodness-of-fit (GoF) statistics. In particular, the rank energy (RE) and rank maximum mean discrepancy (RMMD) are distribution-free under the null, exhibit high power in statistical testing, and are robust to outliers. In this paper,… ▽ More

    Submitted 25 November, 2022; v1 submitted 29 October, 2021; originally announced November 2021.

    Comments: 46 pages, 10 figures. Replacement note: Substantial revision over V2: Title change, first authors contribution change, new improved theoretical results relaxing compactness assumptions

  14. arXiv:2103.15783  [pdf, other

    cs.LG cs.CV stat.ML

    Multiscale Clustering of Hyperspectral Images Through Spectral-Spatial Diffusion Geometry

    Authors: Sam L. Polk, James M. Murphy

    Abstract: Clustering algorithms partition a dataset into groups of similar points. The primary contribution of this article is the Multiscale Spatially-Regularized Diffusion Learning (M-SRDL) clustering algorithm, which uses spatially-regularized diffusion distances to efficiently and accurately learn multiple scales of latent structure in hyperspectral images. The M-SRDL clustering algorithm extracts clust… ▽ More

    Submitted 7 April, 2022; v1 submitted 29 March, 2021; originally announced March 2021.

    Comments: (6 pages, 2 figures). Proceedings of IEEE IGARSS 2021

  15. arXiv:2102.00500  [pdf, other

    cs.LG cs.CV math.PR stat.ML

    A Multiscale Environment for Learning by Diffusion

    Authors: James M. Murphy, Sam L. Polk

    Abstract: Clustering algorithms partition a dataset into groups of similar points. The clustering problem is very general, and different partitions of the same dataset could be considered correct and useful. To fully understand such data, it must be considered at a variety of scales, ranging from coarse to fine. We introduce the Multiscale Environment for Learning by Diffusion (MELD) data model, which is a… ▽ More

    Submitted 31 January, 2021; originally announced February 2021.

    Comments: 35 pages, 10 figures

  16. arXiv:2012.02134  [pdf, other

    cs.LG cs.IT eess.SP math.OC

    K-Deep Simplex: Deep Manifold Learning via Local Dictionaries

    Authors: Pranay Tankala, Abiy Tasissa, James M. Murphy, Demba Ba

    Abstract: We propose K-Deep Simplex (KDS) which, given a set of data points, learns a dictionary comprising synthetic landmarks, along with representation coefficients supported on a simplex. KDS integrates manifold learning and sparse coding/dictionary learning: reconstruction term, as in classical dictionary learning, and a novel local weighted $\ell_1$ penalty that encourages each data point to represent… ▽ More

    Submitted 14 January, 2023; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: 14 pages, 8 figures

  17. arXiv:2004.05048  [pdf, other

    cs.CV cs.LG stat.AP

    Hyperspectral Image Clustering with Spatially-Regularized Ultrametrics

    Authors: Shukun Zhang, James M. Murphy

    Abstract: We propose a method for the unsupervised clustering of hyperspectral images based on spatially regularized spectral clustering with ultrametric path distances. The proposed method efficiently combines data density and geometry to distinguish between material classes in the data, without the need for training labels. The proposed method is efficient, with quasilinear scaling in the number of data p… ▽ More

    Submitted 10 April, 2020; originally announced April 2020.

    Comments: 5 pages, 2 columns, 9 figures

  18. arXiv:2003.03616  [pdf, other

    stat.ML cs.CV cs.LG math.PR

    Diffusion State Distances: Multitemporal Analysis, Fast Algorithms, and Applications to Biological Networks

    Authors: Lenore Cowen, Kapil Devkota, Xiaozhe Hu, James M. Murphy, Kaiyi Wu

    Abstract: Data-dependent metrics are powerful tools for learning the underlying structure of high-dimensional data. This article develops and analyzes a data-dependent metric known as diffusion state distance (DSD), which compares points using a data-driven diffusion process. Unlike related diffusion methods, DSDs incorporate information across time scales, which allows for the intrinsic data structure to b… ▽ More

    Submitted 7 March, 2020; originally announced March 2020.

    Comments: 28 pages

  19. arXiv:1911.02155  [pdf, other

    cs.LG cs.CV stat.ME stat.ML

    Spatially regularized active diffusion learning for high-dimensional images

    Authors: James M. Murphy

    Abstract: An active learning algorithm for the classification of high-dimensional images is proposed in which spatially-regularized nonlinear diffusion geometry is used to characterize cluster cores. The proposed method samples from estimated cluster cores in order to generate a small but potent set of training labels which propagate to the remainder of the dataset via the underlying diffusion process. By s… ▽ More

    Submitted 5 November, 2019; originally announced November 2019.

    Comments: 17 pages

  20. arXiv:1905.12989  [pdf, other

    cs.LG math.ST stat.ML

    Learning by Active Nonlinear Diffusion

    Authors: Mauro Maggioni, James M. Murphy

    Abstract: This article proposes an active learning method for high dimensional data, based on intrinsic data geometries learned through diffusion processes on graphs. Diffusion distances are used to parametrize low-dimensional structures on the dataset, which allow for high-accuracy labelings of the dataset with only a small number of carefully chosen labels. The geometric structure of the data suggests reg… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

    Comments: 20 pages, 10 figures

  21. arXiv:1902.05402  [pdf, other

    cs.CV cs.LG stat.ML

    Spectral-Spatial Diffusion Geometry for Hyperspectral Image Clustering

    Authors: James M. Murphy, Mauro Maggioni

    Abstract: An unsupervised learning algorithm to cluster hyperspectral image (HSI) data is proposed that exploits spatially-regularized random walks. Markov diffusions are defined on the space of HSI spectra with transitions constrained to near spatial neighbors. The explicit incorporation of spatial regularity into the diffusion construction leads to smoother random processes that are more adapted for unsup… ▽ More

    Submitted 8 February, 2019; originally announced February 2019.

  22. arXiv:1810.06702  [pdf, other

    stat.ML cs.LG

    Learning by Unsupervised Nonlinear Diffusion

    Authors: Mauro Maggioni, James M. Murphy

    Abstract: This paper proposes and analyzes a novel clustering algorithm that combines graph-based diffusion geometry with techniques based on density and mode estimation. The proposed method is suitable for data generated from mixtures of distributions with densities that are both multimodal and have nonlinear shapes. A crucial aspect of this algorithm is the use of time of a data-adapted diffusion process… ▽ More

    Submitted 29 December, 2018; v1 submitted 15 October, 2018; originally announced October 2018.

    Comments: 40 Pages, 17 Figures

  23. arXiv:1704.07961  [pdf, other

    cs.CV

    Unsupervised Clustering and Active Learning of Hyperspectral Images with Nonlinear Diffusion

    Authors: James M. Murphy, Mauro Maggioni

    Abstract: The problem of unsupervised learning and segmentation of hyperspectral images is a significant challenge in remote sensing. The high dimensionality of hyperspectral data, presence of substantial noise, and overlap of classes all contribute to the difficulty of automatically clustering and segmenting hyperspectral images. We propose an unsupervised learning technique called spectral-spatial diffusi… ▽ More

    Submitted 15 October, 2018; v1 submitted 25 April, 2017; originally announced April 2017.

    Comments: 17 pages, 22 figures, 3 tables. IEEE accepted version

  24. arXiv:1602.08575  [pdf, other

    cs.CV

    Superresolution of Noisy Remotely Sensed Images Through Directional Representations

    Authors: Wojciech Czaja, James M. Murphy, Daniel Weinberg

    Abstract: We develop an algorithm for single-image superresolution of remotely sensed data, based on the discrete shearlet transform. The shearlet transform extracts directional features of signals, and is known to provide near-optimally sparse representations for a broad class of images. This often leads to superior performance in edge detection and image representation when compared to isotropic frames. W… ▽ More

    Submitted 4 September, 2018; v1 submitted 27 February, 2016; originally announced February 2016.

    Comments: 5 pages (double column). IEEE copyright added