Skip to main content

Showing 1–11 of 11 results for author: Ross, B L

.
  1. arXiv:2406.03537  [pdf, other

    cs.LG cs.AI stat.ML

    A Geometric View of Data Complexity: Efficient Local Intrinsic Dimension Estimation with Diffusion Models

    Authors: Hamidreza Kamkari, Brendan Leigh Ross, Rasa Hosseinzadeh, Jesse C. Cresswell, Gabriel Loaiza-Ganem

    Abstract: High-dimensional data commonly lies on low-dimensional submanifolds, and estimating the local intrinsic dimension (LID) of a datum -- i.e. the dimension of the submanifold it belongs to -- is a longstanding problem. LID can be understood as the number of local factors of variation: the more factors of variation a datum has, the more complex it tends to be. Estimating this quantity has proven usefu… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 10 pages

  2. arXiv:2404.02954  [pdf, other

    cs.LG cs.AI stat.ML

    Deep Generative Models through the Lens of the Manifold Hypothesis: A Survey and New Connections

    Authors: Gabriel Loaiza-Ganem, Brendan Leigh Ross, Rasa Hosseinzadeh, Anthony L. Caterini, Jesse C. Cresswell

    Abstract: In recent years there has been increased interest in understanding the interplay between deep generative models (DGMs) and the manifold hypothesis. Research in this area focuses on understanding the reasons why commonly-used DGMs succeed or fail at learning distributions supported on unknown low-dimensional manifolds, as well as develo** new models explicitly designed to account for manifold-sup… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  3. arXiv:2403.18910  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    A Geometric Explanation of the Likelihood OOD Detection Paradox

    Authors: Hamidreza Kamkari, Brendan Leigh Ross, Jesse C. Cresswell, Anthony L. Caterini, Rahul G. Krishnan, Gabriel Loaiza-Ganem

    Abstract: Likelihood-based deep generative models (DGMs) commonly exhibit a puzzling behaviour: when trained on a relatively complex dataset, they assign higher likelihood values to out-of-distribution (OOD) data from simpler sources. Adding to the mystery, OOD samples are never generated by these DGMs despite having higher likelihoods. This two-pronged paradox has yet to be conclusively explained, making l… ▽ More

    Submitted 11 June, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: ICML 2024

  4. arXiv:2306.04675  [pdf, other

    cs.LG cs.CV stat.ML

    Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models

    Authors: George Stein, Jesse C. Cresswell, Rasa Hosseinzadeh, Yi Sui, Brendan Leigh Ross, Valentin Villecroze, Zhaoyan Liu, Anthony L. Caterini, J. Eric T. Taylor, Gabriel Loaiza-Ganem

    Abstract: We systematically study a wide variety of generative models spanning semantically-diverse image datasets to understand and improve the feature extractors and metrics used to evaluate them. Using best practices in psychophysics, we measure human perception of image realism for generated samples by conducting the largest experiment evaluating generative models to date, and find that no existing metr… ▽ More

    Submitted 30 October, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023. 53 pages, 29 figures, 12 tables. Code at https://github.com/layer6ai-labs/dgm-eval, reviews at https://openreview.net/forum?id=08zf7kTOoh

    Journal ref: Thirty-seventh Conference on Neural Information Processing Systems (2023)

  5. arXiv:2212.01265  [pdf, other

    cs.LG cs.AI

    Denoising Deep Generative Models

    Authors: Gabriel Loaiza-Ganem, Brendan Leigh Ross, Luhuan Wu, John P. Cunningham, Jesse C. Cresswell, Anthony L. Caterini

    Abstract: Likelihood-based deep generative models have recently been shown to exhibit pathological behaviour under the manifold hypothesis as a consequence of using high-dimensional densities to model data with low-dimensional structure. In this paper we propose two methodologies aimed at addressing this problem. Both are based on adding Gaussian noise to the data to remove the dimensionality mismatch durin… ▽ More

    Submitted 4 January, 2023; v1 submitted 30 November, 2022; originally announced December 2022.

    Comments: NeurIPS 2022 ICBINB workshop (spotlight)

  6. arXiv:2211.15380  [pdf, other

    hep-ph cs.LG hep-ex physics.data-an physics.ins-det

    CaloMan: Fast generation of calorimeter showers with density estimation on learned manifolds

    Authors: Jesse C. Cresswell, Brendan Leigh Ross, Gabriel Loaiza-Ganem, Humberto Reyes-Gonzalez, Marco Letizia, Anthony L. Caterini

    Abstract: Precision measurements and new physics searches at the Large Hadron Collider require efficient simulations of particle propagation and interactions within the detectors. The most computationally expensive simulations involve calorimeter showers. Advances in deep generative modelling - particularly in the realm of high-dimensional data - have opened the possibility of generating realistic calorimet… ▽ More

    Submitted 23 November, 2022; originally announced November 2022.

    Comments: Accepted to the Machine Learning and the Physical Sciences Workshop at NeurIPS 2022

  7. arXiv:2210.06597  [pdf, other

    cs.LG

    Find Your Friends: Personalized Federated Learning with the Right Collaborators

    Authors: Yi Sui, Junfeng Wen, Yenson Lau, Brendan Leigh Ross, Jesse C. Cresswell

    Abstract: In the traditional federated learning setting, a central server coordinates a network of clients to train one global model. However, the global model may serve many clients poorly due to data heterogeneity. Moreover, there may not exist a trusted central party that can coordinate the clients to ensure that each of them can benefit from others. To address these concerns, we present a novel decentra… ▽ More

    Submitted 14 October, 2022; v1 submitted 12 October, 2022; originally announced October 2022.

  8. arXiv:2207.02862  [pdf, other

    stat.ML cs.AI cs.LG

    Verifying the Union of Manifolds Hypothesis for Image Data

    Authors: Bradley C. A. Brown, Anthony L. Caterini, Brendan Leigh Ross, Jesse C. Cresswell, Gabriel Loaiza-Ganem

    Abstract: Deep learning has had tremendous success at learning low-dimensional representations of high-dimensional data. This success would be impossible if there was no hidden low-dimensional structure in data of interest; this existence is posited by the manifold hypothesis, which states that the data lies on an unknown manifold of low intrinsic dimension. In this paper, we argue that this hypothesis does… ▽ More

    Submitted 2 March, 2023; v1 submitted 6 July, 2022; originally announced July 2022.

    Comments: ICLR 2023

  9. arXiv:2206.11267  [pdf, other

    stat.ML cs.LG

    Neural Implicit Manifold Learning for Topology-Aware Density Estimation

    Authors: Brendan Leigh Ross, Gabriel Loaiza-Ganem, Anthony L. Caterini, Jesse C. Cresswell

    Abstract: Natural data observed in $\mathbb{R}^n$ is often constrained to an $m$-dimensional manifold $\mathcal{M}$, where $m < n$. This work focuses on the task of building theoretically principled generative models for such data. Current generative models learn $\mathcal{M}$ by map** an $m$-dimensional latent variable through a neural network $f_θ: \mathbb{R}^m \to \mathbb{R}^n$. These procedures, which… ▽ More

    Submitted 21 December, 2023; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: Accepted to TMLR in 2023. Code: https://github.com/layer6ai-labs/implicit-manifolds

  10. arXiv:2204.07172  [pdf, other

    stat.ML cs.AI cs.LG stat.ME

    Diagnosing and Fixing Manifold Overfitting in Deep Generative Models

    Authors: Gabriel Loaiza-Ganem, Brendan Leigh Ross, Jesse C. Cresswell, Anthony L. Caterini

    Abstract: Likelihood-based, or explicit, deep generative models use neural networks to construct flexible high-dimensional densities. This formulation directly contradicts the manifold hypothesis, which states that observed data lies on a low-dimensional manifold embedded in high-dimensional ambient space. In this paper we investigate the pathologies of maximum-likelihood training in the presence of this di… ▽ More

    Submitted 28 November, 2022; v1 submitted 14 April, 2022; originally announced April 2022.

    Comments: Accepted for publication in TMLR

  11. arXiv:2106.05275  [pdf, other

    stat.ML cs.LG

    Tractable Density Estimation on Learned Manifolds with Conformal Embedding Flows

    Authors: Brendan Leigh Ross, Jesse C. Cresswell

    Abstract: Normalizing flows are generative models that provide tractable density estimation via an invertible transformation from a simple base distribution to a complex target distribution. However, this technique cannot directly model data supported on an unknown low-dimensional manifold, a common occurrence in real-world domains such as image data. Recent attempts to remedy this limitation have introduce… ▽ More

    Submitted 11 November, 2021; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 Camera-Ready. Code: https://github.com/layer6ai-labs/CEF