Skip to main content

Showing 1–13 of 13 results for author: Niles-Weed, J

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.05061  [pdf, other

    stat.ML cs.LG

    Progressive Entropic Optimal Transport Solvers

    Authors: Parnian Kassraie, Aram-Alexandre Pooladian, Michal Klein, James Thornton, Jonathan Niles-Weed, Marco Cuturi

    Abstract: Optimal transport (OT) has profoundly impacted machine learning by providing theoretical and computational tools to realign datasets. In this context, given two large point clouds of sizes $n$ and $m$ in $\mathbb{R}^d$, entropic OT (EOT) solvers have emerged as the most reliable tool to either solve the Kantorovich problem and output a $n\times m$ coupling matrix, or to solve the Monge problem and… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 18 pages, 7 figures

  2. arXiv:2306.11895  [pdf, other

    stat.ML cs.LG

    Learning Elastic Costs to Shape Monge Displacements

    Authors: Michal Klein, Aram-Alexandre Pooladian, Pierre Ablin, Eugène Ndiaye, Jonathan Niles-Weed, Marco Cuturi

    Abstract: Given a source and a target probability measure supported on $\mathbb{R}^d$, the Monge problem asks to find the most efficient way to map one distribution to the other. This efficiency is quantified by defining a \textit{cost} function between source and target data. Such a cost is often set by default in the machine learning literature to the squared-Euclidean distance,… ▽ More

    Submitted 23 May, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

  3. arXiv:2301.11302  [pdf, other

    math.ST stat.ML

    Minimax estimation of discontinuous optimal transport maps: The semi-discrete case

    Authors: Aram-Alexandre Pooladian, Vincent Divol, Jonathan Niles-Weed

    Abstract: We consider the problem of estimating the optimal transport map between two probability distributions, $P$ and $Q$ in $\mathbb R^d$, on the basis of i.i.d. samples. All existing statistical analyses of this problem require the assumption that the transport map is Lipschitz, a strong requirement that, in particular, excludes any examples where the transport map is discontinuous. As a first step tow… ▽ More

    Submitted 24 May, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: 35 pages

    MSC Class: 62G05

  4. arXiv:2212.03722  [pdf, ps, other

    math.ST stat.ML

    Optimal transport map estimation in general function spaces

    Authors: Vincent Divol, Jonathan Niles-Weed, Aram-Alexandre Pooladian

    Abstract: We study the problem of estimating a function $T$ given independent samples from a distribution $P$ and from the pushforward distribution $T_\sharp P$. This setting is motivated by applications in the sciences, where $T$ represents the evolution of a physical system over time, and in machine learning, where, for example, $T$ may represent a transformation learned by a deep neural network trained f… ▽ More

    Submitted 2 January, 2024; v1 submitted 7 December, 2022; originally announced December 2022.

    Comments: 68 pages

    MSC Class: 62G05

  5. arXiv:2206.12768  [pdf, other

    math.ST stat.ML

    Estimation and inference for the Wasserstein distance between mixing measures in topic models

    Authors: Xin Bing, Florentina Bunea, Jonathan Niles-Weed

    Abstract: The Wasserstein distance between mixing measures has come to occupy a central place in the statistical analysis of mixture models. This work proposes a new canonical interpretation of this distance and provides tools to perform inference on the Wasserstein distance between mixing measures in topic models. We consider the general setting of an identifiable mixture model consisting of mixtures of… ▽ More

    Submitted 17 March, 2023; v1 submitted 25 June, 2022; originally announced June 2022.

  6. arXiv:2112.07465  [pdf, other

    stat.ME stat.CO

    The multirank likelihood for semiparametric canonical correlation analysis

    Authors: Jordan G. Bryan, Jonathan Niles-Weed, Peter D. Hoff

    Abstract: Many analyses of multivariate data focus on evaluating the dependence between two sets of variables, rather than the dependence among individual variables within each set. Canonical correlation analysis (CCA) is a classical data analysis technique that estimates parameters describing the dependence between such sets. However, inference procedures based on traditional CCA rely on the assumption tha… ▽ More

    Submitted 22 April, 2024; v1 submitted 14 December, 2021; originally announced December 2021.

  7. arXiv:2111.10734  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Deep Probability Estimation

    Authors: Sheng Liu, Aakash Kaku, Weicheng Zhu, Matan Leibovich, Sreyas Mohan, Boyang Yu, Haoxiang Huang, Laure Zanna, Narges Razavian, Jonathan Niles-Weed, Carlos Fernandez-Granda

    Abstract: Reliable probability estimation is of crucial importance in many real-world applications where there is inherent (aleatoric) uncertainty. Probability-estimation models are trained on observed outcomes (e.g. whether it has rained or not, or whether a patient has died or not), because the ground-truth probabilities of the events of interest are typically unknown. The problem is therefore analogous t… ▽ More

    Submitted 11 October, 2022; v1 submitted 20 November, 2021; originally announced November 2021.

    Comments: SL, AK, WZ, ML, SM contributed equally to this work; 36 pages, 17 figures, 12 tables

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:13746-13781, 2022

  8. arXiv:2109.12004  [pdf, other

    math.ST stat.ML

    Entropic estimation of optimal transport maps

    Authors: Aram-Alexandre Pooladian, Jonathan Niles-Weed

    Abstract: We develop a computationally tractable method for estimating the optimal map between two distributions over $\mathbb{R}^d$ with rigorous finite-sample guarantees. Leveraging an entropic version of Brenier's theorem, we show that our estimator -- the \emph{barycentric projection} of the optimal entropic plan -- is easy to compute using Sinkhorn's algorithm. As a result, unlike current approaches fo… ▽ More

    Submitted 12 May, 2024; v1 submitted 24 September, 2021; originally announced September 2021.

    Comments: 38 pages, 8 figures

    MSC Class: 62G05

  9. arXiv:2107.12364  [pdf, other

    math.ST stat.ML

    Plugin Estimation of Smooth Optimal Transport Maps

    Authors: Tudor Manole, Sivaraman Balakrishnan, Jonathan Niles-Weed, Larry Wasserman

    Abstract: We analyze a number of natural estimators for the optimal transport map between two distributions and show that they are minimax optimal. We adopt the plugin approach: our estimators are simply optimal couplings between measures derived from our observations, appropriately extended so that they define functions on $\mathbb{R}^d$. When the underlying map is assumed to be Lipschitz, we show that com… ▽ More

    Submitted 16 June, 2024; v1 submitted 26 July, 2021; originally announced July 2021.

    Comments: To appear in the Annals of Statistics

  10. arXiv:2007.00151  [pdf, other

    cs.LG cs.CV stat.ML

    Early-Learning Regularization Prevents Memorization of Noisy Labels

    Authors: Sheng Liu, Jonathan Niles-Weed, Narges Razavian, Carlos Fernandez-Granda

    Abstract: We propose a novel framework to perform classification via deep learning in the presence of noisy annotations. When trained on noisy labels, deep neural networks have been observed to first fit the training data with clean labels during an "early learning" phase, before eventually memorizing the examples with false labels. We prove that early learning and memorization are fundamental phenomena in… ▽ More

    Submitted 22 October, 2020; v1 submitted 30 June, 2020; originally announced July 2020.

  11. arXiv:2006.16548  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Sinkhorn EM: An Expectation-Maximization algorithm based on entropic optimal transport

    Authors: Gonzalo Mena, Amin Nejatbakhsh, Erdem Varol, Jonathan Niles-Weed

    Abstract: We study Sinkhorn EM (sEM), a variant of the expectation maximization (EM) algorithm for mixtures based on entropic optimal transport. sEM differs from the classic EM algorithm in the way responsibilities are computed during the expectation step: rather than assign data points to clusters independently, sEM uses optimal transport to compute responsibilities by incorporating prior information about… ▽ More

    Submitted 30 June, 2020; originally announced June 2020.

    Comments: Under review

  12. arXiv:2002.03229  [pdf, other

    cs.LG stat.ML

    Supervised Quantile Normalization for Low-rank Matrix Approximation

    Authors: Marco Cuturi, Olivier Teboul, Jonathan Niles-Weed, Jean-Philippe Vert

    Abstract: Low rank matrix factorization is a fundamental building block in machine learning, used for instance to summarize gene expression profile data or word-document counts. To be robust to outliers and differences in scale across features, a matrix factorization step is usually preceded by ad-hoc feature normalization steps, such as \texttt{tf-idf} scaling or data whitening. We propose in this work to… ▽ More

    Submitted 3 July, 2020; v1 submitted 8 February, 2020; originally announced February 2020.

    Comments: new version with genomics experiments

    Journal ref: ICML 2020

  13. arXiv:1812.05189  [pdf, other

    stat.ML cs.DS cs.LG math.OC

    Massively scalable Sinkhorn distances via the Nyström method

    Authors: Jason Altschuler, Francis Bach, Alessandro Rudi, Jonathan Niles-Weed

    Abstract: The Sinkhorn "distance", a variant of the Wasserstein distance with entropic regularization, is an increasingly popular tool in machine learning and statistical inference. However, the time and memory requirements of standard algorithms for computing this distance grow quadratically with the size of the data, making them prohibitively expensive on massive data sets. In this work, we show that this… ▽ More

    Submitted 26 October, 2019; v1 submitted 12 December, 2018; originally announced December 2018.

    Comments: to appear in NeurIPS 2019

    Journal ref: Advances in Neural Information Processing Systems 32 (NeurIPS 2019)