Skip to main content

Showing 1–6 of 6 results for author: Genevay, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2106.01954  [pdf, other

    cs.LG

    Do Neural Optimal Transport Solvers Work? A Continuous Wasserstein-2 Benchmark

    Authors: Alexander Korotin, Lingxiao Li, Aude Genevay, Justin Solomon, Alexander Filippov, Evgeny Burnaev

    Abstract: Despite the recent popularity of neural network-based solvers for optimal transport (OT), there is no standard quantitative way to evaluate their performance. In this paper, we address this issue for quadratic-cost transport -- specifically, computation of the Wasserstein-2 distance, a commonly-used formulation of optimal transport in machine learning. To overcome the challenge of computing ground… ▽ More

    Submitted 25 October, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

  2. arXiv:2106.00736  [pdf, other

    cs.LG

    Large-Scale Wasserstein Gradient Flows

    Authors: Petr Mokrov, Alexander Korotin, Lingxiao Li, Aude Genevay, Justin Solomon, Evgeny Burnaev

    Abstract: Wasserstein gradient flows provide a powerful means of understanding and solving many diffusion equations. Specifically, Fokker-Planck equations, which model the diffusion of probability measures, can be understood as gradient descent over entropy functionals in Wasserstein space. This equivalence, introduced by Jordan, Kinderlehrer and Otto, inspired the so-called JKO scheme to approximate these… ▽ More

    Submitted 25 October, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

  3. arXiv:2102.12731  [pdf, other

    cs.LG stat.ML

    Improving Approximate Optimal Transport Distances using Quantization

    Authors: Gaspard Beugnot, Aude Genevay, Kristjan Greenewald, Justin Solomon

    Abstract: Optimal transport (OT) is a popular tool in machine learning to compare probability measures geometrically, but it comes with substantial computational burden. Linear programming algorithms for computing OT distances scale cubically in the size of the input, making OT impractical in the large-sample regime. We introduce a practical algorithm, which relies on a quantization step, to estimate OT dis… ▽ More

    Submitted 23 March, 2022; v1 submitted 25 February, 2021; originally announced February 2021.

    Comments: Published in the proceedings of the Conference on Uncertainty in Artificial Intelligence 2021 (UAI)

    Journal ref: PMLR 161:290-300, 2021

  4. arXiv:2008.12534  [pdf, other

    cs.LG stat.ML

    Continuous Regularized Wasserstein Barycenters

    Authors: Lingxiao Li, Aude Genevay, Mikhail Yurochkin, Justin Solomon

    Abstract: Wasserstein barycenters provide a geometrically meaningful way to aggregate probability distributions, built on the theory of optimal transport. They are difficult to compute in practice, however, leading previous work to restrict their supports to finite sets of points. Leveraging a new dual formulation for the regularized Wasserstein barycenter problem, we introduce a stochastic algorithm that c… ▽ More

    Submitted 24 October, 2020; v1 submitted 28 August, 2020; originally announced August 2020.

  5. arXiv:1910.09036  [pdf, other

    cs.LG stat.ML

    Differentiable Deep Clustering with Cluster Size Constraints

    Authors: Aude Genevay, Gabriel Dulac-Arnold, Jean-Philippe Vert

    Abstract: Clustering is a fundamental unsupervised learning approach. Many clustering algorithms -- such as $k$-means -- rely on the euclidean distance as a similarity measure, which is often not the most relevant metric for high dimensional data such as images. Learning a lower-dimensional embedding that can better reflect the geometry of the dataset is therefore instrumental for performance. We propose a… ▽ More

    Submitted 20 October, 2019; originally announced October 2019.

  6. arXiv:1805.07412  [pdf, other

    stat.ML cs.LG

    Wasserstein Measure Coresets

    Authors: Sebastian Claici, Aude Genevay, Justin Solomon

    Abstract: The proliferation of large data sets and Bayesian inference techniques motivates demand for better data sparsification. Coresets provide a principled way of summarizing a large dataset via a smaller one that is guaranteed to match the performance of the full data set on specific problems. Classical coresets, however, neglect the underlying data distribution, which is often continuous. We address t… ▽ More

    Submitted 2 March, 2020; v1 submitted 18 May, 2018; originally announced May 2018.