Skip to main content

Showing 1–7 of 7 results for author: Claici, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2007.06168  [pdf, other

    cs.LG stat.ML

    Model Fusion with Kullback--Leibler Divergence

    Authors: Sebastian Claici, Mikhail Yurochkin, Soumya Ghosh, Justin Solomon

    Abstract: We propose a method to fuse posterior distributions learned from heterogeneous datasets. Our algorithm relies on a mean field assumption for both the fused model and the individual dataset posteriors and proceeds using a simple assign-and-average approach. The components of the dataset posteriors are assigned to the proposed global model components by solving a regularized variant of the assignmen… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

    Comments: ICML 2020

  2. arXiv:1912.07729  [pdf, other

    cs.LG stat.ML

    Incorporating Unlabeled Data into Distributionally Robust Learning

    Authors: Charlie Frogner, Sebastian Claici, Edward Chien, Justin Solomon

    Abstract: We study a robust alternative to empirical risk minimization called distributionally robust learning (DRL), in which one learns to perform against an adversary who can choose the data distribution from a specified set of distributions. We illustrate a problem with current DRL formulations, which rely on an overly broad definition of allowed distributions for the adversary, leading to learned class… ▽ More

    Submitted 17 December, 2019; v1 submitted 16 December, 2019; originally announced December 2019.

  3. arXiv:1911.02053  [pdf, other

    cs.LG stat.ML

    Alleviating Label Switching with Optimal Transport

    Authors: Pierre Monteiller, Sebastian Claici, Edward Chien, Farzaneh Mirzazadeh, Justin Solomon, Mikhail Yurochkin

    Abstract: Label switching is a phenomenon arising in mixture model posterior inference that prevents one from meaningfully assessing posterior statistics using standard Monte Carlo procedures. This issue arises due to invariance of the posterior under actions of a group; for example, permuting the ordering of mixture components has no effect on the likelihood. We propose a resolution to label switching that… ▽ More

    Submitted 10 November, 2019; v1 submitted 5 November, 2019; originally announced November 2019.

    Comments: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canada

  4. arXiv:1906.10827  [pdf, other

    cs.LG cs.CL cs.IR stat.ML

    Hierarchical Optimal Transport for Document Representation

    Authors: Mikhail Yurochkin, Sebastian Claici, Edward Chien, Farzaneh Mirzazadeh, Justin Solomon

    Abstract: The ability to measure similarity between documents enables intelligent summarization and analysis of large corpora. Past distances between documents suffer from either an inability to incorporate semantic similarities between words or from scalability issues. As an alternative, we introduce hierarchical optimal transport as a meta-distance between documents, where documents are modeled as distrib… ▽ More

    Submitted 1 November, 2019; v1 submitted 25 June, 2019; originally announced June 2019.

    Comments: NeurIPS 2019

  5. arXiv:1805.07412  [pdf, other

    stat.ML cs.LG

    Wasserstein Measure Coresets

    Authors: Sebastian Claici, Aude Genevay, Justin Solomon

    Abstract: The proliferation of large data sets and Bayesian inference techniques motivates demand for better data sparsification. Coresets provide a principled way of summarizing a large dataset via a smaller one that is guaranteed to match the performance of the full data set on specific problems. Classical coresets, however, neglect the underlying data distribution, which is often continuous. We address t… ▽ More

    Submitted 2 March, 2020; v1 submitted 18 May, 2018; originally announced May 2018.

  6. arXiv:1802.05757  [pdf, other

    cs.LG math.OC stat.ML

    Stochastic Wasserstein Barycenters

    Authors: Sebastian Claici, Edward Chien, Justin Solomon

    Abstract: We present a stochastic algorithm to compute the barycenter of a set of probability distributions under the Wasserstein metric from optimal transport. Unlike previous approaches, our method extends to continuous input distributions and allows the support of the barycenter to be adjusted in each iteration. We tackle the problem without regularization, allowing us to recover a sharp output whose sup… ▽ More

    Submitted 6 June, 2018; v1 submitted 15 February, 2018; originally announced February 2018.

    Comments: ICML 2018

  7. arXiv:1705.07443  [pdf, other

    cs.LG math.OC stat.CO stat.ML

    Parallel Streaming Wasserstein Barycenters

    Authors: Matthew Staib, Sebastian Claici, Justin Solomon, Stefanie Jegelka

    Abstract: Efficiently aggregating data from different sources is a challenging problem, particularly when samples from each source are distributed differently. These differences can be inherent to the inference task or present for other reasons: sensors in a sensor network may be placed far apart, affecting their individual measurements. Conversely, it is computationally advantageous to split Bayesian infer… ▽ More

    Submitted 13 November, 2017; v1 submitted 21 May, 2017; originally announced May 2017.

    Comments: NIPS 2017