Skip to main content

Showing 1–11 of 11 results for author: Simon-Gabriel, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2403.13134  [pdf, other

    cs.LG cs.AI stat.ML

    Robust NAS under adversarial training: benchmark, theory, and beyond

    Authors: Yongtao Wu, Fanghui Liu, Carl-Johann Simon-Gabriel, Grigorios G Chrysos, Volkan Cevher

    Abstract: Recent developments in neural architecture search (NAS) emphasize the significance of considering robust architectures against malicious data. However, there is a notable absence of benchmark evaluations and theoretical guarantees for searching these robust architectures, especially when adversarial training is considered. In this work, we aim to address these two challenges, making twofold contri… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  2. arXiv:2209.12835  [pdf, ps, other

    stat.ML cs.LG math.ST

    Targeted Separation and Convergence with Kernel Discrepancies

    Authors: Alessandro Barp, Carl-Johann Simon-Gabriel, Mark Girolami, Lester Mackey

    Abstract: Maximum mean discrepancies (MMDs) like the kernel Stein discrepancy (KSD) have grown central to a wide range of applications, including hypothesis testing, sampler selection, distribution approximation, and variational inference. In each setting, these kernel-based discrepancy measures are required to (i) separate a target P from other probability measures or even (ii) control weak convergence to… ▽ More

    Submitted 6 December, 2023; v1 submitted 26 September, 2022; originally announced September 2022.

  3. arXiv:2207.09239  [pdf, other

    cs.LG stat.ML

    Assaying Out-Of-Distribution Generalization in Transfer Learning

    Authors: Florian Wenzel, Andrea Dittadi, Peter Vincent Gehler, Carl-Johann Simon-Gabriel, Max Horn, Dominik Zietlow, David Kernert, Chris Russell, Thomas Brox, Bernt Schiele, Bernhard Schölkopf, Francesco Locatello

    Abstract: Since out-of-distribution generalization is a generally ill-posed problem, various proxy targets (e.g., calibration, adversarial robustness, algorithmic corruptions, invariance across shifts) were studied across different research programs resulting in different recommendations. While sharing the same aspirational goal, these approaches have never been tested under the same experimental conditions… ▽ More

    Submitted 21 October, 2022; v1 submitted 19 July, 2022; originally announced July 2022.

  4. arXiv:2106.07445  [pdf, other

    cs.LG cs.CR cs.CV math.OC stat.ML

    PopSkipJump: Decision-Based Attack for Probabilistic Classifiers

    Authors: Carl-Johann Simon-Gabriel, Noman Ahmed Sheikh, Andreas Krause

    Abstract: Most current classifiers are vulnerable to adversarial examples, small input perturbations that change the classification output. Many existing attack algorithms cover various settings, from white-box to black-box classifiers, but typically assume that the answers are deterministic and often fail when they are not. We therefore propose a new adversarial decision-based attack specifically designed… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

    Comments: ICML'21. Code available at https://github.com/cjsg/PopSkipJump . 9 pages & 7 figures in main part, 14 pages & 10 figures in appendix

  5. arXiv:2006.09268  [pdf, ps, other

    cs.LG math.PR math.ST stat.ML

    Metrizing Weak Convergence with Maximum Mean Discrepancies

    Authors: Carl-Johann Simon-Gabriel, Alessandro Barp, Bernhard Schölkopf, Lester Mackey

    Abstract: This paper characterizes the maximum mean discrepancies (MMD) that metrize the weak convergence of probability measures for a wide class of kernels. More precisely, we prove that, on a locally compact, non-compact, Hausdorff space, the MMD of a bounded continuous Borel measurable kernel k, whose reproducing kernel Hilbert space (RKHS) functions vanish at infinity, metrizes the weak convergence of… ▽ More

    Submitted 3 September, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: 14 pages. Corrects in particular Thm.12 of Simon-Gabriel and Schölkopf, JMLR, 19(44):1-29, 2018. See http://jmlr.org/papers/v19/16-291.html

    MSC Class: 60B10 (Primary) 60F05; 60-08; 28-08 (Secondary) ACM Class: G.3; I.2.6; I.5.0

  6. arXiv:1802.01421  [pdf, other

    stat.ML cs.CV cs.LG

    First-order Adversarial Vulnerability of Neural Networks and Input Dimension

    Authors: Carl-Johann Simon-Gabriel, Yann Ollivier, Léon Bottou, Bernhard Schölkopf, David Lopez-Paz

    Abstract: Over the past few years, neural networks were proven vulnerable to adversarial images: targeted but imperceptible image perturbations lead to drastically different predictions. We show that adversarial vulnerability increases with the gradients of the training objective when viewed as a function of the inputs. Surprisingly, vulnerability does not depend on network topology: for many standard netwo… ▽ More

    Submitted 16 June, 2019; v1 submitted 5 February, 2018; originally announced February 2018.

    Comments: Paper previously called: "Adversarial Vulnerability of Neural Networks Increases with Input Dimension". 9 pages main text and references, 11 pages appendix, 14 figures

    MSC Class: 68T45 ACM Class: I.2.6

    Journal ref: Proceedings of ICML 2019

  7. arXiv:1705.07642  [pdf, other

    stat.ML

    From optimal transport to generative modeling: the VEGAN cookbook

    Authors: Olivier Bousquet, Sylvain Gelly, Ilya Tolstikhin, Carl-Johann Simon-Gabriel, Bernhard Schoelkopf

    Abstract: We study unsupervised generative modeling in terms of the optimal transport (OT) problem between true (but unknown) data distribution $P_X$ and the latent variable model distribution $P_G$. We show that the OT problem can be equivalently written in terms of probabilistic encoders, which are constrained to match the posterior and prior distributions over the latent space. When relaxed, this constra… ▽ More

    Submitted 22 May, 2017; originally announced May 2017.

  8. arXiv:1701.02386  [pdf, other

    stat.ML cs.LG

    AdaGAN: Boosting Generative Models

    Authors: Ilya Tolstikhin, Sylvain Gelly, Olivier Bousquet, Carl-Johann Simon-Gabriel, Bernhard Schölkopf

    Abstract: Generative Adversarial Networks (GAN) (Goodfellow et al., 2014) are an effective method for training generative models of complex data such as natural images. However, they are notoriously hard to train and can suffer from the problem of missing modes where the model is not able to produce examples in certain regions of the space. We propose an iterative procedure, called AdaGAN, where at every st… ▽ More

    Submitted 24 May, 2017; v1 submitted 9 January, 2017; originally announced January 2017.

    Comments: Updated with MNIST pictures and discussions + Unrolled GAN experiments

  9. arXiv:1610.05950  [pdf, other

    stat.ML

    Consistent Kernel Mean Estimation for Functions of Random Variables

    Authors: Carl-Johann Simon-Gabriel, Adam Ścibior, Ilya Tolstikhin, Bernhard Schölkopf

    Abstract: We provide a theoretical foundation for non-parametric estimation of functions of random variables using kernel mean embeddings. We show that for any continuous function $f$, consistent estimators of the mean embedding of a random variable $X$ lead to consistent estimators of the mean embedding of $f(X)$. For Matérn kernels and sufficiently smooth functions we also provide rates of convergence. Ou… ▽ More

    Submitted 19 October, 2016; originally announced October 2016.

    Comments: 17 pages including appendix

    Journal ref: NIPS 2016 Proceedings (p. 1732-1740)

  10. arXiv:1604.05251  [pdf, ps, other

    stat.ML math.FA math.PR

    Kernel Distribution Embeddings: Universal Kernels, Characteristic Kernels and Kernel Metrics on Distributions

    Authors: Carl-Johann Simon-Gabriel, Bernhard Schölkopf

    Abstract: Kernel mean embeddings have recently attracted the attention of the machine learning community. They map measures $μ$ from some set $M$ to functions in a reproducing kernel Hilbert space (RKHS) with kernel $k$. The RKHS distance of two mapped measures is a semi-metric $d_k$ over $M$. We study three questions. (I) For a given kernel, what sets $M$ can be embedded? (II) When is the embedding injecti… ▽ More

    Submitted 17 December, 2019; v1 submitted 18 April, 2016; originally announced April 2016.

    Comments: Old and longer version of the JMLR paper with same title (published 2018). Please start with the JMLR version. 55 pages (33 pages main text, 22 pages appendix), 2 tables, 1 figure (in appendix)

    MSC Class: G.3 ACM Class: G.3

    Journal ref: Journal of Machine Learning Research, 19(44):1-29, 2018

  11. arXiv:1505.03036  [pdf, other

    stat.ML astro-ph.EP astro-ph.IM cs.LG

    Removing systematic errors for exoplanet search via latent causes

    Authors: Bernhard Schölkopf, David W. Hogg, Dun Wang, Daniel Foreman-Mackey, Dominik Janzing, Carl-Johann Simon-Gabriel, Jonas Peters

    Abstract: We describe a method for removing the effect of confounders in order to reconstruct a latent quantity of interest. The method, referred to as half-sibling regression, is inspired by recent work in causal inference using additive noise models. We provide a theoretical justification and illustrate the potential of the method in a challenging astronomy application.

    Submitted 12 May, 2015; originally announced May 2015.

    Comments: Extended version of a paper appearing in the Proceedings of the 32nd International Conference on Machine Learning, Lille, France, 2015

    ACM Class: G.3; I.2.6; J.2