Skip to main content

Showing 1–9 of 9 results for author: Aubin, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.02347  [pdf, other

    cs.CV cs.AI cs.LG

    Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation

    Authors: Clement Chadebec, Onur Tasar, Eyal Benaroche, Benjamin Aubin

    Abstract: In this paper, we propose an efficient, fast, and versatile distillation method to accelerate the generation of pre-trained diffusion models: Flash Diffusion. The method reaches state-of-the-art performances in terms of FID and CLIP-Score for few steps image generation on the COCO2014 and COCO2017 datasets, while requiring only several GPU hours of training and fewer trainable parameters than exis… ▽ More

    Submitted 5 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Comments: 16 pages + 16 pages appendices

  2. arXiv:2103.05945  [pdf, other

    cond-mat.dis-nn cs.LG

    Mean-field methods and algorithmic perspectives for high-dimensional machine learning

    Authors: Benjamin Aubin

    Abstract: The main difficulty that arises in the analysis of most machine learning algorithms is to handle, analytically and numerically, a large number of interacting random variables. In this Ph.D manuscript, we revisit an approach based on the tools of statistical physics of disordered systems. Developed through a rich literature, they have been precisely designed to infer the macroscopic behavior of a l… ▽ More

    Submitted 10 March, 2021; originally announced March 2021.

    Comments: Ph.D manuscript

  3. arXiv:2102.10867  [pdf, other

    cs.LG cs.AI

    Linear unit-tests for invariance discovery

    Authors: Benjamin Aubin, Agnieszka Słowik, Martin Arjovsky, Leon Bottou, David Lopez-Paz

    Abstract: There is an increasing interest in algorithms to learn invariant correlations across training environments. A big share of the current proposals find theoretical support in the causality literature but, how useful are they in practice? The purpose of this note is to propose six linear low-dimensional problems -- unit tests -- to evaluate different types of out-of-distribution generalization in a p… ▽ More

    Submitted 22 February, 2021; originally announced February 2021.

    Comments: 5 pages, Causal Discovery & Causality-Inspired Machine Learning Workshop at Neural Information Processing Systems

  4. arXiv:2006.06560  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG math.ST

    Generalization error in high-dimensional perceptrons: Approaching Bayes error with convex optimization

    Authors: Benjamin Aubin, Florent Krzakala, Yue M. Lu, Lenka Zdeborová

    Abstract: We consider a commonly studied supervised classification of a synthetic dataset whose labels are generated by feeding a one-layer neural network with random iid inputs. We study the generalization performances of standard classifiers in the high-dimensional regime where $α=n/d$ is kept finite in the limit of a high dimension $d$ and number of samples $n$. Our contribution is three-fold: First, we… ▽ More

    Submitted 7 November, 2020; v1 submitted 11 June, 2020; originally announced June 2020.

    Comments: 11 pages + 45 pages Supplementary Material / 5 figures, v2 revised and accepted at NeurIPS

    Journal ref: Advances in Neural Information Processing Systems, v33, pages 12199--12210, 2020

  5. arXiv:2004.01571  [pdf, other

    stat.ML cond-mat.dis-nn cs.LG eess.SP math.ST stat.CO

    Tree-AMP: Compositional Inference with Tree Approximate Message Passing

    Authors: Antoine Baker, Benjamin Aubin, Florent Krzakala, Lenka Zdeborová

    Abstract: We introduce Tree-AMP, standing for Tree Approximate Message Passing, a python package for compositional inference in high-dimensional tree-structured models. The package provides a unifying framework to study several approximate message passing algorithms previously derived for a variety of machine learning tasks such as generalized linear models, inference in multi-layer networks, matrix factori… ▽ More

    Submitted 11 December, 2021; v1 submitted 3 April, 2020; originally announced April 2020.

    Comments: Source code available at https://github.com/sphinxteam/tramp and documentation at https://sphinxteam.github.io/tramp.docs

    Journal ref: Journal of Machine Learning Research 24 (2023) 1-89

  6. arXiv:1912.02729  [pdf, ps, other

    cond-mat.dis-nn cond-mat.stat-mech cs.LG stat.ML

    Rademacher complexity and spin glasses: A link between the replica and statistical theories of learning

    Authors: Alia Abbara, Benjamin Aubin, Florent Krzakala, Lenka Zdeborová

    Abstract: Statistical learning theory provides bounds of the generalization gap, using in particular the Vapnik-Chervonenkis dimension and the Rademacher complexity. An alternative approach, mainly studied in the statistical physics literature, is the study of generalization in simple synthetic-data models. Here we discuss the connections between these approaches and focus on the link between the Rademacher… ▽ More

    Submitted 15 June, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: 15 + 10 pages, v2 revised and accepted at MSML

    Journal ref: Proceedings of The First Mathematical and Scientific Machine Learning Conference, PMLR 107:27-54, 2020

  7. arXiv:1912.02008  [pdf, other

    math.ST cond-mat.dis-nn cs.LG eess.SP stat.ML

    Exact asymptotics for phase retrieval and compressed sensing with random generative priors

    Authors: Benjamin Aubin, Bruno Loureiro, Antoine Baker, Florent Krzakala, Lenka Zdeborová

    Abstract: We consider the problem of compressed sensing and of (real-valued) phase retrieval with random measurement matrix. We derive sharp asymptotics for the information-theoretically optimal performance and for the best known polynomial algorithm for an ensemble of generative priors consisting of fully connected deep neural networks with random weight matrices and arbitrary activations. We compare the p… ▽ More

    Submitted 12 June, 2020; v1 submitted 4 December, 2019; originally announced December 2019.

    Comments: 13+3 pages, 7 figures, v2 revised and accepted at MSML

    Journal ref: Proceedings of The First Mathematical and Scientific Machine Learning Conference, PMLR 107:55-73, 2020

  8. arXiv:1905.12385  [pdf, other

    math.ST cs.LG eess.SP math.PR stat.ML

    The spiked matrix model with generative priors

    Authors: Benjamin Aubin, Bruno Loureiro, Antoine Maillard, Florent Krzakala, Lenka Zdeborová

    Abstract: Using a low-dimensional parametrization of signals is a generic and powerful way to enhance performance in signal processing and statistical inference. A very popular and widely explored type of dimensionality reduction is sparsity; another type is generative modelling of signal distributions. Generative models based on neural networks, such as GANs or variational auto-encoders, are particularly p… ▽ More

    Submitted 30 May, 2019; v1 submitted 29 May, 2019; originally announced May 2019.

    Comments: 12 + 56, 8 figures, v2 lighter jpeg figures

    Journal ref: Advances in Neural Information Processing Systems, pp. 8364-8375. 2019

  9. arXiv:1806.05451  [pdf, other

    cs.LG cond-mat.dis-nn cond-mat.stat-mech physics.comp-ph stat.ML

    The committee machine: Computational to statistical gaps in learning a two-layers neural network

    Authors: Benjamin Aubin, Antoine Maillard, Jean Barbier, Florent Krzakala, Nicolas Macris, Lenka Zdeborová

    Abstract: Heuristic tools from statistical physics have been used in the past to locate the phase transitions and compute the optimal learning and generalization errors in the teacher-student scenario in multi-layer neural networks. In this contribution, we provide a rigorous justification of these approaches for a two-layers neural network model called the committee machine. We also introduce a version of… ▽ More

    Submitted 29 February, 2024; v1 submitted 14 June, 2018; originally announced June 2018.

    Comments: 18 pages + supplementary material, 3 figures. (v2: update to match the published version ; v3: clarification of the caption of Fig. 3)

    Journal ref: J. Stat. Mech. (2019) 124023. & NeurIPS 2018