Skip to main content

Showing 1–6 of 6 results for author: Chehab, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.14040  [pdf, other

    stat.ML cs.LG

    A Practical Diffusion Path for Sampling

    Authors: Omar Chehab, Anna Korba

    Abstract: Diffusion models are state-of-the-art methods in generative modeling when samples from a target probability distribution are available, and can be efficiently sampled, using score matching to estimate score vectors guiding a Langevin process. However, in the setting where samples from the target are not available, e.g. when this target's density is known up to a normalization constant, the score e… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  2. arXiv:2310.03902  [pdf, other

    stat.ML cs.LG

    Provable benefits of annealing for estimating normalizing constants: Importance Sampling, Noise-Contrastive Estimation, and beyond

    Authors: Omar Chehab, Aapo Hyvarinen, Andrej Risteski

    Abstract: Recent research has developed several Monte Carlo methods for estimating the normalization constant (partition function) based on the idea of annealing. This means sampling successively from a path of distributions that interpolate between a tractable "proposal" distribution and the unnormalized "target" distribution. Prominent estimators in this family include annealed importance sampling and ann… ▽ More

    Submitted 9 October, 2023; v1 submitted 5 October, 2023; originally announced October 2023.

  3. arXiv:2301.09696  [pdf, other

    stat.ML cs.LG

    Optimizing the Noise in Self-Supervised Learning: from Importance Sampling to Noise-Contrastive Estimation

    Authors: Omar Chehab, Alexandre Gramfort, Aapo Hyvarinen

    Abstract: Self-supervised learning is an increasingly popular approach to unsupervised learning, achieving state-of-the-art results. A prevalent approach consists in contrasting data points and noise points within a classification task: this requires a good noise distribution which is notoriously hard to specify. While a comprehensive theory is missing, it is widely assumed that the optimal noise distributi… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

    Comments: arXiv admin note: text overlap with arXiv:2203.01110

  4. arXiv:2203.01110  [pdf, other

    stat.ML cs.LG

    The Optimal Noise in Noise-Contrastive Learning Is Not What You Think

    Authors: Omar Chehab, Alexandre Gramfort, Aapo Hyvarinen

    Abstract: Learning a parametric model of a data distribution is a well-known statistical problem that has seen renewed interest as it is brought to scale in deep learning. Framing the problem as a self-supervised task, where data samples are discriminated from noise samples, is at the core of state-of-the-art methods, beginning with Noise-Contrastive Estimation (NCE). Yet, such contrastive learning requires… ▽ More

    Submitted 26 July, 2022; v1 submitted 2 March, 2022; originally announced March 2022.

  5. arXiv:2103.02339  [pdf, other

    q-bio.NC cs.LG cs.NE

    Deep Recurrent Encoder: A scalable end-to-end network to model brain signals

    Authors: Omar Chehab, Alexandre Defossez, Jean-Christophe Loiseau, Alexandre Gramfort, Jean-Remi King

    Abstract: Understanding how the brain responds to sensory inputs is challenging: brain recordings are partial, noisy, and high dimensional; they vary across sessions and subjects and they capture highly nonlinear dynamics. These challenges have led the community to develop a variety of preprocessing and analytical (almost exclusively linear) methods, each designed to tackle one of these issues. Instead, we… ▽ More

    Submitted 30 September, 2022; v1 submitted 3 March, 2021; originally announced March 2021.

  6. arXiv:2007.16104  [pdf, other

    stat.ML cs.LG eess.SP q-bio.NC q-bio.QM

    Uncovering the structure of clinical EEG signals with self-supervised learning

    Authors: Hubert Banville, Omar Chehab, Aapo Hyvärinen, Denis-Alexander Engemann, Alexandre Gramfort

    Abstract: Objective. Supervised learning paradigms are often limited by the amount of labeled data that is available. This phenomenon is particularly problematic in clinically-relevant data, such as electroencephalography (EEG), where labeling can be costly in terms of specialized expertise and human processing time. Consequently, deep learning architectures designed to learn on EEG data have yielded relati… ▽ More

    Submitted 31 July, 2020; originally announced July 2020.

    Comments: 32 pages, 9 figures