Skip to main content

Showing 1–14 of 14 results for author: Jolicoeur-Martineau, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05630  [pdf, other

    cs.CV

    Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion

    Authors: Ge Ya Luo, Zhi Hao Luo, Anthony Gosselin, Alexia Jolicoeur-Martineau, Christopher Pal

    Abstract: With recent advances in video prediction, controllable video generation has been attracting more attention. Generating high fidelity videos according to simple and flexible conditioning is of particular interest. To this end, we propose a controllable video generation model using pixel level renderings of 2D or 3D bounding boxes as conditioning. In addition, we also create a bounding box predictor… ▽ More

    Submitted 21 June, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

  2. arXiv:2405.16287  [pdf, other

    cs.LG

    LoGAH: Predicting 774-Million-Parameter Transformers using Graph HyperNetworks with 1/100 Parameters

    Authors: Xinyu Zhou, Boris Knyazev, Alexia Jolicoeur-Martineau, Jie Fu

    Abstract: A good initialization of deep learning models is essential since it can help them converge better and faster. However, pretraining large models is unaffordable for many researchers, which makes a desired prediction for initial parameters more necessary nowadays. Graph HyperNetworks (GHNs), one approach to predicting model parameters, have recently shown strong performance in initializing large vis… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 16 pages

  3. arXiv:2309.09968  [pdf, other

    cs.LG

    Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted Trees

    Authors: Alexia Jolicoeur-Martineau, Kilian Fatras, Tal Kachman

    Abstract: Tabular data is hard to acquire and is subject to missing values. This paper introduces a novel approach for generating and imputing mixed-type (continuous and categorical) tabular data utilizing score-based diffusion and conditional flow matching. In contrast to prior methods that rely on neural networks to learn the score function or the vector field, we adopt XGBoost, a widely used Gradient-Boo… ▽ More

    Submitted 19 February, 2024; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: Code: https://github.com/SamsungSAILMontreal/ForestDiffusion

  4. arXiv:2304.05907  [pdf, ps, other

    cs.LG cs.AI math.NA

    Diffusion models with location-scale noise

    Authors: Alexia Jolicoeur-Martineau, Kilian Fatras, Ke Li, Tal Kachman

    Abstract: Diffusion Models (DMs) are powerful generative models that add Gaussian noise to the data and learn to remove it. We wanted to determine which noise distribution (Gaussian or non-Gaussian) led to better generated data in DMs. Since DMs do not work by design with non-Gaussian noise, we built a framework that allows reversing a diffusion process with non-Gaussian location-scale noise. We use that fr… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

  5. arXiv:2304.03094  [pdf, other

    cs.LG cs.CV

    PopulAtion Parameter Averaging (PAPA)

    Authors: Alexia Jolicoeur-Martineau, Emy Gervais, Kilian Fatras, Yan Zhang, Simon Lacoste-Julien

    Abstract: Ensemble methods combine the predictions of multiple models to improve performance, but they require significantly higher computation costs at inference time. To avoid these costs, multiple neural networks can be combined into one by averaging their weights. However, this usually performs significantly worse than ensembling. Weight averaging is only beneficial when different enough to benefit from… ▽ More

    Submitted 6 May, 2024; v1 submitted 6 April, 2023; originally announced April 2023.

    Comments: Blog post: https://ajolicoeur.wordpress.com/papa/, Code: https://github.com/SamsungSAILMontreal/PAPA, TMLR journal publication: https://openreview.net/forum?id=cPDVjsOytS

  6. arXiv:2210.09505  [pdf, other

    cs.LG stat.ML

    CNT (Conditioning on Noisy Targets): A new Algorithm for Leveraging Top-Down Feedback

    Authors: Alexia Jolicoeur-Martineau, Alex Lamb, Vikas Verma, Aniket Didolkar

    Abstract: We propose a novel regularizer for supervised learning called Conditioning on Noisy Targets (CNT). This approach consists in conditioning the model on a noisy version of the target(s) (e.g., actions in imitation learning or labels in classification) at a random noise level (from small to large noise). At inference time, since we do not know the target, we run the network with only noise in place o… ▽ More

    Submitted 26 October, 2022; v1 submitted 17 October, 2022; originally announced October 2022.

  7. arXiv:2205.09853  [pdf, other

    cs.CV cs.AI cs.LG

    MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation

    Authors: Vikram Voleti, Alexia Jolicoeur-Martineau, Christopher Pal

    Abstract: Video prediction is a challenging task. The quality of video frames from current state-of-the-art (SOTA) generative models tends to be poor and generalization beyond the training data is difficult. Furthermore, existing prediction frameworks are typically not capable of simultaneously handling other video-related tasks such as unconditional generation or interpolation. In this work, we devise a ge… ▽ More

    Submitted 12 October, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: NeurIPS 2022 ; 10 pages, 4 figures, 7 tables

  8. arXiv:2105.14080  [pdf, other

    cs.LG cs.CV math.OC stat.ML

    Gotta Go Fast When Generating Data with Score-Based Models

    Authors: Alexia Jolicoeur-Martineau, Ke Li, Rémi Piché-Taillefer, Tal Kachman, Ioannis Mitliagkas

    Abstract: Score-based (denoising diffusion) generative models have recently gained a lot of success in generating realistic and diverse data. These approaches define a forward diffusion process for transforming data to noise and generate data by reversing it (thereby going from noise to data). Unfortunately, current score-based models generate data very slowly due to the sheer number of score network evalua… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

    Comments: Code is available on https://github.com/AlexiaJM/score_sde_fast_sampling

  9. arXiv:2009.05475  [pdf, other

    cs.LG cs.CV stat.ML

    Adversarial score matching and improved sampling for image generation

    Authors: Alexia Jolicoeur-Martineau, Rémi Piché-Taillefer, Rémi Tachet des Combes, Ioannis Mitliagkas

    Abstract: Denoising Score Matching with Annealed Langevin Sampling (DSM-ALS) has recently found success in generative modeling. The approach works by first training a neural network to estimate the score of a distribution, and then using Langevin dynamics to sample from the data distribution assumed by the score network. Despite the convincing visual quality of samples, this method appears to perform worse… ▽ More

    Submitted 10 October, 2020; v1 submitted 11 September, 2020; originally announced September 2020.

    Comments: Code at https://github.com/AlexiaJM/AdversarialConsistentScoreMatching

  10. arXiv:2007.04202  [pdf, other

    cs.LG cs.GT math.OC stat.ML

    Stochastic Hamiltonian Gradient Methods for Smooth Games

    Authors: Nicolas Loizou, Hugo Berard, Alexia Jolicoeur-Martineau, Pascal Vincent, Simon Lacoste-Julien, Ioannis Mitliagkas

    Abstract: The success of adversarial formulations in machine learning has brought renewed motivation for smooth games. In this work, we focus on the class of stochastic Hamiltonian methods and provide the first convergence guarantees for certain classes of stochastic smooth games. We propose a novel unbiased estimator for the stochastic Hamiltonian gradient descent (SHGD) and highlight its benefits. Using t… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

    Comments: ICML 2020 - Proceedings of the 37th International Conference on Machine Learning

  11. arXiv:1910.06922  [pdf, other

    cs.LG stat.ML

    Gradient penalty from a maximum margin perspective

    Authors: Alexia Jolicoeur-Martineau, Ioannis Mitliagkas

    Abstract: A popular heuristic for improved performance in Generative adversarial networks (GANs) is to use some form of gradient penalty on the discriminator. This gradient penalty was originally motivated by a Wasserstein distance formulation. However, the use of gradient penalty in other GAN formulations is not well motivated. We present a unifying framework of expected margin maximization and show that a… ▽ More

    Submitted 24 November, 2020; v1 submitted 15 October, 2019; originally announced October 2019.

    Comments: Code at https://github.com/AlexiaJM/MaximumMarginGANs

  12. arXiv:1901.02474  [pdf, other

    cs.LG

    On Relativistic $f$-Divergences

    Authors: Alexia Jolicoeur-Martineau

    Abstract: This paper provides a more rigorous look at Relativistic Generative Adversarial Networks (RGANs). We prove that the objective function of the discriminator is a statistical divergence for any concave function $f$ with minimal properties ($f(0)=0$, $f'(0) \neq 0$, $\sup_x f(x)>0$). We also devise a few variants of relativistic $f$-divergences. Wasserstein GAN was originally justified by the idea th… ▽ More

    Submitted 8 January, 2019; originally announced January 2019.

    Comments: Code is available on: https://github.com/AlexiaJM/relativistic-f-divergences

  13. arXiv:1809.02145  [pdf, other

    cs.LG cs.AI stat.ML

    GANs beyond divergence minimization

    Authors: Alexia Jolicoeur-Martineau

    Abstract: Generative adversarial networks (GANs) can be interpreted as an adversarial game between two players, a discriminator D and a generator G, in which D learns to classify real from fake data and G learns to generate realistic data by "fooling" D into thinking that fake data is actually real data. Currently, a dominating view is that G actually learns by minimizing a divergence given that the general… ▽ More

    Submitted 6 September, 2018; originally announced September 2018.

    Comments: Associated repository: https://github.com/AlexiaJM/GANsBeyondDivergenceMin

  14. arXiv:1807.00734  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    The relativistic discriminator: a key element missing from standard GAN

    Authors: Alexia Jolicoeur-Martineau

    Abstract: In standard generative adversarial network (SGAN), the discriminator estimates the probability that the input data is real. The generator is trained to increase the probability that fake data is real. We argue that it should also simultaneously decrease the probability that real data is real because 1) this would account for a priori knowledge that half of the data in the mini-batch is fake, 2) th… ▽ More

    Submitted 10 September, 2018; v1 submitted 2 July, 2018; originally announced July 2018.

    Comments: https://github.com/AlexiaJM/RelativisticGAN