Skip to main content

Showing 1–50 of 72 results for author: Vincent, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.06132  [pdf, ps, other

    physics.flu-dyn cs.PF math.NA

    Cache Blocking for Flux Reconstruction: Extension to Navier-Stokes Equations and Anti-aliasing

    Authors: Semih Akkurt, Freddie Witherden, Peter Vincent

    Abstract: In this article, cache blocking is implemented for the Navier Stokes equations with anti-aliasing support on mixed grids in PyFR for CPUs. In particular, cache blocking is used as an alternative to kernel fusion to eliminate unnecessary data movements between kernels at the main memory level. Specifically, kernels that exchange data are grouped together, and these groups are then executed on small… ▽ More

    Submitted 6 November, 2023; originally announced January 2024.

  2. arXiv:2311.15930  [pdf, other

    cs.CL cs.AI

    WorldSense: A Synthetic Benchmark for Grounded Reasoning in Large Language Models

    Authors: Youssef Benchekroun, Megi Dervishi, Mark Ibrahim, Jean-Baptiste Gaya, Xavier Martinet, Grégoire Mialon, Thomas Scialom, Emmanuel Dupoux, Dieuwke Hupkes, Pascal Vincent

    Abstract: We propose WorldSense, a benchmark designed to assess the extent to which LLMs are consistently able to sustain tacit world models, by testing how they draw simple inferences from descriptions of simple arrangements of entities. Worldsense is a synthetic benchmark with three problem types, each with their own trivial control, which explicitly avoids bias by decorrelating the abstract structure of… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  3. arXiv:2311.08815  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Self-Supervised Disentanglement by Leveraging Structure in Data Augmentations

    Authors: Cian Eastwood, Julius von Kügelgen, Linus Ericsson, Diane Bouchacourt, Pascal Vincent, Bernhard Schölkopf, Mark Ibrahim

    Abstract: Self-supervised representation learning often uses data augmentations to induce some invariance to "style" attributes of the data. However, with downstream tasks generally unknown at training time, it is difficult to deduce a priori which attributes of the data are indeed "style" and can be safely discarded. To address this, we introduce a more principled approach that seeks to disentangle style f… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  4. arXiv:2310.00166  [pdf, other

    cs.AI cs.LG

    Motif: Intrinsic Motivation from Artificial Intelligence Feedback

    Authors: Martin Klissarov, Pierluca D'Oro, Shagun Sodhani, Roberta Raileanu, Pierre-Luc Bacon, Pascal Vincent, Amy Zhang, Mikael Henaff

    Abstract: Exploring rich environments and evaluating one's actions without prior knowledge is immensely challenging. In this paper, we propose Motif, a general method to interface such prior knowledge from a Large Language Model (LLM) with an agent. Motif is based on the idea of grounding LLMs for decision-making without requiring them to interact with the environment: it elicits preferences from an LLM ove… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

    Comments: The first two authors equally contributed - order decided by coin flip

  5. arXiv:2309.16748  [pdf, other

    cs.LG cs.AI stat.ML

    Discovering environments with XRM

    Authors: Mohammad Pezeshki, Diane Bouchacourt, Mark Ibrahim, Nicolas Ballas, Pascal Vincent, David Lopez-Paz

    Abstract: Successful out-of-distribution generalization requires environment annotations. Unfortunately, these are resource-intensive to obtain, and their relevance to model performance is limited by the expectations and perceptual biases of human annotators. Therefore, to enable robust AI systems across applications, we must develop algorithms to automatically discover environments inducing broad generaliz… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  6. arXiv:2308.03977  [pdf, other

    cs.CV cs.LG

    PUG: Photorealistic and Semantically Controllable Synthetic Data for Representation Learning

    Authors: Florian Bordes, Shashank Shekhar, Mark Ibrahim, Diane Bouchacourt, Pascal Vincent, Ari S. Morcos

    Abstract: Synthetic image datasets offer unmatched advantages for designing and evaluating deep neural networks: they make it possible to (i) render as many data samples as needed, (ii) precisely control each scene and yield granular ground truth labels (and captions), (iii) precisely control distribution shifts between training and testing to isolate variables of interest for sound experimentation. Despite… ▽ More

    Submitted 12 December, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

  7. arXiv:2308.00566  [pdf, other

    cs.CV cs.AI cs.LG

    Stochastic positional embeddings improve masked image modeling

    Authors: Amir Bar, Florian Bordes, Assaf Shocher, Mahmoud Assran, Pascal Vincent, Nicolas Ballas, Trevor Darrell, Amir Globerson, Yann LeCun

    Abstract: Masked Image Modeling (MIM) is a promising self-supervised learning approach that enables learning from unlabeled images. Despite its recent success, learning good representations through MIM remains challenging because it requires predicting the right semantic content in accurate locations. For example, given an incomplete picture of a dog, we can guess that there is a tail, but we cannot determi… ▽ More

    Submitted 27 February, 2024; v1 submitted 31 July, 2023; originally announced August 2023.

    Comments: Code and models available in https://github.com/amirbar/StoP

  8. arXiv:2306.16334  [pdf, other

    cs.LG cs.AI

    On the Identifiability of Quantized Factors

    Authors: Vitória Barin-Pacela, Kartik Ahuja, Simon Lacoste-Julien, Pascal Vincent

    Abstract: Disentanglement aims to recover meaningful latent ground-truth factors from the observed distribution solely, and is formalized through the theory of identifiability. The identifiability of independent latent factors is proven to be impossible in the unsupervised i.i.d. setting under a general nonlinear map from factors to observations. In this work, however, we demonstrate that it is possible to… ▽ More

    Submitted 12 March, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

    Comments: Appears in: 3rd Conference on Causal Learning and Reasoning (CLeaR 2024). 39 pages

  9. arXiv:2304.13850  [pdf, other

    cs.CV cs.CR cs.LG

    Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning

    Authors: Casey Meehan, Florian Bordes, Pascal Vincent, Kamalika Chaudhuri, Chuan Guo

    Abstract: Self-supervised learning (SSL) algorithms can produce useful image representations by learning to associate different parts of natural images with one another. However, when taken to the extreme, SSL models can unintendedly memorize specific parts in individual training samples rather than learning semantically meaningful associations. In this work, we perform a systematic study of the unintended… ▽ More

    Submitted 12 December, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

  10. arXiv:2304.13089  [pdf, other

    cs.LG cs.CV eess.IV

    Objectives Matter: Understanding the Impact of Self-Supervised Objectives on Vision Transformer Representations

    Authors: Shashank Shekhar, Florian Bordes, Pascal Vincent, Ari Morcos

    Abstract: Joint-embedding based learning (e.g., SimCLR, MoCo, DINO) and reconstruction-based learning (e.g., BEiT, SimMIM, MAE) are the two leading paradigms for self-supervised learning of vision transformers, but they differ substantially in their transfer performance. Here, we aim to explain these differences by analyzing the impact of these objectives on the structure and transferability of the learned… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

  11. arXiv:2304.05369  [pdf, other

    cs.LG

    A surprisingly simple technique to control the pretraining bias for better transfer: Expand or Narrow your representation

    Authors: Florian Bordes, Samuel Lavoie, Randall Balestriero, Nicolas Ballas, Pascal Vincent

    Abstract: Self-Supervised Learning (SSL) models rely on a pretext task to learn representations. Because this pretext task differs from the downstream tasks used to evaluate the performance of these models, there is an inherent misalignment or pretraining bias. A commonly used trick in SSL, shown to make deep networks more robust to such bias, is the addition of a small projector (usually a 2 or 3 layer mul… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  12. arXiv:2303.09677  [pdf, other

    cs.CV

    Instance-Conditioned GAN Data Augmentation for Representation Learning

    Authors: Pietro Astolfi, Arantxa Casanova, Jakob Verbeek, Pascal Vincent, Adriana Romero-Soriano, Michal Drozdzal

    Abstract: Data augmentation has become a crucial component to train state-of-the-art visual representation models. However, handcrafting combinations of transformations that lead to improved performances is a laborious task, which can result in visually unrealistic samples. To overcome these limitations, recent works have explored the use of generative models as learnable data augmentation tools, showing pr… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: TMLR reviews at https://openreview.net/forum?id=1n7q9mxG3T&referrer=%5BTMLR%5D(%2Fgroup%3Fid%3DTMLR)

  13. arXiv:2303.01986  [pdf, other

    cs.LG

    Towards Democratizing Joint-Embedding Self-Supervised Learning

    Authors: Florian Bordes, Randall Balestriero, Pascal Vincent

    Abstract: Joint Embedding Self-Supervised Learning (JE-SSL) has seen rapid developments in recent years, due to its promise to effectively leverage large unlabeled data. The development of JE-SSL methods was driven primarily by the search for ever increasing downstream classification accuracies, using huge computational resources, and typically built upon insights and intuitions inherited from a close paren… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

  14. arXiv:2301.08243  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Self-Supervised Learning from Images with a Joint-Embedding Predictive Architecture

    Authors: Mahmoud Assran, Quentin Duval, Ishan Misra, Piotr Bojanowski, Pascal Vincent, Michael Rabbat, Yann LeCun, Nicolas Ballas

    Abstract: This paper demonstrates an approach for learning highly semantic image representations without relying on hand-crafted data-augmentations. We introduce the Image-based Joint-Embedding Predictive Architecture (I-JEPA), a non-generative approach for self-supervised learning from images. The idea behind I-JEPA is simple: from a single context block, predict the representations of various target block… ▽ More

    Submitted 13 April, 2023; v1 submitted 19 January, 2023; originally announced January 2023.

    Comments: 2023 IEEE/CVF International Conference on Computer Vision

  15. arXiv:2211.01866  [pdf, other

    cs.CV cs.LG

    ImageNet-X: Understanding Model Mistakes with Factor of Variation Annotations

    Authors: Badr Youbi Idrissi, Diane Bouchacourt, Randall Balestriero, Ivan Evtimov, Caner Hazirbas, Nicolas Ballas, Pascal Vincent, Michal Drozdzal, David Lopez-Paz, Mark Ibrahim

    Abstract: Deep learning vision systems are widely deployed across applications where reliability is critical. However, even today's best models can fail to recognize an object when its pose, lighting, or background varies. While existing benchmarks surface examples challenging for models, they do not explain why such mistakes arise. To address this need, we introduce ImageNet-X, a set of sixteen human annot… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

  16. arXiv:2210.07347  [pdf, other

    cs.LG stat.ML

    Disentanglement of Correlated Factors via Hausdorff Factorized Support

    Authors: Karsten Roth, Mark Ibrahim, Zeynep Akata, Pascal Vincent, Diane Bouchacourt

    Abstract: A grand goal in deep learning research is to learn representations capable of generalizing across distribution shifts. Disentanglement is one promising direction aimed at aligning a model's representation with the underlying factors generating the data (e.g. color or background). Existing disentanglement methods, however, rely on an often unrealistic assumption: that factors are statistically inde… ▽ More

    Submitted 25 February, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted to ICLR 2023

  17. arXiv:2210.07277  [pdf, other

    cs.LG cs.AI cs.CV

    The Hidden Uniform Cluster Prior in Self-Supervised Learning

    Authors: Mahmoud Assran, Randall Balestriero, Quentin Duval, Florian Bordes, Ishan Misra, Piotr Bojanowski, Pascal Vincent, Michael Rabbat, Nicolas Ballas

    Abstract: A successful paradigm in representation learning is to perform self-supervised pretraining using tasks based on mini-batch statistics (e.g., SimCLR, VICReg, SwAV, MSN). We show that in the formulation of all these methods is an overlooked prior to learn features that enable uniform clustering of the data. While this prior has led to remarkably semantic representations when pretraining on class-bal… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

  18. arXiv:2206.13378  [pdf, other

    cs.LG

    Guillotine Regularization: Why removing layers is needed to improve generalization in Self-Supervised Learning

    Authors: Florian Bordes, Randall Balestriero, Quentin Garrido, Adrien Bardes, Pascal Vincent

    Abstract: One unexpected technique that emerged in recent years consists in training a Deep Network (DN) with a Self-Supervised Learning (SSL) method, and using this network on downstream tasks but with its last few projector layers entirely removed. This trick of throwing away the projector is actually critical for SSL methods to display competitive performances on ImageNet for which more than 30 percentag… ▽ More

    Submitted 9 June, 2023; v1 submitted 27 June, 2022; originally announced June 2022.

    Comments: Accepted at TMLR 2023

  19. arXiv:2204.07141  [pdf, other

    cs.LG cs.AI cs.CV eess.IV

    Masked Siamese Networks for Label-Efficient Learning

    Authors: Mahmoud Assran, Mathilde Caron, Ishan Misra, Piotr Bojanowski, Florian Bordes, Pascal Vincent, Armand Joulin, Michael Rabbat, Nicolas Ballas

    Abstract: We propose Masked Siamese Networks (MSN), a self-supervised learning framework for learning image representations. Our approach matches the representation of an image view containing randomly masked patches to the representation of the original unmasked image. This self-supervised pre-training strategy is particularly scalable when applied to Vision Transformers since only the unmasked patches are… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

  20. arXiv:2112.09164  [pdf, other

    cs.LG cs.AI

    High Fidelity Visualization of What Your Self-Supervised Representation Knows About

    Authors: Florian Bordes, Randall Balestriero, Pascal Vincent

    Abstract: Discovering what is learned by neural networks remains a challenge. In self-supervised learning, classification is the most common task used to evaluate how good a representation is. However, relying only on such downstream task can limit our understanding of what information is retained in the representation of a given input. In this work, we showcase the use of a Representation Conditional Diffu… ▽ More

    Submitted 16 August, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    Comments: Accepted at TMLR 2022

  21. arXiv:2110.09348  [pdf, other

    cs.CV cs.AI cs.LG

    Understanding Dimensional Collapse in Contrastive Self-supervised Learning

    Authors: Li **g, Pascal Vincent, Yann LeCun, Yuandong Tian

    Abstract: Self-supervised visual representation learning aims to learn useful representations without relying on human annotations. Joint embedding approach bases on maximizing the agreement between embedding vectors from different views of the same image. Various methods have been proposed to solve the collapsing problem where all embedding vectors collapse to a trivial constant solution. Among these metho… ▽ More

    Submitted 23 April, 2022; v1 submitted 18 October, 2021; originally announced October 2021.

    Comments: In Proceedings of the 10th International Conference on Learning Representations (ICLR) 2022

    Journal ref: ICLR 2022

  22. arXiv:2103.03098  [pdf, other

    cs.LG stat.ML

    Accounting for Variance in Machine Learning Benchmarks

    Authors: Xavier Bouthillier, Pierre Delaunay, Mirko Bronzi, Assya Trofimov, Brennan Nichyporuk, Justin Szeto, Naz Sepah, Edward Raff, Kanika Madan, Vikram Voleti, Samira Ebrahimi Kahou, Vincent Michalski, Dmitriy Serdyuk, Tal Arbel, Chris Pal, Gaël Varoquaux, Pascal Vincent

    Abstract: Strong empirical evidence that one machine-learning algorithm A outperforms another one B ideally calls for multiple trials optimizing the learning pipeline over sources of variation such as data sampling, data augmentation, parameter initialization, and hyperparameters choices. This is prohibitively expensive, and corners are cut to reach conclusions. We model the whole benchmarking process, reve… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Comments: Submitted to MLSys2021

  23. arXiv:2103.02014  [pdf, other

    cs.LG cs.CR cs.DS

    Online Adversarial Attacks

    Authors: Andjela Mladenovic, Avishek Joey Bose, Hugo Berard, William L. Hamilton, Simon Lacoste-Julien, Pascal Vincent, Gauthier Gidel

    Abstract: Adversarial attacks expose important vulnerabilities of deep learning models, yet little attention has been paid to settings where data arrives as a stream. In this paper, we formalize the online adversarial attack problem, emphasizing two key elements found in real-world use-cases: attackers must operate under partial knowledge of the target model, and the decisions made by the attacker are irrev… ▽ More

    Submitted 22 March, 2022; v1 submitted 2 March, 2021; originally announced March 2021.

    Comments: ICLR 2022

  24. arXiv:2010.12870  [pdf, ps, other

    cs.LG stat.ML

    Efficient Learning in Non-Stationary Linear Markov Decision Processes

    Authors: Ahmed Touati, Pascal Vincent

    Abstract: We study episodic reinforcement learning in non-stationary linear (a.k.a. low-rank) Markov Decision Processes (MDPs), i.e, both the reward and transition kernel are linear with respect to a given feature map and are allowed to evolve either slowly or abruptly over time. For this problem setting, we propose OPT-WLSVI an optimistic model-free algorithm based on weighted least squares value iteration… ▽ More

    Submitted 27 December, 2021; v1 submitted 24 October, 2020; originally announced October 2020.

  25. arXiv:2010.04425  [pdf, other

    eess.IV cs.CV

    WHO 2016 subty** and automated segmentation of glioma using multi-task deep learning

    Authors: Sebastian R. van der Voort, Fatih Incekara, Maarten M. J. Wijnenga, Georgios Kapsas, Renske Gahrmann, Joost W. Schouten, Rishi Nandoe Tewarie, Geert J. Lycklama, Philip C. De Witt Hamer, Roelant S. Eijgelaar, Pim J. French, Hendrikus J. Dubbink, Arnaud J. P. E. Vincent, Wiro J. Niessen, Martin J. van den Bent, Marion Smits, Stefan Klein

    Abstract: Accurate characterization of glioma is crucial for clinical decision making. A delineation of the tumor is also desirable in the initial decision stages but is a time-consuming task. Leveraging the latest GPU capabilities, we developed a single multi-task convolutional neural network that uses the full 3D, structural, pre-operative MRI scans to can predict the IDH mutation status, the 1p/19q co-de… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

  26. arXiv:2008.00938  [pdf, other

    cs.LG stat.ML

    Implicit Regularization via Neural Feature Alignment

    Authors: Aristide Baratin, Thomas George, César Laurent, R Devon Hjelm, Guillaume Lajoie, Pascal Vincent, Simon Lacoste-Julien

    Abstract: We approach the problem of implicit regularization in deep learning from a geometrical viewpoint. We highlight a regularization effect induced by a dynamical alignment of the neural tangent features introduced by Jacot et al, along a small number of task-relevant directions. This can be interpreted as a combined mechanism of feature selection and compression. By extrapolating a new analysis of Rad… ▽ More

    Submitted 16 March, 2021; v1 submitted 3 August, 2020; originally announced August 2020.

    Comments: AISTATS 2021

  27. arXiv:2007.04202  [pdf, other

    cs.LG cs.GT math.OC stat.ML

    Stochastic Hamiltonian Gradient Methods for Smooth Games

    Authors: Nicolas Loizou, Hugo Berard, Alexia Jolicoeur-Martineau, Pascal Vincent, Simon Lacoste-Julien, Ioannis Mitliagkas

    Abstract: The success of adversarial formulations in machine learning has brought renewed motivation for smooth games. In this work, we focus on the class of stochastic Hamiltonian methods and provide the first convergence guarantees for certain classes of stochastic smooth games. We propose a novel unbiased estimator for the stochastic Hamiltonian gradient descent (SHGD) and highlight its benefits. Using t… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

    Comments: ICML 2020 - Proceedings of the 37th International Conference on Machine Learning

  28. arXiv:2007.03749  [pdf, ps, other

    cs.LG stat.ML

    Sharp Analysis of Smoothed Bellman Error Embedding

    Authors: Ahmed Touati, Pascal Vincent

    Abstract: The \textit{Smoothed Bellman Error Embedding} algorithm~\citep{dai2018sbeed}, known as SBEED, was proposed as a provably convergent reinforcement learning algorithm with general nonlinear function approximation. It has been successfully implemented with neural networks and achieved strong empirical results. In this work, we study the theoretical behavior of SBEED in batch-mode reinforcement learni… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

    Comments: Accepted at the ICML 2020 Workshop on Theoretical Foundations of Reinforcement Learning

  29. arXiv:2007.00720  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Adversarial Example Games

    Authors: Avishek Joey Bose, Gauthier Gidel, Hugo Berard, Andre Cianflone, Pascal Vincent, Simon Lacoste-Julien, William L. Hamilton

    Abstract: The existence of adversarial examples capable of fooling trained neural network classifiers calls for a much better understanding of possible attacks to guide the development of safeguards against them. This includes attack methods in the challenging non-interactive blackbox setting, where adversarial attacks are generated without any access, including queries, to the target model. Prior attacks i… ▽ More

    Submitted 8 January, 2021; v1 submitted 1 July, 2020; originally announced July 2020.

    Comments: Appears in: Advances in Neural Information Processing Systems 33 (NeurIPS 2020)

  30. arXiv:2006.12279  [pdf, other

    cs.LG stat.ML

    Revisiting Loss Modelling for Unstructured Pruning

    Authors: César Laurent, Camille Ballas, Thomas George, Nicolas Ballas, Pascal Vincent

    Abstract: By removing parameters from deep neural networks, unstructured pruning methods aim at cutting down memory footprint and computational cost, while maintaining prediction accuracy. In order to tackle this otherwise intractable problem, many of these methods model the loss landscape using first or second order Taylor expansions to identify which parameters can be discarded. We revisit loss modelling… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

  31. Do sequence-to-sequence VAEs learn global features of sentences?

    Authors: Tom Bosc, Pascal Vincent

    Abstract: Autoregressive language models are powerful and relatively easy to train. However, these models are usually trained without explicit conditioning labels and do not offer easy ways to control global aspects such as sentiment or topic during generation. Bowman & al. (2016) adapted the Variational Autoencoder (VAE) for natural language with the sequence-to-sequence architecture and claimed that the l… ▽ More

    Submitted 28 March, 2021; v1 submitted 16 April, 2020; originally announced April 2020.

    Comments: Camera-ready version, EMNLP2020

  32. arXiv:2003.04108  [pdf, other

    cs.LG stat.ML

    Stable Policy Optimization via Off-Policy Divergence Regularization

    Authors: Ahmed Touati, Amy Zhang, Joelle Pineau, Pascal Vincent

    Abstract: Trust Region Policy Optimization (TRPO) and Proximal Policy Optimization (PPO) are among the most successful policy gradient approaches in deep reinforcement learning (RL). While these methods achieve state-of-the-art performance across a wide range of challenging tasks, there is room for improvement in the stabilization of the policy learning and how the off-policy data are used. In this paper we… ▽ More

    Submitted 19 June, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

    Journal ref: Proceedings of the 36th Conference on Uncertainty in Artificial Intelligence (UAI), PMLR volume 124, 2020

  33. Application of Genetic Algorithm for More Efficient Multi-Layer Thickness Optimization in Solar Cells

    Authors: Premkumar Vincent, Gwenaelle Cunha Sergio, Jaewon Jang, In Man Kang, Jaehoon Park, Hyeok Kim, Minho Lee, **-Hyuk Bae

    Abstract: Thin-film solar cells are predominately designed similar to a stacked structure. Optimizing the layer thicknesses in this stack structure is crucial to extract the best efficiency of the solar cell. The commonplace method used in optimization simulations, such as for optimizing the optical spacer layers' thicknesses, is the parameter sweep. Our simulation study shows that the implementation of a m… ▽ More

    Submitted 7 April, 2020; v1 submitted 14 September, 2019; originally announced September 2019.

    Comments: Published (https://www.mdpi.com/1996-1073/13/7/1726), Code (https://github.com/gcunhase/GeneticAlgorithm-SolarCells)

    Journal ref: Energies 13 (2020) 1-14

  34. arXiv:1908.00061  [pdf, other

    cs.CV cs.LG

    An Empirical Study of Batch Normalization and Group Normalization in Conditional Computation

    Authors: Vincent Michalski, Vikram Voleti, Samira Ebrahimi Kahou, Anthony Ortiz, Pascal Vincent, Chris Pal, Doina Precup

    Abstract: Batch normalization has been widely used to improve optimization in deep neural networks. While the uncertainty in batch statistics can act as a regularizer, using these dataset statistics specific to the training set impairs generalization in certain tasks. Recently, alternative methods for normalizing feature activations in neural networks have been proposed. Among them, group normalization has… ▽ More

    Submitted 31 July, 2019; originally announced August 2019.

  35. arXiv:1906.04848  [pdf, other

    cs.LG stat.ML

    A Closer Look at the Optimization Landscapes of Generative Adversarial Networks

    Authors: Hugo Berard, Gauthier Gidel, Amjad Almahairi, Pascal Vincent, Simon Lacoste-Julien

    Abstract: Generative adversarial networks have been very successful in generative modeling, however they remain relatively challenging to train compared to standard deep neural networks. In this paper, we propose new visualization techniques for the optimization landscapes of GANs that enable us to study the game vector field resulting from the concatenation of the gradient of both players. Using these visu… ▽ More

    Submitted 27 April, 2020; v1 submitted 11 June, 2019; originally announced June 2019.

  36. arXiv:1906.04282  [pdf, other

    cs.LG stat.ML

    Stochastic Neural Network with Kronecker Flow

    Authors: Chin-Wei Huang, Ahmed Touati, Pascal Vincent, Gintare Karolina Dziugaite, Alexandre Lacoste, Aaron Courville

    Abstract: Recent advances in variational inference enable the modelling of highly structured joint distributions, but are limited in their capacity to scale to the high-dimensional setting of stochastic neural networks. This limitation motivates a need for scalable parameterizations of the noise generation process, in a manner that adequately captures the dependencies among the various parameters. In this w… ▽ More

    Submitted 13 February, 2020; v1 submitted 10 June, 2019; originally announced June 2019.

    Comments: Proceedings of the 23rdInternational Conference on ArtificialIntelligence and Statistics (AISTATS) 2020

  37. arXiv:1906.03704  [pdf, other

    cs.LG stat.ML

    SVRG for Policy Evaluation with Fewer Gradient Evaluations

    Authors: Zilun Peng, Ahmed Touati, Pascal Vincent, Doina Precup

    Abstract: Stochastic variance-reduced gradient (SVRG) is an optimization method originally designed for tackling machine learning problems with a finite sum structure. SVRG was later shown to work for policy evaluation, a problem in reinforcement learning in which one aims to estimate the value function of a given policy. SVRG makes use of gradient estimates at two scales. At the slower scale, SVRG computes… ▽ More

    Submitted 19 June, 2020; v1 submitted 9 June, 2019; originally announced June 2019.

    Comments: Short version of the paper is published in the proceedings of the 29th International Joint Conference on Artificial Intelligence and the 17th Pacific Rim International Conference on Artificial Intelligence (IJCAI-PRICAI2020)

  38. arXiv:1902.03051  [pdf, other

    cs.CV

    Reducing Uncertainty in Undersampled MRI Reconstruction with Active Acquisition

    Authors: Zizhao Zhang, Adriana Romero, Matthew J. Muckley, Pascal Vincent, Lin Yang, Michal Drozdzal

    Abstract: The goal of MRI reconstruction is to restore a high fidelity image from partially observed measurements. This partial view naturally induces reconstruction uncertainty that can only be reduced by acquiring additional measurements. In this paper, we present a novel method for MRI reconstruction that, at inference time, dynamically selects the measurements to take and iteratively refines the predict… ▽ More

    Submitted 8 February, 2019; originally announced February 2019.

  39. arXiv:1811.08839  [pdf, other

    cs.CV cs.LG eess.SP physics.med-ph stat.ML

    fastMRI: An Open Dataset and Benchmarks for Accelerated MRI

    Authors: Jure Zbontar, Florian Knoll, Anuroop Sriram, Tullie Murrell, Zhengnan Huang, Matthew J. Muckley, Aaron Defazio, Ruben Stern, Patricia Johnson, Mary Bruno, Marc Parente, Krzysztof J. Geras, Joe Katsnelson, Hersh Chandarana, Zizhao Zhang, Michal Drozdzal, Adriana Romero, Michael Rabbat, Pascal Vincent, Nafissa Yakubova, James Pinkerton, Duo Wang, Erich Owens, C. Lawrence Zitnick, Michael P. Recht , et al. (2 additional authors not shown)

    Abstract: Accelerating Magnetic Resonance Imaging (MRI) by taking fewer measurements has the potential to reduce medical costs, minimize stress to patients and make MRI possible in applications where it is currently prohibitively slow or expensive. We introduce the fastMRI dataset, a large-scale collection of both raw MR measurements and clinical MR images, that can be used for training and evaluation of ma… ▽ More

    Submitted 11 December, 2019; v1 submitted 21 November, 2018; originally announced November 2018.

    Comments: 35 pages, 10 figures

  40. The Long Road to Computational Location Privacy: A Survey

    Authors: Primault Vincent, Boutet Antoine, Ben Mokhtar Sonia, Brunie Lionel

    Abstract: The widespread adoption of continuously connected smartphones and tablets developed the usage of mobile applications, among which many use location to provide geolocated services. These services provide new prospects for users: getting directions to work in the morning, leaving a check-in at a restaurant at noon and checking next day's weather in the evening are possible right from any mobile devi… ▽ More

    Submitted 8 October, 2018; originally announced October 2018.

    Comments: IEEE Communications Surveys & Tutorials

  41. arXiv:1806.03884  [pdf, other

    cs.LG stat.ML

    Fast Approximate Natural Gradient Descent in a Kronecker-factored Eigenbasis

    Authors: Thomas George, César Laurent, Xavier Bouthillier, Nicolas Ballas, Pascal Vincent

    Abstract: Optimization algorithms that leverage gradient covariance information, such as variants of natural gradient descent (Amari, 1998), offer the prospect of yielding more effective descent directions. For models with many parameters, the covariance matrix they are based on becomes gigantic, making them inapplicable in their original form. This has motivated research into both simple diagonal approxima… ▽ More

    Submitted 26 July, 2021; v1 submitted 11 June, 2018; originally announced June 2018.

    Journal ref: Advances in Neural Information Processing Systems 2018

  42. arXiv:1806.02315  [pdf, other

    cs.LG stat.ML

    Randomized Value Functions via Multiplicative Normalizing Flows

    Authors: Ahmed Touati, Harsh Satija, Joshua Romoff, Joelle Pineau, Pascal Vincent

    Abstract: Randomized value functions offer a promising approach towards the challenge of efficient exploration in complex environments with high dimensional state and action spaces. Unlike traditional point estimate methods, randomized value functions maintain a posterior distribution over action-space values. This prevents the agent's behavior policy from prematurely exploiting early estimates and falling… ▽ More

    Submitted 28 June, 2019; v1 submitted 6 June, 2018; originally announced June 2018.

    Journal ref: UAI 2019: Conference on Uncertainty in Artificial Intelligence 2019

  43. arXiv:1802.10551  [pdf, other

    cs.LG math.OC stat.ML

    A Variational Inequality Perspective on Generative Adversarial Networks

    Authors: Gauthier Gidel, Hugo Berard, Gaëtan Vignoud, Pascal Vincent, Simon Lacoste-Julien

    Abstract: Generative adversarial networks (GANs) form a generative modeling approach known for producing appealing samples, but they are notably difficult to train. One common way to tackle this issue has been to propose new formulations of the GAN objective. Yet, surprisingly few studies have looked at optimization methods designed for this adversarial training. In this work, we cast GAN optimization probl… ▽ More

    Submitted 28 August, 2020; v1 submitted 28 February, 2018; originally announced February 2018.

    Comments: Appears in: Proceedings of the Seventh International Conference on Learning Representations (ICLR 2019). Minor modifications with respect to the ICLR version (First paragraph of page 2 and section 3.3): New reference [Popov 1980] and discussion with regards to the novelty of extrapolation from the past. 38 pages

    ACM Class: I.2.6; G.1.6

  44. arXiv:1709.01591  [pdf, other

    cs.CV

    Improving Landmark Localization with Semi-Supervised Learning

    Authors: Sina Honari, Pavlo Molchanov, Stephen Tyree, Pascal Vincent, Christopher Pal, Jan Kautz

    Abstract: We present two techniques to improve landmark localization in images from partially annotated datasets. Our primary goal is to leverage the common situation where precise landmark locations are only provided for a small data subset, but where class labels for classification or regression tasks related to the landmarks are more abundantly available. First, we propose the framework of sequential mul… ▽ More

    Submitted 28 October, 2018; v1 submitted 5 September, 2017; originally announced September 2017.

    Comments: Published as a conference paper in CVPR 2018

  45. arXiv:1708.02511  [pdf, other

    cs.LG stat.ML

    Parametric Adversarial Divergences are Good Losses for Generative Modeling

    Authors: Gabriel Huang, Hugo Berard, Ahmed Touati, Gauthier Gidel, Pascal Vincent, Simon Lacoste-Julien

    Abstract: Parametric adversarial divergences, which are a generalization of the losses used to train generative adversarial networks (GANs), have often been described as being approximations of their nonparametric counterparts, such as the Jensen-Shannon divergence, which can be derived under the so-called optimal discriminator assumption. In this position paper, we argue that despite being "non-optimal", p… ▽ More

    Submitted 21 October, 2021; v1 submitted 8 August, 2017; originally announced August 2017.

  46. arXiv:1706.00286  [pdf, other

    cs.LG cs.CL

    Learning to Compute Word Embeddings On the Fly

    Authors: Dzmitry Bahdanau, Tom Bosc, Stanisław Jastrzębski, Edward Grefenstette, Pascal Vincent, Yoshua Bengio

    Abstract: Words in natural language follow a Zipfian distribution whereby some words are frequent but most are rare. Learning representations for words in the "long tail" of this distribution requires enormous amounts of data. Representations of rare words trained directly on end tasks are usually poor, requiring us to pre-train embeddings on external data, or treat all rare words as out-of-vocabulary words… ▽ More

    Submitted 7 March, 2018; v1 submitted 1 June, 2017; originally announced June 2017.

  47. arXiv:1705.09322  [pdf, other

    cs.LG

    Convergent Tree Backup and Retrace with Function Approximation

    Authors: Ahmed Touati, Pierre-Luc Bacon, Doina Precup, Pascal Vincent

    Abstract: Off-policy learning is key to scaling up reinforcement learning as it allows to learn about a target policy from the experience generated by a different behavior policy. Unfortunately, it has been challenging to combine off-policy learning with function approximation and multi-step bootstrap** in a way that leads to both stable and efficient algorithms. In this work, we show that the \textsc{Tre… ▽ More

    Submitted 22 October, 2018; v1 submitted 25 May, 2017; originally announced May 2017.

    Journal ref: ICML 2018, Proceedings of the 35th International Conference on Machine Learning, PMLR 80:4955-4964, 2018

  48. arXiv:1703.06975  [pdf, other

    stat.ML cs.LG

    Learning to Generate Samples from Noise through Infusion Training

    Authors: Florian Bordes, Sina Honari, Pascal Vincent

    Abstract: In this work, we investigate a novel training procedure to learn a generative model as the transition operator of a Markov chain, such that, when applied repeatedly on an unstructured random noise sample, it will denoise it into a sample that matches the target distribution from the training set. The novel training procedure to learn this progressive denoising operation involves sampling from a sl… ▽ More

    Submitted 20 March, 2017; originally announced March 2017.

    Comments: Published as a conference paper at ICLR 2017

  49. arXiv:1609.05866  [pdf, other

    cs.LG cs.IR cs.NE stat.ML

    A Cheap Linear Attention Mechanism with Fast Lookups and Fixed-Size Representations

    Authors: Alexandre de Brébisson, Pascal Vincent

    Abstract: The softmax content-based attention mechanism has proven to be very beneficial in many applications of recurrent neural networks. Nevertheless it suffers from two major computational limitations. First, its computations for an attention lookup scale linearly in the size of the attended sequence. Second, it does not encode the sequence into a fixed-size representation but instead requires to memori… ▽ More

    Submitted 19 September, 2016; originally announced September 2016.

  50. arXiv:1607.00844  [pdf, other

    cs.MS

    Using the pyMIC Offload Module in PyFR

    Authors: Michael Klemm, Freddie Witherden, Peter Vincent

    Abstract: PyFR is an open-source high-order accurate computational fluid dynamics solver for unstructured grids. It is designed to efficiently solve the compressible Navier-Stokes equations on a range of hardware platforms, including GPUs and CPUs. In this paper we will describe how the Python Offload Infrastructure for the Intel Many Integrated Core Architecture (pyMIC) was used to enable PyFR to run with… ▽ More

    Submitted 1 July, 2016; originally announced July 2016.

    Report number: euroscipy-proceedings2015-01