Skip to main content

Showing 1–3 of 3 results for author: Mialon, G

Searching in archive stat. Search in all archives.
.
  1. arXiv:2006.12065  [pdf, other

    cs.LG stat.ML

    A Trainable Optimal Transport Embedding for Feature Aggregation and its Relationship to Attention

    Authors: Grégoire Mialon, Dexiong Chen, Alexandre d'Aspremont, Julien Mairal

    Abstract: We address the problem of learning on sets of features, motivated by the need of performing pooling operations in long biological sequences of varying sizes, with long-range dependencies, and possibly few labeled data. To address this challenging task, we introduce a parametrized representation of fixed size, which embeds and then aggregates elements from a given input set according to the optimal… ▽ More

    Submitted 9 February, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: ICLR 2021

  2. arXiv:1912.02566  [pdf, other

    cs.LG stat.ML

    Screening Data Points in Empirical Risk Minimization via Ellipsoidal Regions and Safe Loss Functions

    Authors: Grégoire Mialon, Alexandre d'Aspremont, Julien Mairal

    Abstract: We design simple screening tests to automatically discard data samples in empirical risk minimization without losing optimization guarantees. We derive loss functions that produce dual objectives with a sparse solution. We also show how to regularize convex losses to ensure such a dual sparsity-inducing property, and propose a general method to design screening tests for classification or regressi… ▽ More

    Submitted 12 June, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: AISTATS 2020

  3. arXiv:1810.00363  [pdf, other

    stat.ML cs.LG

    A Kernel Perspective for Regularizing Deep Neural Networks

    Authors: Alberto Bietti, Grégoire Mialon, Dexiong Chen, Julien Mairal

    Abstract: We propose a new point of view for regularizing deep neural networks by using the norm of a reproducing kernel Hilbert space (RKHS). Even though this norm cannot be computed, it admits upper and lower approximations leading to various practical strategies. Specifically, this perspective (i) provides a common umbrella for many existing regularization principles, including spectral norm and gradient… ▽ More

    Submitted 13 May, 2019; v1 submitted 30 September, 2018; originally announced October 2018.

    Comments: ICML