Skip to main content

Showing 1–50 of 82 results for author: Mairal, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.20233  [pdf, other

    stat.ML cs.LG

    Functional Bilevel Optimization for Machine Learning

    Authors: Ieva Petrulionyte, Julien Mairal, Michael Arbel

    Abstract: In this paper, we introduce a new functional point of view on bilevel optimization problems for machine learning, where the inner objective is minimized over a function space. These types of problems are most often solved by using methods developed in the parametric setting, where the inner objective is strongly convex with respect to the parameters of the prediction function. The functional point… ▽ More

    Submitted 13 June, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

  2. arXiv:2402.11305  [pdf, other

    cs.CV

    On Good Practices for Task-Specific Distillation of Large Pretrained Visual Models

    Authors: Juliette Marrie, Michael Arbel, Julien Mairal, Diane Larlus

    Abstract: Large pretrained visual models exhibit remarkable generalization across diverse recognition tasks. Yet, real-world applications often demand compact models tailored to specific problems. Variants of knowledge distillation have been devised for such a purpose, enabling task-specific compact models (the students) to learn from a generic large pretrained one (the teacher). In this paper, we show that… ▽ More

    Submitted 7 May, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

    Journal ref: Published in Transactions on Machine Learning Research (TMLR), 2024

  3. arXiv:2401.12609  [pdf, other

    cs.CV cs.LG eess.IV

    Fast Semi-supervised Unmixing using Non-convex Optimization

    Authors: Behnood Rasti, Alexandre Zouaoui, Julien Mairal, Jocelyn Chanussot

    Abstract: In this paper, we introduce a novel linear model tailored for semisupervised/library-based unmixing. Our model incorporates considerations for library mismatch while enabling the enforcement of the abundance sum-to-one constraint (ASC). Unlike conventional sparse unmixing methods, this model involves nonconvex optimization, presenting significant computational challenges. We demonstrate the effica… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  4. arXiv:2312.05190  [pdf, other

    cs.CV

    Fine Dense Alignment of Image Bursts through Camera Pose and Depth Estimation

    Authors: Bruno Lecouat, Yann Dubois de Mont-Marin, Théo Bodrito, Julien Mairal, Jean Ponce

    Abstract: This paper introduces a novel approach to the fine alignment of images in a burst captured by a handheld camera. In contrast to traditional techniques that estimate two-dimensional transformations between frame pairs or rely on discrete correspondences, the proposed algorithm establishes dense correspondences by optimizing both the camera motion and surface depth and orientation at every pixel. Th… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  5. arXiv:2311.17846  [pdf, other

    cs.CV

    Towards Real-World Focus Stacking with Deep Learning

    Authors: Alexandre Araujo, Jean Ponce, Julien Mairal

    Abstract: Focus stacking is widely used in micro, macro, and landscape photography to reconstruct all-in-focus images from multiple frames obtained with focus bracketing, that is, with shallow depth of field and different focus planes. Existing deep learning approaches to the underlying multi-focus image fusion problem have limited applicability to real-world imagery since they are designed for very short i… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  6. arXiv:2309.16588  [pdf, other

    cs.CV

    Vision Transformers Need Registers

    Authors: Timothée Darcet, Maxime Oquab, Julien Mairal, Piotr Bojanowski

    Abstract: Transformers have recently emerged as a powerful tool for learning visual representations. In this paper, we identify and characterize artifacts in feature maps of both supervised and self-supervised ViT networks. The artifacts correspond to high-norm tokens appearing during inference primarily in low-informative background areas of images, that are repurposed for internal computations. We propose… ▽ More

    Submitted 12 April, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

  7. arXiv:2308.09375  [pdf, other

    eess.IV cs.CV cs.LG

    Image Processing and Machine Learning for Hyperspectral Unmixing: An Overview and the HySUPP Python Package

    Authors: Behnood Rasti, Alexandre Zouaoui, Julien Mairal, Jocelyn Chanussot

    Abstract: Spectral pixels are often a mixture of the pure spectra of the materials, called endmembers, due to the low spatial resolution of hyperspectral sensors, double scattering, and intimate mixtures of materials in the scenes. Unmixing estimates the fractional abundances of the endmembers within the pixel. Depending on the prior knowledge of endmembers, linear unmixing can be divided into three main g… ▽ More

    Submitted 26 April, 2024; v1 submitted 18 August, 2023; originally announced August 2023.

    Comments: IEEE Transactions on Geoscience and Remote Sensing, 2024

  8. arXiv:2308.04771  [pdf, other

    cs.CV cs.LG eess.IV

    SUnAA: Sparse Unmixing using Archetypal Analysis

    Authors: Behnood Rasti, Alexandre Zouaoui, Julien Mairal, Jocelyn Chanussot

    Abstract: This paper introduces a new sparse unmixing technique using archetypal analysis (SUnAA). First, we design a new model based on archetypal analysis. We assume that the endmembers of interest are a convex combination of endmembers provided by a spectral library and that the number of endmembers of interest is known. Then, we propose a minimization problem. Unlike most conventional sparse unmixing me… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Journal ref: IEEE Geoscience and Remote Sensing Letters, 2023, 20, pp.1-5

  9. arXiv:2306.14932  [pdf, ps, other

    math.OC cs.LG

    GloptiNets: Scalable Non-Convex Optimization with Certificates

    Authors: Gaspard Beugnot, Julien Mairal, Alessandro Rudi

    Abstract: We present a novel approach to non-convex optimization with certificates, which handles smooth functions on the hypercube or on the torus. Unlike traditional methods that rely on algebraic properties, our algorithm exploits the regularity of the target function intrinsic in the decay of its Fourier spectrum. By defining a tractable family of models, we allow at the same time to obtain precise cert… ▽ More

    Submitted 20 December, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: Edit affiliations and acknowledgments

  10. arXiv:2306.12266  [pdf, other

    astro-ph.IM astro-ph.EP cs.LG

    Combining multi-spectral data with statistical and deep-learning models for improved exoplanet detection in direct imaging at high contrast

    Authors: Olivier Flasseur, Théo Bodrito, Julien Mairal, Jean Ponce, Maud Langlois, Anne-Marie Lagrange

    Abstract: Exoplanet detection by direct imaging is a difficult task: the faint signals from the objects of interest are buried under a spatially structured nuisance component induced by the host star. The exoplanet signals can only be identified when combining several observations with dedicated detection algorithms. In contrast to most of existing methods, we propose to learn a model of the spatial, tempor… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: accepted to EUSIPCO 2023

  11. arXiv:2306.09998  [pdf, other

    cs.CV cs.LG

    SLACK: Stable Learning of Augmentations with Cold-start and KL regularization

    Authors: Juliette Marrie, Michael Arbel, Diane Larlus, Julien Mairal

    Abstract: Data augmentation is known to improve the generalization capabilities of neural networks, provided that the set of transformations is chosen with care, a selection often performed manually. Automatic data augmentation aims at automating this process. However, most recent approaches still rely on some prior information; they start from a small pool of manually-selected default transformations that… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: Accepted to CVPR 2023

  12. arXiv:2306.07483  [pdf, other

    cs.CV

    Semi-supervised learning made simple with self-supervised clustering

    Authors: Enrico Fini, Pietro Astolfi, Karteek Alahari, Xavier Alameda-Pineda, Julien Mairal, Moin Nabi, Elisa Ricci

    Abstract: Self-supervised learning models have been shown to learn rich visual representations without requiring human annotations. However, in many real-world scenarios, labels are partially available, motivating a recent line of work on semi-supervised methods inspired by self-supervised principles. In this paper, we propose a conceptually simple yet empirically powerful approach to turn clustering-based… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: CVPR 2023 - Code available at https://github.com/pietroastolfi/suave-daino

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2023) 3187-3197

  13. arXiv:2304.10933  [pdf, other

    cs.LG

    Self-Attention in Colors: Another Take on Encoding Graph Structure in Transformers

    Authors: Romain Menegaux, Emmanuel Jehanno, Margot Selosse, Julien Mairal

    Abstract: We introduce a novel self-attention mechanism, which we call CSA (Chromatic Self-Attention), which extends the notion of attention scores to attention _filters_, independently modulating the feature channels. We showcase CSA in a fully-attentional graph Transformer CGT (Chromatic Graph Transformer) which integrates both graph structural information and edge features, completely bypassing the need… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

  14. arXiv:2304.07193  [pdf, other

    cs.CV

    DINOv2: Learning Robust Visual Features without Supervision

    Authors: Maxime Oquab, Timothée Darcet, Théo Moutakanni, Huy Vo, Marc Szafraniec, Vasil Khalidov, Pierre Fernandez, Daniel Haziza, Francisco Massa, Alaaeldin El-Nouby, Mahmoud Assran, Nicolas Ballas, Wojciech Galuba, Russell Howes, Po-Yao Huang, Shang-Wen Li, Ishan Misra, Michael Rabbat, Vasu Sharma, Gabriel Synnaeve, Hu Xu, Hervé Jegou, Julien Mairal, Patrick Labatut, Armand Joulin , et al. (1 additional authors not shown)

    Abstract: The recent breakthroughs in natural language processing for model pretraining on large quantities of data have opened the way for similar foundation models in computer vision. These models could greatly simplify the use of images in any system by producing all-purpose visual features, i.e., features that work across image distributions and tasks without finetuning. This work shows that existing pr… ▽ More

    Submitted 2 February, 2024; v1 submitted 14 April, 2023; originally announced April 2023.

  15. arXiv:2302.12120  [pdf, other

    cs.LG

    Sequential Counterfactual Risk Minimization

    Authors: Houssam Zenati, Eustache Diemert, Matthieu Martin, Julien Mairal, Pierre Gaillard

    Abstract: Counterfactual Risk Minimization (CRM) is a framework for dealing with the logged bandit feedback problem, where the goal is to improve a logging policy using offline data. In this paper, we explore the case where it is possible to deploy learned policies multiple times and acquire new data. We extend the CRM principle and its theory to this scenario, which we call "Sequential Counterfactual Risk… ▽ More

    Submitted 25 May, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: To appear at ICML23

  16. arXiv:2211.09019  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Learning Reward Functions for Robotic Manipulation by Observing Humans

    Authors: Minttu Alakuijala, Gabriel Dulac-Arnold, Julien Mairal, Jean Ponce, Cordelia Schmid

    Abstract: Observing a human demonstrator manipulate objects provides a rich, scalable and inexpensive source of data for learning robotic policies. However, transferring skills from human videos to a robotic manipulator poses several challenges, not least a difference in action and observation spaces. In this work, we use unlabeled videos of humans solving a wide range of manipulation tasks to learn a task-… ▽ More

    Submitted 7 March, 2023; v1 submitted 16 November, 2022; originally announced November 2022.

  17. arXiv:2209.11002  [pdf, other

    eess.IV cs.CV cs.LG

    Entropic Descent Archetypal Analysis for Blind Hyperspectral Unmixing

    Authors: Alexandre Zouaoui, Gedeon Muhawenayo, Behnood Rasti, Jocelyn Chanussot, Julien Mairal

    Abstract: In this paper, we introduce a new algorithm based on archetypal analysis for blind hyperspectral unmixing, assuming linear mixing of endmembers. Archetypal analysis is a natural formulation for this task. This method does not require the presence of pure pixels (i.e., pixels containing a single material) but instead represents endmembers as convex combinations of a few pixels present in the origin… ▽ More

    Submitted 26 September, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

  18. arXiv:2207.14671  [pdf, other

    cs.CV eess.IV

    High Dynamic Range and Super-Resolution from Raw Image Bursts

    Authors: Bruno Lecouat, Thomas Eboli, Jean Ponce, Julien Mairal

    Abstract: Photographs captured by smartphones and mid-range cameras have limited spatial resolution and dynamic range, with noisy response in underexposed regions and color artefacts in saturated areas. This paper introduces the first approach (to the best of our knowledge) to the reconstruction of high-resolution, high-dynamic range color images from raw photographic bursts captured by a handheld camera wi… ▽ More

    Submitted 29 July, 2022; originally announced July 2022.

    Comments: Accepted to Siggraph 2022 Technical Papers program

  19. arXiv:2206.12117  [pdf, other

    cs.CV cs.LG

    Self Supervised Learning for Few Shot Hyperspectral Image Classification

    Authors: Nassim Ait Ali Braham, Lichao Mou, Jocelyn Chanussot, Julien Mairal, Xiao Xiang Zhu

    Abstract: Deep learning has proven to be a very effective approach for Hyperspectral Image (HSI) classification. However, deep neural networks require large annotated datasets to generalize well. This limits the applicability of deep learning for HSI classification, where manually labelling thousands of pixels for every scene is impractical. In this paper, we propose to leverage Self Supervised Learning (SS… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

    Comments: Accepted in IGARSS 2022

  20. arXiv:2202.13733  [pdf, other

    stat.ML cs.LG math.OC

    On the Benefits of Large Learning Rates for Kernel Methods

    Authors: Gaspard Beugnot, Julien Mairal, Alessandro Rudi

    Abstract: This paper studies an intriguing phenomenon related to the good generalization performance of estimators obtained by using large learning rates within gradient descent algorithms. First observed in the deep learning literature, we show that a phenomenon can be precisely characterized in the context of kernel methods, even though the resulting optimization problem is convex. Specifically, we consid… ▽ More

    Submitted 3 June, 2022; v1 submitted 28 February, 2022; originally announced February 2022.

    Comments: Accepted paper at Conference COLT 2022. To be published to Proceedings of Machine Learning Research (PMLR)

  21. arXiv:2202.13473  [pdf, other

    cs.LG cs.CV

    The Spectral Bias of Polynomial Neural Networks

    Authors: Moulik Choraria, Leello Tadesse Dadi, Grigorios Chrysos, Julien Mairal, Volkan Cevher

    Abstract: Polynomial neural networks (PNNs) have been recently shown to be particularly effective at image generation and face recognition, where high-frequency information is critical. Previous studies have revealed that neural networks demonstrate a $\textit{spectral bias}$ towards low-frequency functions, which yields faster learning of low-frequency components during training. Inspired by such studies,… ▽ More

    Submitted 27 February, 2022; originally announced February 2022.

    Comments: Accepted at the International Conference on Learning Representations(ICLR) 2022

  22. arXiv:2202.05638  [pdf, other

    cs.LG

    Efficient Kernel UCB for Contextual Bandits

    Authors: Houssam Zenati, Alberto Bietti, Eustache Diemert, Julien Mairal, Matthieu Martin, Pierre Gaillard

    Abstract: In this paper, we tackle the computational efficiency of kernelized UCB algorithms in contextual bandits. While standard methods require a O(CT^3) complexity where T is the horizon and the constant C is related to optimizing the UCB rule, we propose an efficient contextual algorithm for large-scale problems. Specifically, our method relies on incremental Nystrom approximations of the joint kernel… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

    Comments: To appear at AISTATS2022

  23. arXiv:2112.04215  [pdf, other

    cs.CV cs.LG

    Self-Supervised Models are Continual Learners

    Authors: Enrico Fini, Victor G. Turrisi da Costa, Xavier Alameda-Pineda, Elisa Ricci, Karteek Alahari, Julien Mairal

    Abstract: Self-supervised models have been shown to produce comparable or better visual representations than their supervised counterparts when trained offline on unlabeled data at scale. However, their efficacy is catastrophically reduced in a Continual Learning (CL) scenario where data is presented to the model sequentially. In this paper, we show that self-supervised loss functions can be seamlessly conv… ▽ More

    Submitted 1 April, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

  24. arXiv:2111.14580  [pdf, other

    math.OC cs.LG

    Amortized Implicit Differentiation for Stochastic Bilevel Optimization

    Authors: Michael Arbel, Julien Mairal

    Abstract: We study a class of algorithms for solving bilevel optimization problems in both stochastic and deterministic settings when the inner-level objective is strongly convex. Specifically, we consider algorithms based on inexact implicit differentiation and we exploit a warm-start strategy to amortize the estimation of the exact gradient. We then introduce a unified theoretical framework inspired by th… ▽ More

    Submitted 11 July, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

  25. arXiv:2111.09708  [pdf, other

    eess.IV cs.CV cs.LG

    A Trainable Spectral-Spatial Sparse Coding Model for Hyperspectral Image Restoration

    Authors: Théo Bodrito, Alexandre Zouaoui, Jocelyn Chanussot, Julien Mairal

    Abstract: Hyperspectral imaging offers new perspectives for diverse applications, ranging from the monitoring of the environment using airborne or satellite remote sensing, precision farming, food safety, planetary exploration, or astrophysics. Unfortunately, the spectral diversity of information comes at the expense of various sources of degradation, and the lack of accurate ground-truth "clean" hyperspec… ▽ More

    Submitted 18 November, 2021; originally announced November 2021.

    Journal ref: 2021 Conference on Neural Information Processing Systems, Dec 2021, Sydney, Australia

  26. arXiv:2106.08855  [pdf, other

    cs.LG stat.ML

    Beyond Tikhonov: Faster Learning with Self-Concordant Losses via Iterative Regularization

    Authors: Gaspard Beugnot, Julien Mairal, Alessandro Rudi

    Abstract: The theory of spectral filtering is a remarkable tool to understand the statistical properties of learning with kernels. For least squares, it allows to derive various regularization schemes that yield faster convergence rates of the excess risk than with Tikhonov regularization. This is typically achieved by leveraging classical assumptions called source and capacity conditions, which characteriz… ▽ More

    Submitted 10 November, 2021; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: To be published in NeurIPS 2021

  27. arXiv:2106.08050  [pdf, other

    cs.LG

    Residual Reinforcement Learning from Demonstrations

    Authors: Minttu Alakuijala, Gabriel Dulac-Arnold, Julien Mairal, Jean Ponce, Cordelia Schmid

    Abstract: Residual reinforcement learning (RL) has been proposed as a way to solve challenging robotic tasks by adapting control actions from a conventional feedback controller to maximize a reward signal. We extend the residual formulation to learn from visual inputs and sparse rewards using demonstrations. Learning from images, proprioceptive inputs and a sparse task-completion reward relaxes the requirem… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

  28. arXiv:2106.05667  [pdf, other

    cs.LG

    GraphiT: Encoding Graph Structure in Transformers

    Authors: Grégoire Mialon, Dexiong Chen, Margot Selosse, Julien Mairal

    Abstract: We show that viewing graphs as sets of node features and incorporating structural and positional information into a transformer architecture is able to outperform representations learned with classical graph neural networks (GNNs). Our model, GraphiT, encodes such information by (i) leveraging relative positional encoding strategies in self-attention scores based on positive definite kernels on gr… ▽ More

    Submitted 10 June, 2021; originally announced June 2021.

  29. arXiv:2106.03839  [pdf, other

    cs.CV

    NTIRE 2021 Challenge on Burst Super-Resolution: Methods and Results

    Authors: Goutam Bhat, Martin Danelljan, Radu Timofte, Kazutoshi Akita, Wooyeong Cho, Haoqiang Fan, Lanpeng Jia, Daeshik Kim, Bruno Lecouat, Youwei Li, Shuaicheng Liu, Ziluan Liu, Ziwei Luo, Takahiro Maeda, Julien Mairal, Christian Micheloni, Xuan Mo, Takeru Oba, Pavel Ostyakov, Jean Ponce, Sanghyeok Son, Jian Sun, Norimichi Ukita, Rao Muhammad Umer, Youliang Yan , et al. (3 additional authors not shown)

    Abstract: This paper reviews the NTIRE2021 challenge on burst super-resolution. Given a RAW noisy burst as input, the task in the challenge was to generate a clean RGB image with 4 times higher resolution. The challenge contained two tracks; Track 1 evaluating on synthetically generated data, and Track 2 using real-world bursts from mobile camera. In the final testing phase, 6 teams submitted results using… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: NTIRE 2021 Burst Super-Resolution challenge report

  30. arXiv:2104.14294  [pdf, other

    cs.CV

    Emerging Properties in Self-Supervised Vision Transformers

    Authors: Mathilde Caron, Hugo Touvron, Ishan Misra, Hervé Jégou, Julien Mairal, Piotr Bojanowski, Armand Joulin

    Abstract: In this paper, we question if self-supervised learning provides new properties to Vision Transformer (ViT) that stand out compared to convolutional networks (convnets). Beyond the fact that adapting self-supervised methods to this architecture works particularly well, we make the following observations: first, self-supervised ViT features contain explicit information about the semantic segmentatio… ▽ More

    Submitted 24 May, 2021; v1 submitted 29 April, 2021; originally announced April 2021.

    Comments: 21 pages

  31. arXiv:2104.06191  [pdf, other

    cs.CV eess.IV

    Lucas-Kanade Reloaded: End-to-End Super-Resolution from Raw Image Bursts

    Authors: Bruno Lecouat, Jean Ponce, Julien Mairal

    Abstract: This presentation addresses the problem of reconstructing a high-resolution image from multiple lower-resolution snapshots captured from slightly different viewpoints in space and time. Key challenges for solving this problem include (i) aligning the input pictures with sub-pixel accuracy, (ii) handling raw (noisy) images for maximal faithfulness to native camera data, and (iii) designing/learning… ▽ More

    Submitted 23 August, 2021; v1 submitted 13 April, 2021; originally announced April 2021.

    Journal ref: ICCV 2021

  32. arXiv:2006.14859  [pdf, other

    cs.CV

    A Flexible Framework for Designing Trainable Priors with Adaptive Smoothing and Game Encoding

    Authors: Bruno Lecouat, Jean Ponce, Julien Mairal

    Abstract: We introduce a general framework for designing and training neural network layers whose forward passes can be interpreted as solving non-smooth convex optimization problems, and whose architectures are derived from an optimization algorithm. We focus on convex games, solved by local agents represented by the nodes of a graph and interacting through regularization functions. This approach is appeal… ▽ More

    Submitted 9 November, 2020; v1 submitted 26 June, 2020; originally announced June 2020.

    Comments: NeurIPS 2020

  33. arXiv:2006.12065  [pdf, other

    cs.LG stat.ML

    A Trainable Optimal Transport Embedding for Feature Aggregation and its Relationship to Attention

    Authors: Grégoire Mialon, Dexiong Chen, Alexandre d'Aspremont, Julien Mairal

    Abstract: We address the problem of learning on sets of features, motivated by the need of performing pooling operations in long biological sequences of varying sizes, with long-range dependencies, and possibly few labeled data. To address this challenging task, we introduce a parametrized representation of fixed size, which embeds and then aggregates elements from a given input set according to the optimal… ▽ More

    Submitted 9 February, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: ICLR 2021

  34. arXiv:2006.09882  [pdf, other

    cs.CV

    Unsupervised Learning of Visual Features by Contrasting Cluster Assignments

    Authors: Mathilde Caron, Ishan Misra, Julien Mairal, Priya Goyal, Piotr Bojanowski, Armand Joulin

    Abstract: Unsupervised image representations have significantly reduced the gap with supervised pretraining, notably with the recent achievements of contrastive learning methods. These contrastive methods typically work online and rely on a large number of explicit pairwise feature comparisons, which is computationally challenging. In this paper, we propose an online algorithm, SwAV, that takes advantage of… ▽ More

    Submitted 8 January, 2021; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: NeurIPS 2020

  35. arXiv:2004.11722  [pdf, other

    stat.ML cs.LG

    Counterfactual Learning of Stochastic Policies with Continuous Actions: from Models to Offline Evaluation

    Authors: Houssam Zenati, Alberto Bietti, Matthieu Martin, Eustache Diemert, Pierre Gaillard, Julien Mairal

    Abstract: Counterfactual reasoning from logged data has become increasingly important for many applications such as web advertising or healthcare. In this paper, we address the problem of learning stochastic policies with continuous actions from the viewpoint of counterfactual risk minimization (CRM). While the CRM framework is appealing and well studied for discrete actions, the continuous action case rais… ▽ More

    Submitted 14 December, 2022; v1 submitted 22 April, 2020; originally announced April 2020.

  36. arXiv:2003.09338  [pdf, other

    cs.CV

    Selecting Relevant Features from a Multi-domain Representation for Few-shot Classification

    Authors: Nikita Dvornik, Cordelia Schmid, Julien Mairal

    Abstract: Popular approaches for few-shot classification consist of first learning a generic data representation based on a large annotated dataset, before adapting the representation to new classes given only a few labeled samples. In this work, we propose a new strategy based on feature selection, which is both simpler and more effective than previous feature adaptation approaches. First, we obtain a mult… ▽ More

    Submitted 20 July, 2020; v1 submitted 20 March, 2020; originally announced March 2020.

    Comments: ECCV'20

  37. arXiv:2003.05189  [pdf, other

    stat.ML cs.LG

    Convolutional Kernel Networks for Graph-Structured Data

    Authors: Dexiong Chen, Laurent Jacob, Julien Mairal

    Abstract: We introduce a family of multilayer graph kernels and establish new links between graph convolutional neural networks and kernel methods. Our approach generalizes convolutional kernel networks to graph-structured data, by representing graphs as a sequence of kernel feature maps, where each node carries information about local graph substructures. On the one hand, the kernel point of view offers an… ▽ More

    Submitted 29 June, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

    Report number: hal-02151135

    Journal ref: International Conference on Machine Learning (ICML), Jul 2020

  38. arXiv:2001.03554  [pdf, other

    cs.CV cs.LG cs.NE

    Pruning Convolutional Neural Networks with Self-Supervision

    Authors: Mathilde Caron, Ari Morcos, Piotr Bojanowski, Julien Mairal, Armand Joulin

    Abstract: Convolutional neural networks trained without supervision come close to matching performance with supervised pre-training, but sometimes at the cost of an even higher number of parameters. Extracting subnetworks from these large unsupervised convnets with preserved performance is of particular interest to make them less computationally intensive. Typical pruning methods operate during training on… ▽ More

    Submitted 10 January, 2020; originally announced January 2020.

  39. arXiv:1912.08165  [pdf, other

    stat.ML cs.LG

    Cyanure: An Open-Source Toolbox for Empirical Risk Minimization for Python, C++, and soon more

    Authors: Julien Mairal

    Abstract: Cyanure is an open-source C++ software package with a Python interface. The goal of Cyanure is to provide state-of-the-art solvers for learning linear models, based on stochastic variance-reduced stochastic optimization with acceleration mechanisms. Cyanure can handle a large variety of loss functions (logistic, square, squared hinge, multinomial logistic) and regularization functions (l_2, l_1, e… ▽ More

    Submitted 20 December, 2019; v1 submitted 17 December, 2019; originally announced December 2019.

    Comments: http://julien.mairal.org/cyanure/welcome.html

  40. arXiv:1912.02566  [pdf, other

    cs.LG stat.ML

    Screening Data Points in Empirical Risk Minimization via Ellipsoidal Regions and Safe Loss Functions

    Authors: Grégoire Mialon, Alexandre d'Aspremont, Julien Mairal

    Abstract: We design simple screening tests to automatically discard data samples in empirical risk minimization without losing optimization guarantees. We derive loss functions that produce dual objectives with a sparse solution. We also show how to regularize convex losses to ensure such a dual sparsity-inducing property, and propose a general method to design screening tests for classification or regressi… ▽ More

    Submitted 12 June, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: AISTATS 2020

  41. arXiv:1912.02456  [pdf, other

    cs.CV

    Fully Trainable and Interpretable Non-Local Sparse Models for Image Restoration

    Authors: Bruno Lecouat, Jean Ponce, Julien Mairal

    Abstract: Non-local self-similarity and sparsity principles have proven to be powerful priors for natural image modeling. We propose a novel differentiable relaxation of joint sparsity that exploits both principles and leads to a general framework for image restoration which is (1) trainable end to end, (2) fully interpretable, and (3) much more compact than competing deep learning architectures. We apply t… ▽ More

    Submitted 20 August, 2020; v1 submitted 5 December, 2019; originally announced December 2019.

    Comments: ECCV 2020

  42. arXiv:1906.03200  [pdf, other

    stat.ML cs.LG

    Recurrent Kernel Networks

    Authors: Dexiong Chen, Laurent Jacob, Julien Mairal

    Abstract: Substring kernels are classical tools for representing biological sequences or text. However, when large amounts of annotated data are available, models that allow end-to-end training such as neural networks are often preferred. Links between recurrent neural networks (RNNs) and substring kernels have recently been drawn, by formally showing that RNNs with specific activation functions were points… ▽ More

    Submitted 17 October, 2019; v1 submitted 7 June, 2019; originally announced June 2019.

    Report number: hal-02151135

    Journal ref: Advances in Neural Information Processing Systems (NeurIPS), Dec 2019, Vancouver, Canada

  43. arXiv:1906.01164  [pdf, other

    math.OC cs.LG stat.ML

    A Generic Acceleration Framework for Stochastic Composite Optimization

    Authors: Andrei Kulunchakov, Julien Mairal

    Abstract: In this paper, we introduce various mechanisms to obtain accelerated first-order stochastic optimization algorithms when the objective function is convex or strongly convex. Specifically, we extend the Catalyst approach originally designed for deterministic objectives to the stochastic setting. Given an optimization method with mild convergence guarantees for strongly convex problems, the challeng… ▽ More

    Submitted 9 October, 2019; v1 submitted 3 June, 2019; originally announced June 2019.

    Journal ref: Advances in Neural Information Processing Systems (NeurIPS), Dec 2019, Vancouver, Canada

  44. arXiv:1905.12173  [pdf, other

    stat.ML cs.LG

    On the Inductive Bias of Neural Tangent Kernels

    Authors: Alberto Bietti, Julien Mairal

    Abstract: State-of-the-art neural networks are heavily over-parameterized, making the optimization algorithm a crucial ingredient for learning predictive models with good generalization properties. A recent line of work has shown that in a certain over-parameterized regime, the learning dynamics of gradient descent are governed by a certain kernel obtained at initialization, called the neural tangent kernel… ▽ More

    Submitted 31 October, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: NeurIPS 2019

  45. arXiv:1905.02374  [pdf, other

    stat.ML cs.LG math.OC

    Estimate Sequences for Variance-Reduced Stochastic Composite Optimization

    Authors: Andrei Kulunchakov, Julien Mairal

    Abstract: In this paper, we propose a unified view of gradient-based algorithms for stochastic convex composite optimization by extending the concept of estimate sequence introduced by Nesterov. This point of view covers the stochastic gradient descent method, variants of the approaches SAGA, SVRG, and has several advantages: (i) we provide a generic proof of convergence for the aforementioned methods; (ii)… ▽ More

    Submitted 7 May, 2019; originally announced May 2019.

    Comments: short version of preprint arXiv:1901.08788

    Journal ref: International Conference on Machine Learning (ICML), Jun 2019, Long Beach, United States

  46. arXiv:1905.01278  [pdf, other

    cs.CV

    Unsupervised Pre-Training of Image Features on Non-Curated Data

    Authors: Mathilde Caron, Piotr Bojanowski, Julien Mairal, Armand Joulin

    Abstract: Pre-training general-purpose visual features with convolutional neural networks without relying on annotations is a challenging and important task. Most recent efforts in unsupervised feature learning have focused on either small or highly curated datasets like ImageNet, whereas using uncurated raw datasets was found to decrease the feature quality when evaluated on a transfer task. Our goal is to… ▽ More

    Submitted 13 August, 2019; v1 submitted 3 May, 2019; originally announced May 2019.

    Comments: Accepted at ICCV 2019 (Oral)

  47. arXiv:1903.11341  [pdf, other

    cs.CV cs.AI

    Diversity with Cooperation: Ensemble Methods for Few-Shot Classification

    Authors: Nikita Dvornik, Cordelia Schmid, Julien Mairal

    Abstract: Few-shot classification consists of learning a predictive model that is able to effectively adapt to a new class, given only a few annotated samples. To solve this challenging problem, meta-learning has become a popular paradigm that advocates the ability to "learn to adapt". Recent works have shown, however, that simple learning strategies without meta-learning could be competitive. In this paper… ▽ More

    Submitted 30 August, 2019; v1 submitted 27 March, 2019; originally announced March 2019.

    Comments: Added experiments for different network architectures across different input image resolutions

  48. arXiv:1901.08788  [pdf, other

    stat.ML cs.LG math.OC

    Estimate Sequences for Stochastic Composite Optimization: Variance Reduction, Acceleration, and Robustness to Noise

    Authors: Andrei Kulunchakov, Julien Mairal

    Abstract: In this paper, we propose a unified view of gradient-based algorithms for stochastic convex composite optimization by extending the concept of estimate sequence introduced by Nesterov. More precisely, we interpret a large class of stochastic optimization methods as procedures that iteratively minimize a surrogate of the objective, which covers the stochastic gradient descent method and variants of… ▽ More

    Submitted 4 September, 2020; v1 submitted 25 January, 2019; originally announced January 2019.

    Comments: Journal of Machine Learning Research, Microtome Publishing, In press

  49. arXiv:1810.00363  [pdf, other

    stat.ML cs.LG

    A Kernel Perspective for Regularizing Deep Neural Networks

    Authors: Alberto Bietti, Grégoire Mialon, Dexiong Chen, Julien Mairal

    Abstract: We propose a new point of view for regularizing deep neural networks by using the norm of a reproducing kernel Hilbert space (RKHS). Even though this norm cannot be computed, it admits upper and lower approximations leading to various practical strategies. Specifically, this perspective (i) provides a common umbrella for many existing regularization principles, including spectral norm and gradient… ▽ More

    Submitted 13 May, 2019; v1 submitted 30 September, 2018; originally announced October 2018.

    Comments: ICML

  50. arXiv:1809.06035  [pdf, other

    stat.ML cs.CV cs.LG q-bio.QM

    Extracting representations of cognition across neuroimaging studies improves brain decoding

    Authors: Arthur Mensch, Julien Mairal, Bertrand Thirion, Gaël Varoquaux

    Abstract: Cognitive brain imaging is accumulating datasets about the neural substrate of many different mental processes. Yet, most studies are based on few subjects and have low statistical power. Analyzing data across studies could bring more statistical power; yet the current brain-imaging analytic framework cannot be used at scale as it requires casting all cognitive tasks in a unified theoretical frame… ▽ More

    Submitted 19 May, 2021; v1 submitted 17 September, 2018; originally announced September 2018.

    Journal ref: PLoS Computational Biology, Public Library of Science, 2021