Skip to main content

Showing 51–91 of 91 results for author: Mairal, J

.
  1. arXiv:1901.08788  [pdf, other

    stat.ML cs.LG math.OC

    Estimate Sequences for Stochastic Composite Optimization: Variance Reduction, Acceleration, and Robustness to Noise

    Authors: Andrei Kulunchakov, Julien Mairal

    Abstract: In this paper, we propose a unified view of gradient-based algorithms for stochastic convex composite optimization by extending the concept of estimate sequence introduced by Nesterov. More precisely, we interpret a large class of stochastic optimization methods as procedures that iteratively minimize a surrogate of the objective, which covers the stochastic gradient descent method and variants of… ▽ More

    Submitted 4 September, 2020; v1 submitted 25 January, 2019; originally announced January 2019.

    Comments: Journal of Machine Learning Research, Microtome Publishing, In press

  2. arXiv:1810.00363  [pdf, other

    stat.ML cs.LG

    A Kernel Perspective for Regularizing Deep Neural Networks

    Authors: Alberto Bietti, Grégoire Mialon, Dexiong Chen, Julien Mairal

    Abstract: We propose a new point of view for regularizing deep neural networks by using the norm of a reproducing kernel Hilbert space (RKHS). Even though this norm cannot be computed, it admits upper and lower approximations leading to various practical strategies. Specifically, this perspective (i) provides a common umbrella for many existing regularization principles, including spectral norm and gradient… ▽ More

    Submitted 13 May, 2019; v1 submitted 30 September, 2018; originally announced October 2018.

    Comments: ICML

  3. arXiv:1809.06035  [pdf, other

    stat.ML cs.CV cs.LG q-bio.QM

    Extracting representations of cognition across neuroimaging studies improves brain decoding

    Authors: Arthur Mensch, Julien Mairal, Bertrand Thirion, Gaël Varoquaux

    Abstract: Cognitive brain imaging is accumulating datasets about the neural substrate of many different mental processes. Yet, most studies are based on few subjects and have low statistical power. Analyzing data across studies could bring more statistical power; yet the current brain-imaging analytic framework cannot be used at scale as it requires casting all cognitive tasks in a unified theoretical frame… ▽ More

    Submitted 19 May, 2021; v1 submitted 17 September, 2018; originally announced September 2018.

    Journal ref: PLoS Computational Biology, Public Library of Science, 2021

  4. arXiv:1809.02492  [pdf, other

    cs.CV

    On the Importance of Visual Context for Data Augmentation in Scene Understanding

    Authors: Nikita Dvornik, Julien Mairal, Cordelia Schmid

    Abstract: Performing data augmentation for learning deep neural networks is known to be important for training visual recognition systems. By artificially increasing the number of training examples, it helps reducing overfitting and improves generalization. While simple image transformations can already improve predictive performance in most vision tasks, larger gains can be obtained by leveraging task-spec… ▽ More

    Submitted 19 September, 2019; v1 submitted 6 September, 2018; originally announced September 2018.

    Comments: Updated the experimental section. arXiv admin note: substantial text overlap with arXiv:1807.07428

  5. arXiv:1807.07428  [pdf, other

    cs.CV

    Modeling Visual Context is Key to Augmenting Object Detection Datasets

    Authors: Nikita Dvornik, Julien Mairal, Cordelia Schmid

    Abstract: Performing data augmentation for learning deep neural networks is well known to be important for training visual recognition systems. By artificially increasing the number of training examples, it helps reducing overfitting and improves generalization. For object detection, classical approaches for data augmentation consist of generating images obtained by basic geometrical transformations and col… ▽ More

    Submitted 19 July, 2018; originally announced July 2018.

    Journal ref: ECCV2018, Sep 2018, Munich, Germany. 2018

  6. arXiv:1805.11155  [pdf, other

    stat.ML cs.CV cs.LG

    Unsupervised Learning of Artistic Styles with Archetypal Style Analysis

    Authors: Daan Wynen, Cordelia Schmid, Julien Mairal

    Abstract: In this paper, we introduce an unsupervised learning approach to automatically discover, summarize, and manipulate artistic styles from large collections of paintings. Our method is based on archetypal analysis, which is an unsupervised learning technique akin to sparse coding with a geometric interpretation. When applied to deep image representations from a collection of artworks, it learns a dic… ▽ More

    Submitted 2 October, 2018; v1 submitted 28 May, 2018; originally announced May 2018.

    Comments: Accepted at NIPS 2018, Montréal, Canada

  7. arXiv:1712.05654  [pdf, other

    stat.ML math.OC

    Catalyst Acceleration for First-order Convex Optimization: from Theory to Practice

    Authors: Hongzhou Lin, Julien Mairal, Zaid Harchaoui

    Abstract: We introduce a generic scheme for accelerating gradient-based optimization methods in the sense of Nesterov. The approach, called Catalyst, builds upon the inexact accelerated proximal point algorithm for minimizing a convex objective function, and consists of approximately solving a sequence of well-chosen auxiliary problems, leading to faster convergence. One of the keys to achieve acceleration… ▽ More

    Submitted 19 June, 2018; v1 submitted 15 December, 2017; originally announced December 2017.

    Comments: link to publisher website: http://jmlr.org/papers/volume18/17-748/17-748.pdf

    Journal ref: Journal of Machine Learning Research (JMLR), 18(212):1--54, 2018

  8. arXiv:1710.11438  [pdf, other

    stat.ML cs.LG q-bio.NC

    Learning Neural Representations of Human Cognition across Many fMRI Studies

    Authors: Arthur Mensch, Julien Mairal, Danilo Bzdok, Bertrand Thirion, Gaël Varoquaux

    Abstract: Cognitive neuroscience is enjoying rapid increase in extensive public brain-imaging datasets. It opens the door to large-scale statistical models. Finding a unified perspective for all available data calls for scalable and automated solutions to an old challenge: how to aggregate heterogeneous information on brain function into a universal cognitive system that relates mental operations/cognitive… ▽ More

    Submitted 10 November, 2017; v1 submitted 31 October, 2017; originally announced October 2017.

    Comments: Advances in Neural Information Processing Systems, Dec 2017, Long Beach, United States. 2017

    Journal ref: Advances in Neural Information Processing Systems, 2017

  9. arXiv:1708.02813  [pdf, other

    cs.CV

    BlitzNet: A Real-Time Deep Network for Scene Understanding

    Authors: Nikita Dvornik, Konstantin Shmelkov, Julien Mairal, Cordelia Schmid

    Abstract: Real-time scene understanding has become crucial in many applications such as autonomous driving. In this paper, we propose a deep architecture, called BlitzNet, that jointly performs object detection and semantic segmentation in one forward pass, allowing real-time computations. Besides the computational gain of having a single network to perform several tasks, we show that object detection and s… ▽ More

    Submitted 9 August, 2017; originally announced August 2017.

  10. arXiv:1706.03078  [pdf, other

    stat.ML cs.LG

    Group Invariance, Stability to Deformations, and Complexity of Deep Convolutional Representations

    Authors: Alberto Bietti, Julien Mairal

    Abstract: The success of deep convolutional architectures is often attributed in part to their ability to learn multiscale and invariant representations of natural signals. However, a precise study of these properties and how they affect learning guarantees is still missing. In this paper, we consider deep convolutional representations of signals; we study their invariance to translations and to more genera… ▽ More

    Submitted 10 October, 2018; v1 submitted 9 June, 2017; originally announced June 2017.

    Journal ref: Journal of Machine Learning Research 20 (2019) 1-49

  11. arXiv:1703.10993  [pdf, other

    stat.ML math.OC

    Catalyst Acceleration for Gradient-Based Non-Convex Optimization

    Authors: Courtney Paquette, Hongzhou Lin, Dmitriy Drusvyatskiy, Julien Mairal, Zaid Harchaoui

    Abstract: We introduce a generic scheme to solve nonconvex optimization problems using gradient-based algorithms originally designed for minimizing convex functions. Even though these methods may originally require convexity to operate, the proposed approach allows one to use them on weakly convex objectives, which covers a large class of non-convex functions typically appearing in machine learning and sign… ▽ More

    Submitted 31 December, 2018; v1 submitted 31 March, 2017; originally announced March 2017.

  12. arXiv:1701.05363  [pdf, other

    stat.ML cs.LG math.OC q-bio.NC

    Stochastic Subsampling for Factorizing Huge Matrices

    Authors: Arthur Mensch, Julien Mairal, Bertrand Thirion, Gael Varoquaux

    Abstract: We present a matrix-factorization algorithm that scales to input matrices with both huge number of rows and columns. Learned factors may be sparse or dense and/or non-negative, which makes our algorithm suitable for dictionary learning, sparse component analysis, and non-negative matrix factorization. Our algorithm streams matrix columns while subsampling them to iteratively learn the matrix facto… ▽ More

    Submitted 30 October, 2017; v1 submitted 19 January, 2017; originally announced January 2017.

    Comments: IEEE Transactions on Signal Processing, Institute of Electrical and Electronics Engineers, A Paraître

    Journal ref: IEEE Transactions on Signal Processing, 2018, 66 (1), pp 113-128

  13. arXiv:1611.10041  [pdf, other

    math.OC cs.LG stat.ML

    Subsampled online matrix factorization with convergence guarantees

    Authors: Arthur Mensch, Julien Mairal, Gaël Varoquaux, Bertrand Thirion

    Abstract: We present a matrix factorization algorithm that scales to input matrices that are large in both dimensions (i.e., that contains morethan 1TB of data). The algorithm streams the matrix columns while subsampling them, resulting in low complexity per iteration andreasonable memory footprint. In contrast to previous online matrix factorization methods, our approach relies on low-dimensional statistic… ▽ More

    Submitted 30 November, 2016; originally announced November 2016.

    Journal ref: 9th NIPS Workshop on Optimization for Machine Learning, Dec 2016, Barcelone, Spain

  14. arXiv:1610.00970  [pdf, other

    stat.ML cs.LG math.OC

    Stochastic Optimization with Variance Reduction for Infinite Datasets with Finite-Sum Structure

    Authors: Alberto Bietti, Julien Mairal

    Abstract: Stochastic optimization algorithms with variance reduction have proven successful for minimizing large finite sums of functions. Unfortunately, these techniques are unable to deal with stochastic perturbations of input data, induced for example by data augmentation. In such cases, the objective is no longer a finite sum, and the main candidate for optimization is the stochastic gradient descent me… ▽ More

    Submitted 15 November, 2017; v1 submitted 4 October, 2016; originally announced October 2016.

    Comments: Advances in Neural Information Processing Systems (NIPS), Dec 2017, Long Beach, CA, United States

  15. arXiv:1610.00960  [pdf, other

    stat.ML math.OC

    An Inexact Variable Metric Proximal Point Algorithm for Generic Quasi-Newton Acceleration

    Authors: Hongzhou Lin, Julien Mairal, Zaid Harchaoui

    Abstract: We propose an inexact variable-metric proximal point algorithm to accelerate gradient-based optimization algorithms. The proposed scheme, called QNing can be notably applied to incremental first-order methods such as the stochastic variance-reduced gradient descent algorithm (SVRG) and other randomized incremental optimization algorithms. QNing is also compatible with composite objectives, meaning… ▽ More

    Submitted 29 January, 2019; v1 submitted 4 October, 2016; originally announced October 2016.

    Comments: to appear in SIAM Journal on Optimization

  16. arXiv:1605.06265  [pdf, other

    stat.ML cs.CV cs.LG

    End-to-End Kernel Learning with Supervised Convolutional Kernel Networks

    Authors: Julien Mairal

    Abstract: In this paper, we introduce a new image representation based on a multilayer kernel machine. Unlike traditional kernel methods where data representation is decoupled from the prediction task, we learn how to shape the kernel with supervision. We proceed by first proposing improvements of the recently-introduced convolutional kernel networks (CKNs) in the context of unsupervised learning; then, we… ▽ More

    Submitted 25 October, 2016; v1 submitted 20 May, 2016; originally announced May 2016.

    Comments: to appear in Advances in Neural Information Processing Systems (NIPS)

  17. arXiv:1605.00937  [pdf, other

    stat.ML cs.LG q-bio.QM

    Dictionary Learning for Massive Matrix Factorization

    Authors: Arthur Mensch, Julien Mairal, Bertrand Thirion, Gaël Varoquaux

    Abstract: Sparse matrix factorization is a popular tool to obtain interpretable data decompositions, which are also effective to perform data completion or denoising. Its applicability to large datasets has been addressed with online and randomized methods, that reduce the complexity in one of the matrix dimension, but not in both of them. In this paper, we tackle very large matrices in both dimensions. We… ▽ More

    Submitted 26 May, 2016; v1 submitted 3 May, 2016; originally announced May 2016.

    Journal ref: Proceedings of the International Conference on Machine Learning, 2016, pp 1737-1746

  18. arXiv:1603.00438  [pdf, other

    cs.CV

    Convolutional Patch Representations for Image Retrieval: an Unsupervised Approach

    Authors: Mattis Paulin, Julien Mairal, Matthijs Douze, Zaid Harchaoui, Florent Perronnin, Cordelia Schmid

    Abstract: Convolutional neural networks (CNNs) have recently received a lot of attention due to their ability to model local stationary structures in natural images in a multi-scale fashion, when learning all model parameters with supervision. While excellent performance was achieved for image classification when large amounts of labeled visual data are available, their success for un-supervised tasks such… ▽ More

    Submitted 1 March, 2016; originally announced March 2016.

  19. arXiv:1602.02263  [pdf, other

    math.OC cs.IT cs.LG stat.ML

    DOLPHIn - Dictionary Learning for Phase Retrieval

    Authors: Andreas M. Tillmann, Yonina C. Eldar, Julien Mairal

    Abstract: We propose a new algorithm to learn a dictionary for reconstructing and sparsely encoding signals from measurements without phase. Specifically, we consider the task of estimating a two-dimensional image from squared-magnitude measurements of a complex-valued linear transformation of the original image. Several recent phase retrieval algorithms exploit underlying sparsity of the unknown signal in… ▽ More

    Submitted 3 August, 2016; v1 submitted 6 February, 2016; originally announced February 2016.

  20. arXiv:1506.02186  [pdf, ps, other

    math.OC

    A Universal Catalyst for First-Order Optimization

    Authors: Hongzhou Lin, Julien Mairal, Zaid Harchaoui

    Abstract: We introduce a generic scheme for accelerating first-order optimization methods in the sense of Nesterov, which builds upon a new analysis of the accelerated proximal point algorithm. Our approach consists of minimizing a convex objective by approximately solving a sequence of well-chosen auxiliary problems, leading to faster convergence. This strategy applies to a large class of algorithms, inclu… ▽ More

    Submitted 25 October, 2015; v1 submitted 6 June, 2015; originally announced June 2015.

    Comments: to appear in Advances in Neural Information Processing Systems (NIPS)

  21. arXiv:1411.3230  [pdf, other

    cs.CV

    Sparse Modeling for Image and Vision Processing

    Authors: Julien Mairal, Francis Bach, Jean Ponce

    Abstract: In recent years, a large amount of multi-disciplinary research has been conducted on sparse models and their applications. In statistics and machine learning, the sparsity principle is used to perform model selection---that is, automatically selecting a simple model among a large collection of them. In signal processing, sparse coding consists of representing data with linear combinations of a few… ▽ More

    Submitted 6 December, 2014; v1 submitted 12 November, 2014; originally announced November 2014.

    Comments: 205 pages, to appear in Foundations and Trends in Computer Graphics and Vision

  22. arXiv:1406.3332  [pdf, ps, other

    cs.CV cs.LG stat.ML

    Convolutional Kernel Networks

    Authors: Julien Mairal, Piotr Koniusz, Zaid Harchaoui, Cordelia Schmid

    Abstract: An important goal in visual recognition is to devise image representations that are invariant to particular transformations. In this paper, we address this goal with a new type of convolutional neural network (CNN) whose invariance is encoded by a reproducing kernel. Unlike traditional approaches where neural networks are learned either to represent data or for solving a classification task, our n… ▽ More

    Submitted 14 November, 2014; v1 submitted 12 June, 2014; originally announced June 2014.

    Comments: appears in Advances in Neural Information Processing Systems (NIPS), Dec 2014, Montreal, Canada, http://nips.cc

  23. arXiv:1405.6472  [pdf, other

    cs.CV cs.LG stat.ML

    Fast and Robust Archetypal Analysis for Representation Learning

    Authors: Yuansi Chen, Julien Mairal, Zaid Harchaoui

    Abstract: We revisit a pioneer unsupervised learning technique called archetypal analysis, which is related to successful data analysis methods such as sparse coding and non-negative matrix factorization. Since it was proposed, archetypal analysis did not gain a lot of popularity even though it produces more interpretable models than other alternatives. Because no efficient implementation has ever been made… ▽ More

    Submitted 26 May, 2014; originally announced May 2014.

    Journal ref: CVPR 2014 - IEEE Conference on Computer Vision \& Pattern Recognition (2014)

  24. arXiv:1403.1024  [pdf, other

    cs.CV cs.LG

    On learning to localize objects with minimal supervision

    Authors: Hyun Oh Song, Ross Girshick, Stefanie Jegelka, Julien Mairal, Zaid Harchaoui, Trevor Darrell

    Abstract: Learning to localize objects with minimal supervision is an important problem in computer vision, since large fully annotated datasets are extremely costly to obtain. In this paper, we propose a new method that achieves this goal with only image-level labels of whether the objects are present or not. Our approach combines a discriminative submodular cover problem for automatically discovering a se… ▽ More

    Submitted 15 May, 2014; v1 submitted 5 March, 2014; originally announced March 2014.

  25. arXiv:1402.4419  [pdf, ps, other

    math.OC cs.LG stat.ML

    Incremental Majorization-Minimization Optimization with Application to Large-Scale Machine Learning

    Authors: Julien Mairal

    Abstract: Majorization-minimization algorithms consist of successively minimizing a sequence of upper bounds of the objective function. These upper bounds are tight at the current estimate, and each iteration monotonically drives the objective function downhill. Such a simple principle is widely applicable and has been very popular in various scientific fields, especially in signal processing and statistics… ▽ More

    Submitted 1 February, 2015; v1 submitted 18 February, 2014; originally announced February 2014.

    Comments: to appear in SIAM Journal on Optimization; final author's version

  26. arXiv:1306.4650  [pdf, ps, other

    stat.ML cs.LG math.OC

    Stochastic Majorization-Minimization Algorithms for Large-Scale Optimization

    Authors: Julien Mairal

    Abstract: Majorization-minimization algorithms consist of iteratively minimizing a majorizing surrogate of an objective function. Because of its simplicity and its wide applicability, this principle has been very popular in statistics and in signal processing. In this paper, we intend to make this principle scalable. We introduce a stochastic majorization-minimization scheme which is able to deal with large… ▽ More

    Submitted 10 September, 2013; v1 submitted 19 June, 2013; originally announced June 2013.

    Comments: accepted for publication for Neural Information Processing Systems (NIPS) 2013. This is the 9-pages version followed by 16 pages of appendices. The title has changed compared to the first technical report

  27. arXiv:1305.3120  [pdf, ps, other

    stat.ML cs.LG math.OC

    Optimization with First-Order Surrogate Functions

    Authors: Julien Mairal

    Abstract: In this paper, we study optimization methods consisting of iteratively minimizing surrogates of an objective function. By proposing several algorithmic variants and simple convergence analyses, we make two main contributions. First, we provide a unified viewpoint for several first-order optimization techniques such as accelerated proximal gradient, block coordinate descent, or Frank-Wolfe algorith… ▽ More

    Submitted 14 May, 2013; originally announced May 2013.

    Comments: to appear in the proceedings of ICML 2013; the arxiv paper contains the 9 pages main text followed by 26 pages of supplemental material. International Conference on Machine Learning (ICML 2013) (2013)

  28. arXiv:1205.0079  [pdf, ps, other

    stat.ML cs.LG math.OC

    Complexity Analysis of the Lasso Regularization Path

    Authors: Julien Mairal, Bin Yu

    Abstract: The regularization path of the Lasso can be shown to be piecewise linear, making it possible to "follow" and explicitly compute the entire path. We analyze in this paper this popular strategy, and prove that its worst case complexity is exponential in the number of variables. We then oppose this pessimistic result to an (optimistic) approximate analysis: We show that an approximate path with at mo… ▽ More

    Submitted 19 May, 2012; v1 submitted 30 April, 2012; originally announced May 2012.

    Comments: To appear in the proceedings of 29th International Conference on Machine Learning (ICML 2012)

  29. arXiv:1204.4539  [pdf, ps, other

    stat.ML cs.LG math.OC

    Supervised Feature Selection in Graphs with Path Coding Penalties and Network Flows

    Authors: Julien Mairal, Bin Yu

    Abstract: We consider supervised learning problems where the features are embedded in a graph, such as gene expressions in a gene network. In this context, it is of much interest to automatically select a subgraph with few connected components; by exploiting prior knowledge, one can indeed improve the prediction performance or obtain results that are easier to interpret. Regularization or penalty functions… ▽ More

    Submitted 29 August, 2013; v1 submitted 20 April, 2012; originally announced April 2012.

    Comments: 37 pages; to appear in the Journal of Machine Learning Research (JMLR)

    Journal ref: Journal of Machine Learning Research 14(Aug) (2013) 2449-2485

  30. Learning Hierarchical and Topographic Dictionaries with Structured Sparsity

    Authors: Julien Mairal, Rodolphe Jenatton, Guillaume Obozinski, Francis Bach

    Abstract: Recent work in signal processing and statistics have focused on defining new regularization functions, which not only induce sparsity of the solution, but also take into account the structure of the problem. We present in this paper a class of convex penalties introduced in the machine learning community, which take the form of a sum of l_2 and l_infinity-norms over groups of variables. They exten… ▽ More

    Submitted 20 October, 2011; originally announced October 2011.

    Journal ref: SPIE Wavelets and Sparsity XIV 81381P (2011)

  31. arXiv:1110.2855  [pdf, other

    cs.LG cs.CV stat.ML

    Sparse Image Representation with Epitomes

    Authors: Louise Benoît, Julien Mairal, Francis Bach, Jean Ponce

    Abstract: Sparse coding, which is the decomposition of a vector using only a few basis elements, is widely used in machine learning and image processing. The basis set, also called dictionary, is learned to adapt to specific data. This approach has proven to be very effective in many image processing tasks. Traditionally, the dictionary is an unstructured "flat" set of atoms. In this paper, we study structu… ▽ More

    Submitted 13 October, 2011; originally announced October 2011.

    Comments: Computer Vision and Pattern Recognition, Colorado Springs : United States (2011)

    Journal ref: Computer Vision and Pattern Recognition, Colorado Springs : États-Unis (2011)

  32. arXiv:1110.0957  [pdf, ps, other

    cs.LG cs.CV

    Dictionary Learning for Deblurring and Digital Zoom

    Authors: Florent Couzinie-Devy, Julien Mairal, Francis Bach, Jean Ponce

    Abstract: This paper proposes a novel approach to image deblurring and digital zooming using sparse local models of image appearance. These models, where small image patches are represented as linear combinations of a few elements drawn from some large set (dictionary) of candidates, have proven well adapted to several image restoration tasks. A key to their success has been to learn dictionaries adapted to… ▽ More

    Submitted 5 October, 2011; originally announced October 2011.

  33. arXiv:1109.2397  [pdf, ps, other

    cs.LG stat.ML

    Structured sparsity through convex optimization

    Authors: Francis Bach, Rodolphe Jenatton, Julien Mairal, Guillaume Obozinski

    Abstract: Sparse estimation methods are aimed at using or obtaining parsimonious representations of data or models. While naturally cast as a combinatorial optimization problem, variable or feature selection admits a convex relaxation through the regularization by the $\ell_1$-norm. In this paper, we consider situations where we are not only interested in sparsity, but where some structural prior knowledge… ▽ More

    Submitted 20 April, 2012; v1 submitted 12 September, 2011; originally announced September 2011.

    Comments: Statistical Science (2012) To appear

  34. arXiv:1108.0775  [pdf, ps, other

    cs.LG math.OC stat.ML

    Optimization with Sparsity-Inducing Penalties

    Authors: Francis Bach, Rodolphe Jenatton, Julien Mairal, Guillaume Obozinski

    Abstract: Sparse estimation methods are aimed at using or obtaining parsimonious representations of data or models. They were first dedicated to linear variable selection but numerous extensions have now emerged such as structured sparsity or kernel selection. It turns out that many of the related estimation problems can be cast as convex optimization problems by regularizing the empirical risk with appropr… ▽ More

    Submitted 22 November, 2011; v1 submitted 3 August, 2011; originally announced August 2011.

  35. arXiv:1104.1872  [pdf, ps, other

    math.OC cs.LG stat.ML

    Convex and Network Flow Optimization for Structured Sparsity

    Authors: Julien Mairal, Rodolphe Jenatton, Guillaume Obozinski, Francis Bach

    Abstract: We consider a class of learning problems regularized by a structured sparsity-inducing norm defined as the sum of l_2- or l_infinity-norms over groups of variables. Whereas much effort has been put in develo** fast optimization techniques when the groups are disjoint or embedded in a hierarchy, we address here the case of general overlap** groups. To this end, we present two different strategi… ▽ More

    Submitted 16 September, 2011; v1 submitted 11 April, 2011; originally announced April 2011.

    Comments: to appear in the Journal of Machine Learning Research (JMLR)

    Journal ref: Journal of Machine Learning Research 12 (2011) 2681?2720

  36. Task-Driven Dictionary Learning

    Authors: Julien Mairal, Francis Bach, Jean Ponce

    Abstract: Modeling data with linear combinations of a few elements from a learned dictionary has been the focus of much recent research in machine learning, neuroscience and signal processing. For signals such as natural images that admit such sparse representations, it is now well established that these models are well suited to restoration tasks. In this context, learning the dictionary amounts to solving… ▽ More

    Submitted 9 September, 2013; v1 submitted 27 September, 2010; originally announced September 2010.

    Comments: final draft post-refereeing

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence 34, 4 (2012) 30

  37. arXiv:1009.2139  [pdf, ps, other

    stat.ML

    Proximal Methods for Hierarchical Sparse Coding

    Authors: Rodolphe Jenatton, Julien Mairal, Guillaume Obozinski, Francis Bach

    Abstract: Sparse coding consists in representing signals as sparse linear combinations of atoms selected from a dictionary. We consider an extension of this framework where the atoms are further assumed to be embedded in a tree. This is achieved using a recently introduced tree-structured sparse regularization norm, which has proven useful in several applications. This norm leads to regularized problems tha… ▽ More

    Submitted 5 July, 2011; v1 submitted 11 September, 2010; originally announced September 2010.

    Journal ref: Journal of Machine Learning Research, 12 (2011) 2297-2334

  38. arXiv:1008.5209  [pdf, ps, other

    cs.LG stat.ML

    Network Flow Algorithms for Structured Sparsity

    Authors: Julien Mairal, Rodolphe Jenatton, Guillaume Obozinski, Francis Bach

    Abstract: We consider a class of learning problems that involve a structured sparsity-inducing norm defined as the sum of $\ell_\infty$-norms over groups of variables. Whereas a lot of effort has been put in develo** fast optimization methods when the groups are disjoint or embedded in a specific hierarchical structure, we address here the case of general overlap** groups. To this end, we show that the… ▽ More

    Submitted 30 August, 2010; originally announced August 2010.

    Comments: accepted for publication in Adv. Neural Information Processing Systems, 2010

    Report number: RR-7372

  39. arXiv:0908.0050  [pdf, ps, other

    stat.ML cs.LG math.OC

    Online Learning for Matrix Factorization and Sparse Coding

    Authors: Julien Mairal, Francis Bach, Jean Ponce, Guillermo Sapiro

    Abstract: Sparse coding--that is, modelling data vectors as sparse linear combinations of basis elements--is widely used in machine learning, neuroscience, signal processing, and statistics. This paper focuses on the large-scale matrix factorization problem that consists of learning the basis set, adapting it to specific data. Variations of this problem include dictionary learning in signal processing, no… ▽ More

    Submitted 11 February, 2010; v1 submitted 1 August, 2009; originally announced August 2009.

    Comments: revised version

    Journal ref: Journal of Machine Learning Research 11 (2010) 19--60

  40. arXiv:0812.1869  [pdf, ps, other

    cs.LG

    Convex Sparse Matrix Factorizations

    Authors: Francis Bach, Julien Mairal, Jean Ponce

    Abstract: We present a convex formulation of dictionary learning for sparse signal decomposition. Convexity is obtained by replacing the usual explicit upper bound on the dictionary size by a convex rank-reducing term similar to the trace norm. In particular, our formulation introduces an explicit trade-off between size and sparsity of the decomposition of rectangular matrices. Using a large set of synthe… ▽ More

    Submitted 10 December, 2008; originally announced December 2008.

  41. arXiv:0809.3083  [pdf, ps, other

    cs.CV

    Supervised Dictionary Learning

    Authors: Julien Mairal, Francis Bach, Jean Ponce, Guillermo Sapiro, Andrew Zisserman

    Abstract: It is now well established that sparse signal models are well suited to restoration tasks and can effectively be learned from audio, image, and video data. Recent research has been aimed at learning discriminative sparse models instead of purely reconstructive ones. This paper proposes a new step in that direction, with a novel sparse representation for signals belonging to different classes in… ▽ More

    Submitted 18 September, 2008; originally announced September 2008.

    Report number: RR-6652