Skip to main content

Showing 1–19 of 19 results for author: Riccietti, E

.
  1. arXiv:2405.15006  [pdf, other

    cs.LG stat.ML

    Path-metrics, pruning, and generalization

    Authors: Antoine Gonon, Nicolas Brisebarre, Elisa Riccietti, Rémi Gribonval

    Abstract: Analyzing the behavior of ReLU neural networks often hinges on understanding the relationships between their parameters and the functions they implement. This paper proves a new bound on function distances in terms of the so-called path-metrics of the parameters. Since this bound is intrinsically invariant with respect to the rescaling symmetries of the networks, it sharpens previously known bound… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  2. arXiv:2403.13385  [pdf, other

    math.OC astro-ph.IM

    A multilevel framework for accelerating uSARA in radio-interferometric imaging

    Authors: Guillaume Lauga, Audrey Repetti, Elisa Riccietti, Nelly Pustelnik, Paulo Gonçalves, Yves Wiaux

    Abstract: This paper presents a multilevel algorithm specifically designed for radio-interferometric imaging in astronomy. The proposed algorithm is used to solve the uSARA (unconstrained Sparsity Averaging Reweighting Analysis) formulation of this image restoration problem. Multilevel algorithms rely on a hierarchy of approximations of the objective function to accelerate its optimization. In contrast to t… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  3. arXiv:2310.01225  [pdf, other

    stat.ML cs.LG math.ST

    A path-norm toolkit for modern networks: consequences, promises and challenges

    Authors: Antoine Gonon, Nicolas Brisebarre, Elisa Riccietti, Rémi Gribonval

    Abstract: This work introduces the first toolkit around path-norms that fully encompasses general DAG ReLU networks with biases, skip connections and any operation based on the extraction of order statistics: max pooling, GroupSort etc. This toolkit notably allows us to establish generalization bounds for modern neural networks that are not only the most widely applicable path-norm based ones, but also reco… ▽ More

    Submitted 13 March, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

  4. arXiv:2307.00820  [pdf, other

    math.NA

    Butterfly factorization by algorithmic identification of rank-one blocks

    Authors: Léon Zheng, Gilles Puy, Elisa Riccietti, Patrick Pérez, Rémi Gribonval

    Abstract: Many matrices associated with fast transforms posess a certain low-rank property characterized by the existence of several block partitionings of the matrix, where each block is of low rank. Provided that these partitionings are known, there exist algorithms, called butterfly factorization algorithms, that approximate the matrix into a product of sparse factors, thus enabling a rapid evaluation of… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: in French language. XXIX{è}me Colloque Francophone de Traitement du Signal et des Images, Aug 2023, Grenoble, France

  5. arXiv:2306.02666  [pdf, other

    cs.NE math.AG math.FA math.OC

    Does a sparse ReLU network training problem always admit an optimum?

    Authors: Quoc-Tung Le, Elisa Riccietti, Rémi Gribonval

    Abstract: Given a training set, a loss function, and a neural network architecture, it is often taken for granted that optimal network parameters exist, and a common practice is to apply available optimization algorithms to search for them. In this work, we show that the existence of an optimal solution is not always guaranteed, especially in the context of {\em sparse} ReLU neural networks. In particular,… ▽ More

    Submitted 5 December, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023 - Thirty-seventh Conference on Neural Information Processing Systems, Dec 2023, New Orleans (Lousiane), United States

  6. arXiv:2305.14477  [pdf, other

    cs.LG math.OC

    A Block-Coordinate Approach of Multi-level Optimization with an Application to Physics-Informed Neural Networks

    Authors: Serge Gratton, Valentin Mercier, Elisa Riccietti, Philippe L. Toint

    Abstract: Multi-level methods are widely used for the solution of large-scale problems, because of their computational advantages and exploitation of the complementarity between the involved sub-problems. After a re-interpretation of multi-level methods from a block-coordinate point of view, we propose a multi-level algorithm for the solution of nonlinear optimization problems and analyze its evaluation com… ▽ More

    Submitted 25 May, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  7. arXiv:2304.13329  [pdf, other

    math.OC

    IML FISTA: A Multilevel Framework for Inexact and Inertial Forward-Backward. Application to Image Restoration

    Authors: Guillaume Lauga, Elisa Riccietti, Nelly Pustelnik, Paulo Gonçalves

    Abstract: This paper presents a multilevel framework for inertial and inexact proximal algorithms, that encompasses multilevel versions of classical algorithms such as forward-backward and FISTA. The methods are supported by strong theoretical guarantees: we prove both the rate of convergence and the convergence of the iterates to a minimum in the convex case, an important result for ill-posed problems. We… ▽ More

    Submitted 2 April, 2024; v1 submitted 26 April, 2023; originally announced April 2023.

  8. arXiv:2210.15940  [pdf, other

    math.OC

    Multilevel fista for image restoration

    Authors: Guillaume Lauga, Elisa Riccietti, Nelly Pustelnik, Paulo Gonçalves

    Abstract: This paper presents a multilevel FISTA algorithm, based on the use of the Moreau envelope to build the correction brought by the coarse models, which is easy to compute when the explicit form of the proximal operator of the considered functions is known. This approach is supported by strong theoretical guarantees: we prove both the rate of convergence and the convergence of the iterates to a minim… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

  9. arXiv:2208.00789  [pdf, other

    cs.CV cs.AI

    Self-supervised learning with rotation-invariant kernels

    Authors: Léon Zheng, Gilles Puy, Elisa Riccietti, Patrick Pérez, Rémi Gribonval

    Abstract: We introduce a regularization loss based on kernel mean embeddings with rotation-invariant kernels on the hypersphere (also known as dot-product kernels) for self-supervised learning of image representations. Besides being fully competitive with the state of the art, our method significantly reduces time and memory complexity for self-supervised training, making it implementable for very large emb… ▽ More

    Submitted 8 March, 2023; v1 submitted 28 July, 2022; originally announced August 2022.

    Journal ref: The Eleventh International Conference on Learning Representations, May 2023, Kigali, Rwanda

  10. arXiv:2205.11874  [pdf, other

    cs.IT cs.NE

    Approximation speed of quantized vs. unquantized ReLU neural networks and beyond

    Authors: Antoine Gonon, Nicolas Brisebarre, Rémi Gribonval, Elisa Riccietti

    Abstract: We deal with two complementary questions about approximation properties of ReLU networks. First, we study how the uniform quantization of ReLU networks with real-valued weights impacts their approximation properties. We establish an upper-bound on the minimal number of bits per coordinate needed for uniformly quantized ReLU networks to keep the same polynomial asymptotic approximation speeds as un… ▽ More

    Submitted 7 October, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

  11. arXiv:2112.00386  [pdf, other

    cs.CC

    Spurious Valleys, NP-hardness, and Tractability of Sparse Matrix Factorization With Fixed Support

    Authors: Quoc-Tung Le, Elisa Riccietti, Rémi Gribonval

    Abstract: The problem of approximating a dense matrix by a product of sparse factors is a fundamental problem for many signal processing and machine learning tasks. It can be decomposed into two subproblems: finding the position of the non-zero coefficients in the sparse factors, and determining their values. While the first step is usually seen as the most challenging one due to its combinatorial nature, t… ▽ More

    Submitted 22 November, 2022; v1 submitted 1 December, 2021; originally announced December 2021.

  12. arXiv:2110.01235  [pdf, other

    cs.LG

    Identifiability in Two-Layer Sparse Matrix Factorization

    Authors: Léon Zheng, Elisa Riccietti, Rémi Gribonval

    Abstract: Sparse matrix factorization is the problem of approximating a matrix $\mathbf{Z}$ by a product of $J$ sparse factors $\mathbf{X}^{(J)} \mathbf{X}^{(J-1)} \ldots \mathbf{X}^{(1)}$. This paper focuses on identifiability issues that appear in this problem, in view of better understanding under which sparsity constraints the problem is well-posed. We give conditions under which the problem of factori… ▽ More

    Submitted 17 November, 2021; v1 submitted 4 October, 2021; originally announced October 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2110.01230

  13. arXiv:2110.01230  [pdf, other

    cs.LG

    Efficient Identification of Butterfly Sparse Matrix Factorizations

    Authors: Léon Zheng, Elisa Riccietti, Rémi Gribonval

    Abstract: Fast transforms correspond to factorizations of the form $\mathbf{Z} = \mathbf{X}^{(1)} \ldots \mathbf{X}^{(J)}$, where each factor $ \mathbf{X}^{(\ell)}$ is sparse and possibly structured. This paper investigates essential uniqueness of such factorizations, i.e., uniqueness up to unavoidable scaling ambiguities. Our main contribution is to prove that any $N \times N$ matrix having the so-called… ▽ More

    Submitted 7 November, 2022; v1 submitted 4 October, 2021; originally announced October 2021.

    Journal ref: SIAM Journal on Mathematics of Data Science, Society for Industrial and Applied Mathematics, In press

  14. arXiv:1912.13427  [pdf, ps, other

    math.OC

    An inexact non stationary Tikhonov procedure for large-scale nonlinear ill-posed problems

    Authors: Stefania Bellavia, Marco Donatelli, Elisa Riccietti

    Abstract: In this work we consider the stable numerical solution of large-scale ill-posed nonlinear least squares problems with nonzero residual. We propose a non-stationary Tikhonov method with inexact step computation, specially designed for large-scale problems. At each iteration the method requires the solution of an elliptical trust-region subproblem to compute the step. This task is carried out employ… ▽ More

    Submitted 1 January, 2020; v1 submitted 31 December, 2019; originally announced December 2019.

  15. arXiv:1911.00026  [pdf, ps, other

    math.NA

    On the iterative solution of systems of the form $A^T A x=A^Tb+c$

    Authors: Henri Calandra, Serge Gratton, Elisa Riccietti, Xavier Vasseur

    Abstract: Given a full column rank matrix $A \in \mathbb{R}^{m\times n}$ ($m\geq n$), we consider a special class of linear systems of the form $A^\top Ax=A^\top b+c$ with $x, c \in \mathbb{R}^{n}$ and $b \in \mathbb{R}^{m}$. The occurrence of $c$ in the right-hand side of the equation prevents the direct application of standard methods for least squares problems. Hence, we investigate alternative solution… ▽ More

    Submitted 31 October, 2019; originally announced November 2019.

  16. arXiv:1909.08099  [pdf, ps, other

    math.OC

    Worst-case Complexity Bounds of Directional Direct-search Methods for Multiobjective Optimization

    Authors: A. L. Custódio, Y. Diouane, R. Garmanjani, E. Riccietti

    Abstract: Direct Multisearch is a well-established class of algorithms, suited for multiobjective derivative-free optimization. In this work, we analyze the worst-case complexity of this class of methods in its most general formulation for unconstrained optimization. Considering nonconvex smooth functions, we show that to drive a given criticality measure below a specific positive threshold, Direct Multisea… ▽ More

    Submitted 3 November, 2020; v1 submitted 17 September, 2019; originally announced September 2019.

  17. arXiv:1904.04692  [pdf, ps, other

    math.NA

    On high-order multilevel optimization strategies

    Authors: Henri Calandra, Serge Gratton, Elisa Riccietti, Xavier Vasseur

    Abstract: We propose a new family of multilevel methods for unconstrained minimization. The resulting strategies are multilevel extensions of high-order optimization methods based on q-order Taylor models (with q >= 1) that have been recently proposed in the literature. The use of high-order models, while decreasing the worst-case complexity bound, makes these methods computationally more expensive. Hence,… ▽ More

    Submitted 9 April, 2019; originally announced April 2019.

  18. arXiv:1904.04685  [pdf, other

    math.NA cs.LG math.OC

    On the approximation of the solution of partial differential equations by artificial neural networks trained by a multilevel Levenberg-Marquardt method

    Authors: Henri Calandra, Serge Gratton, Elisa Riccietti, Xavier Vasseur

    Abstract: This paper is concerned with the approximation of the solution of partial differential equations by means of artificial neural networks. Here a feedforward neural network is used to approximate the solution of the partial differential equation. The learning problem is formulated as a least squares problem, choosing the residual of the partial differential equation as a loss function, whereas a mul… ▽ More

    Submitted 9 April, 2019; originally announced April 2019.

  19. arXiv:1504.03442  [pdf, ps, other

    math.NA

    On an adaptive regularization for ill-posed nonlinear systems and its trust-region implementation

    Authors: Stefania Bellavia, Benedetta Morini, Elisa Riccietti

    Abstract: In this paper we address the stable numerical solution of nonlinear ill-posed systems by a trust-region method. We show that an appropriate choice of the trust-region radius gives rise to a procedure that has the potential to approach a solution of the unperturbed system. This regularizing property is shown theoretically and validated numerically.

    Submitted 14 April, 2015; originally announced April 2015.

    Comments: arXiv admin note: text overlap with arXiv:1410.2780