Skip to main content

Showing 1–28 of 28 results for author: Bianchi, P

Searching in archive math. Search in all archives.
.
  1. arXiv:2406.11929  [pdf, other

    cs.LG math.PR

    Long-time asymptotics of noisy SVGD outside the population limit

    Authors: Victor Priser, Pascal Bianchi, Adil Salim

    Abstract: Stein Variational Gradient Descent (SVGD) is a widely used sampling algorithm that has been successfully applied in several areas of Machine Learning. SVGD operates by iteratively moving a set of interacting particles (which represent the samples) to approximate the target distribution. Despite recent studies on the complexity of SVGD and its variants, their long-time asymptotic behavior (i.e., a… ▽ More

    Submitted 21 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  2. arXiv:2403.17472  [pdf, ps, other

    math.PR

    Long run convergence of discrete-time interacting particle systems of the McKean-Vlasov type

    Authors: Pascal Bianchi, Walid Hachem, Victor Priser

    Abstract: We consider a discrete-time system of n coupled random vectors, a.k.a. interacting particles. The dynamics involve a vanishing step size, some random centered perturbations, and a mean vector field which induces the coupling between the particles. We study the doubly asymptotic regime where both the number of iterations and the number n of particles tend to infinity, without any constraint on… ▽ More

    Submitted 3 April, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  3. arXiv:2112.05482  [pdf, ps, other

    math.OC math.DS math.PR

    A closed-measure approach to stochastic approximation

    Authors: Pascal Bianchi, Rodolfo Rios-Zertuche

    Abstract: This paper introduces a new method to tackle the issue of the almost sure convergence of stochastic approximation algorithms defined from a differential inclusion. Under the assumption of slowly decaying step-sizes, we establish that the set of essential accumulation points of the iterates belongs to the Birkhoff center associated with the differential inclusion. Unlike previous works, our results… ▽ More

    Submitted 2 December, 2023; v1 submitted 10 December, 2021; originally announced December 2021.

    Comments: 20 pages

    MSC Class: 37A50 (Primary); 65K10; 37B35 (Secondary) ACM Class: G.1.6

  4. arXiv:2108.02072  [pdf, ps, other

    math.OC stat.ML

    Stochastic Subgradient Descent Escapes Active Strict Saddles on Weakly Convex Functions

    Authors: Pascal Bianchi, Walid Hachem, Sholom Schechtman

    Abstract: In non-smooth stochastic optimization, we establish the non-convergence of the stochastic subgradient descent (SGD) to the critical points recently called active strict saddles by Davis and Drusvyatskiy. Such points lie on a manifold $M$ where the function $f$ has a direction of second-order negative curvature. Off this manifold, the norm of the Clarke subdifferential of $f$ is lower-bounded. We r… ▽ More

    Submitted 25 July, 2023; v1 submitted 4 August, 2021; originally announced August 2021.

    Comments: Accepted for publication in Mathematics of Operations Research

    MSC Class: 65K10; 62L20 (Primary); 49J52; 32B20 (secondary)

  5. arXiv:2106.07472  [pdf, ps, other

    cs.LG math.OC stat.ML

    Analysis of a Target-Based Actor-Critic Algorithm with Linear Function Approximation

    Authors: Anas Barakat, Pascal Bianchi, Julien Lehmann

    Abstract: Actor-critic methods integrating target networks have exhibited a stupendous empirical success in deep reinforcement learning. However, a theoretical understanding of the use of target networks in actor-critic methods is largely missing in the literature. In this paper, we reduce this gap between theory and practice by proposing the first theoretical analysis of an online target-based actor-critic… ▽ More

    Submitted 22 February, 2022; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: 50 pages

    Journal ref: AISTATS 2022

  6. arXiv:2012.04002  [pdf, ps, other

    math.OC math.PR stat.ML

    Stochastic optimization with momentum: convergence, fluctuations, and traps avoidance

    Authors: A. Barakat, P. Bianchi, W. Hachem, Sh. Schechtman

    Abstract: In this paper, a general stochastic optimization procedure is studied, unifying several variants of the stochastic gradient descent such as, among others, the stochastic heavy ball method, the Stochastic Nesterov Accelerated Gradient algorithm (S-NAG), and the widely used Adam algorithm. The algorithm is seen as a noisy Euler discretization of a non-autonomous ordinary differential equation, recen… ▽ More

    Submitted 10 July, 2021; v1 submitted 7 December, 2020; originally announced December 2020.

    Comments: Accepted for publication in Electronic Journal of Statistics. 49 pages

    MSC Class: 62L20; 34A12; 60F99

  7. arXiv:2005.08513  [pdf, ps, other

    math.NA math.OC

    Convergence of constant step stochastic gradient descent for non-smooth non-convex functions

    Authors: Pascal Bianchi, Walid Hachem, Sholom Schechtman

    Abstract: This paper studies the asymptotic behavior of the constant step Stochastic Gradient Descent for the minimization of an unknown function F , defined as the expectation of a non convex, non smooth, locally Lipschitz random function. As the gradient may not exist, it is replaced by a certain operator: a reasonable choice is to use an element of the Clarke subdifferential of the random function; an ot… ▽ More

    Submitted 12 April, 2022; v1 submitted 18 May, 2020; originally announced May 2020.

    Journal ref: Set-Valued and Variational Analysis, Springer, 2022

  8. arXiv:1911.07596  [pdf, other

    math.OC cs.LG stat.ML

    Convergence Analysis of a Momentum Algorithm with Adaptive Step Size for Non Convex Optimization

    Authors: Anas Barakat, Pascal Bianchi

    Abstract: Although ADAM is a very popular algorithm for optimizing the weights of neural networks, it has been recently shown that it can diverge even in simple convex optimization examples. Several variants of ADAM have been proposed to circumvent this convergence issue. In this work, we study the ADAM algorithm for smooth nonconvex optimization under a boundedness assumption on the adaptive learning rate.… ▽ More

    Submitted 24 September, 2020; v1 submitted 18 November, 2019; originally announced November 2019.

    Comments: 28 pages, 1 figure, published in ACML2020

  9. arXiv:1901.08170  [pdf, ps, other

    math.OC stat.ML

    A Fully Stochastic Primal-Dual Algorithm

    Authors: Pascal Bianchi, Walid Hachem, Adil Salim

    Abstract: A new stochastic primal--dual algorithm for solving a composite optimization problem is proposed. It is assumed that all the functions/operators that enter the optimization problem are given as statistical expectations. These expectations are unknown but revealed across time through i.i.d. realizations. The proposed algorithm is proven to converge to a saddle point of the Lagrangian function. In t… ▽ More

    Submitted 22 June, 2020; v1 submitted 23 January, 2019; originally announced January 2019.

  10. arXiv:1810.02263  [pdf, ps, other

    stat.ML cs.LG math.CA math.DS math.OC

    Convergence and Dynamical Behavior of the ADAM Algorithm for Non-Convex Stochastic Optimization

    Authors: Anas Barakat, Pascal Bianchi

    Abstract: Adam is a popular variant of stochastic gradient descent for finding a local minimizer of a function. In the constant stepsize regime, assuming that the objective function is differentiable and non-convex, we establish the convergence in the long run of the iterates to a stationary point under a stability condition. The key ingredient is the introduction of a continuous-time version of Adam, under… ▽ More

    Submitted 13 May, 2020; v1 submitted 4 October, 2018; originally announced October 2018.

    Comments: 30 pages

  11. arXiv:1804.00934  [pdf, other

    math.OC stat.ML

    A Constant Step Stochastic Douglas-Rachford Algorithm with Application to Non Separable Regularizations

    Authors: Adil Salim, Pascal Bianchi, Walid Hachem

    Abstract: The Douglas Rachford algorithm is an algorithm that converges to a minimizer of a sum of two convex functions. The algorithm consists in fixed point iterations involving computations of the proximity operators of the two functions separately. The paper investigates a stochastic version of the algorithm where both functions are random and the step size is constant. We establish that the iterates of… ▽ More

    Submitted 3 April, 2018; originally announced April 2018.

  12. arXiv:1712.07027  [pdf, other

    math.OC cs.LG stat.ML

    Snake: a Stochastic Proximal Gradient Algorithm for Regularized Problems over Large Graphs

    Authors: Adil Salim, Pascal Bianchi, Walid Hachem

    Abstract: A regularized optimization problem over a large unstructured graph is studied, where the regularization term is tied to the graph geometry. Typical regularization examples include the total variation and the Laplacian regularizations over the graph. When applying the proximal gradient algorithm to solve this problem, there exist quite affordable methods to implement the proximity operator (backwar… ▽ More

    Submitted 19 December, 2017; originally announced December 2017.

  13. arXiv:1705.06603  [pdf, other

    math.OC

    Distributed Deblurring of Large Images of Wide Field-Of-View

    Authors: Rahul Mourya, André Ferrari, Rémi Flamary, Pascal Bianchi, Cédric Richard

    Abstract: Image deblurring is an economic way to reduce certain degradations (blur and noise) in acquired images. Thus, it has become essential tool in high resolution imaging in many applications, e.g., astronomy, microscopy or computational photography. In applications such as astronomy and satellite imaging, the size of acquired images can be extremely large (up to gigapixels) covering wide field-of-view… ▽ More

    Submitted 17 May, 2017; originally announced May 2017.

    Comments: 16 pages, 10 figures, submitted to IEEE Trans. on Image Processing

  14. arXiv:1702.04144  [pdf, ps, other

    math.OC

    A constant step Forward-Backward algorithm involving random maximal monotone operators

    Authors: Pascal Bianchi, Walid Hachem, Adil Salim

    Abstract: A stochastic Forward-Backward algorithm with a constant step is studied. At each time step, this algorithm involves an independent copy of a couple of random maximal monotone operators. Defining a mean operator as a selection integral, the differential inclusion built from the sum of the two mean operators is considered. As a first result, it is shown that the interpolated process obtained from th… ▽ More

    Submitted 4 April, 2018; v1 submitted 14 February, 2017; originally announced February 2017.

  15. arXiv:1612.03831  [pdf, ps, other

    math.PR

    Constant Step Stochastic Approximations Involving Differential Inclusions: Stability, Long-Run Convergence and Applications

    Authors: Pascal Bianchi, Walid Hachem, Adil Salim

    Abstract: We consider a Markov chain $(x_n)$ whose kernel is indexed by a scaling parameter $γ>0$, refered to as the step size. The aim is to analyze the behavior of the Markov chain in the doubly asymptotic regime where $n\to\infty$ then $γ\to 0$. First, under mild assumptions on the so-called drift of the Markov chain, we show that the interpolated process converges narrowly to the solutions of a Differen… ▽ More

    Submitted 14 December, 2017; v1 submitted 12 December, 2016; originally announced December 2016.

  16. A Coordinate Descent Primal-Dual Algorithm with Large Step Size and Possibly Non Separable Functions

    Authors: Olivier Fercoq, Pascal Bianchi

    Abstract: This paper introduces a coordinate descent version of the Vũ-Condat algorithm. By coordinate descent, we mean that only a subset of the coordinates of the primal and dual iterates is updated at each iteration, the other coordinates being maintained to their past value. Our method allows us to solve optimization problems with a combination of differentiable functions, constraints as well as non-sep… ▽ More

    Submitted 2 February, 2018; v1 submitted 19 August, 2015; originally announced August 2015.

    Comments: 32 pages

  17. arXiv:1508.02845  [pdf, ps, other

    math.OC math.DS

    Dynamical behavior of a stochastic forward-backward algorithm using random monotone operators

    Authors: Pascal Bianchi, Walid Hachem

    Abstract: The purpose of this paper is to study the dynamical behavior of the sequence produced by a forward-backward algorithm involving two random maximal monotone operators and a sequence of decreasing step sizes. Defining a mean monotone operator as an Aumann integral, and assuming that the sum of the two mean operators is maximal (sufficient maximality conditions are provided), it is shown that with pr… ▽ More

    Submitted 4 July, 2016; v1 submitted 12 August, 2015; originally announced August 2015.

    MSC Class: 47H05; 47N10; 62L20; 34A60

  18. arXiv:1504.05400  [pdf, ps, other

    math.OC math.NA

    Ergodic convergence of a stochastic proximal point algorithm

    Authors: Pascal Bianchi

    Abstract: The purpose of this paper is to establish the almost sure weak ergodic convergence of a sequence of iterates $(x_n)$ given by $x_{n+1} = (I+λ_n A(ξ_{n+1},\,.\,))^{-1}(x_n)$ where $(A(s,\,.\,):s\in E)$ is a collection of maximal monotone operators on a separable Hilbert space, $(ξ_n)$ is an independent identically distributed sequence of random variables on $E$ and $(λ_n)$ is a positive sequence in… ▽ More

    Submitted 25 July, 2016; v1 submitted 21 April, 2015; originally announced April 2015.

    Comments: 26 pages

  19. arXiv:1410.6956  [pdf, other

    cs.MA eess.SY math.NA

    Success and Failure of Adaptation-Diffusion Algorithms for Consensus in Multi-Agent Networks

    Authors: Gemma Morral, Pascal Bianchi, Gersende Fort

    Abstract: This paper investigates the problem of distributed stochastic approximation in multi-agent systems. The algorithm under study consists of two steps: a local stochastic approximation step and a diffusion step which drives the network to a consensus. The diffusion step uses row-stochastic matrices to weight the network exchanges. As opposed to previous works, exchange matrices are not supposed to be… ▽ More

    Submitted 25 October, 2014; originally announced October 2014.

    Comments: 13 pages, 4 figures

  20. arXiv:1407.0898  [pdf, ps, other

    math.OC cs.DC eess.SY math.NA

    A Coordinate Descent Primal-Dual Algorithm and Application to Distributed Asynchronous Optimization

    Authors: Pascal Bianchi, Walid Hachem, Franck Iutzeler

    Abstract: Based on the idea of randomized coordinate descent of $α$-averaged operators, a randomized primal-dual optimization algorithm is introduced, where a random subset of coordinates is updated at each iteration. The algorithm builds upon a variant of a recent (deterministic) algorithm proposed by Vũ and Condat that includes the well known ADMM as a particular case. The obtained algorithm is used to so… ▽ More

    Submitted 30 September, 2015; v1 submitted 3 July, 2014; originally announced July 2014.

    Comments: 10 pages

  21. arXiv:1312.1085  [pdf, ps, other

    cs.DC math.OC

    Explicit Convergence Rate of a Distributed Alternating Direction Method of Multipliers

    Authors: Franck Iutzeler, Pascal Bianchi, Philippe Ciblat, Walid Hachem

    Abstract: Consider a set of N agents seeking to solve distributively the minimization problem $\inf_{x} \sum_{n = 1}^N f_n(x)$ where the convex functions $f_n$ are local to the agents. The popular Alternating Direction Method of Multipliers has the potential to handle distributed optimization problems of this kind. We provide a general reformulation of the problem and obtain a class of distributed algorithm… ▽ More

    Submitted 28 December, 2014; v1 submitted 4 December, 2013; originally announced December 2013.

    Comments: 13 pages

  22. arXiv:1309.7264  [pdf, ps, other

    math.OC

    Robust Consensus in Distributed Networks using Total Variation

    Authors: Walid Ben-Ameur, Pascal Bianchi, Jérémie Jakubowicz

    Abstract: Consider a connected network of agents endowed with local cost functions representing private objectives. Agents seek to find an agreement on some minimizer of the aggregate cost, by means of repeated communications between neighbors. Consensus on the average over the network, usually addressed by gossip algorithms, is a special instance of this problem, corresponding to quadratic private objectiv… ▽ More

    Submitted 27 September, 2013; originally announced September 2013.

  23. arXiv:1303.2837  [pdf, other

    cs.DC math.OC

    Asynchronous Distributed Optimization using a Randomized Alternating Direction Method of Multipliers

    Authors: Franck Iutzeler, Pascal Bianchi, Philippe Ciblat, Walid Hachem

    Abstract: Consider a set of networked agents endowed with private cost functions and seeking to find a consensus on the minimizer of the aggregate cost. A new class of random asynchronous distributed optimization methods is introduced. The methods generalize the standard Alternating Direction Method of Multipliers (ADMM) to an asynchronous setting where isolated components of the network are activated in an… ▽ More

    Submitted 12 March, 2013; originally announced March 2013.

    Comments: 6 pages

  24. arXiv:1203.1505  [pdf, other

    math.OC cs.DC eess.SY

    Performance of a Distributed Stochastic Approximation Algorithm

    Authors: Pascal Bianchi, Gersende Fort, Walid Hachem

    Abstract: In this paper, a distributed stochastic approximation algorithm is studied. Applications of such algorithms include decentralized estimation, optimization, control or computing. The algorithm consists in two steps: a local step, where each node in a network updates a local estimate using a stochastic approximation algorithm with decreasing step size, and a gossip step, where a node computes a loca… ▽ More

    Submitted 2 December, 2013; v1 submitted 7 March, 2012; originally announced March 2012.

    Comments: IEEE Transactions on Information Theory 2013

  25. arXiv:1107.2526  [pdf, other

    math.OC cs.DC eess.SY

    Convergence of a Multi-Agent Projected Stochastic Gradient Algorithm for Non-Convex Optimization

    Authors: Pascal Bianchi, Jérémie Jakubowicz

    Abstract: We introduce a new framework for the convergence analysis of a class of distributed constrained non-convex optimization algorithms in multi-agent systems. The aim is to search for local minimizers of a non-convex objective function which is supposed to be a sum of local utility functions of the agents. The algorithm under study consists of two steps: a local stochastic gradient descent at each age… ▽ More

    Submitted 2 December, 2013; v1 submitted 13 July, 2011; originally announced July 2011.

    Comments: IEEE Transactions on Automatic Control 2013

  26. arXiv:1004.5529  [pdf, other

    cs.IT math.PR math.ST

    High-Rate Vector Quantization for the Neyman-Pearson Detection of Correlated Processes

    Authors: Joffrey Villard, Pascal Bianchi

    Abstract: This paper investigates the effect of quantization on the performance of the Neyman-Pearson test. It is assumed that a sensing unit observes samples of a correlated stationary ergodic multivariate process. Each sample is passed through an N-point quantizer and transmitted to a decision device which performs a binary hypothesis test. For any false alarm level, it is shown that the miss probability… ▽ More

    Submitted 4 May, 2011; v1 submitted 30 April, 2010; originally announced April 2010.

    Comments: 47 pages, 7 figures, 1 table. To appear in the IEEE Transactions on Information Theory

  27. arXiv:0910.0827  [pdf, ps, other

    math.PR cs.IT math.ST

    Performance of Statistical Tests for Single Source Detection using Random Matrix Theory

    Authors: Pascal Bianchi, Merouane Debbah, Mylène Maïda, Jamal Najim

    Abstract: This paper introduces a unified framework for the detection of a source with a sensor array in the context where the noise variance and the channel between the source and the sensors are unknown at the receiver. The Generalized Maximum Likelihood Test is studied and yields the analysis of the ratio between the maximum eigenvalue of the sampled covariance matrix and its normalized trace. Using rece… ▽ More

    Submitted 31 May, 2010; v1 submitted 5 October, 2009; originally announced October 2009.

    Comments: 45 p. improved presentation; more proofs provided

  28. arXiv:0811.0979  [pdf, ps, other

    math.PR math-ph

    Asymptotic Independence in the Spectrum of the Gaussian Unitary Ensemble

    Authors: P. Bianchi, M. Debbah, J. Najim

    Abstract: Consider a $n \times n$ matrix from the Gaussian Unitary Ensemble (GUE). Given a finite collection of bounded disjoint real Borel sets $(Δ_{i,n},\ 1\leq i\leq p)$, properly rescaled, and eventually included in any neighbourhood of the support of Wigner's semi-circle law, we prove that the related counting measures $({\mathcal N}_n(Δ_{i,n}), 1\leq i\leq p)$, where ${\mathcal N}_n(Δ)$ represents t… ▽ More

    Submitted 6 November, 2008; originally announced November 2008.

    Comments: 15 pages

    MSC Class: 15A52