Skip to main content

Showing 1–33 of 33 results for author: Dvinskikh, D

.
  1. Gradient-free algorithm for saddle point problems under overparametrization

    Authors: Ekaterina Statkevich, Sofiya Bondar, Darina Dvinskikh, Alexander Gasnikov, Aleksandr Lobanov

    Abstract: This paper focuses on solving a stochastic saddle point problem (SPP) under an overparameterized regime for the case, when the gradient computation is impractical. As an intermediate step, we generalize Same-sample Stochastic Extra-gradient algorithm (Gorbunov et al., 2022) to a biased oracle and estimate novel convergence rates. As the result of the paper we introduce an algorithm, which uses gra… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Journal ref: Chaos, Solitons & Fractals Chaos, Solitons & Fractals Volume 185 August 2024 115048

  2. arXiv:2311.16743  [pdf, ps, other

    math.OC

    About some works of Boris Polyak on convergence of gradient methods and their development

    Authors: Seydamet Ablaev, Aleksandr Beznosikov, Alexander Gasnikov, Darina Dvinskikh, Aleksandr Lobanov, Sergei Puchinin, Fedor Stonyakin

    Abstract: The paper presents a review of the state-of-the-art of subgradient and accelerated methods of convex optimization, including in the presence of disturbances and access to various information about the objective function (function value, gradient, stochastic gradient, higher derivatives). For nonconvex problems, the Polak-Lojasiewicz condition is considered and a review of the main results is given… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: in Russian language

  3. arXiv:2311.06953  [pdf, other

    math.OC

    Bregman Proximal Method for Efficient Communications under Similarity

    Authors: Aleksandr Beznosikov, Darina Dvinskikh, Andrei Semenov, Alexander Gasnikov

    Abstract: We propose a novel distributed method for monotone variational inequalities and convex-concave saddle point problems arising in various machine learning applications such as game theory and adversarial training. By exploiting \textit{similarity} our algorithm overcomes communication bottleneck which is a major issue in distributed optimization. The proposed algorithm enjoys optimal communication c… ▽ More

    Submitted 21 June, 2024; v1 submitted 12 November, 2023; originally announced November 2023.

    Comments: 16 pages

  4. arXiv:2310.18763  [pdf, other

    math.OC

    Accelerated Zeroth-order Method for Non-Smooth Stochastic Convex Optimization Problem with Infinite Variance

    Authors: Nikita Kornilov, Ohad Shamir, Aleksandr Lobanov, Darina Dvinskikh, Alexander Gasnikov, Innokentiy Shibaev, Eduard Gorbunov, Samuel Horváth

    Abstract: In this paper, we consider non-smooth stochastic convex optimization with two function evaluations per round under infinite noise variance. In the classical setting when noise has finite variance, an optimal algorithm, built upon the batched accelerated gradient method, was proposed in (Gasnikov et. al., 2022). This optimality is defined in terms of iteration and oracle complexity, as well as the… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

  5. arXiv:2304.02442  [pdf, ps, other

    math.OC

    Gradient-Free Methods for Non-Smooth Convex Stochastic Optimization with Heavy-Tailed Noise on Convex Compact

    Authors: Nikita Kornilov, Alexander Gasnikov, Pavel Dvurechensky, Darina Dvinskikh

    Abstract: We present two easy-to-implement gradient-free/zeroth-order methods to optimize a stochastic non-smooth function accessible only via a black-box. The methods are built upon efficient first-order methods in the heavy-tailed case, i.e., when the gradient noise has infinite variance but bounded $(1+κ)$-th moment for some $κ\in(0,1]$. The first algorithm is based on the stochastic mirror descent with… ▽ More

    Submitted 24 August, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

  6. arXiv:2211.13566  [pdf, ps, other

    math.OC

    Randomized gradient-free methods in convex optimization

    Authors: Alexander Gasnikov, Darina Dvinskikh, Pavel Dvurechensky, Eduard Gorbunov, Aleksander Beznosikov, Aleksandr Lobanov

    Abstract: This review presents modern gradient-free methods to solve convex optimization problems. By gradient-free methods, we mean those that use only (noisy) realizations of the objective value. We are motivated by various applications where gradient information is prohibitively expensive or even unavailable. We mainly focus on three criteria: oracle complexity, iteration complexity, and the maximum perm… ▽ More

    Submitted 12 February, 2024; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: Survey paper; 9 pages

  7. arXiv:2211.10783  [pdf, other

    math.OC

    Gradient-Free Federated Learning Methods with $l_1$ and $l_2$-Randomization for Non-Smooth Convex Stochastic Optimization Problems

    Authors: Aleksandr Lobanov, Belal Alashqar, Darina Dvinskikh, Alexander Gasnikov

    Abstract: This paper studies non-smooth problems of convex stochastic optimization. Using the smoothing technique based on the replacement of the function value at the considered point by the averaged function value over a ball (in $l_1$-norm or $l_2$-norm) of small radius with the center in this point, the original problem is reduced to a smooth problem (whose Lipschitz constant of the gradient is inversel… ▽ More

    Submitted 21 May, 2023; v1 submitted 19 November, 2022; originally announced November 2022.

    Comments: In Russian language. Redesigned version for publication in the journal Computational Mathematics and Mathematical Physics

  8. arXiv:2210.11368  [pdf, ps, other

    math.OC

    Numerical Methods for Large-Scale Optimal Transport

    Authors: Nazarii Tupitsa, Pavel Dvurechensky, Darina Dvinskikh, Alexander Gasnikov

    Abstract: The optimal transport (OT) problem is a classical optimization problem having the form of linear programming. Machine learning applications put forward new computational challenges in its solution. In particular, the OT problem defines a distance between real-world objects such as images, videos, texts, etc., modeled as probability distributions. In this case, the large dimension of the correspond… ▽ More

    Submitted 24 October, 2022; v1 submitted 20 October, 2022; originally announced October 2022.

    Comments: An Encyclopedia article

  9. arXiv:2202.06114  [pdf, other

    math.OC

    Gradient-Free Optimization for Non-Smooth Saddle Point Problems under Adversarial Noise

    Authors: Darina Dvinskikh, Vladislav Tominin, Yaroslav Tominin, Alexander Gasnikov

    Abstract: We consider non-smooth saddle point optimization problems. To solve these problems, we propose a zeroth-order method under bounded or Lipschitz continuous noise, possible adversarial. In contrast to the state-of-the-art algorithms, our algorithm is optimal in terms of both criteria: oracle calls complexity and the maximum value of admissible noise. The proposed method is simple and easy to impleme… ▽ More

    Submitted 25 March, 2023; v1 submitted 12 February, 2022; originally announced February 2022.

  10. arXiv:2202.01805  [pdf, ps, other

    math.OC

    On the relations of stochastic convex optimization problems with empirical risk minimization problems on $p$-norm balls

    Authors: Darina Dvinskikh, Vitali Pirau, Alexander Gasnikov

    Abstract: In this paper, we consider convex stochastic optimization problems arising in machine learning applications (e.g., risk minimization) and mathematical statistics (e.g., maximum likelihood estimation). There are two main approaches to solve such kinds of problems, namely the Stochastic Approximation approach (online approach) and the Sample Average Approximation approach, also known as the Monte Ca… ▽ More

    Submitted 2 March, 2022; v1 submitted 3 February, 2022; originally announced February 2022.

    Comments: 14 pages, in Russian

  11. Decentralized Personalized Federated Learning: Lower Bounds and Optimal Algorithm for All Personalization Modes

    Authors: Abdurakhmon Sadiev, Ekaterina Borodich, Aleksandr Beznosikov, Darina Dvinskikh, Saveliy Chezhegov, Rachael Tappenden, Martin Takáč, Alexander Gasnikov

    Abstract: This paper considers the problem of decentralized, personalized federated learning. For centralized personalized federated learning, a penalty that measures the deviation from the local model and its average, is often added to the objective function. However, in a decentralized setting this penalty is expensive in terms of communication costs, so here, a different penalty - one that is built to re… ▽ More

    Submitted 23 August, 2022; v1 submitted 15 July, 2021; originally announced July 2021.

    Comments: New in v3: more detailed proofs, more experiments. 40 pages, 6 algorithms, 10 figures, 2 tables, 5 theorems

  12. arXiv:2105.01587  [pdf, other

    math.OC

    Decentralized Algorithms for Wasserstein Barycenters

    Authors: Darina Dvinskikh

    Abstract: In this thesis, we consider the Wasserstein barycenter problem of discrete probability measures from computational and statistical sides. The statistical focus is estimating the sample size of measures necessary to calculate an approximation for Fréchet mean (barycenter) of a probability distribution with a given precision. For empirical risk minimization approaches, the question of the regulariza… ▽ More

    Submitted 25 October, 2021; v1 submitted 4 May, 2021; originally announced May 2021.

    Comments: 103 pages, Masters thesis. arXiv admin note: text overlap with arXiv:2001.07697

  13. Decentralized Distributed Optimization for Saddle Point Problems

    Authors: Alexander Rogozin, Aleksandr Beznosikov, Darina Dvinskikh, Dmitry Kovalev, Pavel Dvurechensky, Alexander Gasnikov

    Abstract: We consider distributed convex-concave saddle point problems over arbitrary connected undirected networks and propose a decentralized distributed algorithm for their solution. The local functions distributed across the nodes are assumed to have global and local groups of variables. For the proposed algorithm we prove non-asymptotic convergence rate estimates with explicit dependence on the network… ▽ More

    Submitted 9 April, 2024; v1 submitted 15 February, 2021; originally announced February 2021.

  14. Recent theoretical advances in decentralized distributed convex optimization

    Authors: Eduard Gorbunov, Alexander Rogozin, Aleksandr Beznosikov, Darina Dvinskikh, Alexander Gasnikov

    Abstract: In the last few years, the theory of decentralized distributed convex optimization has made significant progress. The lower bounds on communications rounds and oracle calls have appeared, as well as methods that reach both of these bounds. In this paper, we focus on how these results can be explained based on optimal algorithms for the non-distributed setup. In particular, we provide our recent re… ▽ More

    Submitted 29 November, 2021; v1 submitted 26 November, 2020; originally announced November 2020.

    Comments: 46 pages; a survey paper

  15. arXiv:2010.09585  [pdf, ps, other

    math.OC

    Parallel and Distributed algorithms for ML problems

    Authors: Darina Dvinskikh, Alexander Gasnikov, Alexander Rogozin, Alexander Beznosikov

    Abstract: In this paper we make a survey of modern parallel and distributed approaches to solve sum-type convex minimization problems come from ML applications.

    Submitted 25 April, 2021; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: in Russian

  16. arXiv:2010.04677  [pdf, other

    math.OC

    Improved Complexity Bounds in Wasserstein Barycenter Problem

    Authors: Darina Dvinskikh, Daniil Tiapkin

    Abstract: In this paper, we focus on computational aspects of the Wasserstein barycenter problem. We propose two algorithms to compute Wasserstein barycenters of $m$ discrete measures of size $n$ with accuracy $\e$. The first algorithm, based on mirror prox with a specific norm, meets the complexity of celebrated accelerated iterative Bregman projections (IBP), namely $\widetilde O(mn^2\sqrt n/\e)$, however… ▽ More

    Submitted 24 February, 2021; v1 submitted 9 October, 2020; originally announced October 2020.

    Comments: 23 pages

  17. Accelerated meta-algorithm for convex optimization

    Authors: Alexander Gasnikov, Darina Dvinskikh, Pavel Dvurechensky, Dmitry Kamzolov, Vladislav Matykhin, Dmitry Pasechnyk, Nazarii Tupitsa, Alexei Chernov

    Abstract: We propose an accelerated meta-algorithm, which allows to obtain accelerated methods for convex unconstrained minimization in different settings. As an application of the general scheme we propose nearly optimal methods for minimizing smooth functions with Lipschitz derivatives of an arbitrary order, as well as for smooth minimax optimization problems. The proposed meta-algorithm is more general t… ▽ More

    Submitted 4 November, 2020; v1 submitted 18 April, 2020; originally announced April 2020.

    Comments: 25 pages, in Russian

  18. arXiv:2004.04490   

    math.OC

    Accelerated and nonaccelerated stochastic gradient descent with inexact model

    Authors: Darina Dvinskikh, Alexander Tyurin, Alexander Gasnikov, Sergey Omelchenko

    Abstract: In this paper, we propose a new way to obtain optimal convergence rates for smooth stochastic (strong) convex optimization tasks. Our approach is based on results for optimization tasks where gradients have nonrandom noise. In contrast to previously known results, we extend our idea to the inexact model conception.

    Submitted 15 April, 2020; v1 submitted 9 April, 2020; originally announced April 2020.

    Comments: Withdrawn as this should not have been a new article. Please instead see arXiv:2001.03443

  19. arXiv:2002.02706  [pdf, other

    math.OC

    Oracle Complexity Separation in Convex Optimization

    Authors: Anastasiya Ivanova, Evgeniya Vorontsova, Dmitry Pasechnyuk, Alexander Gasnikov, Pavel Dvurechensky, Darina Dvinskikh, Alexander Tyurin

    Abstract: Many convex optimization problems have structured objective function written as a sum of functions with different types of oracles (full gradient, coordinate derivative, stochastic gradient) and different evaluation complexity of these oracles. In the strongly convex case these functions also have different condition numbers, which eventually define the iteration complexity of first-order methods… ▽ More

    Submitted 11 March, 2022; v1 submitted 7 February, 2020; originally announced February 2020.

  20. arXiv:2001.09013  [pdf, other

    math.OC

    Inexact Relative Smoothness and Strong Convexity for Optimization and Variational Inequalities by Inexact Model

    Authors: Fedor Stonyakin, Alexander Tyurin, Alexander Gasnikov, Pavel Dvurechensky, Artem Agafonov, Darina Dvinskikh, Mohammad Alkousa, Dmitry Pasechnyuk, Sergei Artamonov, Victorya Piskunova

    Abstract: In this paper, we propose a general algorithmic framework for first-order methods in optimization in a broad sense, including minimization problems, saddle-point problems, and variational inequalities. This framework allows obtaining many known methods as a special case, the list including accelerated gradient method, composite optimization methods, level-set methods, Bregman proximal methods. The… ▽ More

    Submitted 19 December, 2021; v1 submitted 23 January, 2020; originally announced January 2020.

    Comments: arXiv admin note: text overlap with arXiv:1902.00990. To appear in Optimization Methods and Software, https://doi.org/10.1080/10556788.2021.1924714

  21. arXiv:2001.07697  [pdf, other

    math.OC cs.LG stat.ML

    Stochastic Approximation versus Sample Average Approximation for population Wasserstein barycenters

    Authors: Darina Dvinskikh

    Abstract: In the machine learning and optimization community, there are two main approaches for the convex risk minimization problem, namely, the Stochastic Approximation (SA) and the Sample Average Approximation (SAA). In terms of oracle complexity (required number of stochastic gradient evaluations), both approaches are considered equivalent on average (up to a logarithmic factor). The total complexity de… ▽ More

    Submitted 25 October, 2021; v1 submitted 21 January, 2020; originally announced January 2020.

    Comments: 33 pages

  22. arXiv:2001.03443  [pdf, ps, other

    math.OC

    Accelerated and nonaccelerated stochastic gradient descent with model conception

    Authors: Darina Dvinskikh, Alexander Tyurin, Alexander Gasnikov, Sergey Omelchenko

    Abstract: In this paper, we describe a new way to get convergence rates for optimal methods in smooth (strongly) convex optimization tasks. Our approach is based on results for tasks where gradients have nonrandom small noises. Unlike previous results, we obtain convergence rates with model conception.

    Submitted 13 July, 2020; v1 submitted 10 January, 2020; originally announced January 2020.

    Comments: in Russian

  23. arXiv:1912.11632  [pdf, ps, other

    math.OC

    Accelerated gradient sliding and variance reduction

    Authors: Darina Dvinskikh, Sergey Omelchenko, Alexander Tyurin, Alexander Gasnikov

    Abstract: We consider sum-type strongly convex optimization problem (first term) with smooth convex not proximal friendly composite (second term). We show that the complexity of this problem can be split into optimal number of incremental oracle calls for the first (sum-type) term and optimal number of oracle calls for the second (composite) term. Here under `optimal number' we mean estimate that correspond… ▽ More

    Submitted 11 March, 2020; v1 submitted 25 December, 2019; originally announced December 2019.

    Comments: in Russian

  24. arXiv:1911.08380  [pdf, other

    math.OC

    Adaptive Gradient Descent for Convex and Non-Convex Stochastic Optimization

    Authors: Darina Dvinskikh, Aleksandr Ogaltsov, Alexander Gasnikov, Pavel Dvurechensky, Alexander Tyurin, Vladimir Spokoiny

    Abstract: In this paper we propose several adaptive gradient methods for stochastic optimization. Unlike AdaGrad-type of methods, our algorithms are based on Armijo-type line search and they simultaneously adapt to the unknown Lipschitz constant of the gradient and variance of the stochastic approximation for the gradient. We consider an accelerated and non-accelerated gradient descent for convex problems a… ▽ More

    Submitted 12 June, 2020; v1 submitted 19 November, 2019; originally announced November 2019.

    Comments: 18 pages

  25. arXiv:1911.07363  [pdf, ps, other

    math.OC

    Optimal Decentralized Distributed Algorithms for Stochastic Convex Optimization

    Authors: Eduard Gorbunov, Darina Dvinskikh, Alexander Gasnikov

    Abstract: We consider stochastic convex optimization problems with affine constraints and develop several methods using either primal or dual approach to solve it. In the primal case, we use a special penalization technique to make the initial problem more convenient for using optimization methods. We propose algorithms to solve it based on Similar Triangles Method with Inexact Proximal Step for the convex… ▽ More

    Submitted 11 November, 2020; v1 submitted 17 November, 2019; originally announced November 2019.

    Comments: The content of this version is the same as in the version from February 16, 2020. The changes are only in the restructuring of the paper

  26. arXiv:1906.03620  [pdf, ps, other

    math.OC

    Accelerated methods for composite non-bilinear saddle point problem

    Authors: Mohammad Alkousa, Darina Dvinskikh, Fedor Stonyakin, Alexander Gasnikov, Dmitry Kovalev

    Abstract: Based on G. Lan's accelerated gradient sliding and general relation between the smoothness and strong convexity parameters of function under Legendre transformation we show that under rather general conditions the best known bounds for bilinear convex-concave smooth composite saddle point problem keep true for or non-bilinear convex-concave smooth composite saddle point problem. Moreover, we descr… ▽ More

    Submitted 1 January, 2020; v1 submitted 9 June, 2019; originally announced June 2019.

    Comments: 28 pages, in Russian

  27. arXiv:1904.09015  [pdf, ps, other

    math.OC

    Decentralized and Parallel Primal and Dual Accelerated Methods for Stochastic Convex Programming Problems

    Authors: Darina Dvinskikh, Alexander Gasnikov

    Abstract: We introduce primal and dual stochastic gradient oracle methods for decentralized convex optimization problems. Both for primal and dual oracles, the proposed methods are optimal in terms of the number of communication steps. However, for all classes of the objective, the optimality in terms of the number of oracle calls per node takes place only up to a logarithmic factor and the notion of smooth… ▽ More

    Submitted 10 February, 2021; v1 submitted 18 April, 2019; originally announced April 2019.

    Comments: 36 pages

  28. arXiv:1903.09844  [pdf, ps, other

    math.OC

    On Primal-Dual Approach for Distributed Stochastic Convex Optimization over Networks

    Authors: Darina Dvinskikh, Eduard Gorbunov, Alexander Gasnikov, Pavel Dvurechensky, Cesar A. Uribe

    Abstract: We introduce a primal-dual stochastic gradient oracle method for distributed convex optimization problems over networks. We show that the proposed method is optimal in terms of communication steps. Additionally, we propose a new analysis method for the rate of convergence in terms of duality gap and probability of large deviations. This analysis is based on a new technique that allows to bound the… ▽ More

    Submitted 26 November, 2019; v1 submitted 23 March, 2019; originally announced March 2019.

  29. arXiv:1902.09001  [pdf, other

    math.OC

    Gradient Methods for Problems with Inexact Model of the Objective

    Authors: Fedor Stonyakin, Darina Dvinskikh, Pavel Dvurechensky, Alexey Kroshnin, Olesya Kuznetsova, Artem Agafonov, Alexander Gasnikov, Alexander Tyurin, César A. Uribe, Dmitry Pasechnyuk, Sergei Artamonov

    Abstract: We consider optimization methods for convex minimization problems under inexact information on the objective function. We introduce inexact model of the objective, which as a particular cases includes $(δ,L)$ inexact oracle and relative smoothness condition. We analyze gradient method which uses this inexact model and obtain convergence rates for convex and strongly convex problems. To show potent… ▽ More

    Submitted 23 March, 2019; v1 submitted 24 February, 2019; originally announced February 2019.

    MSC Class: 90C25; 90C30; 90C06; 90C90; 68Q25; 65K05; 65Y20; 68W40 ACM Class: G.1.6

  30. arXiv:1902.00990  [pdf, ps, other

    math.OC

    Inexact Model: A Framework for Optimization and Variational Inequalities

    Authors: Fedor Stonyakin, Alexander Gasnikov, Alexander Tyurin, Dmitry Pasechnyuk, Artem Agafonov, Pavel Dvurechensky, Darina Dvinskikh, Alexey Kroshnin, Victorya Piskunova

    Abstract: In this paper we propose a general algorithmic framework for first-order methods in optimization in a broad sense, including minimization problems, saddle-point problems and variational inequalities. This framework allows to obtain many known methods as a special case, the list including accelerated gradient method, composite optimization methods, level-set methods, proximal methods. The idea of t… ▽ More

    Submitted 5 January, 2020; v1 submitted 3 February, 2019; originally announced February 2019.

    Comments: 41 pages

  31. arXiv:1901.08686  [pdf, ps, other

    math.OC cs.DS

    On the Complexity of Approximating Wasserstein Barycenter

    Authors: Alexey Kroshnin, Darina Dvinskikh, Pavel Dvurechensky, Alexander Gasnikov, Nazarii Tupitsa, Cesar Uribe

    Abstract: We study the complexity of approximating Wassertein barycenter of $m$ discrete measures, or histograms of size $n$ by contrasting two alternative approaches, both using entropic regularization. The first approach is based on the Iterative Bregman Projections (IBP) algorithm for which our novel analysis gives a complexity bound proportional to $\frac{mn^2}{\varepsilon^2}$ to approximate the origina… ▽ More

    Submitted 20 February, 2020; v1 submitted 24 January, 2019; originally announced January 2019.

    Comments: Corrected misprints. Added a reference to accelerated Iterative Bregman Projections introduced in arXiv:1906.03622

    MSC Class: 90C25; 90C30; 90C06; 90C90

    Journal ref: ICML 2019, in PMLR 97:3530-3540. http://proceedings.mlr.press/v97/kroshnin19a.html

  32. arXiv:1806.03915  [pdf, other

    math.OC cs.DC

    Decentralize and Randomize: Faster Algorithm for Wasserstein Barycenters

    Authors: Pavel Dvurechensky, Darina Dvinskikh, Alexander Gasnikov, César A. Uribe, Angelia Nedić

    Abstract: We study the decentralized distributed computation of discrete approximations for the regularized Wasserstein barycenter of a finite set of continuous probability measures distributedly stored over a network. We assume there is a network of agents/machines/computers, and each agent holds a private continuous probability measure and seeks to compute the barycenter of all the measures in the network… ▽ More

    Submitted 19 February, 2020; v1 submitted 11 June, 2018; originally announced June 2018.

    MSC Class: 90C25; 90C30; 90C06; 90C90; 68Q25; 65K05; 65Y20; 68W40 ACM Class: G.1.6

  33. arXiv:1803.02933  [pdf, other

    math.OC cs.DC cs.MA stat.ML

    Distributed Computation of Wasserstein Barycenters over Networks

    Authors: César A. Uribe, Darina Dvinskikh, Pavel Dvurechensky, Alexander Gasnikov, Angelia Nedić

    Abstract: We propose a new \cu{class-optimal} algorithm for the distributed computation of Wasserstein Barycenters over networks. Assuming that each node in a graph has a probability distribution, we prove that every node can reach the barycenter of all distributions held in the network by using local interactions compliant with the topology of the graph. We provide an estimate for the minimum number of com… ▽ More

    Submitted 20 September, 2018; v1 submitted 7 March, 2018; originally announced March 2018.