Skip to main content

Showing 1–27 of 27 results for author: Samsonov, S

.
  1. arXiv:2406.13655  [pdf, other

    cs.LG cs.AI

    Improving GFlowNets with Monte Carlo Tree Search

    Authors: Nikita Morozov, Daniil Tiapkin, Sergey Samsonov, Alexey Naumov, Dmitry Vetrov

    Abstract: Generative Flow Networks (GFlowNets) treat sampling from distributions over compositional discrete spaces as a sequential decision-making problem, training a stochastic policy to construct objects step by step. Recent studies have revealed strong connections between GFlowNets and entropy-regularized reinforcement learning. Building on these insights, we propose to enhance planning capabilities of… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: ICML 2024 SPIGM Workshop

  2. arXiv:2405.16644  [pdf, other

    stat.ML cs.LG math.OC math.PR math.ST

    Gaussian Approximation and Multiplier Bootstrap for Polyak-Ruppert Averaged Linear Stochastic Approximation with Applications to TD Learning

    Authors: Sergey Samsonov, Eric Moulines, Qi-Man Shao, Zhuo-Song Zhang, Alexey Naumov

    Abstract: In this paper, we obtain the Berry-Esseen bound for multivariate normal approximation for the Polyak-Ruppert averaged iterates of the linear stochastic approximation (LSA) algorithm with decreasing step size. Our findings reveal that the fastest rate of normal approximation is achieved when setting the most aggressive step size $α_{k} \asymp k^{-1/2}$. Moreover, we prove the non-asymptotic validit… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    MSC Class: 60F05; 62L20; 62E20

  3. arXiv:2405.00017  [pdf, other

    cs.DC cs.LG stat.ML

    Queuing dynamics of asynchronous Federated Learning

    Authors: Louis Leconte, Matthieu Jonckheere, Sergey Samsonov, Eric Moulines

    Abstract: We study asynchronous federated learning mechanisms with nodes having potentially different computational speeds. In such an environment, each node is allowed to work on models with potential delays and contribute to updates to the central server at its own pace. Existing analyses of such algorithms typically depend on intractable quantities such as the maximum node delay and do not consider the u… ▽ More

    Submitted 12 February, 2024; originally announced May 2024.

  4. arXiv:2402.04114  [pdf, other

    stat.ML cs.LG math.OC

    SCAFFLSA: Taming Heterogeneity in Federated Linear Stochastic Approximation and TD Learning

    Authors: Paul Mangold, Sergey Samsonov, Safwan Labbi, Ilya Levin, Reda Alami, Alexey Naumov, Eric Moulines

    Abstract: In this paper, we analyze the sample and communication complexity of the federated linear stochastic approximation (FedLSA) algorithm. We explicitly quantify the effects of local training with agent heterogeneity. We show that the communication complexity of FedLSA scales polynomially with the inverse of the desired accuracy $ε$. To overcome this, we propose SCAFFLSA a new variant of FedLSA that u… ▽ More

    Submitted 27 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: now with linear speed-up!

  5. arXiv:2310.14286  [pdf, ps, other

    stat.ML cs.LG math.OC

    Improved High-Probability Bounds for the Temporal Difference Learning Algorithm via Exponential Stability

    Authors: Sergey Samsonov, Daniil Tiapkin, Alexey Naumov, Eric Moulines

    Abstract: In this paper we consider the problem of obtaining sharp bounds for the performance of temporal difference (TD) methods with linear function approximation for policy evaluation in discounted Markov decision processes. We show that a simple algorithm with a universal and instance-independent step size together with Polyak-Ruppert tail averaging is sufficient to obtain near-optimal variance and bias… ▽ More

    Submitted 15 June, 2024; v1 submitted 22 October, 2023; originally announced October 2023.

    Comments: Accepted to COLT-2024

    MSC Class: 62L20; 60J20

  6. arXiv:2305.15938  [pdf, ps, other

    math.OC cs.LG stat.ML

    First Order Methods with Markovian Noise: from Acceleration to Variational Inequalities

    Authors: Aleksandr Beznosikov, Sergey Samsonov, Marina Sheshukova, Alexander Gasnikov, Alexey Naumov, Eric Moulines

    Abstract: This paper delves into stochastic optimization problems that involve Markovian noise. We present a unified approach for the theoretical analysis of first-order gradient methods for stochastic optimization and variational inequalities. Our approach covers scenarios for both non-convex and strongly convex minimization problems. To achieve an optimal (linear) dependence on the mixing time of the unde… ▽ More

    Submitted 30 March, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Appears in: Advances in Neural Information Processing Systems 36 (NeurIPS 2023). 41 pages, 3 algorithms, 2 tables

    Journal ref: https://proceedings.neurips.cc/paper_files/paper/2023/hash/8c3e38ce55a0fa44bc325bc6fdb7f4e5-Abstract-Conference.html

  7. arXiv:2304.01111  [pdf, ps, other

    math.ST cs.LG math.PR stat.ME stat.ML

    Theoretical guarantees for neural control variates in MCMC

    Authors: Denis Belomestny, Artur Goldman, Alexey Naumov, Sergey Samsonov

    Abstract: In this paper, we propose a variance reduction approach for Markov chains based on additive control variates and the minimization of an appropriate estimate for the asymptotic variance. We focus on the particular case when control variates are represented as deep neural networks. We derive the optimal convergence rate of the asymptotic variance under various ergodicity assumptions on the underlyin… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    MSC Class: 65C40; 62-08

  8. arXiv:2303.05838  [pdf, ps, other

    math.PR math.ST stat.ML

    Rosenthal-type inequalities for linear statistics of Markov chains

    Authors: Alain Durmus, Eric Moulines, Alexey Naumov, Sergey Samsonov, Marina Sheshukova

    Abstract: In this paper, we establish novel deviation bounds for additive functionals of geometrically ergodic Markov chains similar to Rosenthal and Bernstein inequalities for sums of independent random variables. We pay special attention to the dependence of our bounds on the mixing time of the corresponding chain. More precisely, we establish explicit bounds that are linked to the constants from the mart… ▽ More

    Submitted 28 June, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

    MSC Class: 60E15; 60J20; 65C40

  9. arXiv:2210.01606  [pdf, other

    physics.plasm-ph physics.comp-ph

    Opacity of relativistically underdense plasmas for extremely intense laser pulses

    Authors: M. A. Serebryakov, A. S. Samsonov, E. N. Nerush, I. Yu. Kostyukov

    Abstract: It is generally believed that relativistically underdense plasma is transparent for intense laser radiation. However, particle-in-cell simulations reveal abnormal laser field absorption above the intensity threshold about~$3 \times 10^{24}~\mathrm{W}\,\mathrm{cm}^{-2}$ for the wavelength of $1~μ\mathrm{m}$. Above the threshold, the further increase of the laser intensity doesn't lead to the increa… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

    Comments: 8 pages, 3 figures

  10. arXiv:2208.00673  [pdf, other

    physics.plasm-ph

    High-order corrections to the radiation-free dynamics of an electron in the strongly radiation-dominated regime

    Authors: A. S. Samsonov, E. N. Nerush, I. Yu. Kostyukov

    Abstract: A system of reduced equations is proposed for the electron motion in the strongly-radiation dominated regime for an arbitrary electromagnetic field configuration. The developed approach is used to analyze various scenarios of an electron dynamics in the strongly-radiation dominated regime: motion in rotating electric and magnetic fields, longitudinal acceleration in a plane wave and in a plasma wa… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

  11. arXiv:2207.06364  [pdf, other

    stat.ML cs.LG stat.CO

    BR-SNIS: Bias Reduced Self-Normalized Importance Sampling

    Authors: Gabriel Cardoso, Sergey Samsonov, Achille Thin, Eric Moulines, Jimmy Olsson

    Abstract: Importance Sampling (IS) is a method for approximating expectations under a target distribution using independent samples from a proposal distribution and the associated importance weights. In many applications, the target distribution is known only up to a normalization constant, in which case self-normalized IS (SNIS) can be used. While the use of self-normalization can have a positive effect on… ▽ More

    Submitted 13 September, 2022; v1 submitted 13 July, 2022; originally announced July 2022.

  12. arXiv:2207.04475  [pdf, ps, other

    stat.ML cs.LG math.PR math.ST

    Finite-time High-probability Bounds for Polyak-Ruppert Averaged Iterates of Linear Stochastic Approximation

    Authors: Alain Durmus, Eric Moulines, Alexey Naumov, Sergey Samsonov

    Abstract: This paper provides a finite-time analysis of linear stochastic approximation (LSA) algorithms with fixed step size, a core method in statistics and machine learning. LSA is used to compute approximate solutions of a $d$-dimensional linear system $\bar{\mathbf{A}} θ= \bar{\mathbf{b}}$ for which $(\bar{\mathbf{A}}, \bar{\mathbf{b}})$ can only be estimated by (asymptotically) unbiased observations… ▽ More

    Submitted 29 March, 2023; v1 submitted 10 July, 2022; originally announced July 2022.

    MSC Class: 62L20; 60J20

  13. arXiv:2206.09527  [pdf, other

    math.NA math.ST stat.ML

    Simultaneous approximation of a smooth function and its derivatives by deep neural networks with piecewise-polynomial activations

    Authors: Denis Belomestny, Alexey Naumov, Nikita Puchkin, Sergey Samsonov

    Abstract: This paper investigates the approximation properties of deep neural networks with piecewise-polynomial activation functions. We derive the required depth, width, and sparsity of a deep neural network to approximate any Hölder smooth function up to a given approximation error in Hölder norms in such a way that all weights of this neural network are bounded by $1$. The latter feature is essential to… ▽ More

    Submitted 2 December, 2022; v1 submitted 19 June, 2022; originally announced June 2022.

    Comments: 28 pages

    MSC Class: 41A25; 41A15; 41A28; 68T07

  14. arXiv:2205.07704  [pdf, other

    stat.ML cs.LG

    From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses

    Authors: Daniil Tiapkin, Denis Belomestny, Eric Moulines, Alexey Naumov, Sergey Samsonov, Yunhao Tang, Michal Valko, Pierre Menard

    Abstract: We propose the Bayes-UCBVI algorithm for reinforcement learning in tabular, stage-dependent, episodic Markov decision process: a natural extension of the Bayes-UCB algorithm by Kaufmann et al. (2012) for multi-armed bandits. Our method uses the quantile of a Q-value function posterior as upper confidence bound on the optimal Q-value function. For Bayes-UCBVI, we prove a regret bound of order… ▽ More

    Submitted 22 June, 2022; v1 submitted 16 May, 2022; originally announced May 2022.

  15. arXiv:2111.02702  [pdf, other

    stat.ML cs.LG

    Local-Global MCMC kernels: the best of both worlds

    Authors: Sergey Samsonov, Evgeny Lagutin, Marylou Gabrié, Alain Durmus, Alexey Naumov, Eric Moulines

    Abstract: Recent works leveraging learning to enhance sampling have shown promising results, in particular by designing effective non-local moves and global proposals. However, learning accuracy is inevitably limited in regions where little data is available such as in the tails of distributions as well as in high-dimensional problems. In the present paper we study an Explore-Exploit Markov chain Monte Carl… ▽ More

    Submitted 4 October, 2022; v1 submitted 4 November, 2021; originally announced November 2021.

    Comments: arXiv admin note: text overlap with arXiv:1111.5421 by other authors

  16. arXiv:2109.00331  [pdf, ps, other

    math.PR

    Probability and moment inequalities for additive functionals of geometrically ergodic Markov chains

    Authors: Alain Durmus, Eric Moulines, Alexey Naumov, Sergey Samsonov

    Abstract: In this paper, we establish moment and Bernstein-type inequalities for additive functionals of geometrically ergodic Markov chains. These inequalities extend the corresponding inequalities for independent random variables. Our conditions cover Markov chains converging geometrically to the stationary distribution either in $V$-norms or in weighted Wasserstein distances. Our inequalities apply to un… ▽ More

    Submitted 15 June, 2023; v1 submitted 1 September, 2021; originally announced September 2021.

    MSC Class: 60E15; 60J20; 65C40

  17. arXiv:2107.04787  [pdf, other

    physics.acc-ph physics.plasm-ph

    Beamstrahlung-enhanced disruption in beam-beam interaction

    Authors: A. S. Samsonov, E. N. Nerush, I. Yu. Kostyukov, M. Filipovic, C. Baumann, A. Pukhov

    Abstract: The radiation reaction (beamstrahlung) effect on particle dynamics during interaction of oppositely charged beams is studied. It is shown that the beam focusing can be strongly enhanced due to beamstrahlung. An approximate analytical solution of the motion equation including the radiation reaction force is derived. The disruption parameter is calculated for classical and quantum regime of beamstra… ▽ More

    Submitted 10 July, 2021; originally announced July 2021.

  18. arXiv:2106.01257  [pdf, ps, other

    stat.ML cs.LG math.PR math.ST

    Tight High Probability Bounds for Linear Stochastic Approximation with Fixed Stepsize

    Authors: Alain Durmus, Eric Moulines, Alexey Naumov, Sergey Samsonov, Kevin Scaman, Hoi-To Wai

    Abstract: This paper provides a non-asymptotic analysis of linear stochastic approximation (LSA) algorithms with fixed stepsize. This family of methods arises in many machine learning tasks and is used to obtain approximate solutions of a linear system $\bar{A}θ= \bar{b}$ for which $\bar{A}$ and $\bar{b}$ can only be accessed through random estimates $\{({\bf A}_n, {\bf b}_n): n \in \mathbb{N}^*\}$. Our ana… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: 21 pages

  19. arXiv:2105.02135  [pdf, other

    cs.LG math.OC

    UVIP: Model-Free Approach to Evaluate Reinforcement Learning Algorithms

    Authors: D. Belomestny, I. Levin, E. Moulines, A. Naumov, S. Samsonov, V. Zorina

    Abstract: Policy evaluation is an important instrument for the comparison of different algorithms in Reinforcement Learning (RL). Yet even a precise knowledge of the value function $V^π$ corresponding to a policy $π$ does not provide reliable information on how far is the policy $π$ from the optimal one. We present a novel model-free upper value iteration procedure $({\sf UVIP})$ that allows us to estimate… ▽ More

    Submitted 3 June, 2021; v1 submitted 5 May, 2021; originally announced May 2021.

  20. arXiv:2102.00199  [pdf, ps, other

    math.ST stat.ML

    Rates of convergence for density estimation with generative adversarial networks

    Authors: Nikita Puchkin, Sergey Samsonov, Denis Belomestny, Eric Moulines, Alexey Naumov

    Abstract: In this work we undertake a thorough study of the non-asymptotic properties of the vanilla generative adversarial networks (GANs). We prove an oracle inequality for the Jensen-Shannon (JS) divergence between the underlying density $\mathsf{p}^*$ and the GAN estimate with a significantly better statistical error term compared to the previously known results. The advantage of our bound becomes clear… ▽ More

    Submitted 25 January, 2024; v1 submitted 30 January, 2021; originally announced February 2021.

    Comments: To appear in Journal of Machine Learning Research

  21. arXiv:2102.00185  [pdf, ps, other

    stat.ML cs.LG math.PR math.ST

    On the Stability of Random Matrix Product with Markovian Noise: Application to Linear Stochastic Approximation and TD Learning

    Authors: Alain Durmus, Eric Moulines, Alexey Naumov, Sergey Samsonov, Hoi-To Wai

    Abstract: This paper studies the exponential stability of random matrix products driven by a general (possibly unbounded) state space Markov chain. It is a cornerstone in the analysis of stochastic algorithms in machine learning (e.g. for parameter tracking in online learning or reinforcement learning). The existing results impose strong conditions such as uniform boundedness of the matrix-valued functions… ▽ More

    Submitted 30 January, 2021; originally announced February 2021.

  22. arXiv:2010.14116  [pdf, other

    physics.plasm-ph

    Hydrodynamical model of QED cascade expansion in an extremely strong laser pulse

    Authors: A. S. Samsonov, I. Yu. Kostyukov, E. N. Nerush

    Abstract: Development of the self-sustained quantum-electrodynamical (QED) cascade in a single strong laser pulse is studied analytically and numerically. The hydrodynamical approach is used to construct the analytical model of the cascade evolution, which includes the key features of the cascade observed in 3D QED particle-in-cell (QED-PIC) simulations such as the magnetic field predominance in the cascade… ▽ More

    Submitted 27 October, 2020; originally announced October 2020.

  23. arXiv:2008.06858  [pdf, other

    math.ST stat.CO

    Variance reduction for dependent sequences with applications to Stochastic Gradient MCMC

    Authors: D. Belomestny, L. Iosipoi, E. Moulines, A. Naumov, S. Samsonov

    Abstract: In this paper we propose a novel and practical variance reduction approach for additive functionals of dependent sequences. Our approach combines the use of control variates with the minimisation of an empirical variance estimate. We analyse finite sample properties of the proposed method and derive finite-time bounds of the excess asymptotic variance to zero. We apply our methodology to Stochasti… ▽ More

    Submitted 16 August, 2020; originally announced August 2020.

    MSC Class: 60J20; 65C40; 65C60

  24. arXiv:1910.03643  [pdf, other

    math.ST cs.LG math.PR stat.CO stat.ML

    Variance reduction for Markov chains with application to MCMC

    Authors: D. Belomestny, L. Iosipoi, E. Moulines, A. Naumov, S. Samsonov

    Abstract: In this paper we propose a novel variance reduction approach for additive functionals of Markov chains based on minimization of an estimate for the asymptotic variance of these functionals over suitable classes of control variates. A distinctive feature of the proposed approach is its ability to significantly reduce the overall finite sample variance. This feature is theoretically demonstrated by… ▽ More

    Submitted 15 February, 2020; v1 submitted 8 October, 2019; originally announced October 2019.

  25. arXiv:1903.07373  [pdf, other

    stat.CO stat.ML

    Variance reduction for additive functional of Markov chains via martingale representations

    Authors: D. Belomestny, E. Moulines, S. Samsonov

    Abstract: In this paper we propose an efficient variance reduction approach for additive functionals of Markov chains relying on a novel discrete time martingale representation. Our approach is fully non-asymptotic and does not require the knowledge of the stationary distribution (and even any type of ergodicity) or specific structure of the underlying density. By rigorously analyzing the convergence proper… ▽ More

    Submitted 21 December, 2021; v1 submitted 18 March, 2019; originally announced March 2019.

    MSC Class: 60G40

  26. arXiv:1809.06115  [pdf, other

    physics.plasm-ph

    Laser-driven vacuum breakdown waves

    Authors: A. S. Samsonov, E. N. Nerush, I. Yu. Kostyukov

    Abstract: It is demonstrated by three-dimensional quantum electrodynamics --- particle-in-cell (QED-PIC) simulations that vacuum breakdown wave in the form of QED cascade front can propagate in an extremely intense plane electromagnetic wave. The result disproves the statement that the self-sustained cascading is not possible in a plane wave configuration. In the simulations the cascade initiates during las… ▽ More

    Submitted 27 June, 2019; v1 submitted 17 September, 2018; originally announced September 2018.

    Comments: 12 pages, 8 figures; many changes in comparison with v1

  27. Asymptotic electron motion in strong radiation-dominated regime

    Authors: A. S. Samsonov, E. N. Nerush, I. Yu. Kostyukov

    Abstract: We study electron motion in electromagnetic (EM) fields in the radiation-dominated regime. It is shown that the electron trajectories become close to some asymptotic trajectories in the strong field limit. The description of the electron dynamics by this asymptotic trajectories significantly differs from the ponderomotive description that is barely applicable in the radiation-dominated regime. The… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

    Comments: 14 pages, 6 figures, sent to Phys. Rev. A

    Journal ref: Phys. Rev. A 98, 053858 (2018)