Skip to main content

Showing 1–5 of 5 results for author: Tzen, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2303.09532  [pdf, ps, other

    math.OC cs.LG

    Variational Principles for Mirror Descent and Mirror Langevin Dynamics

    Authors: Belinda Tzen, Anant Raj, Maxim Raginsky, Francis Bach

    Abstract: Mirror descent, introduced by Nemirovski and Yudin in the 1970s, is a primal-dual convex optimization method that can be tailored to the geometry of the optimization problem at hand through the choice of a strongly convex potential function. It arises as a basic primitive in a variety of applications, including large-scale optimization, machine learning, and control. This paper proposes a variatio… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: 6 pages

  2. arXiv:2002.01987  [pdf, ps, other

    cs.LG math.OC math.PR stat.ML

    Function approximation by neural nets in the mean-field regime: Entropic regularization and controlled McKean-Vlasov dynamics

    Authors: Belinda Tzen, Maxim Raginsky

    Abstract: We consider the problem of function approximation by two-layer neural nets with random weights that are "nearly Gaussian" in the sense of Kullback-Leibler divergence. Our setting is the mean-field limit, where the finite population of neurons in the hidden layer is replaced by a continuous ensemble. We show that the problem can be phrased as global minimization of a free energy functional on the s… ▽ More

    Submitted 22 June, 2024; v1 submitted 5 February, 2020; originally announced February 2020.

    Comments: 30 pages; note the change of title

  3. arXiv:1905.09883  [pdf, other

    cs.LG stat.ML

    Neural Stochastic Differential Equations: Deep Latent Gaussian Models in the Diffusion Limit

    Authors: Belinda Tzen, Maxim Raginsky

    Abstract: In deep latent Gaussian models, the latent variable is generated by a time-inhomogeneous Markov chain, where at each time step we pass the current state through a parametric nonlinear map, such as a feedforward neural net, and add a small independent Gaussian perturbation. This work considers the diffusion limit of such models, where the number of layers tends to infinity, while the step size and… ▽ More

    Submitted 27 October, 2019; v1 submitted 23 May, 2019; originally announced May 2019.

  4. arXiv:1903.01608  [pdf, ps, other

    math.PR cs.LG math.OC stat.ML

    Theoretical guarantees for sampling and inference in generative models with latent diffusions

    Authors: Belinda Tzen, Maxim Raginsky

    Abstract: We introduce and study a class of probabilistic generative models, where the latent object is a finite-dimensional diffusion process on a finite time interval and the observed variable is drawn conditionally on the terminal point of the diffusion. We make the following contributions: We provide a unified viewpoint on both sampling and variational inference in such generative models through the l… ▽ More

    Submitted 31 May, 2019; v1 submitted 4 March, 2019; originally announced March 2019.

    Comments: To appear in COLT 2019

  5. arXiv:1802.06439  [pdf, ps, other

    cs.LG math.OC math.PR stat.ML

    Local Optimality and Generalization Guarantees for the Langevin Algorithm via Empirical Metastability

    Authors: Belinda Tzen, Tengyuan Liang, Maxim Raginsky

    Abstract: We study the detailed path-wise behavior of the discrete-time Langevin algorithm for non-convex Empirical Risk Minimization (ERM) through the lens of metastability, adopting some techniques from Berglund and Gentz (2003. For a particular local optimum of the empirical risk, with an arbitrary initialization, we show that, with high probability, at least one of the following two events will occur:… ▽ More

    Submitted 5 June, 2018; v1 submitted 18 February, 2018; originally announced February 2018.

    Comments: 19 pages

    Journal ref: Proceedings of the 31st Conference on Learning Theory 75 (2018) 857-875