Skip to main content

Showing 1–10 of 10 results for author: Zygalakis, K C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.02002  [pdf, ps, other

    math.OC cs.LG math.NA

    A Variational Perspective on High-Resolution ODEs

    Authors: Hoomaan Maskan, Konstantinos C. Zygalakis, Alp Yurtsever

    Abstract: We consider unconstrained minimization of smooth convex functions. We propose a novel variational perspective using forced Euler-Lagrange equation that allows for studying high-resolution ODEs. Through this, we obtain a faster convergence rate for gradient norm minimization using Nesterov's accelerated gradient method. Additionally, we show that Nesterov's method can be interpreted as a rate-match… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: 37th Annual Conference on Neural Information Processing Systems (NeurIPS 2023)

  2. arXiv:2308.09460  [pdf, other

    stat.CO cs.CV math.NA stat.ML

    Accelerated Bayesian imaging by relaxed proximal-point Langevin sampling

    Authors: Teresa Klatzer, Paul Dobson, Yoann Altmann, Marcelo Pereyra, Jesús María Sanz-Serna, Konstantinos C. Zygalakis

    Abstract: This paper presents a new accelerated proximal Markov chain Monte Carlo methodology to perform Bayesian inference in imaging inverse problems with an underlying convex geometry. The proposed strategy takes the form of a stochastic relaxed proximal-point iteration that admits two complementary interpretations. For models that are smooth or regularised by Moreau-Yosida smoothing, the algorithm is eq… ▽ More

    Submitted 12 January, 2024; v1 submitted 18 August, 2023; originally announced August 2023.

    Comments: 34 pages, 13 figures

    MSC Class: 65C40; 68U10; 62F15; 65C60; 65J22; 68W25

  3. Gaussian processes for Bayesian inverse problems associated with linear partial differential equations

    Authors: Tianming Bai, Aretha L. Teckentrup, Konstantinos C. Zygalakis

    Abstract: This work is concerned with the use of Gaussian surrogate models for Bayesian inverse problems associated with linear partial differential equations. A particular focus is on the regime where only a small amount of training data is available. In this regime the type of Gaussian prior used is of critical importance with respect to how well the surrogate model will perform in terms of Bayesian inver… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

  4. arXiv:2111.05715  [pdf, ps, other

    cs.SI math.DS math.NA

    A Hierarchy of Network Models Giving Bistability Under Triadic Closure

    Authors: Stefano Di Giovacchino, Desmond J. Higham, Konstantinos C. Zygalakis

    Abstract: Triadic closure describes the tendency for new friendships to form between individuals who already have friends in common. It has been argued heuristically that the triadic closure effect can lead to bistability in the formation of large-scale social interaction networks. Here, depending on the initial state and the transient dynamics, the system may evolve towards either of two long-time states.… ▽ More

    Submitted 10 November, 2021; originally announced November 2021.

    Comments: 20 pages, 9 figures

    MSC Class: 60J20; 60J74; 68R10

  5. arXiv:2104.12384  [pdf, ps, other

    stat.ML cs.LG math.NA math.PR

    Wasserstein distance estimates for the distributions of numerical approximations to ergodic stochastic differential equations

    Authors: J. M. Sanz-Serna, Konstantinos C. Zygalakis

    Abstract: We present a framework that allows for the non-asymptotic study of the $2$-Wasserstein distance between the invariant distribution of an ergodic stochastic differential equation and the distribution of its numerical approximation in the strongly log-concave case. This allows us to study in a unified way a number of different integrators proposed in the literature for the overdamped and underdamped… ▽ More

    Submitted 24 September, 2021; v1 submitted 26 April, 2021; originally announced April 2021.

    Comments: 29 pages, 2 figures

    MSC Class: 65C40; 60H10; 60H35

  6. arXiv:2103.10182  [pdf, other

    stat.ME cs.CV eess.IV stat.ML

    Bayesian Imaging With Data-Driven Priors Encoded by Neural Networks: Theory, Methods, and Algorithms

    Authors: Matthew Holden, Marcelo Pereyra, Konstantinos C. Zygalakis

    Abstract: This paper proposes a new methodology for performing Bayesian inference in imaging inverse problems where the prior knowledge is available in the form of training data. Following the manifold hypothesis and adopting a generative modelling approach, we construct a data-driven prior that is supported on a sub-manifold of the ambient space, which we can learn from the training data by using a variati… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

  7. arXiv:2009.11262  [pdf, other

    cs.CV math.OC

    A Linear Transportation $\mathrm{L}^p$ Distance for Pattern Recognition

    Authors: Oliver M. Crook, Mihai Cucuringu, Tim Hurst, Carola-Bibiane Schönlieb, Matthew Thorpe, Konstantinos C. Zygalakis

    Abstract: The transportation $\mathrm{L}^p$ distance, denoted $\mathrm{TL}^p$, has been proposed as a generalisation of Wasserstein $\mathrm{W}^p$ distances motivated by the property that it can be applied directly to colour or multi-channelled images, as well as multivariate time-series without normalisation or mass constraints. These distances, as with $\mathrm{W}^p$, are powerful tools in modelling data… ▽ More

    Submitted 23 September, 2020; originally announced September 2020.

  8. arXiv:2009.00673  [pdf, other

    math.NA cs.LG math.OC

    The connections between Lyapunov functions for some optimization algorithms and differential equations

    Authors: J. M. Sanz-Serna, Konstantinos C. Zygalakis

    Abstract: In this manuscript, we study the properties of a family of second-order differential equations with dam**, its discretizations and their connections with accelerated optimization algorithms for $m$-strongly convex and $L$-smooth functions. In particular, using the Linear Matrix Inequality LMI framework developed by \emph{Fazlyab et. al. $(2018)$}, we derive analytically a (discrete) Lyapunov fun… ▽ More

    Submitted 11 January, 2021; v1 submitted 1 September, 2020; originally announced September 2020.

    Comments: 21 pages, 1 figure

    MSC Class: 65L06; 65L20; 90C25; 93C15

  9. arXiv:1911.05035   

    cs.LG math.NA math.OC stat.ML

    Constructing Gradient Controllable Recurrent Neural Networks Using Hamiltonian Dynamics

    Authors: Konstantin Rusch, John W. Pearson, Konstantinos C. Zygalakis

    Abstract: Recurrent neural networks (RNNs) have gained a great deal of attention in solving sequential learning problems. The learning of long-term dependencies, however, remains challenging due to the problem of a vanishing or exploding hidden states gradient. By exploring further the recently established connections between RNNs and dynamical systems we propose a novel RNN architecture, which we call a Ha… ▽ More

    Submitted 16 March, 2020; v1 submitted 11 November, 2019; originally announced November 2019.

    Comments: Reasons: 1. theoretical result of bounding the gradient dynamics is highly important when tackling the exploding gradient problem. However, we only proved the boundedness in one dimension and cannot generalize to the higher dimensional case, as the Hamiltonian argument is not valid in the general higher dimensional case. 2. The only medium strong performance on the widely used sMNIST problem

  10. arXiv:1703.08816  [pdf, other

    cs.LG stat.ML

    Uncertainty quantification in graph-based classification of high dimensional data

    Authors: Andrea L. Bertozzi, Xiyang Luo, Andrew M. Stuart, Konstantinos C. Zygalakis

    Abstract: Classification of high dimensional data finds wide-ranging applications. In many of these applications equip** the resulting classification with a measure of uncertainty may be as important as the classification itself. In this paper we introduce, develop algorithms for, and investigate the properties of, a variety of Bayesian models for the task of binary classification; via the posterior distr… ▽ More

    Submitted 8 February, 2018; v1 submitted 26 March, 2017; originally announced March 2017.

    Comments: 33 pages, 14 figures