Skip to main content

Showing 1–26 of 26 results for author: Spiliopoulos, K

Searching in archive stat. Search in all archives.
.
  1. arXiv:2403.07124  [pdf, other

    stat.ME cs.SI

    Stochastic gradient descent-based inference for dynamic network models with attractors

    Authors: Hancong Pan, Xiao**g Zhu, Cantay Caliskan, Dino P. Christenson, Konstantinos Spiliopoulos, Dylan Walker, Eric D. Kolaczyk

    Abstract: In Coevolving Latent Space Networks with Attractors (CLSNA) models, nodes in a latent space represent social actors, and edges indicate their dynamic interactions. Attractors are added at the latent level to capture the notion of attractive and repulsive forces between nodes, borrowing from dynamical systems theory. However, CLSNA reliance on MCMC estimation makes scaling difficult, and the requir… ▽ More

    Submitted 20 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  2. arXiv:2308.14555  [pdf, other

    cs.LG math.PR stat.ML

    Kernel Limit of Recurrent Neural Networks Trained on Ergodic Data Sequences

    Authors: Samuel Chun-Hei Lam, Justin Sirignano, Konstantinos Spiliopoulos

    Abstract: Mathematical methods are developed to characterize the asymptotics of recurrent neural networks (RNN) as the number of hidden units, data samples in the sequence, hidden state updates, and training steps simultaneously grow to infinity. In the case of an RNN with a simplified weight matrix, we prove the convergence of the RNN to the solution of an infinite-dimensional ODE coupled with the fixed po… ▽ More

    Submitted 15 May, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: Major revision for lemma 7.1

    MSC Class: 68T07 (Primary); 68T05; 60J20 (Secondary)

  3. arXiv:2302.07227  [pdf, other

    stat.ME math.PR stat.ML

    Transport map unadjusted Langevin algorithms: learning and discretizing perturbed samplers

    Authors: Benjamin J. Zhang, Youssef M. Marzouk, Konstantinos Spiliopoulos

    Abstract: Langevin dynamics are widely used in sampling high-dimensional, non-Gaussian distributions whose densities are known up to a normalizing constant. In particular, there is strong interest in unadjusted Langevin algorithms (ULA), which directly discretize Langevin dynamics to estimate expectations over the target distribution. We study the use of transport maps that approximately normalize a target… ▽ More

    Submitted 28 September, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: 28 pages, 12 figures

    MSC Class: 62D99; 60H35

  4. arXiv:2209.01018  [pdf, other

    cs.LG math.PR stat.AP stat.ML

    Normalization effects on deep neural networks

    Authors: Jiahui Yu, Konstantinos Spiliopoulos

    Abstract: We study the effect of normalization on the layers of deep neural networks of feed-forward type. A given layer $i$ with $N_{i}$ hidden units is allowed to be normalized by $1/N_{i}^{γ_{i}}$ with $γ_{i}\in[1/2,1]$ and we study the effect of the choice of the $γ_{i}$ on the statistical behavior of the neural network's output (such as variance) as well as on the test accuracy on the MNIST data set. W… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

    Comments: arXiv admin note: text overlap with arXiv:2011.10487

    MSC Class: 60F05; 68T01; 60G99

  5. arXiv:2206.00646  [pdf, other

    math.PR math.OC stat.ME

    Importance sampling for stochastic reaction-diffusion equations in the moderate deviation regime

    Authors: Ioannis Gasteratos, Michael Salins, Konstantinos Spiliopoulos

    Abstract: We develop a provably efficient importance sampling scheme that estimates exit probabilities of solutions to small-noise stochastic reaction-diffusion equations from scaled neighborhoods of a stable equilibrium. The moderate deviation scaling allows for a local approximation of the nonlinear dynamics by their linearized version. In addition, we identify a finite-dimensional subspace where exits ta… ▽ More

    Submitted 22 October, 2023; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: Version to appear in Stochastics and Partial Differential Equations: Analysis and Computations. 46 pages

    MSC Class: 65C05; 60G99; 60F9

  6. arXiv:2109.13129  [pdf, other

    stat.AP stat.ME

    Disentangling positive and negative partisanship in social media interactions using a coevolving latent space network with attractors model

    Authors: Xiao**g Zhu, Cantay Caliskan, Dino P. Christenson, Konstantinos Spiliopoulos, Dylan Walker, Eric D. Kolaczyk

    Abstract: We develop a broadly applicable class of coevolving latent space network with attractors (CLSNA) models, where nodes represent individual social actors assumed to lie in an unknown latent space, edges represent the presence of a specified interaction between actors, and attractors are added in the latent level to capture the notion of attractive and repulsive forces. We apply the CLSNA models to u… ▽ More

    Submitted 13 August, 2022; v1 submitted 27 September, 2021; originally announced September 2021.

    Comments: revised version

  7. arXiv:2108.08247  [pdf, other

    stat.ME math.PR stat.AP stat.CO stat.ML

    Geometry-informed irreversible perturbations for accelerated convergence of Langevin dynamics

    Authors: Benjamin J. Zhang, Youssef M. Marzouk, Konstantinos Spiliopoulos

    Abstract: We introduce a novel geometry-informed irreversible perturbation that accelerates convergence of the Langevin algorithm for Bayesian computation. It is well documented that there exist perturbations to the Langevin dynamics that preserve its invariant measure while accelerating its convergence. Irreversible perturbations and reversible perturbations (such as Riemannian manifold Langevin dynamics (… ▽ More

    Submitted 1 September, 2022; v1 submitted 18 August, 2021; originally announced August 2021.

  8. arXiv:2011.10487  [pdf, other

    stat.ML cs.LG math.PR

    Normalization effects on shallow neural networks and related asymptotic expansions

    Authors: Jiahui Yu, Konstantinos Spiliopoulos

    Abstract: We consider shallow (single hidden layer) neural networks and characterize their performance when trained with stochastic gradient descent as the number of hidden units $N$ and gradient descent steps grow to infinity. In particular, we investigate the effect of different scaling schemes, which lead to different normalizations of the neural network, on the network's statistical output, closing the… ▽ More

    Submitted 1 June, 2022; v1 submitted 20 November, 2020; originally announced November 2020.

    Comments: Added link to code on GitHub: https://github.com/kspiliopoulos/NormalizationEffectsNeuralNetworks

    MSC Class: 60F05; 68T01; 60G99

    Journal ref: AIMS Journal on Foundations of Data Science, June 2021, Vol. 3, Issue 2, pp. 151-200

  9. arXiv:1911.07304  [pdf, ps, other

    cs.LG math.PR stat.ML

    Asymptotics of Reinforcement Learning with Neural Networks

    Authors: Justin Sirignano, Konstantinos Spiliopoulos

    Abstract: We prove that a single-layer neural network trained with the Q-learning algorithm converges in distribution to a random ordinary differential equation as the size of the model and the number of training steps become large. Analysis of the limit differential equation shows that it has a unique stationary solution which is the solution of the Bellman equation, thus giving the optimal control for the… ▽ More

    Submitted 2 April, 2021; v1 submitted 13 November, 2019; originally announced November 2019.

    Comments: arXiv admin note: text overlap with arXiv:1907.04108

  10. arXiv:1907.04108  [pdf, ps, other

    math.PR cs.LG stat.ML

    Scaling Limit of Neural Networks with the Xavier Initialization and Convergence to a Global Minimum

    Authors: Justin Sirignano, Konstantinos Spiliopoulos

    Abstract: We analyze single-layer neural networks with the Xavier initialization in the asymptotic regime of large numbers of hidden units and large numbers of stochastic gradient descent training steps. The evolution of the neural network during training can be viewed as a stochastic system and, using techniques from stochastic analysis, we prove the neural network converges in distribution to a random ODE… ▽ More

    Submitted 12 April, 2022; v1 submitted 9 July, 2019; originally announced July 2019.

    Comments: The results of this technical note have been extended and generalized in arXiv:1911.07304. In the present note the full details for the proof of the special case studied here are presented

  11. arXiv:1903.04440  [pdf, other

    math.PR stat.ML

    Mean Field Analysis of Deep Neural Networks

    Authors: Justin Sirignano, Konstantinos Spiliopoulos

    Abstract: We analyze multi-layer neural networks in the asymptotic regime of simultaneously (A) large network sizes and (B) large numbers of stochastic gradient descent training iterations. We rigorously establish the limiting behavior of the multi-layer neural network output. The limit procedure is valid for any number of hidden layers and it naturally also describes the limiting behavior of the training l… ▽ More

    Submitted 2 April, 2021; v1 submitted 11 March, 2019; originally announced March 2019.

  12. arXiv:1812.02127  [pdf, other

    stat.ME math.PR math.ST stat.AP stat.ML

    Information geometry for approximate Bayesian computation

    Authors: Konstantinos Spiliopoulos

    Abstract: The goal of this paper is to explore the basic Approximate Bayesian Computation (ABC) algorithm via the lens of information theory. ABC is a widely used algorithm in cases where the likelihood of the data is hard to work with or intractable, but one can simulate from it. We use relative entropy ideas to analyze the behavior of the algorithm as a function of the threshold parameter and of the size… ▽ More

    Submitted 12 August, 2019; v1 submitted 5 December, 2018; originally announced December 2018.

  13. arXiv:1808.09372  [pdf, ps, other

    math.PR math.ST stat.ML

    Mean Field Analysis of Neural Networks: A Central Limit Theorem

    Authors: Justin Sirignano, Konstantinos Spiliopoulos

    Abstract: We rigorously prove a central limit theorem for neural network models with a single hidden layer. The central limit theorem is proven in the asymptotic regime of simultaneously (A) large numbers of hidden units and (B) large numbers of stochastic gradient descent training iterations. Our result describes the neural network's fluctuations around its mean-field limit. The fluctuations have a Gaussia… ▽ More

    Submitted 3 June, 2019; v1 submitted 28 August, 2018; originally announced August 2018.

    MSC Class: 60F05; 60G57; 62M45

  14. arXiv:1805.10229  [pdf, ps, other

    math.PR math.OC stat.ME

    Importance sampling for slow-fast diffusions based on moderate deviations

    Authors: Matthew R. Morse, Konstantinos Spiliopoulos

    Abstract: We consider systems of slow--fast diffusions with small noise in the slow component. We construct provably logarithmic asymptotically optimal importance schemes for the estimation of rare events based on the moderate deviations principle. Using the subsolution approach we construct schemes and identify conditions under which the schemes will be asymptotically optimal. Moderate deviations--based im… ▽ More

    Submitted 6 January, 2020; v1 submitted 25 May, 2018; originally announced May 2018.

  15. arXiv:1710.04273  [pdf, ps, other

    math.PR math.ST q-fin.CP stat.ML

    Stochastic Gradient Descent in Continuous Time: A Central Limit Theorem

    Authors: Justin Sirignano, Konstantinos Spiliopoulos

    Abstract: Stochastic gradient descent in continuous time (SGDCT) provides a computationally efficient method for the statistical learning of continuous-time models, which are widely used in science, engineering, and finance. The SGDCT algorithm follows a (noisy) descent direction along a continuous stream of data. The parameter updates occur in continuous time and satisfy a stochastic differential equation.… ▽ More

    Submitted 17 June, 2019; v1 submitted 11 October, 2017; originally announced October 2017.

  16. arXiv:1709.02223  [pdf, other

    math.PR math.ST stat.AP stat.ME

    Discrete-Time Statistical Inference for Multiscale Diffusions

    Authors: Siragan Gailus, Konstantinos Spiliopoulos

    Abstract: We study statistical inference for small-noise-perturbed multiscale dynamical systems under the assumption that we observe a single time series from the slow process only. We construct estimators for both averaging and homogenization regimes, based on an appropriate misspecified model motivated by a second-order stochastic Taylor expansion of the slow process with respect to a function of the time… ▽ More

    Submitted 11 September, 2018; v1 submitted 7 September, 2017; originally announced September 2017.

  17. arXiv:1708.07469  [pdf, other

    q-fin.MF math.NA q-fin.CP stat.ML

    DGM: A deep learning algorithm for solving partial differential equations

    Authors: Justin Sirignano, Konstantinos Spiliopoulos

    Abstract: High-dimensional PDEs have been a longstanding computational challenge. We propose to solve high-dimensional PDEs by approximating the solution with a deep neural network which is trained to satisfy the differential operator, initial condition, and boundary conditions. Our algorithm is meshfree, which is key since meshes become infeasible in higher dimensions. Instead of forming a mesh, the neural… ▽ More

    Submitted 5 September, 2018; v1 submitted 24 August, 2017; originally announced August 2017.

    Comments: Deep learning, machine learning, partial differential equations

  18. arXiv:1707.08868  [pdf, other

    math.PR math.OC stat.ME

    Importance sampling for metastable and multiscale dynamical systems

    Authors: Konstantinos Spiliopoulos

    Abstract: In this article, we address the issues that come up in the design of importance sampling schemes for rare events associated to stochastic dynamical systems. We focus on the issue of metastability and on the effect of multiple scales. We discuss why seemingly reasonable schemes that follow large deviations optimal paths may perform poorly in practice, even though they are asymptotically optimal. Pr… ▽ More

    Submitted 27 July, 2017; originally announced July 2017.

    Comments: Will appear as a chapter in Springer book

  19. arXiv:1702.01777  [pdf, other

    stat.ME math.PR math.ST

    Optimal Scaling of the MALA algorithm with Irreversible Proposals for Gaussian targets

    Authors: Michela Ottobre, Natesh S. Pillai, Konstantinos Spiliopoulos

    Abstract: It is well known in many settings that reversible Langevin diffusions in confining potentials converge to equilibrium exponentially fast. Adding irreversible perturbations to the drift of a Langevin diffusion that maintain the same invariant measure accelerates its convergence to stationarity. Many existing works thus advocate the use of such non-reversible dynamics for sampling. When implementing… ▽ More

    Submitted 1 July, 2019; v1 submitted 6 February, 2017; originally announced February 2017.

  20. arXiv:1611.05545  [pdf, other

    math.PR math.OC math.ST stat.ML

    Stochastic Gradient Descent in Continuous Time

    Authors: Justin Sirignano, Konstantinos Spiliopoulos

    Abstract: Stochastic gradient descent in continuous time (SGDCT) provides a computationally efficient method for the statistical learning of continuous-time models, which are widely used in science, engineering, and finance. The SGDCT algorithm follows a (noisy) descent direction along a continuous stream of data. SGDCT performs an online parameter update in continuous time, with the parameter updates… ▽ More

    Submitted 29 October, 2017; v1 submitted 16 November, 2016; originally announced November 2016.

  21. arXiv:1609.04365  [pdf, ps, other

    math.PR math.OC stat.ME

    Rare event simulation via importance sampling for linear SPDE's

    Authors: Michael Salins, Konstantinos Spiliopoulos

    Abstract: The goal of this paper is to develop provably efficient importance sampling Monte Carlo methods for the estimation of rare events within the class of linear stochastic partial differential equations (SPDEs). We find that if a spectral gap of appropriate size exists, then one can identify a lower dimensional manifold where the rare event takes place. This allows one to build importance sampling cha… ▽ More

    Submitted 4 May, 2017; v1 submitted 14 September, 2016; originally announced September 2016.

    MSC Class: 65C05; 60G99; 60F9

  22. arXiv:1607.06158  [pdf, other

    math.PR math.ST q-fin.ST stat.ME

    Dimension Reduction in Statistical Estimation of Partially Observed Multiscale Processes

    Authors: Andrew Papanicolaou, Konstantinos Spiliopoulos

    Abstract: We consider partially observed multiscale diffusion models that are specified up to an unknown vector parameter. We establish for a very general class of test functions that the filter of the original model converges to a filter of reduced dimension. Then, this result is used to justify statistical estimation for the unknown parameters of interest based on the model of reduced dimension but using… ▽ More

    Submitted 26 November, 2017; v1 submitted 20 July, 2016; originally announced July 2016.

    Comments: SIAM Journal of Uncertainty Quantification, 2017

    MSC Class: 93E10; 93E11; 93C70; 62M07; 62M86

  23. arXiv:1606.09539  [pdf, ps, other

    math.NA math.PR stat.ME

    Analysis of multiscale integrators for multiple attractors and irreversible Langevin samplers

    Authors: Jianfeng Lu, Konstantinos Spiliopoulos

    Abstract: We study multiscale integrator numerical schemes for a class of stiff stochastic differential equations (SDEs). We consider multiscale SDEs with potentially multiple attractors that behave as diffusions on graphs as the stiffness parameter goes to its limit. Classical numerical discretization schemes, such as the Euler-Maruyama scheme, become unstable as the stiffness parameter converges to its li… ▽ More

    Submitted 9 October, 2018; v1 submitted 30 June, 2016; originally announced June 2016.

  24. arXiv:1601.08118  [pdf, ps, other

    math.PR math-ph stat.ME

    Improving the convergence of reversible samplers

    Authors: Luc Rey-Bellet, Konstantinos Spiliopoulos

    Abstract: In Monte-Carlo methods the Markov processes used to sample a given target distribution usually satisfy detailed balance, i.e. they are time-reversible. However, relatively recent results have demonstrated that appropriate reversible and irreversible perturbations can accelerate convergence to equilibrium. In this paper we present some general design principles which apply to general Markov process… ▽ More

    Submitted 9 June, 2016; v1 submitted 29 January, 2016; originally announced January 2016.

    Comments: Final version will appear in the Journal of Statistical Physics

  25. arXiv:1508.02651  [pdf, other

    stat.ME math.ST stat.CO

    Sequential Monte Carlo for fractional Stochastic Volatility Models

    Authors: Alexandra Chronopoulou, Konstantinos Spiliopoulos

    Abstract: In this paper we consider a fractional stochastic volatility model, that is a model in which the volatility may exhibit a long-range dependent or a rough/antipersistent behavior. We propose a dynamic sequential Monte Carlo methodology that is applicable to both long memory and antipersistent processes in order to estimate the volatility as well as the unknown parameters of the model. We establish… ▽ More

    Submitted 25 February, 2017; v1 submitted 11 August, 2015; originally announced August 2015.

  26. arXiv:1410.0386  [pdf, ps, other

    math.PR stat.ME

    Rare event simulation for multiscale diffusions in random environments

    Authors: Konstantinos Spiliopoulos

    Abstract: We consider systems of stochastic differential equations with multiple scales and small noise and assume that the coefficients of the equations are ergodic and stationary random fields. Our goal is to construct provably-efficient importance sampling Monte Carlo methods that allow efficient computation of rare event probabilities or expectations of functionals that can be associated with rare event… ▽ More

    Submitted 28 September, 2015; v1 submitted 1 October, 2014; originally announced October 2014.

    Comments: Final version, paper to appear in SIAM Journal Multiscale Modelling and Simulation

    MSC Class: 60F10; 60F05; 60G60