Showing 1–2 of 2 results for author: Rontsis, N

Search v0.5.6 released 2020-02-24

arXiv:2008.03273 [pdf, other]

cs.LG eess.SY stat.ML

SafePILCO: a software tool for safe and data-efficient policy synthesis

Authors: Kyriakos Polymenakos, Nikitas Rontsis, Alessandro Abate, Stephen Roberts

Abstract: SafePILCO is a software tool for safe and data-efficient policy search with reinforcement learning. It extends the known PILCO algorithm, originally written in MATLAB, to support safe learning. We provide a Python implementation and leverage existing libraries that allow the codebase to remain short and modular, which is appropriate for wider use by the verification, reinforcement learning, and co… ▽ More SafePILCO is a software tool for safe and data-efficient policy search with reinforcement learning. It extends the known PILCO algorithm, originally written in MATLAB, to support safe learning. We provide a Python implementation and leverage existing libraries that allow the codebase to remain short and modular, which is appropriate for wider use by the verification, reinforcement learning, and control communities. △ Less

Submitted 7 August, 2020; originally announced August 2020.

Comments: Shorter Version published as a software tool demonstration at QEST 2020
arXiv:1910.05295 [pdf, other]

math.OC cs.DS

Optimal Approximation of Doubly Stochastic Matrices

Authors: Nikitas Rontsis, Paul J. Goulart

Abstract: We consider the least-squares approximation of a matrix C in the set of doubly stochastic matrices with the same sparsity pattern as C. Our approach is based on applying the well-known Alternating Direction Method of Multipliers (ADMM) to a reformulation of the original problem. Our resulting algorithm requires an initial Cholesky factorization of a positive definite matrix that has the same spars… ▽ More We consider the least-squares approximation of a matrix C in the set of doubly stochastic matrices with the same sparsity pattern as C. Our approach is based on applying the well-known Alternating Direction Method of Multipliers (ADMM) to a reformulation of the original problem. Our resulting algorithm requires an initial Cholesky factorization of a positive definite matrix that has the same sparsity pattern as C + I followed by simple iterations whose complexity is linear in the number of nonzeros in C, thus ensuring excellent scalability and speed. We demonstrate the advantages of our approach in a series of experiments on problems with up to 82 million nonzeros; these include normalizing large scale matrices arising from the 3D structure of the human genome, clustering applications, and the SuiteSparse matrix library. Overall, our experiments illustrate the outstanding scalability of our algorithm; matrices with millions of nonzeros can be approximated in a few seconds on modest desktop computing hardware. △ Less

Submitted 11 October, 2019; originally announced October 2019.

Search v0.5.6 released 2020-02-24