Skip to main content

Showing 1–32 of 32 results for author: Sootla, A

.
  1. arXiv:2209.04698  [pdf, other

    cs.LG

    Structured Q-learning For Antibody Design

    Authors: Alexander I. Cowen-Rivers, Philip John Gorinski, Aivar Sootla, Asif Khan, Liu Furui, Jun Wang, Jan Peters, Haitham Bou Ammar

    Abstract: Optimizing combinatorial structures is core to many real-world problems, such as those encountered in life sciences. For example, one of the crucial steps involved in antibody design is to find an arrangement of amino acids in a protein sequence that improves its binding with a pathogen. Combinatorial optimization of antibodies is difficult due to extremely large search spaces and non-linear objec… ▽ More

    Submitted 13 September, 2022; v1 submitted 10 September, 2022; originally announced September 2022.

  2. arXiv:2206.02675  [pdf, other

    cs.LG cs.AI

    Effects of Safety State Augmentation on Safe Exploration

    Authors: Aivar Sootla, Alexander I. Cowen-Rivers, Jun Wang, Haitham Bou Ammar

    Abstract: Safe exploration is a challenging and important problem in model-free reinforcement learning (RL). Often the safety cost is sparse and unknown, which unavoidably leads to constraint violations -- a phenomenon ideally to be avoided in safety-critical applications. We tackle this problem by augmenting the state-space with a safety state, which is nonnegative if and only if the constraint is satisfie… ▽ More

    Submitted 12 October, 2022; v1 submitted 6 June, 2022; originally announced June 2022.

    Comments: Published in Neurips 2022

  3. arXiv:2205.15953  [pdf, other

    cs.LG

    Timing is Everything: Learning to Act Selectively with Costly Actions and Budgetary Constraints

    Authors: David Mguni, Aivar Sootla, Juliusz Ziomek, Oliver Slumbers, Zipeng Dai, Kun Shao, Jun Wang

    Abstract: Many real-world settings involve costs for performing actions; transaction costs in financial systems and fuel costs being common examples. In these settings, performing actions at each time step quickly accumulates costs leading to vastly suboptimal outcomes. Additionally, repeatedly acting produces wear and tear and ultimately, damage. Determining \textit{when to act} is crucial for achieving su… ▽ More

    Submitted 4 June, 2023; v1 submitted 31 May, 2022; originally announced May 2022.

  4. arXiv:2205.15064  [pdf, other

    cs.LG

    SEREN: Knowing When to Explore and When to Exploit

    Authors: Changmin Yu, David Mguni, Dong Li, Aivar Sootla, Jun Wang, Neil Burgess

    Abstract: Efficient reinforcement learning (RL) involves a trade-off between "exploitative" actions that maximise expected reward and "explorative'" ones that sample unvisited states. To encourage exploration, recent approaches proposed adding stochasticity to actions, separating exploration and exploitation phases, or equating reduction in uncertainty with reward. However, these techniques do not necessari… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

    Comments: arXiv admin note: text overlap with arXiv:2112.02618, arXiv:2103.09159, arXiv:2110.14468

  5. arXiv:2202.06558  [pdf, other

    cs.LG cs.AI

    Saute RL: Almost Surely Safe Reinforcement Learning Using State Augmentation

    Authors: Aivar Sootla, Alexander I. Cowen-Rivers, Taher Jafferjee, Ziyan Wang, David Mguni, Jun Wang, Haitham Bou-Ammar

    Abstract: Satisfying safety constraints almost surely (or with probability one) can be critical for the deployment of Reinforcement Learning (RL) in real-life applications. For example, plane landing and take-off should ideally occur with probability one. We address the problem by introducing Safety Augmented (Saute) Markov Decision Processes (MDPs), where the safety constraints are eliminated by augmenting… ▽ More

    Submitted 22 June, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: ICML 2022

  6. arXiv:2202.06557  [pdf, other

    cs.LG cs.AI

    Reinforcement Learning in Presence of Discrete Markovian Context Evolution

    Authors: Hang Ren, Aivar Sootla, Taher Jafferjee, Junxiao Shen, Jun Wang, Haitham Bou-Ammar

    Abstract: We consider a context-dependent Reinforcement Learning (RL) setting, which is characterized by: a) an unknown finite number of not directly observable contexts; b) abrupt (discontinuous) context changes occurring during an episode; and c) Markovian context evolution. We argue that this challenging case is often met in applications and we tackle it using a Bayesian approach and variational inferenc… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

    Comments: Accepted to ICLR 2022

  7. arXiv:2110.14468  [pdf, other

    cs.LG

    DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention

    Authors: David Mguni, Usman Islam, Yaqi Sun, Xiuling Zhang, Joel Jennings, Aivar Sootla, Changmin Yu, Ziyan Wang, Jun Wang, Yaodong Yang

    Abstract: Reinforcement learning (RL) involves performing exploratory actions in an unknown system. This can place a learning agent in dangerous and potentially catastrophic system states. Current approaches for tackling safe learning in RL simultaneously trade-off safe exploration and task fulfillment. In this paper, we introduce a new generation of RL solvers that learn to minimise safety violations while… ▽ More

    Submitted 1 March, 2023; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: arXiv admin note: text overlap with arXiv:2103.09159

  8. arXiv:2107.02474  [pdf, other

    stat.ML cs.LG

    Viscos Flows: Variational Schur Conditional Sampling With Normalizing Flows

    Authors: Vincent Moens, Aivar Sootla, Haitham Bou Ammar, Jun Wang

    Abstract: We present a method for conditional sampling for pre-trained normalizing flows when only part of an observation is available. We derive a lower bound to the conditioning variable log-probability using Schur complement properties in the spirit of Gaussian conditional sampling. Our derivation relies on partitioning flow's domain in such a way that the flow restrictions to subdomains remain bijective… ▽ More

    Submitted 15 October, 2021; v1 submitted 6 July, 2021; originally announced July 2021.

  9. Diagnosing and Preventing Instabilities in Recurrent Video Processing

    Authors: Thomas Tanay, Aivar Sootla, Matteo Maggioni, Puneet K. Dokania, Philip Torr, Ales Leonardis, Gregory Slabaugh

    Abstract: Recurrent models are a popular choice for video enhancement tasks such as video denoising or super-resolution. In this work, we focus on their stability as dynamical systems and show that they tend to fail catastrophically at inference time on long video sequences. To address this issue, we (1) introduce a diagnostic tool which produces input sequences optimized to trigger instabilities and that c… ▽ More

    Submitted 11 March, 2023; v1 submitted 10 October, 2020; originally announced October 2020.

    Journal ref: in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 45, no. 2, pp. 1594-1605, 1 Feb. 2023

  10. arXiv:2006.09436  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    SAMBA: Safe Model-Based & Active Reinforcement Learning

    Authors: Alexander I. Cowen-Rivers, Daniel Palenicek, Vincent Moens, Mohammed Abdullah, Aivar Sootla, Jun Wang, Haitham Ammar

    Abstract: In this paper, we propose SAMBA, a novel framework for safe reinforcement learning that combines aspects from probabilistic modelling, information theory, and statistics. Our method builds upon PILCO to enable active exploration using novel(semi-)metrics for out-of-sample Gaussian process evaluation optimised through a multi-objective problem that supports conditional-value-at-risk constraints. We… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

  11. On the Existence of Block-Diagonal Solutions to Lyapunov and $\mathcal{H}_{\infty}$ Riccati Inequalities

    Authors: Aivar Sootla, Yang Zheng, Antonis Papachristodoulou

    Abstract: In this paper, we describe sufficient conditions when block-diagonal solutions to Lyapunov and $\mathcal{H}_{\infty}$ Riccati inequalities exist. In order to derive our results, we define a new type of comparison systems, which are positive and are computed using the state-space matrices of the original (possibly nonpositive) systems. Computing the comparison system involves only the calculation o… ▽ More

    Submitted 6 October, 2019; originally announced October 2019.

    Comments: This is an extended technical report. The main results have been accepted for publication as a technical note in the IEEE Transactions on Automatic Control

  12. Block Factor-width-two Matrices and Their Applications to Semidefinite and Sum-of-squares Optimization

    Authors: Yang Zheng, Aivar Sootla, Antonis Papachristodoulou

    Abstract: Semidefinite and sum-of-squares (SOS) optimization are fundamental computational tools in many areas, including linear and nonlinear systems theory. However, the scale of problems that can be addressed reliably and efficiently is still limited. In this paper, we introduce a new notion of block factor-width-two matrices and build a new hierarchy of inner and outer approximations of the cone of posi… ▽ More

    Submitted 8 February, 2022; v1 submitted 24 September, 2019; originally announced September 2019.

    Comments: Accepted for publication as a regular paper at IEEE TAC. Code is available through https://github.com/zhengy09/SDPfw

  13. arXiv:1903.04938  [pdf, ps, other

    math.OC eess.SY

    Block Factor-Width-Two Matrices in Semidefinite Programming

    Authors: Aivar Sootla, Yang Zheng, Antonis Papachristodoulou

    Abstract: In this paper, we introduce a set of block factor-width-two matrices, which is a generalisation of factor-width-two matrices and is a subset of positive semidefinite matrices. The set of block factor-width-two matrices is a proper cone and we compute a closed-form expression for its dual cone. We use these cones to build hierarchies of inner and outer approximations of the cone of positive semidef… ▽ More

    Submitted 12 March, 2019; originally announced March 2019.

    Comments: To appear in European Control Conference, 2019

  14. arXiv:1803.05996  [pdf, ps, other

    math.OC math.DS

    Scalable analysis of linear networked systems via chordal decomposition

    Authors: Yang Zheng, Maryam Kamgarpour, Aivar Sootla, Antonis Papachristodoulou

    Abstract: This paper introduces a chordal decomposition approach for scalable analysis of linear networked systems, including stability, $\mathcal{H}_2$ and $\mathcal{H}_{\infty}$ performance. Our main strategy is to exploit any sparsity within these analysis problems and use chordal decomposition. We first show that Grone's and Agler's theorems can be generalized to block matrices with any partition. This… ▽ More

    Submitted 15 March, 2018; originally announced March 2018.

    Comments: 6 pages; to appear at ECC2018

    MSC Class: 93D05; 93D25; 93C05

  15. arXiv:1709.06809  [pdf, ps, other

    eess.SY math.OC

    Block-Diagonal Solutions to Lyapunov Inequalities and Generalisations of Diagonal Dominance

    Authors: Aivar Sootla, Yang Zheng, Antonis Papachristodoulou

    Abstract: Diagonally dominant matrices have many applications in systems and control theory. Linear dynamical systems with scaled diagonally dominant drift matrices, which include stable positive systems, allow for scalable stability analysis. For example, it is known that Lyapunov inequalities for this class of systems admit diagonal solutions. In this paper, we present an extension of scaled diagonally do… ▽ More

    Submitted 20 September, 2017; originally announced September 2017.

    Comments: 6 pages, to appear in Proceedings of the Conference on Decision and Control 2017

  16. Distributed Design for Decentralized Control using Chordal Decomposition and ADMM

    Authors: Yang Zheng, Maryam Kamgarpour, Aivar Sootla, Antonis Papachristodoulou

    Abstract: We propose a distributed design method for decentralized control by exploiting the underlying sparsity properties of the problem. Our method is based on chordal decomposition of sparse block matrices and the alternating direction method of multipliers (ADMM). We first apply a classical parameterization technique to restrict the optimal decentralized control into a convex problem that inherits the… ▽ More

    Submitted 3 August, 2019; v1 submitted 3 September, 2017; originally announced September 2017.

    Comments: 11 pages, 8 figures. Accepted for publication in the IEEE Transactions on Control of Network Systems

    Journal ref: IEEE Transactions on Control of Network Systems (Volume: 7, Issue: 2, June 2020)

  17. arXiv:1708.00232  [pdf, other

    eess.SY math.OC

    Pulse-Based Control Using Koopman Operator Under Parametric Uncertainty

    Authors: Aivar Sootla, Damien Ernst

    Abstract: In applications, such as biomedicine and systems/synthetic biology, technical limitations in actuation complicate implementation of time-varying control signals. In order to alleviate some of these limitations, it may be desirable to derive simple control policies, such as step functions with fixed magnitude and length (or temporal pulses). In this technical note, we further develop a recently pro… ▽ More

    Submitted 1 August, 2017; originally announced August 2017.

  18. An Optimal Control Formulation of Pulse-Based Control Using Koopman Operator

    Authors: Aivar Sootla, Alexandre Mauroy, Damien Ernst

    Abstract: In many applications, and in systems/synthetic biology, in particular, it is desirable to compute control policies that force the trajectory of a bistable system from one equilibrium (the initial point) to another equilibrium (the target point), or in other words to solve the switching problem. It was recently shown that, for monotone bistable systems, this problem admits easy-to-implement open-lo… ▽ More

    Submitted 28 June, 2018; v1 submitted 26 July, 2017; originally announced July 2017.

    Comments: corrected typos

    Journal ref: Automatica Volume 91, May 2018, Pages 217-224

  19. arXiv:1705.02853  [pdf, other

    eess.SY math.DS math.OC

    Geometric Properties of Isostables and Basins of Attraction of Monotone Systems

    Authors: Aivar Sootla, Alexandre Mauroy

    Abstract: In this paper, we study geometric properties of basins of attraction of monotone systems. Our results are based on a combination of monotone systems theory and spectral operator theory. We exploit the framework of the Koopman operator, which provides a linear infinite-dimensional description of nonlinear dynamical systems and spectral operator-theoretic notions such as eigenvalues and eigenfunctio… ▽ More

    Submitted 8 May, 2017; originally announced May 2017.

    Comments: 12 pages, to appear in IEEE Transaction on Automatic Control

  20. arXiv:1605.06252  [pdf, other

    eess.SY math.DS math.OC

    Sha** Pulses to Control Bistable Monotone Systems Using Koopman Operator

    Authors: Aivar Sootla, Alexandre Mauroy, Jorge Goncalves

    Abstract: In this paper, we further develop a recently proposed control method to switch a bistable system between its steady states using temporal pulses. The motivation for using pulses comes from biomedical and biological applications (e.g. synthetic biology), where it is generally difficult to build feedback control systems due to technical limitations in sensing and actuation. The original framework wa… ▽ More

    Submitted 20 May, 2016; originally announced May 2016.

    Comments: 7 pages

  21. arXiv:1603.07686  [pdf, other

    math.OC eess.SY

    On Existence of Solutions to Structured Lyapunov Inequalities

    Authors: Aviar Sootla, James Anderson

    Abstract: In this paper, we derive sufficient conditions on drift matrices under which block-diagonal solutions to Lyapunov inequalities exist. The motivation for the problem comes from a recently proposed basis pursuit algorithm. In particular, this algorithm can provide approximate solutions to optimisation programmes with constraints involving Lyapunov inequalities using linear or second order cone progr… ▽ More

    Submitted 24 March, 2016; originally announced March 2016.

    Comments: To appear in the Proceedings of the 2016 American Control Conference

  22. arXiv:1510.05784  [pdf, other

    math.OC eess.SY q-bio.QM

    Structured Projection-Based Model Reduction with Application to Stochastic Biochemical Networks

    Authors: Aivar Sootla, James Anderson

    Abstract: The Chemical Master Equation (CME) is well known to provide the highest resolution models of a biochemical reaction network. Unfortunately, even simulating the CME can be a challenging task. For this reason more simple approximations to the CME have been proposed. In this work we focus on one such model, the Linear Noise Approximation. Specifically, we consider implications of a recently proposed… ▽ More

    Submitted 20 October, 2015; originally announced October 2015.

    Comments: 13 pages; 7 figures; submitted to IEEE Transaction on Automatic Control

  23. arXiv:1510.01153  [pdf, other

    math.OC eess.SY math.DS

    Properties of Isostables and Basins of Attraction of Monotone Systems

    Authors: Aivar Sootla, Alexandre Mauroy

    Abstract: In this paper, we investigate geometric properties of monotone systems by studying their isostables and basins of attraction. Isostables are boundaries of specific forward-invariant sets defined by the so-called Koopman operator, which provides a linear infinite-dimensional description of a nonlinear system. First, we study the spectral properties of the Koopman operator and the associated semigro… ▽ More

    Submitted 22 March, 2016; v1 submitted 5 October, 2015; originally announced October 2015.

    Comments: 8 pages, 3 figures, contains material to appear in Proceedings of American Control Conference 2016

  24. arXiv:1510.01149  [pdf, other

    math.DS

    Operator-Theoretic Characterization of Eventually Monotone Systems

    Authors: Aivar Sootla, Alexandre Mauroy

    Abstract: Monotone systems are dynamical systems whose solutions preserve a partial order in the initial condition for all positive times. It stands to reason that some systems may preserve a partial order only after some initial transient. These systems are usually called eventually monotone. While monotone systems have a characterization in terms of their vector fields (i.e. Kamke-Muller condition), event… ▽ More

    Submitted 26 July, 2017; v1 submitted 5 October, 2015; originally announced October 2015.

    Comments: 13 pages

  25. arXiv:1509.08392  [pdf, ps, other

    math.OC eess.SY

    Properties of Eventually Positive Linear Input-Output Systems

    Authors: Aivar Sootla

    Abstract: In this paper, we consider the systems with trajectories originating in the nonnegative orthant becoming nonnegative after some finite time transient. First we consider dynamical systems (i.e., fully observable systems with no inputs), which we call eventually positive. We compute forward-invariant cones and Lyapunov functions for these systems. We then extend the notion of eventually positive sys… ▽ More

    Submitted 19 May, 2024; v1 submitted 28 September, 2015; originally announced September 2015.

  26. arXiv:1503.02557  [pdf, ps, other

    math.OC eess.SY

    On Monotonicity and Propagation of Order Properties

    Authors: Aivar Sootla

    Abstract: In this paper, a link between monotonicity of deterministic dynamical systems and propagation of order by Markov processes is established. The order propagation has received considerable attention in the literature, however, this notion is still not fully understood. The main contribution of this paper is a study of the order propagation in the deterministic setting, which potentially can provide… ▽ More

    Submitted 9 March, 2015; originally announced March 2015.

    Comments: Part of the paper is to appear in American Control Conference 2015

  27. arXiv:1409.6150  [pdf, other

    math.OC eess.SY q-bio.QM

    Sha** Pulses to Control Bistable Biological Systems

    Authors: Aivar Sootla, Diego Oyarzun, David Angeli, Guy-Bart Stan

    Abstract: In this paper we study how to shape temporal pulses to switch a bistable system between its stable steady states. Our motivation for pulse-based control comes from applications in synthetic biology, where it is generally difficult to implement real-time feedback control systems due to technical limitations in sensors and actuators. We show that for monotone bistable systems, the estimation of the… ▽ More

    Submitted 2 October, 2015; v1 submitted 22 September, 2014; originally announced September 2014.

    Comments: 14 pages, contains material from the paper in Proc Amer Control Conf 2015, (pp. 3138-3143) and "Sha** pulses to control bistable systems analysis, computation and counterexamples", which is due to appear in Automatica

  28. arXiv:1403.7429  [pdf, other

    math.OC cs.DC cs.LG eess.SY

    Distributed Reconstruction of Nonlinear Networks: An ADMM Approach

    Authors: Wei Pan, Aivar Sootla, Guy-Bart Stan

    Abstract: In this paper, we present a distributed algorithm for the reconstruction of large-scale nonlinear networks. In particular, we focus on the identification from time-series data of the nonlinear functional forms and associated parameters of large-scale nonlinear networks. Recently, a nonlinear network reconstruction problem was formulated as a nonconvex optimisation problem based on the combination… ▽ More

    Submitted 28 March, 2014; originally announced March 2014.

    Comments: To appear in the Preprints of 19th IFAC World Congress 2014

  29. arXiv:1403.5971  [pdf, other

    math.OC eess.SY q-bio.QM

    On Projection-Based Model Reduction of Biochemical Networks-- Part II: The Stochastic Case

    Authors: Aivar Sootla, James Anderson

    Abstract: In this paper, we consider the problem of model order reduction of stochastic biochemical networks. In particular, we reduce the order of (the number of equations in) the Linear Noise Approximation of the Chemical Master Equation, which is often used to describe biochemical networks. In contrast to other biochemical network reduction methods, the presented one is projection-based. Projection-based… ▽ More

    Submitted 24 March, 2014; originally announced March 2014.

    Comments: Submitted to the 53rd CDC

  30. arXiv:1403.3579  [pdf, other

    math.OC eess.SY

    On Projection-Based Model Reduction of Biochemical Networks-- Part I: The Deterministic Case

    Authors: Aivar Sootla, James Anderson

    Abstract: This paper addresses the problem of model reduction for dynamical system models that describe biochemical reaction networks. Inherent in such models are properties such as stability, positivity and network structure. Ideally these properties should be preserved by model reduction procedures, although traditional projection based approaches struggle to do this. We propose a projection based model r… ▽ More

    Submitted 14 March, 2014; originally announced March 2014.

    Comments: Submitted to 53rd IEEE CDC

  31. arXiv:1303.3183  [pdf, ps, other

    eess.SY cs.CE cs.LG q-bio.MN

    Toggling a Genetic Switch Using Reinforcement Learning

    Authors: Aivar Sootla, Natalja Strelkowa, Damien Ernst, Mauricio Barahona, Guy-Bart Stan

    Abstract: In this paper, we consider the problem of optimal exogenous control of gene regulatory networks. Our approach consists in adapting an established reinforcement learning algorithm called the fitted Q iteration. This algorithm infers the control law directly from the measurements of the system's response to external control inputs without the use of a mathematical model of the system. The measuremen… ▽ More

    Submitted 25 February, 2015; v1 submitted 12 March, 2013; originally announced March 2013.

    Comments: 12 pages, presented at the 9th French Meeting on Planning, Decision Making and Learning, Liège (Belgium), May 12-13, 2014

  32. arXiv:1303.2987  [pdf, other

    eess.SY math.OC

    On Periodic Reference Tracking Using Batch-Mode Reinforcement Learning with Application to Gene Regulatory Network Control

    Authors: Aivar Sootla, Natalja Strelkowa, Damien Ernst, Mauricio Barahona, Guy-Bart Stan

    Abstract: In this paper, we consider the periodic reference tracking problem in the framework of batch-mode reinforcement learning, which studies methods for solving optimal control problems from the sole knowledge of a set of trajectories. In particular, we extend an existing batch-mode reinforcement learning algorithm, known as Fitted Q Iteration, to the periodic reference tracking problem. The presented… ▽ More

    Submitted 12 March, 2013; originally announced March 2013.