Skip to main content

Showing 1–50 of 151 results for author: Abate, A

.
  1. arXiv:2406.15753  [pdf, other

    cs.LG cs.AI stat.ML

    The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret

    Authors: Lukas Fluri, Leon Lang, Alessandro Abate, Patrick Forré, David Krueger, Joar Skalse

    Abstract: In reinforcement learning, specifying reward functions that capture the intended task can be very challenging. Reward learning aims to address this issue by learning the reward function. However, a learned reward model may have a low error on the training distribution, and yet subsequently produce a policy with large regret. We say that such a reward model has an error-regret mismatch. The main so… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: 58 pages, 1 figure

  2. arXiv:2406.10023  [pdf, other

    cs.LG cs.CL stat.ML

    Deep Bayesian Active Learning for Preference Modeling in Large Language Models

    Authors: Luckeciano C. Melo, Panagiotis Tigas, Alessandro Abate, Yarin Gal

    Abstract: Leveraging human preferences for steering the behavior of Large Language Models (LLMs) has demonstrated notable success in recent years. Nonetheless, data selection and labeling are still a bottleneck for these systems, particularly at large scale. Hence, selecting the most informative points for acquiring human feedback may considerably reduce the cost of preference labeling and unleash the furth… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  3. arXiv:2405.17304  [pdf, ps, other

    cs.LO eess.SY

    Stochastic Omega-Regular Verification and Control with Supermartingales

    Authors: Alessandro Abate, Mirco Giacobbe, Diptarko Roy

    Abstract: We present for the first time a supermartingale certificate for $ω$-regular specifications. We leverage the Robbins & Siegmund convergence theorem to characterize supermartingale certificates for the almost-sure acceptance of Streett conditions on general stochastic processes, which we call Streett supermartingales. This enables effective verification and control of discrete-time stochastic dynami… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: The conference version of this manuscript appeared at CAV'24

  4. arXiv:2405.15723  [pdf, other

    cs.LO cs.LG

    Bisimulation Learning

    Authors: Alessandro Abate, Mirco Giacobbe, Yannik Schnitzer

    Abstract: We introduce a data-driven approach to computing finite bisimulations for state transition systems with very large, possibly infinite state space. Our novel technique computes stutter-insensitive bisimulations of deterministic systems, which we characterize as the problem of learning a state classifier together with a ranking function for each class. Our procedure learns a candidate state classifi… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  5. arXiv:2405.08353  [pdf, other

    eess.SY

    Data-driven memory-dependent abstractions of dynamical systems via a Cantor-Kantorovich metric

    Authors: Adrien Banse, Licio Romao, Alessandro Abate, Raphaël M. Jungers

    Abstract: Abstractions of dynamical systems enable their verification and the design of feedback controllers using simpler, usually discrete, models. In this paper, we propose a data-driven abstraction mechanism based on a novel metric between Markov models. Our approach is based purely on observing output labels of the underlying dynamics, thus opening the road for a fully data-driven approach to construct… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: Submitted to IEEE Transactions on Automatic Control

  6. arXiv:2405.08232  [pdf, other

    cs.CE

    Distributionally Robust Aggregation of Electric Vehicle Flexibility

    Authors: Karan Mukhi, Chengrui Qu, Pengcheng You, Alessandro Abate

    Abstract: We address the problem of characterising the aggregate flexibility in populations of electric vehicles (EVs) with uncertain charging requirements. Building on previous results that provide exact characterisations of the aggregate flexibility in populations with known charging requirements, in this paper we extend the aggregation methods so that charging requirements are uncertain, but sampled from… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 6 pages, conference

  7. arXiv:2405.06624  [pdf, other

    cs.AI

    Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems

    Authors: David "davidad" Dalrymple, Joar Skalse, Yoshua Bengio, Stuart Russell, Max Tegmark, Sanjit Seshia, Steve Omohundro, Christian Szegedy, Ben Goldhaber, Nora Ammann, Alessandro Abate, Joe Halpern, Clark Barrett, Ding Zhao, Tan Zhi-Xuan, Jeannette Wing, Joshua Tenenbaum

    Abstract: Ensuring that AI systems reliably and robustly avoid harmful or dangerous behaviours is a crucial challenge, especially for AI systems with a high degree of autonomy and general intelligence, or systems used in safety-critical contexts. In this paper, we will introduce and define a family of approaches to AI safety, which we will refer to as guaranteed safe (GS) AI. The core feature of these appro… ▽ More

    Submitted 17 May, 2024; v1 submitted 10 May, 2024; originally announced May 2024.

  8. arXiv:2404.18813  [pdf, other

    eess.SY cs.LG cs.LO

    Safe Reach Set Computation via Neural Barrier Certificates

    Authors: Alessandro Abate, Sergiy Bogomolov, Alec Edwards, Kostiantyn Potomkin, Sadegh Soudjani, Paolo Zuliani

    Abstract: We present a novel technique for online safety verification of autonomous systems, which performs reachability analysis efficiently for both bounded and unbounded horizons by employing neural barrier certificates. Our approach uses barrier certificates given by parameterized neural networks that depend on a given initial set, unsafe sets, and time horizon. Such networks are trained efficiently off… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: IFAC Conference on Analysis and Design of Hybrid Systems

  9. arXiv:2404.08344  [pdf, other

    eess.SY

    Data-driven Interval MDP for Robust Control Synthesis

    Authors: Rudi Coppola, Andrea Peruffo, Licio Romao, Alessandro Abate, Manuel Mazo Jr

    Abstract: The abstraction of dynamical systems is a powerful tool that enables the design of feedback controllers using a correct-by-design framework. We investigate a novel scheme to obtain data-driven abstractions of discrete-time stochastic processes in terms of richer discrete stochastic models, whose actions lead to nondeterministic transitions over the space of probability measures. The data-driven co… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: 6 pages

  10. arXiv:2404.03314  [pdf, other

    cs.GT eess.SY

    Learning to Bid in Forward Electricity Markets Using a No-Regret Algorithm

    Authors: Arega Getaneh Abate, Dorsa Majdi, Jalal Kazempour, Maryam Kamgarpour

    Abstract: It is a common practice in the current literature of electricity markets to use game-theoretic approaches for strategic price bidding. However, they generally rely on the assumption that the strategic bidders have prior knowledge of rival bids, either perfectly or with some uncertainty. This is not necessarily a realistic assumption. This paper takes a different approach by relaxing such an assump… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  11. arXiv:2404.01726  [pdf, other

    eess.SY

    A Stability-Based Abstraction Framework for Reach-Avoid Control of Stochastic Dynamical Systems with Unknown Noise Distributions

    Authors: Thom Badings, Licio Romao, Alessandro Abate, Nils Jansen

    Abstract: Finite-state abstractions are widely studied for the automated synthesis of correct-by-construction controllers for stochastic dynamical systems. However, existing abstraction methods often lead to prohibitively large finite-state models. To address this issue, we propose a novel abstraction scheme for stochastic linear systems that exploits the system's stability to obtain significantly smaller a… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted for presentation at ECC 2024

  12. arXiv:2403.15398  [pdf

    cs.CY

    An International and Multidisciplinary Teaching Experience with Real Industrial Team Project Development

    Authors: Martin Mellado, Eduardo Vendrell, Filomena Ferrucci, Andrea Abate, Detlef Zuhlke, Bernard Riera

    Abstract: This paper presents the design, objectives, experiences, and results of an international cooperation project funded by the European Commission in the context of the Erasmus Intensive Programme (IP, for short) designed to improve students' curricula. An IP is a short programme of study (minimum 2 weeks) that brings together university students and staff from at least three countries in order to enc… ▽ More

    Submitted 17 February, 2024; originally announced March 2024.

    Comments: 21 pages

  13. arXiv:2403.11982  [pdf

    cond-mat.mtrl-sci physics.app-ph

    Enhanced Electron Extraction in Co-Doped TiO2 Quantified by Drift-Diffusion Simulation for Stable CsPbI3 Solar Cells

    Authors: Thomas W. Gries, Davide Regaldo, Hans Koebler, Titan Noor Hartono Putri, Gennaro V. Sannino, Emilio Gutierrez Partida, Roberto Felix, Elif Huesam, Ahmed Saleh, Regan G. Wilks, Zafar Iqbal, Zahra Loghman Nia, Florian Ruske, Martin Stolterfoht, Dieter Neher, Marcus Baer, Stefan A. Weber, Paola Delli Veneri, Philip Schulz, Jean-Baptiste Puel, Jean-Paul Kleider, Qiong Wang, Eva Unger, Artem Musiienko, Antonio Abate

    Abstract: Solar cells based on inorganic perovskite CsPbI3 are promising candidates to resolve the challenge of operational stability in the field of perovskite photovoltaics. For stable operation, however, it is crucial to thoroughly understand the extractive and recombinative processes occurring at the interfaces of perovskite and the charge-selective layers. In this study, we focus on the electronic prop… ▽ More

    Submitted 24 April, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: 20 pages, 2 figures

  14. arXiv:2403.06854  [pdf, other

    cs.LG

    Quantifying the Sensitivity of Inverse Reinforcement Learning to Misspecification

    Authors: Joar Skalse, Alessandro Abate

    Abstract: Inverse reinforcement learning (IRL) aims to infer an agent's preferences (represented as a reward function $R$) from their behaviour (represented as a policy $π$). To do this, we need a behavioural model of how $π$ relates to $R$. In the current literature, the most common behavioural models are optimality, Boltzmann-rationality, and causal entropy maximisation. However, the true relationship bet… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  15. arXiv:2401.15838  [pdf, other

    stat.ML cs.LG cs.MA math.OC stat.CO

    Distributed Markov Chain Monte Carlo Sampling based on the Alternating Direction Method of Multipliers

    Authors: Alexandros E. Tzikas, Licio Romao, Mert Pilanci, Alessandro Abate, Mykel J. Kochenderfer

    Abstract: Many machine learning applications require operating on a spatially distributed dataset. Despite technological advances, privacy considerations and communication constraints may prevent gathering the entire dataset in a central unit. In this paper, we propose a distributed sampling scheme based on the alternating direction method of multipliers, which is commonly used in the optimization literatur… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

  16. arXiv:2401.14811  [pdf, ps, other

    cs.AI cs.LG

    On the Limitations of Markovian Rewards to Express Multi-Objective, Risk-Sensitive, and Modal Tasks

    Authors: Joar Skalse, Alessandro Abate

    Abstract: In this paper, we study the expressivity of scalar, Markovian reward functions in Reinforcement Learning (RL), and identify several limitations to what they can express. Specifically, we look at three classes of RL tasks; multi-objective RL, risk-sensitive RL, and modal RL. For each class, we derive necessary and sufficient conditions that describe when a problem in this class can be expressed usi… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Journal ref: Proceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence, PMLR 216:1974-1984, 2023

  17. arXiv:2401.10240  [pdf, ps, other

    math.OC

    Policy Evaluation in Distributional LQR (Extended Version)

    Authors: Zifan Wang, Yulong Gao, Siyi Wang, Michael M. Zavlanos, Alessandro Abate, Karl H. Johansson

    Abstract: Distributional reinforcement learning (DRL) enhances the understanding of the effects of the randomness in the environment by letting agents learn the distribution of a random return, rather than its expected value as in standard reinforcement learning. Meanwhile, a challenge in DRL is that the policy evaluation typically relies on the representation of the return distribution, which needs to be c… ▽ More

    Submitted 23 March, 2024; v1 submitted 28 November, 2023; originally announced January 2024.

    Comments: This draft is an extension to our previous work https://proceedings.mlr.press/v211/wang23c/wang23c.pdf. arXiv admin note: text overlap with arXiv:2303.13657

  18. arXiv:2312.11314  [pdf, other

    cs.LG cs.LO eess.SY

    Safeguarded Progress in Reinforcement Learning: Safe Bayesian Exploration for Control Policy Synthesis

    Authors: Rohan Mitta, Hosein Hasanbeig, Jun Wang, Daniel Kroening, Yiannis Kantaros, Alessandro Abate

    Abstract: This paper addresses the problem of maintaining safety during training in Reinforcement Learning (RL), such that the safety constraint violations are bounded at any point during learning. In a variety of RL applications the safety of the agent is particularly important, e.g. autonomous platforms or robots that work in proximity of humans. As enforcing safety during training might severely limit th… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  19. arXiv:2312.08190  [pdf, other

    math.OC math.DS

    Stability Analysis of Switched Linear Systems with Neural Lyapunov Functions

    Authors: Virginie Debauche, Alec Edwards, Raphael M. Jungers, Alessandro Abate

    Abstract: Neural-based, data-driven analysis and control of dynamical systems have been recently investigated and have shown great promise, e.g. for safety verification or stability analysis. Indeed, not only do neural networks allow for an entirely model-free, data-driven approach, but also for handling arbitrary complex functions via their power of representation (as opposed to, e.g. algebraic optimizatio… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: 13 pages, 8 figures

  20. arXiv:2312.06344  [pdf, other

    eess.SY cs.LO

    Learning Robust Policies for Uncertain Parametric Markov Decision Processes

    Authors: Luke Rickard, Alessandro Abate, Kostas Margellos

    Abstract: Synthesising verifiably correct controllers for dynamical systems is crucial for safety-critical problems. To achieve this, it is important to account for uncertainty in a robust manner, while at the same time it is often of interest to avoid being overly conservative with the view of achieving a better cost. We propose a method for verifiably safe policy synthesis for a class of finite state mode… ▽ More

    Submitted 15 May, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

    Comments: 10 pages, accepted for oral presentation at L4DC

  21. arXiv:2311.09793  [pdf, other

    eess.SY cs.LG cs.LO

    Fossil 2.0: Formal Certificate Synthesis for the Verification and Control of Dynamical Models

    Authors: Alec Edwards, Andrea Peruffo, Alessandro Abate

    Abstract: This paper presents Fossil 2.0, a new major release of a software tool for the synthesis of certificates (e.g., Lyapunov and barrier functions) for dynamical systems modelled as ordinary differential and difference equations. Fossil 2.0 is much improved from its original release, including new interfaces, a significantly expanded certificate portfolio, controller synthesis and enhanced extensibili… ▽ More

    Submitted 16 April, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: HSCC 2024 Tool Paper

  22. arXiv:2311.09786  [pdf, other

    eess.SY cs.AI cs.LO

    Correct-by-Construction Control for Stochastic and Uncertain Dynamical Models via Formal Abstractions

    Authors: Thom Badings, Nils Jansen, Licio Romao, Alessandro Abate

    Abstract: Automated synthesis of correct-by-construction controllers for autonomous systems is crucial for their deployment in safety-critical scenarios. Such autonomous systems are naturally modeled as stochastic dynamical models. The general problem is to compute a controller that provably satisfies a given task, represented as a probabilistic temporal logic specification. However, factors such as stochas… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: In Proceedings FMAS 2023, arXiv:2311.08987. arXiv admin note: text overlap with arXiv:2301.01526

    Journal ref: EPTCS 395, 2023, pp. 144-152

  23. arXiv:2310.01951  [pdf, other

    cs.LG cs.AI

    Probabilistic Reach-Avoid for Bayesian Neural Networks

    Authors: Matthew Wicker, Luca Laurenti, Andrea Patane, Nicola Paoletti, Alessandro Abate, Marta Kwiatkowska

    Abstract: Model-based reinforcement learning seeks to simultaneously learn the dynamics of an unknown stochastic environment and synthesise an optimal policy for acting in it. Ensuring the safety and robustness of sequential decisions made through a policy in such an environment is a key challenge for policies intended for safety-critical scenarios. In this work, we investigate two complementary problems: f… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 47 pages, 10 figures. arXiv admin note: text overlap with arXiv:2105.10134

  24. arXiv:2309.15257  [pdf, other

    cs.LG cs.AI

    STARC: A General Framework For Quantifying Differences Between Reward Functions

    Authors: Joar Skalse, Lucy Farnik, Sumeet Ramesh Motwani, Erik Jenner, Adam Gleave, Alessandro Abate

    Abstract: In order to solve a task using reinforcement learning, it is necessary to first formalise the goal of that task as a reward function. However, for many real-world tasks, it is very difficult to manually specify a reward function that never incentivises undesirable behaviour. As a result, it is increasingly popular to use \emph{reward learning algorithms}, which attempt to \emph{learn} a reward fun… ▽ More

    Submitted 11 March, 2024; v1 submitted 26 September, 2023; originally announced September 2023.

  25. arXiv:2309.06090  [pdf, other

    eess.SY cs.LG cs.LO

    A General Verification Framework for Dynamical and Control Models via Certificate Synthesis

    Authors: Alec Edwards, Andrea Peruffo, Alessandro Abate

    Abstract: An emerging branch of control theory specialises in certificate learning, concerning the specification of a desired (possibly complex) system behaviour for an autonomous or control model, which is then analytically verified by means of a function-based proof. However, the synthesis of controllers abiding by these complex requirements is in general a non-trivial task and may elude the most expert c… ▽ More

    Submitted 1 July, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

  26. arXiv:2309.05481  [pdf

    cond-mat.mtrl-sci physics.app-ph physics.chem-ph

    Photovoltaic potential of tin perovskites revealed through layer-by-layer investigation of optoelectronic and charge transport properties

    Authors: Mahmoud H. Aldamasy, Artem Musiienko, Marin Rusu, Davide Regaldo, Shengnan Zho, Hannes Hampel, Chiara Frasca, Zafar Iqbal, Thomas W. Gries, Guixiang Li, Ece Aktas, Giuseppe Nasti, Meng Li, Jorge Pascual, Noor Titan Putri Hartono, Qiong Wang, Thomas Unold, Antonio Abate

    Abstract: Tin perovskites are the most promising environmentally friendly alternative to lead perovskites. Among tin perovskites, FASnI3 (CH4N2SnI3) shows optimum band gap, and easy processability. However, the performance of FASnI3 based solar cells is incomparable to lead perovskites for several reasons, including energy band mismatch between the perovskite absorber film and the charge transporting layers… ▽ More

    Submitted 7 October, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: 22 pages, 5 figures

  27. arXiv:2308.10587  [pdf, other

    cs.FL

    Formal Analysis and Verification of Max-Plus Linear Systems

    Authors: Muhammad Syifa'ul Mufid, Andrea Micheli, Alessandro Abate, Alessandro Cimatti

    Abstract: Max-Plus Linear (MPL) systems are an algebraic formalism with practical applications in transportation networks, manufacturing and biological systems. In this paper, we investigate the problem of automatically analyzing the properties of MPL, taking into account both structural properties such as transient and cyclicity, and the open problem of user-defined temporal properties. We propose Time-Dif… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: 28 pages (including appendixes)

  28. arXiv:2307.15546  [pdf, other

    cs.LO cs.LG eess.SY

    On the Trade-off Between Efficiency and Precision of Neural Abstraction

    Authors: Alec Edwards, Mirco Giacobbe, Alessandro Abate

    Abstract: Neural abstractions have been recently introduced as formal approximations of complex, nonlinear dynamical models. They comprise a neural ODE and a certified upper bound on the error between the abstract neural network and the concrete dynamical model. So far neural abstractions have exclusively been obtained as neural networks consisting entirely of $ReLU$ activation functions, resulting in neura… ▽ More

    Submitted 2 October, 2023; v1 submitted 28 July, 2023; originally announced July 2023.

    Comments: Appeared at QEST 2023. Added codebase link; corrected Eq. 11

  29. arXiv:2307.13174  [pdf

    cond-mat.mtrl-sci physics.chem-ph

    Interface Modification for Energy Levels Alignment and Charge Extraction in CsPbI$_3$ Perovskite Solar Cells

    Authors: Zafar Iqbal, Fengshuo Zu, Artem Musiienko, Emilio Gutierrez Partida, Hans Kobler, Thomas W. Gries, Gennaro V. Sannino, Laura Canil, Norbert Koch, Martin Stolterfoht, Dieter Neher, Michele Pavone, Ana Belen Munoz-Garcia, Antonio Abate, Qiong Wang

    Abstract: In perovskite solar cells (PSCs) energy levels alignment and charge extraction at the interfaces are the essential factors directly affecting the device performance. In this work, we present a modified interface between all-inorganic CsPbI$_3$ perovskite and its hole selective contact (Spiro-OMeTAD), realized by a dipole molecule trioctylphosphine oxide (TOPO), to align the energy levels. On a pas… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: 20 pages, 4 Figures

    Journal ref: ACS Energy Lett. 2023, 8, 4304

  30. arXiv:2307.05059  [pdf, ps, other

    cs.GT cs.AI cs.MA

    On Imperfect Recall in Multi-Agent Influence Diagrams

    Authors: James Fox, Matt MacDermott, Lewis Hammond, Paul Harrenstein, Alessandro Abate, Michael Wooldridge

    Abstract: Multi-agent influence diagrams (MAIDs) are a popular game-theoretic model based on Bayesian networks. In some settings, MAIDs offer significant advantages over extensive-form game representations. Previous work on MAIDs has assumed that agents employ behavioural policies, which set independent conditional probability distributions over actions for each of their decisions. In settings with imperfec… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

    Comments: In Proceedings TARK 2023, arXiv:2307.04005

    Journal ref: EPTCS 379, 2023, pp. 201-220

  31. arXiv:2306.16824  [pdf, other

    eess.SY

    An Exact Characterisation of Flexibility in Populations of Electric Vehicles

    Authors: Karan Mukhi, Alessandro Abate

    Abstract: Increasing penetrations of electric vehicles (EVs) presents a large source of flexibility, which can be used to assist balancing the power grid. The flexibility of an individual EV can be quantified as a convex polytope and the flexibility of a population of EVs is the Minkowski sum of these polytopes. In general computing the exact Minkowski sum is intractable. However, exploiting symmetry in a r… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

  32. arXiv:2306.02766  [pdf, other

    cs.MA cs.AI cs.LG cs.SI eess.SY

    Networked Communication for Decentralised Agents in Mean-Field Games

    Authors: Patrick Benjamin, Alessandro Abate

    Abstract: We introduce networked communication to the mean-field game framework, in particular to oracle-free settings where $N$ decentralised agents learn along a single, non-episodic run of the empirical system. We prove that our architecture, with only a few reasonable assumptions about network structure, has sample guarantees bounded between those of the centralised- and independent-learning cases. We d… ▽ More

    Submitted 28 June, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

  33. arXiv:2304.05770  [pdf, other

    eess.SY

    Abstracting Linear Stochastic Systems via Knowledge Filtering

    Authors: Maico Hendrikus Wilhelmus Engelaar, Licio Romao, Yulong Gao, Mircea Lazar, Alessandro Abate, Sofie Haesaert

    Abstract: In this paper, we propose a new model reduction technique for linear stochastic systems that builds upon knowledge filtering and utilizes optimal Kalman filtering techniques. This new technique will reduce the dimension of the noise disturbance and will allow any controller designed for the reduced model to be refined into a controller for the original stochastic system, while preserving any speci… ▽ More

    Submitted 15 September, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

    Comments: 7 pages, 3 figures, accepted for CDC 2023

  34. arXiv:2304.05105  [pdf, other

    math.OC eess.SY

    Learning-based Rigid Tube Model Predictive Control

    Authors: Yulong Gao, Shuhao Yan, Jian Zhou, Mark Cannon, Alessandro Abate, Karl H. Johansson

    Abstract: This paper is concerned with model predictive control (MPC) of discrete-time linear systems subject to bounded additive disturbance and mixed constraints on the state and input, whereas the true disturbance set is unknown. Unlike most existing work on robust MPC, we propose an algorithm incorporating online learning that builds on prior knowledge of the disturbance, i.e., a known but conservative… ▽ More

    Submitted 21 May, 2024; v1 submitted 11 April, 2023; originally announced April 2023.

    Comments: 8 pages

  35. arXiv:2304.04505  [pdf, other

    eess.SY

    Inner approximations of stochastic programs for data-driven stochastic barrier function design

    Authors: Frederik Baymler Mathiesen, Licio Romao, Simeon C. Calvert, Alessandro Abate, Luca Laurenti

    Abstract: This paper proposes a new framework to compute finite-horizon safety guarantees for discrete-time piece-wise affine systems with stochastic noise of unknown distributions. The approach is based on a novel approach to synthesise a stochastic barrier function (SBF) from noisy data and rely on the scenario optimization theory. In particular, we show that the stochastic program to synthesize a SBF can… ▽ More

    Submitted 10 September, 2023; v1 submitted 10 April, 2023; originally announced April 2023.

  36. arXiv:2304.00644  [pdf, other

    eess.SY

    Distributionally Robust Optimal and Safe Control of Stochastic Systems via Kernel Conditional Mean Embedding

    Authors: Licio Romao, Ashish R. Hota, Alessandro Abate

    Abstract: We present a novel distributionally robust framework for dynamic programming that uses kernel methods to design feedback control policies. Specifically, we leverage kernel mean embedding to map the transition probabilities governing the state evolution into an associated repreducing kernel Hilbert space. Our key idea lies in combining conditional mean embedding with the maximum mean discrepancy di… ▽ More

    Submitted 18 December, 2023; v1 submitted 2 April, 2023; originally announced April 2023.

  37. arXiv:2303.17618  [pdf, other

    cs.LG eess.SY

    Data-driven abstractions via adaptive refinements and a Kantorovich metric [extended version]

    Authors: Adrien Banse, Licio Romao, Alessandro Abate, Raphaël M. Jungers

    Abstract: We introduce an adaptive refinement procedure for smart, and scalable abstraction of dynamical systems. Our technique relies on partitioning the state space depending on the observation of future outputs. However, this knowledge is dynamically constructed in an adaptive, asymmetric way. In order to learn the optimal structure, we define a Kantorovich-inspired metric between Markov chains, and we u… ▽ More

    Submitted 30 October, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: This paper is an extended version of a CDC2023 submission

  38. arXiv:2303.13657  [pdf, other

    math.OC cs.LG

    Policy Evaluation in Distributional LQR

    Authors: Zifan Wang, Yulong Gao, Siyi Wang, Michael M. Zavlanos, Alessandro Abate, Karl H. Johansson

    Abstract: Distributional reinforcement learning (DRL) enhances the understanding of the effects of the randomness in the environment by letting agents learn the distribution of a random return, rather than its expected value as in standard RL. At the same time, a main challenge in DRL is that policy evaluation in DRL typically relies on the representation of the return distribution, which needs to be carefu… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: 12pages

  39. arXiv:2302.13888  [pdf, other

    cs.GT cs.CC

    k-Prize Weighted Voting Games

    Authors: Wei-Chen Lee, David Hyland, Alessandro Abate, Edith Elkind, Jiarui Gan, Julian Gutierrez, Paul Harrenstein, Michael Wooldridge

    Abstract: We introduce a natural variant of weighted voting games, which we refer to as k-Prize Weighted Voting Games. Such games consist of n players with weights, and k prizes, of possibly differing values. The players form coalitions, and the i-th largest coalition (by the sum of weights of its members) wins the i-th largest prize, which is then shared among its members. We present four solution concepts… ▽ More

    Submitted 2 March, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: Accepted to AAMAS 2023

  40. arXiv:2301.11683  [pdf, other

    cs.LO cs.LG eess.SY

    Neural Abstractions

    Authors: Alessandro Abate, Alec Edwards, Mirco Giacobbe

    Abstract: We present a novel method for the safety verification of nonlinear dynamical models that uses neural networks to represent abstractions of their dynamics. Neural networks have extensively been used before as approximators; in this work, we make a step further and use them for the first time as abstractions. For a given dynamical model, our method synthesises a neural network that overapproximates… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

    Comments: NeurIPS 2022

  41. Quantitative Verification with Neural Networks

    Authors: Alessandro Abate, Alec Edwards, Mirco Giacobbe, Hashan Punchihewa, Diptarko Roy

    Abstract: We present a data-driven approach to the quantitative verification of probabilistic programs and stochastic dynamical models. Our approach leverages neural networks to compute tight and sound bounds for the probability that a stochastic process hits a target condition within finite time. This problem subsumes a variety of quantitative verification questions, from the reachability and safety analys… ▽ More

    Submitted 11 March, 2024; v1 submitted 15 January, 2023; originally announced January 2023.

    Comments: The conference version of this manuscript appeared at CONCUR 2023

    ACM Class: F.3.1; D.2.4

  42. Reasoning about Causality in Games

    Authors: Lewis Hammond, James Fox, Tom Everitt, Ryan Carey, Alessandro Abate, Michael Wooldridge

    Abstract: Causal reasoning and game-theoretic reasoning are fundamental topics in artificial intelligence, among many other disciplines: this paper is concerned with their intersection. Despite their importance, a formal framework that supports both these forms of reasoning has, until now, been lacking. We offer a solution in the form of (structural) causal games, which can be seen as extending Pearl's caus… ▽ More

    Submitted 17 April, 2023; v1 submitted 5 January, 2023; originally announced January 2023.

    Comments: Published in Artificial Intelligence (2023)

  43. Robust Control for Dynamical Systems With Non-Gaussian Noise via Formal Abstractions

    Authors: Thom Badings, Licio Romao, Alessandro Abate, David Parker, Hasan A. Poonawala, Marielle Stoelinga, Nils Jansen

    Abstract: Controllers for dynamical systems that operate in safety-critical settings must account for stochastic disturbances. Such disturbances are often modeled as process noise in a dynamical system, and common assumptions are that the underlying distributions are known and/or Gaussian. In practice, however, these assumptions may be unrealistic and can lead to poor approximations of the true noise distri… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

    Comments: To appear in the Journal of Artificial Intelligence Research (JAIR). arXiv admin note: text overlap with arXiv:2110.12662

    Journal ref: Journal of Artificial Intelligence Research (JAIR) 76 (2023) 341-391

  44. Lexicographic Multi-Objective Reinforcement Learning

    Authors: Joar Skalse, Lewis Hammond, Charlie Griffin, Alessandro Abate

    Abstract: In this work we introduce reinforcement learning techniques for solving lexicographic multi-objective problems. These are problems that involve multiple reward signals, and where the goal is to learn a policy that maximises the first reward signal, and subject to this constraint also maximises the second reward signal, and so on. We present a family of both action-value and policy gradient algorit… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

    Journal ref: IJCAI 2022; Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence. Main Track, Pages 3430-3436

  45. arXiv:2212.03201  [pdf, ps, other

    cs.LG

    Misspecification in Inverse Reinforcement Learning

    Authors: Joar Skalse, Alessandro Abate

    Abstract: The aim of Inverse Reinforcement Learning (IRL) is to infer a reward function $R$ from a policy $π$. To do this, we need a model of how $π$ relates to $R$. In the current literature, the most common models are optimality, Boltzmann rationality, and causal entropy maximisation. One of the primary motivations behind IRL is to infer human preferences from human behaviour. However, the true relationsh… ▽ More

    Submitted 24 March, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence, 2023

  46. arXiv:2212.01926  [pdf, other

    eess.SY

    Data-driven memory-dependent abstractions of dynamical systems

    Authors: Adrien Banse, Licio Romao, Alessandro Abate, Raphaël M. Jungers

    Abstract: We propose a sample-based, sequential method to abstract a (potentially black-box) dynamical system with a sequence of memory-dependent Markov chains of increasing size. We show that this approximation allows to alleviating a correlation bias that has been observed in sample-based abstractions. We further propose a methodology to detect on the fly the memory length resulting in an abstraction with… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

  47. arXiv:2212.00679  [pdf, other

    eess.SY cs.AI cs.RO

    Formal Controller Synthesis for Markov Jump Linear Systems with Uncertain Dynamics

    Authors: Luke Rickard, Thom Badings, Licio Romao, Alessandro Abate

    Abstract: Automated synthesis of provably correct controllers for cyber-physical systems is crucial for deployment in safety-critical scenarios. However, hybrid features and stochastic or unknown behaviours make this problem challenging. We propose a method for synthesising controllers for Markov jump linear systems (MJLSs), a class of discrete-time models for cyber-physical systems, so that they certifiabl… ▽ More

    Submitted 4 August, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

    Comments: 15 pages, accepted to QEST

  48. arXiv:2210.05989  [pdf, other

    eess.SY cs.AI cs.RO

    Probabilities Are Not Enough: Formal Controller Synthesis for Stochastic Dynamical Models with Epistemic Uncertainty

    Authors: Thom Badings, Licio Romao, Alessandro Abate, Nils Jansen

    Abstract: Capturing uncertainty in models of complex dynamical systems is crucial to designing safe controllers. Stochastic noise causes aleatoric uncertainty, whereas imprecise knowledge of model parameters leads to epistemic uncertainty. Several approaches use formal abstractions to synthesize policies that satisfy temporal specifications related to safety and reachability. However, the underlying models… ▽ More

    Submitted 7 December, 2022; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted at AAAI 2023

  49. arXiv:2209.15320  [pdf, other

    cs.LG cs.AI cs.RO

    Bounded Robustness in Reinforcement Learning via Lexicographic Objectives

    Authors: Daniel Jarne Ornia, Licio Romao, Lewis Hammond, Manuel Mazo Jr., Alessandro Abate

    Abstract: Policy robustness in Reinforcement Learning may not be desirable at any cost: the alterations caused by robustness requirements from otherwise optimal policies should be explainable, quantifiable and formally verifiable. In this work we study how policies can be maximally robust to arbitrary observational noise by analysing how they are altered by this noise through a stochastic linear operator in… ▽ More

    Submitted 11 December, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

  50. arXiv:2209.10341  [pdf, other

    cs.LG cs.AI cs.LO

    LCRL: Certified Policy Synthesis via Logically-Constrained Reinforcement Learning

    Authors: Hosein Hasanbeig, Daniel Kroening, Alessandro Abate

    Abstract: LCRL is a software tool that implements model-free Reinforcement Learning (RL) algorithms over unknown Markov Decision Processes (MDPs), synthesising policies that satisfy a given linear temporal specification with maximal probability. LCRL leverages partially deterministic finite-state machines known as Limit Deterministic Buchi Automata (LDBA) to express a given linear temporal specification. A… ▽ More

    Submitted 21 September, 2022; originally announced September 2022.

    Comments: Evaluated and Accepted by the 19th International Conference on Quantitative Evaluation of Systems 2022