-
Globally Stable Neural Imitation Policies
Authors:
Amin Abyaneh,
Mariana Sosa Guzmán,
Hsiu-Chin Lin
Abstract:
Imitation learning presents an effective approach to alleviate the resource-intensive and time-consuming nature of policy learning from scratch in the solution space. Even though the resulting policy can mimic expert demonstrations reliably, it often lacks predictability in unexplored regions of the state-space, giving rise to significant safety concerns in the face of perturbations. To address th…
▽ More
Imitation learning presents an effective approach to alleviate the resource-intensive and time-consuming nature of policy learning from scratch in the solution space. Even though the resulting policy can mimic expert demonstrations reliably, it often lacks predictability in unexplored regions of the state-space, giving rise to significant safety concerns in the face of perturbations. To address these challenges, we introduce the Stable Neural Dynamical System (SNDS), an imitation learning regime which produces a policy with formal stability guarantees. We deploy a neural policy architecture that facilitates the representation of stability based on Lyapunov theorem, and jointly train the policy and its corresponding Lyapunov candidate to ensure global stability. We validate our approach by conducting extensive experiments in simulation and successfully deploying the trained policies on a real-world manipulator arm. The experimental results demonstrate that our method overcomes the instability, accuracy, and computational intensity problems associated with previous imitation learning methods, making our method a promising solution for stable policy learning in complex planning scenarios.
△ Less
Submitted 6 March, 2024;
originally announced March 2024.
-
Learning Lyapunov-Stable Polynomial Dynamical Systems Through Imitation
Authors:
Amin Abyaneh,
Hsiu-Chin Lin
Abstract:
Imitation learning is a paradigm to address complex motion planning problems by learning a policy to imitate an expert's behavior. However, relying solely on the expert's data might lead to unsafe actions when the robot deviates from the demonstrated trajectories. Stability guarantees have previously been provided utilizing nonlinear dynamical systems, acting as high-level motion planners, in conj…
▽ More
Imitation learning is a paradigm to address complex motion planning problems by learning a policy to imitate an expert's behavior. However, relying solely on the expert's data might lead to unsafe actions when the robot deviates from the demonstrated trajectories. Stability guarantees have previously been provided utilizing nonlinear dynamical systems, acting as high-level motion planners, in conjunction with the Lyapunov stability theorem. Yet, these methods are prone to inaccurate policies, high computational cost, sample inefficiency, or quasi stability when replicating complex and highly nonlinear trajectories. To mitigate this problem, we present an approach for learning a globally stable nonlinear dynamical system as a motion planning policy. We model the nonlinear dynamical system as a parametric polynomial and learn the polynomial's coefficients jointly with a Lyapunov candidate. To showcase its success, we compare our method against the state of the art in simulation and conduct real-world experiments with the Kinova Gen3 Lite manipulator arm. Our experiments demonstrate the sample efficiency and reproduction accuracy of our method for various expert trajectories, while remaining stable in the face of perturbations.
△ Less
Submitted 14 February, 2024; v1 submitted 31 October, 2023;
originally announced October 2023.
-
Federated Causal Discovery From Interventions
Authors:
Amin Abyaneh,
Nino Scherrer,
Patrick Schwab,
Stefan Bauer,
Bernhard Schölkopf,
Arash Mehrjou
Abstract:
Causal discovery serves a pivotal role in mitigating model uncertainty through recovering the underlying causal mechanisms among variables. In many practical domains, such as healthcare, access to the data gathered by individual entities is limited, primarily for privacy and regulatory constraints. However, the majority of existing causal discovery methods require the data to be available in a cen…
▽ More
Causal discovery serves a pivotal role in mitigating model uncertainty through recovering the underlying causal mechanisms among variables. In many practical domains, such as healthcare, access to the data gathered by individual entities is limited, primarily for privacy and regulatory constraints. However, the majority of existing causal discovery methods require the data to be available in a centralized location. In response, researchers have introduced federated causal discovery. While previous federated methods consider distributed observational data, the integration of interventional data remains largely unexplored. We propose FedCDI, a federated framework for inferring causal structures from distributed data containing interventional samples. In line with the federated learning framework, FedCDI improves privacy by exchanging belief updates rather than raw samples. Additionally, it introduces a novel intervention-aware method for aggregating individual updates. We analyze scenarios with shared or disjoint intervened covariates, and mitigate the adverse effects of interventional data heterogeneity. The performance and scalability of FedCDI is rigorously tested across a variety of synthetic and real-world graphs.
△ Less
Submitted 11 February, 2024; v1 submitted 7 November, 2022;
originally announced November 2022.
-
Pyfectious: An individual-level simulator to discover optimal containment polices for epidemic diseases
Authors:
Arash Mehrjou,
Ashkan Soleymani,
Amin Abyaneh,
Samir Bhatt,
Bernhard Schölkopf,
Stefan Bauer
Abstract:
Simulating the spread of infectious diseases in human communities is critical for predicting the trajectory of an epidemic and verifying various policies to control the devastating impacts of the outbreak. Many existing simulators are based on compartment models that divide people into a few subsets and simulate the dynamics among those subsets using hypothesized differential equations. However, t…
▽ More
Simulating the spread of infectious diseases in human communities is critical for predicting the trajectory of an epidemic and verifying various policies to control the devastating impacts of the outbreak. Many existing simulators are based on compartment models that divide people into a few subsets and simulate the dynamics among those subsets using hypothesized differential equations. However, these models lack the requisite granularity to study the effect of intelligent policies that influence every individual in a particular way. In this work, we introduce a simulator software capable of modeling a population structure and controlling the disease's propagation at an individualistic level. In order to estimate the confidence of the conclusions drawn from the simulator, we employ a comprehensive probabilistic approach where the entire population is constructed as a hierarchical random variable. This approach makes the inferred conclusions more robust against sampling artifacts and gives confidence bounds for decisions based on the simulation results. To showcase potential applications, the simulator parameters are set based on the formal statistics of the COVID-19 pandemic, and the outcome of a wide range of control measures is investigated. Furthermore, the simulator is used as the environment of a reinforcement learning problem to find the optimal policies to control the pandemic. The obtained experimental results indicate the simulator's adaptability and capacity in making sound predictions and a successful policy derivation example based on real-world data. As an exemplary application, our results show that the proposed policy discovery method can lead to control measures that produce significantly fewer infected individuals in the population and protect the health system against saturation.
△ Less
Submitted 20 April, 2021; v1 submitted 24 March, 2021;
originally announced March 2021.
-
Automatic Parking in Smart Cities
Authors:
Arezou Abyaneh,
Vanessa Fakhoury,
Nizar Zorba
Abstract:
The objective behind this project is to maximize the efficiency of land space, to decrease the driver stress and frustration, along with a considerable reduction in air pollution. Our contribution is in the form of an automatic parking system that is controlled by cellular phones. The structure is a hexagon shape that uses conveyor belts, to transport the vehicles from the entrance into the parkin…
▽ More
The objective behind this project is to maximize the efficiency of land space, to decrease the driver stress and frustration, along with a considerable reduction in air pollution. Our contribution is in the form of an automatic parking system that is controlled by cellular phones. The structure is a hexagon shape that uses conveyor belts, to transport the vehicles from the entrance into the parking spaces over an elevating platform. The entrance gate includes length-measuring sensors to determine whether the approaching vehicle is eligible to enter. Our system is controlled through a microcontroller, and using cellular communications to connect to the customer. The project can be applied to different locations and is capable of capacity extensions.
△ Less
Submitted 17 June, 2020;
originally announced July 2020.
-
Deep Neural Networks Meet CSI-Based Authentication
Authors:
Amirhossein Yazdani Abyaneh,
Ali Hosein Gharari Foumani,
Vahid Pourahmadi
Abstract:
The first step of a secure communication is authenticating legible users and detecting the malicious ones. In the last recent years, some promising schemes proposed using wireless medium network's features, in particular, channel state information (CSI) as a means for authentication. These schemes mainly compare user's previous CSI with the new received CSI to determine if the user is in fact what…
▽ More
The first step of a secure communication is authenticating legible users and detecting the malicious ones. In the last recent years, some promising schemes proposed using wireless medium network's features, in particular, channel state information (CSI) as a means for authentication. These schemes mainly compare user's previous CSI with the new received CSI to determine if the user is in fact what it is claiming to be. Despite high accuracy, these approaches lack the stability in authentication when the users rotate in their positions. This is due to a significant change in CSI when a user rotates which mislead the authenticator when it compares the new CSI with the previous ones. Our approach presents a way of extracting features from raw CSI measurements which are stable towards rotation. We extract these features by the means of a deep neural network. We also present a scenario in which users can be {efficiently} authenticated while they are at certain locations in an environment (even if they rotate); and, they will be rejected if they change their location. Also, experimental results are presented to show the performance of the proposed scheme.
△ Less
Submitted 26 November, 2018;
originally announced December 2018.