-
A Layered Architecture for Active Perception: Image Classification using Deep Reinforcement Learning
Authors:
Hossein K. Mousavi,
Guangyi Liu,
Weihang Yuan,
Martin Takáč,
Héctor Muñoz-Avila,
Nader Motee
Abstract:
We propose a planning and perception mechanism for a robot (agent), that can only observe the underlying environment partially, in order to solve an image classification problem. A three-layer architecture is suggested that consists of a meta-layer that decides the intermediate goals, an action-layer that selects local actions as the agent navigates towards a goal, and a classification-layer that…
▽ More
We propose a planning and perception mechanism for a robot (agent), that can only observe the underlying environment partially, in order to solve an image classification problem. A three-layer architecture is suggested that consists of a meta-layer that decides the intermediate goals, an action-layer that selects local actions as the agent navigates towards a goal, and a classification-layer that evaluates the reward and makes a prediction. We design and implement these layers using deep reinforcement learning. A generalized policy gradient algorithm is utilized to learn the parameters of these layers to maximize the expected reward. Our proposed methodology is tested on the MNIST dataset of handwritten digits, which provides us with a level of explainability while interpreting the agent's intermediate goals and course of action.
△ Less
Submitted 20 September, 2019;
originally announced September 2019.
-
Explicit Characterization of Performance of a Class of Networked Linear Control Systems
Authors:
Hossein K. Mousavi,
Nader Motee
Abstract:
We show that the steady-state variance as a performance measure for a class of networked linear control systems is expressible as the summation of a rational function over the Laplacian eigenvalues of the network graph. Moreover, we characterize the role of connectivity thresholds for the feedback (and observer) gain design of these networks. We use our framework to derive bounds and scaling laws…
▽ More
We show that the steady-state variance as a performance measure for a class of networked linear control systems is expressible as the summation of a rational function over the Laplacian eigenvalues of the network graph. Moreover, we characterize the role of connectivity thresholds for the feedback (and observer) gain design of these networks. We use our framework to derive bounds and scaling laws for the performance of the dynamical network. Our approach generalizes and unifies the previous results on the performance measure of these networks for the case of arbitrary nodal dynamics. We bring extensions of our methodology for the case of decentralized observer-based output feedback as well as a class of composite networks. Numerous examples support our theoretical contributions.
△ Less
Submitted 4 August, 2019;
originally announced August 2019.
-
Multi-Agent Image Classification via Reinforcement Learning
Authors:
Hossein K. Mousavi,
Mohammadreza Nazari,
Martin Takáč,
Nader Motee
Abstract:
We investigate a classification problem using multiple mobile agents capable of collecting (partial) pose-dependent observations of an unknown environment. The objective is to classify an image over a finite time horizon. We propose a network architecture on how agents should form a local belief, take local actions, and extract relevant features from their raw partial observations. Agents are allo…
▽ More
We investigate a classification problem using multiple mobile agents capable of collecting (partial) pose-dependent observations of an unknown environment. The objective is to classify an image over a finite time horizon. We propose a network architecture on how agents should form a local belief, take local actions, and extract relevant features from their raw partial observations. Agents are allowed to exchange information with their neighboring agents to update their own beliefs. It is shown how reinforcement learning techniques can be utilized to achieve decentralized implementation of the classification problem by running a decentralized consensus protocol. Our experimental results on the MNIST handwritten digit dataset demonstrates the effectiveness of our proposed framework.
△ Less
Submitted 6 August, 2019; v1 submitted 12 May, 2019;
originally announced May 2019.
-
Estimation with Fast Landmark Selection in Robot Visual Navigation
Authors:
Hossein K. Mousavi,
Nader Motee
Abstract:
We consider the visual feature selection to improve the estimation quality required for the accurate navigation of a robot. We build upon a key property that asserts: contributions of trackable features (landmarks) appear linearly in the information matrix of the corresponding estimation problem. We utilize standard models for motion and vision system using a camera to formulate the feature select…
▽ More
We consider the visual feature selection to improve the estimation quality required for the accurate navigation of a robot. We build upon a key property that asserts: contributions of trackable features (landmarks) appear linearly in the information matrix of the corresponding estimation problem. We utilize standard models for motion and vision system using a camera to formulate the feature selection problem over moving finite time horizons. A scalable randomized sampling algorithm is proposed to select more informative features (and ignore the rest) to achieve a superior position estimation quality. We provide probabilistic performance guarantees for our method. The time-complexity of our feature selection algorithm is linear in the number of candidate features, which is practically plausible and outperforms existing greedy methods that scale quadratically with the number of candidates features. Our numerical simulations confirm that not only the execution time of our proposed method is comparably less than that of the greedy method, but also the resulting estimation quality is very close to the greedy method.
△ Less
Submitted 3 February, 2019;
originally announced February 2019.
-
Sparse Sensing, Communication, and Actuation via Self-Triggered Control Algorithms
Authors:
MirSaleh Bahavarnia,
Hossein K. Mousavi,
Nader Motee
Abstract:
We propose a self-triggered control algorithm to reduce onboard processor usage, communication bandwidth, and energy consumption across a linear time-invariant networked control system. We formulate an optimal control problem by penalizing the l0-measures of the feedback gain and the vector of control inputs and maximizing the dwell time between the consecutive triggering times. It is shown that t…
▽ More
We propose a self-triggered control algorithm to reduce onboard processor usage, communication bandwidth, and energy consumption across a linear time-invariant networked control system. We formulate an optimal control problem by penalizing the l0-measures of the feedback gain and the vector of control inputs and maximizing the dwell time between the consecutive triggering times. It is shown that the corresponding l1-relaxation of the optimal control problem is feasible and results in a stabilizing feedback control law with guaranteed performance bounds, while providing a sparse schedule for collecting samples from sensors, communication with other subsystems, and activating the input actuators.
△ Less
Submitted 21 December, 2018;
originally announced December 2018.
-
Space-Time Sampling for Network Observability
Authors:
Hossein K. Mousavi,
Qiyu Sun,
Nader Motee
Abstract:
Designing sparse sampling strategies is one of the important components in having resilient estimation and control in networked systems as they make network design problems more cost-effective due to their reduced sampling requirements and less fragile to where and when samples are collected. It is shown that under what conditions taking coarse samples from a network will contain the same amount o…
▽ More
Designing sparse sampling strategies is one of the important components in having resilient estimation and control in networked systems as they make network design problems more cost-effective due to their reduced sampling requirements and less fragile to where and when samples are collected. It is shown that under what conditions taking coarse samples from a network will contain the same amount of information as a more finer set of samples. Our goal is to estimate initial condition of linear time-invariant networks using a set of noisy measurements. The observability condition is reformulated as the frame condition, where one can easily trace location and time stamps of each sample. We compare estimation quality of various sampling strategies using estimation measures, which depend on spectrum of the corresponding frame operators. Using properties of the minimal polynomial of the state matrix, deterministic and randomized methods are suggested to construct observability frames. Intrinsic tradeoffs assert that collecting samples from fewer subsystems dictates taking more samples (in average) per subsystem. Three scalable algorithms are developed to generate sparse space-time sampling strategies with explicit error bounds.
△ Less
Submitted 18 July, 2019; v1 submitted 3 November, 2018;
originally announced November 2018.
-
Resilient Sparse Controller Design with Guaranteed Disturbance Attenuation
Authors:
MirSaleh Bahavarnia,
Hossein K. Mousavi
Abstract:
We design resilient sparse state-feedback controllers for a linear time-invariant (LTI) control system while attaining a pre-specified guarantee on ${\mathcal{H}}_\infty$ performance measure. We leverage a technique from non-fragile control theory to identify a region of resilient state-feedback controllers. Afterward, we explore the region to identify a sparse controller. To this end, we use two…
▽ More
We design resilient sparse state-feedback controllers for a linear time-invariant (LTI) control system while attaining a pre-specified guarantee on ${\mathcal{H}}_\infty$ performance measure. We leverage a technique from non-fragile control theory to identify a region of resilient state-feedback controllers. Afterward, we explore the region to identify a sparse controller. To this end, we use two different techniques: the greedy method of sparsification, as well as the re-weighted $\ell_1$ norm minimization. Our approach highlights a tradeoff between the sparsity of the feedback gain, performance measure, and fragility of the design. To best of our knowledge, this work is the first framework providing performance guarantees for sparse feedback gain design.
△ Less
Submitted 26 September, 2019; v1 submitted 11 October, 2018;
originally announced October 2018.
-
Koopman Performance Analysis of Nonlinear Consensus Networks
Authors:
Hossein K. Mousavi,
Christoforos Somarakis,
Qiyu Sun,
Nader Motee
Abstract:
Spectral decomposition of dynamical systems is a popular methodology to investigate the fundamental qualitative and quantitative properties of these systems and their solutions. In this chapter, we consider a class of nonlinear cooperative protocols, which consist of multiple agents that are coupled together via an undirected state-dependent graph. We develop a representation of the system solutio…
▽ More
Spectral decomposition of dynamical systems is a popular methodology to investigate the fundamental qualitative and quantitative properties of these systems and their solutions. In this chapter, we consider a class of nonlinear cooperative protocols, which consist of multiple agents that are coupled together via an undirected state-dependent graph. We develop a representation of the system solution by decomposing the nonlinear system utilizing ideas from the Koopman operator theory and its spectral analysis. We use recent results on the extensions of the well-known Hartman theorem for hyperbolic systems to establish a connection between the original nonlinear dynamics and the linearized dynamics in terms of Koopman spectral properties. The expected value of the output energy of the nonlinear protocol, which is related to the notions of coherence and robustness in dynamical networks, is evaluated and characterized in terms of Koopman eigenvalues, eigenfunctions, and modes. Spectral representation of the performance measure enables us to develop algorithmic methods to assess the performance of this class of nonlinear dynamical networks as a function of their graph topology. Finally, we propose a scalable computational method for approximation of the components of the Koopman mode decomposition, which is necessary to evaluate the systemic performance measure of the nonlinear dynamic network.
△ Less
Submitted 19 April, 2019; v1 submitted 11 July, 2018;
originally announced July 2018.