Search | arXiv e-print repository

arXiv:1909.09705 [pdf, other]

A Layered Architecture for Active Perception: Image Classification using Deep Reinforcement Learning

Authors: Hossein K. Mousavi, Guangyi Liu, Weihang Yuan, Martin Takáč, Héctor Muñoz-Avila, Nader Motee

Abstract: We propose a planning and perception mechanism for a robot (agent), that can only observe the underlying environment partially, in order to solve an image classification problem. A three-layer architecture is suggested that consists of a meta-layer that decides the intermediate goals, an action-layer that selects local actions as the agent navigates towards a goal, and a classification-layer that… ▽ More We propose a planning and perception mechanism for a robot (agent), that can only observe the underlying environment partially, in order to solve an image classification problem. A three-layer architecture is suggested that consists of a meta-layer that decides the intermediate goals, an action-layer that selects local actions as the agent navigates towards a goal, and a classification-layer that evaluates the reward and makes a prediction. We design and implement these layers using deep reinforcement learning. A generalized policy gradient algorithm is utilized to learn the parameters of these layers to maximize the expected reward. Our proposed methodology is tested on the MNIST dataset of handwritten digits, which provides us with a level of explainability while interpreting the agent's intermediate goals and course of action. △ Less

Submitted 20 September, 2019; originally announced September 2019.

Comments: Submitted to ICRA-2020

arXiv:1908.01421 [pdf, other]

Explicit Characterization of Performance of a Class of Networked Linear Control Systems

Authors: Hossein K. Mousavi, Nader Motee

Abstract: We show that the steady-state variance as a performance measure for a class of networked linear control systems is expressible as the summation of a rational function over the Laplacian eigenvalues of the network graph. Moreover, we characterize the role of connectivity thresholds for the feedback (and observer) gain design of these networks. We use our framework to derive bounds and scaling laws… ▽ More We show that the steady-state variance as a performance measure for a class of networked linear control systems is expressible as the summation of a rational function over the Laplacian eigenvalues of the network graph. Moreover, we characterize the role of connectivity thresholds for the feedback (and observer) gain design of these networks. We use our framework to derive bounds and scaling laws for the performance of the dynamical network. Our approach generalizes and unifies the previous results on the performance measure of these networks for the case of arbitrary nodal dynamics. We bring extensions of our methodology for the case of decentralized observer-based output feedback as well as a class of composite networks. Numerous examples support our theoretical contributions. △ Less

Submitted 4 August, 2019; originally announced August 2019.

Comments: detailed version of a paper of the same name to be submitted to IEEE TCNS

arXiv:1905.04835 [pdf, other]

Multi-Agent Image Classification via Reinforcement Learning

Authors: Hossein K. Mousavi, Mohammadreza Nazari, Martin Takáč, Nader Motee

Abstract: We investigate a classification problem using multiple mobile agents capable of collecting (partial) pose-dependent observations of an unknown environment. The objective is to classify an image over a finite time horizon. We propose a network architecture on how agents should form a local belief, take local actions, and extract relevant features from their raw partial observations. Agents are allo… ▽ More We investigate a classification problem using multiple mobile agents capable of collecting (partial) pose-dependent observations of an unknown environment. The objective is to classify an image over a finite time horizon. We propose a network architecture on how agents should form a local belief, take local actions, and extract relevant features from their raw partial observations. Agents are allowed to exchange information with their neighboring agents to update their own beliefs. It is shown how reinforcement learning techniques can be utilized to achieve decentralized implementation of the classification problem by running a decentralized consensus protocol. Our experimental results on the MNIST handwritten digit dataset demonstrates the effectiveness of our proposed framework. △ Less

Submitted 6 August, 2019; v1 submitted 12 May, 2019; originally announced May 2019.

Comments: Preprint of the paper to be published in IROS'19 proceedings

arXiv:1902.01026 [pdf, other]

Estimation with Fast Landmark Selection in Robot Visual Navigation

Authors: Hossein K. Mousavi, Nader Motee

Abstract: We consider the visual feature selection to improve the estimation quality required for the accurate navigation of a robot. We build upon a key property that asserts: contributions of trackable features (landmarks) appear linearly in the information matrix of the corresponding estimation problem. We utilize standard models for motion and vision system using a camera to formulate the feature select… ▽ More We consider the visual feature selection to improve the estimation quality required for the accurate navigation of a robot. We build upon a key property that asserts: contributions of trackable features (landmarks) appear linearly in the information matrix of the corresponding estimation problem. We utilize standard models for motion and vision system using a camera to formulate the feature selection problem over moving finite time horizons. A scalable randomized sampling algorithm is proposed to select more informative features (and ignore the rest) to achieve a superior position estimation quality. We provide probabilistic performance guarantees for our method. The time-complexity of our feature selection algorithm is linear in the number of candidate features, which is practically plausible and outperforms existing greedy methods that scale quadratically with the number of candidates features. Our numerical simulations confirm that not only the execution time of our proposed method is comparably less than that of the greedy method, but also the resulting estimation quality is very close to the greedy method. △ Less

Submitted 3 February, 2019; originally announced February 2019.

arXiv:1812.08964 [pdf, other]

Sparse Sensing, Communication, and Actuation via Self-Triggered Control Algorithms

Authors: MirSaleh Bahavarnia, Hossein K. Mousavi, Nader Motee

Abstract: We propose a self-triggered control algorithm to reduce onboard processor usage, communication bandwidth, and energy consumption across a linear time-invariant networked control system. We formulate an optimal control problem by penalizing the l0-measures of the feedback gain and the vector of control inputs and maximizing the dwell time between the consecutive triggering times. It is shown that t… ▽ More We propose a self-triggered control algorithm to reduce onboard processor usage, communication bandwidth, and energy consumption across a linear time-invariant networked control system. We formulate an optimal control problem by penalizing the l0-measures of the feedback gain and the vector of control inputs and maximizing the dwell time between the consecutive triggering times. It is shown that the corresponding l1-relaxation of the optimal control problem is feasible and results in a stabilizing feedback control law with guaranteed performance bounds, while providing a sparse schedule for collecting samples from sensors, communication with other subsystems, and activating the input actuators. △ Less

Submitted 21 December, 2018; originally announced December 2018.

Comments: Submitted to Automatica

arXiv:1811.01303 [pdf, other]

Space-Time Sampling for Network Observability

Authors: Hossein K. Mousavi, Qiyu Sun, Nader Motee

Abstract: Designing sparse sampling strategies is one of the important components in having resilient estimation and control in networked systems as they make network design problems more cost-effective due to their reduced sampling requirements and less fragile to where and when samples are collected. It is shown that under what conditions taking coarse samples from a network will contain the same amount o… ▽ More Designing sparse sampling strategies is one of the important components in having resilient estimation and control in networked systems as they make network design problems more cost-effective due to their reduced sampling requirements and less fragile to where and when samples are collected. It is shown that under what conditions taking coarse samples from a network will contain the same amount of information as a more finer set of samples. Our goal is to estimate initial condition of linear time-invariant networks using a set of noisy measurements. The observability condition is reformulated as the frame condition, where one can easily trace location and time stamps of each sample. We compare estimation quality of various sampling strategies using estimation measures, which depend on spectrum of the corresponding frame operators. Using properties of the minimal polynomial of the state matrix, deterministic and randomized methods are suggested to construct observability frames. Intrinsic tradeoffs assert that collecting samples from fewer subsystems dictates taking more samples (in average) per subsystem. Three scalable algorithms are developed to generate sparse space-time sampling strategies with explicit error bounds. △ Less

Submitted 18 July, 2019; v1 submitted 3 November, 2018; originally announced November 2018.

Comments: Submitted to IEEE TAC (Revised Version)

arXiv:1810.05284 [pdf, other]

Resilient Sparse Controller Design with Guaranteed Disturbance Attenuation

Authors: MirSaleh Bahavarnia, Hossein K. Mousavi

Abstract: We design resilient sparse state-feedback controllers for a linear time-invariant (LTI) control system while attaining a pre-specified guarantee on ${\mathcal{H}}_\infty$ performance measure. We leverage a technique from non-fragile control theory to identify a region of resilient state-feedback controllers. Afterward, we explore the region to identify a sparse controller. To this end, we use two… ▽ More We design resilient sparse state-feedback controllers for a linear time-invariant (LTI) control system while attaining a pre-specified guarantee on ${\mathcal{H}}_\infty$ performance measure. We leverage a technique from non-fragile control theory to identify a region of resilient state-feedback controllers. Afterward, we explore the region to identify a sparse controller. To this end, we use two different techniques: the greedy method of sparsification, as well as the re-weighted $\ell_1$ norm minimization. Our approach highlights a tradeoff between the sparsity of the feedback gain, performance measure, and fragility of the design. To best of our knowledge, this work is the first framework providing performance guarantees for sparse feedback gain design. △ Less

Submitted 26 September, 2019; v1 submitted 11 October, 2018; originally announced October 2018.

Comments: Submitted to ACC'20

arXiv:1807.04237 [pdf, ps, other]

Koopman Performance Analysis of Nonlinear Consensus Networks

Authors: Hossein K. Mousavi, Christoforos Somarakis, Qiyu Sun, Nader Motee

Abstract: Spectral decomposition of dynamical systems is a popular methodology to investigate the fundamental qualitative and quantitative properties of these systems and their solutions. In this chapter, we consider a class of nonlinear cooperative protocols, which consist of multiple agents that are coupled together via an undirected state-dependent graph. We develop a representation of the system solutio… ▽ More Spectral decomposition of dynamical systems is a popular methodology to investigate the fundamental qualitative and quantitative properties of these systems and their solutions. In this chapter, we consider a class of nonlinear cooperative protocols, which consist of multiple agents that are coupled together via an undirected state-dependent graph. We develop a representation of the system solution by decomposing the nonlinear system utilizing ideas from the Koopman operator theory and its spectral analysis. We use recent results on the extensions of the well-known Hartman theorem for hyperbolic systems to establish a connection between the original nonlinear dynamics and the linearized dynamics in terms of Koopman spectral properties. The expected value of the output energy of the nonlinear protocol, which is related to the notions of coherence and robustness in dynamical networks, is evaluated and characterized in terms of Koopman eigenvalues, eigenfunctions, and modes. Spectral representation of the performance measure enables us to develop algorithmic methods to assess the performance of this class of nonlinear dynamical networks as a function of their graph topology. Finally, we propose a scalable computational method for approximation of the components of the Koopman mode decomposition, which is necessary to evaluate the systemic performance measure of the nonlinear dynamic network. △ Less

Submitted 19 April, 2019; v1 submitted 11 July, 2018; originally announced July 2018.

Comments: Submitted as a chapter to the book "Introduction to Koopman Operator Theory", Revised Version

Showing 1–8 of 8 results for author: Mousavi, H K