Search | arXiv e-print repository

Analysis of Reinforcement Learning Schemes for Trajectory Optimization of an Aerial Radio Unit

Authors: Hossein Mohammadi, Vuk Marojevic, Bodong Shang

Abstract: This paper introduces the deployment of unmanned aerial vehicles (UAVs) as lightweight wireless access points that leverage the fixed infrastructure in the context of the emerging open radio access network (O-RAN). More precisely, we propose an aerial radio unit that dynamically serves an under served area and connects to the distributed unit via a wireless fronthaul between the UAV and the closes… ▽ More This paper introduces the deployment of unmanned aerial vehicles (UAVs) as lightweight wireless access points that leverage the fixed infrastructure in the context of the emerging open radio access network (O-RAN). More precisely, we propose an aerial radio unit that dynamically serves an under served area and connects to the distributed unit via a wireless fronthaul between the UAV and the closest tower. In this paper we analyze the UAV trajectory in terms of artificial intelligence (AI) when it serves both UEs and central units (CUs) at the same time in multi input multi output (MIMO) fading channel. We first demonstrate the nonconvexity of the problem of maximizing the overall network throughput based on UAV location, and then we use two different machine learning approaches to solve it. We first assume that the environment is a gridworld and then let the UAV explore the environment by flying from point A to point B, using both the offline Q-learning and the online SARSA algorithm and the achieved path-loss as the reward. With the intention of maximizing the average payoff, the trajectory in the second scenario is described as a Markov decision process (MDP). According to simulations, MDP produces better results in a smaller setting and in less time. In contrast, SARSA performs better in larger environments at the expense of a longer flight duration. △ Less

Submitted 18 November, 2022; originally announced November 2022.

Comments: 6 pages, 7 figures, 1 table

arXiv:2209.11920 [pdf, other]

Tradeoffs between convergence rate and noise amplification for momentum-based accelerated optimization algorithms

Authors: Hesameddin Mohammadi, Meisam Razaviyayn, Mihailo R. Jovanović

Abstract: We study momentum-based first-order optimization algorithms in which the iterations utilize information from the two previous steps and are subject to an additive white noise. This setup uses noise to account for uncertainty in either gradient evaluation or iteration updates, and it includes Polyak's heavy-ball and Nesterov's accelerated methods as special cases. For strongly convex quadratic prob… ▽ More We study momentum-based first-order optimization algorithms in which the iterations utilize information from the two previous steps and are subject to an additive white noise. This setup uses noise to account for uncertainty in either gradient evaluation or iteration updates, and it includes Polyak's heavy-ball and Nesterov's accelerated methods as special cases. For strongly convex quadratic problems, we use the steady-state variance of the error in the optimization variable to quantify noise amplification and identify fundamental stochastic performance tradeoffs. Our approach utilizes the Jury stability criterion to provide a novel geometric characterization of conditions for linear convergence, and it reveals the relation between the noise amplification and convergence rate as well as their dependence on the condition number and the constant algorithmic parameters. This geometric insight leads to simple alternative proofs of standard convergence results and allows us to establish ``uncertainty principle'' of strongly convex optimization: for the two-step momentum method with linear convergence rate, the lower bound on the product between the settling time and noise amplification scales quadratically with the condition number. Our analysis also identifies a key difference between the gradient and iterate noise models: while the amplification of gradient noise can be made arbitrarily small by sufficiently decelerating the algorithm, the best achievable variance for the iterate noise model increases linearly with the settling time in the decelerating regime. Finally, we introduce two parameterized families of algorithms that strike a balance between noise amplification and settling time while preserving order-wise Pareto optimality for both noise models. △ Less

Submitted 19 June, 2024; v1 submitted 24 September, 2022; originally announced September 2022.

Comments: 23 pages; 7 figures

arXiv:2208.04156 [pdf]

AI-based Optimal scheduling of Renewable AC Microgrids with bidirectional LSTM-Based Wind Power Forecasting

Authors: Hossein Mohammadi, Shiva Jokar, Mojtaba Mohammadi, Abdollah Kavousifard, Morteza Dabbaghjamanesh

Abstract: In terms of the operation of microgrids, optimal scheduling is a vital issue that must be taken into account. In this regard, this paper proposes an effective framework for optimal scheduling of renewable microgrids considering energy storage devices, wind turbines, micro turbines. Due to the nonlinearity and complexity of operation problems in microgrids, it is vital to use an accurate and robust… ▽ More In terms of the operation of microgrids, optimal scheduling is a vital issue that must be taken into account. In this regard, this paper proposes an effective framework for optimal scheduling of renewable microgrids considering energy storage devices, wind turbines, micro turbines. Due to the nonlinearity and complexity of operation problems in microgrids, it is vital to use an accurate and robust optimization technique to efficiently solve this problem. To this end, in the proposed framework, the teacher learning-based optimization is utilized to efficiently solve the scheduling problem in the system. Moreover, a deep learning model based on bidirectional long short-term memory is proposed to address the short-term wind power forecasting problem. The feasibility and performance of the proposed framework as well as the effect of wind power forecasting on the operation efficiency are examined using IEEE 33-bus test system. Also, the Australian Wool north wind site data is utilized as a real-world dataset to evaluate the performance of the forecasting model. Results show the effective and efficient performance of the proposed framework in the optimal scheduling of microgrids. △ Less

Submitted 8 August, 2022; v1 submitted 8 July, 2022; originally announced August 2022.

Comments: The name of one of the authors was not included in the first version by mistake. This issue is solved in this version

arXiv:2202.00764 [pdf, other]

Self Interference Management in In-Band Full-Duplex Systems

Authors: Hossein Mohammadi, Maryam Sabbaghian, Vuk Marojevic

Abstract: The evolution of wireless systems has led to a continuous increase in the demand for radio frequency spectrum. To address this issue, a technology that has received a lot of attention is In-Band Full-Duplex (IBFD). The interest in IBFD systems stems from its capability to simultaneously transmit and receive data in the same frequency. Cancelling the self interference (SI) from the transmitter to t… ▽ More The evolution of wireless systems has led to a continuous increase in the demand for radio frequency spectrum. To address this issue, a technology that has received a lot of attention is In-Band Full-Duplex (IBFD). The interest in IBFD systems stems from its capability to simultaneously transmit and receive data in the same frequency. Cancelling the self interference (SI) from the transmitter to the collocated receiver plays a pivotal role in the performance of the system. There are two types of SI cancellation (SIC) approaches, passive and active. In this research, the focus is on active cancellation and, in particular, SIC in the digital domain. Among the direct and backscattered SI, the former has been studied for a long time; therefore, the backscatter is considered in this research and two SIC approaches are analyzed. The first achieves SIC through beamforming. This requires knowing the angle of the received SI to put the beam null-space in this direction. The second method removes SI by employing an Artificial Neural Networks (ANNs). Using an ANN, there is no need to know the direction of the SI. The neural network is trained with pilots which results in the network being able to separate the desired signal from the SI at the receiver. Bayesian Neural Networks show the importance of the weights and assign a parameter that facilitates ignoring the less significant ones. Through comparative simulations we demonstrate that the ANN-based SIC achieves equivalent bit error rate performance as two beamforming methods. △ Less

Submitted 1 February, 2022; originally announced February 2022.

Comments: 5 pages, 9 figures

arXiv:2201.09911 [pdf, other]

AI-Driven Demodulators for Nonlinear Receivers in Shared Spectrum with High-Power Blockers

Authors: Hossein Mohammadi, Walaa AlQwider, Talha Faizur Rahman, Vuk Marojevic

Abstract: Research has shown that communications systems and receivers suffer from high power adjacent channel signals, called blockers, that drive the radio frequency (RF) front end into nonlinear operation. Since simple systems, such as the Internet of Things (IoT), will coexist with sophisticated communications transceivers, radars and other spectrum consumers, these need to be protected employing a simp… ▽ More Research has shown that communications systems and receivers suffer from high power adjacent channel signals, called blockers, that drive the radio frequency (RF) front end into nonlinear operation. Since simple systems, such as the Internet of Things (IoT), will coexist with sophisticated communications transceivers, radars and other spectrum consumers, these need to be protected employing a simple, yet adaptive solution to RF nonlinearity. This paper therefore proposes a flexible data driven approach that uses a simple artificial neural network (ANN) to aid in the removal of the third order intermodulation distortion (IMD) as part of the demodulation process. We introduce and numerically evaluate two artificial intelligence (AI)-enhanced receivers-ANN as the IMD canceler and ANN as the demodulator. Our results show that a simple ANN structure can significantly improve the bit error rate (BER) performance of nonlinear receivers with strong blockers and that the ANN architecture and configuration depends mainly on the RF front end characteristics, such as the third order intercept point (IP3). We therefore recommend that receivers have hardware tags and ways to monitor those over time so that the AI and software radio processing stack can be effectively customized and automatically updated to deal with changing operating conditions. △ Less

Submitted 24 January, 2022; originally announced January 2022.

Comments: 6 pages, 6 figures

arXiv:2103.08017 [pdf, other]

Transient growth of accelerated optimization algorithms

Authors: Hesameddin Mohammadi, Samantha Samuelson, Mihailo R. Jovanović

Abstract: Optimization algorithms are increasingly being used in applications with limited time budgets. In many real-time and embedded scenarios, only a few iterations can be performed and traditional convergence metrics cannot be used to evaluate performance in these non-asymptotic regimes. In this paper, we examine the transient behavior of accelerated first-order optimization algorithms. For convex quad… ▽ More Optimization algorithms are increasingly being used in applications with limited time budgets. In many real-time and embedded scenarios, only a few iterations can be performed and traditional convergence metrics cannot be used to evaluate performance in these non-asymptotic regimes. In this paper, we examine the transient behavior of accelerated first-order optimization algorithms. For convex quadratic problems, we employ tools from linear systems theory to show that transient growth arises from the presence of non-normal dynamics. We identify the existence of modes that yield an algebraic growth in early iterations and quantify the transient excursion from the optimal solution caused by these modes. For strongly convex smooth optimization problems, we utilize the theory of integral quadratic constraints (IQCs) to establish an upper bound on the magnitude of the transient response of Nesterov's accelerated algorithm. We show that both the Euclidean distance between the optimization variable and the global minimizer and the rise time to the transient peak are proportional to the square root of the condition number of the problem. Finally, for problems with large condition numbers, we demonstrate tightness of the bounds that we derive up to constant factors. △ Less

Submitted 23 December, 2021; v1 submitted 14 March, 2021; originally announced March 2021.

Comments: 12 pages, 2 figures

arXiv:1912.11899 [pdf, other]

Convergence and sample complexity of gradient methods for the model-free linear quadratic regulator problem

Authors: Hesameddin Mohammadi, Armin Zare, Mahdi Soltanolkotabi, Mihailo R. Jovanović

Abstract: Model-free reinforcement learning attempts to find an optimal control action for an unknown dynamical system by directly searching over the parameter space of controllers. The convergence behavior and statistical properties of these approaches are often poorly understood because of the nonconvex nature of the underlying optimization problems and the lack of exact gradient computation. In this pape… ▽ More Model-free reinforcement learning attempts to find an optimal control action for an unknown dynamical system by directly searching over the parameter space of controllers. The convergence behavior and statistical properties of these approaches are often poorly understood because of the nonconvex nature of the underlying optimization problems and the lack of exact gradient computation. In this paper, we take a step towards demystifying the performance and efficiency of such methods by focusing on the standard infinite-horizon linear quadratic regulator problem for continuous-time systems with unknown state-space parameters. We establish exponential stability for the ordinary differential equation (ODE) that governs the gradient-flow dynamics over the set of stabilizing feedback gains and show that a similar result holds for the gradient descent method that arises from the forward Euler discretization of the corresponding ODE. We also provide theoretical bounds on the convergence rate and sample complexity of the random search method with two-point gradient estimates. We prove that the required simulation time for achieving $ε$-accuracy in the model-free setup and the total number of function evaluations both scale as $\log \, (1/ε)$. △ Less

Submitted 15 March, 2021; v1 submitted 26 December, 2019; originally announced December 2019.

Comments: 39 pages, 4 figures

arXiv:1905.11011 [pdf, other]

Robustness of accelerated first-order algorithms for strongly convex optimization problems

Authors: Hesameddin Mohammadi, Meisam Razaviyayn, Mihailo R. Jovanović

Abstract: We study the robustness of accelerated first-order algorithms to stochastic uncertainties in gradient evaluation. Specifically, for unconstrained, smooth, strongly convex optimization problems, we examine the mean-squared error in the optimization variable when the iterates are perturbed by additive white noise. This type of uncertainty may arise in situations where an approximation of the gradien… ▽ More We study the robustness of accelerated first-order algorithms to stochastic uncertainties in gradient evaluation. Specifically, for unconstrained, smooth, strongly convex optimization problems, we examine the mean-squared error in the optimization variable when the iterates are perturbed by additive white noise. This type of uncertainty may arise in situations where an approximation of the gradient is sought through measurements of a real system or in a distributed computation over a network. Even though the underlying dynamics of first-order algorithms for this class of problems are nonlinear, we establish upper bounds on the mean-squared deviation from the optimal solution that are tight up to constant factors. Our analysis quantifies fundamental trade-offs between noise amplification and convergence rates obtained via any acceleration scheme similar to Nesterov's or heavy-ball methods. To gain additional analytical insight, for strongly convex quadratic problems, we explicitly evaluate the steady-state variance of the optimization variable in terms of the eigenvalues of the Hessian of the objective function. We demonstrate that the entire spectrum of the Hessian, rather than just the extreme eigenvalues, influence robustness of noisy algorithms. We specialize this result to the problem of distributed averaging over undirected networks and examine the role of network size and topology on the robustness of noisy accelerated algorithms. △ Less

Submitted 20 February, 2020; v1 submitted 27 May, 2019; originally announced May 2019.

Comments: 45 pages, 6 figures

arXiv:1808.05294 [pdf, other]

Investigation of Using Disentangled and Interpretable Representations for One-shot Cross-lingual Voice Conversion

Authors: Seyed Hamidreza Mohammadi, Taehwan Kim

Abstract: We study the problem of cross-lingual voice conversion in non-parallel speech corpora and one-shot learning setting. Most prior work require either parallel speech corpora or enough amount of training data from a target speaker. However, we convert an arbitrary sentences of an arbitrary source speaker to target speaker's given only one target speaker training utterance. To achieve this, we formula… ▽ More We study the problem of cross-lingual voice conversion in non-parallel speech corpora and one-shot learning setting. Most prior work require either parallel speech corpora or enough amount of training data from a target speaker. However, we convert an arbitrary sentences of an arbitrary source speaker to target speaker's given only one target speaker training utterance. To achieve this, we formulate the problem as learning disentangled speaker-specific and context-specific representations and follow the idea of [1] which uses Factorized Hierarchical Variational Autoencoder (FHVAE). After training FHVAE on multi-speaker training data, given arbitrary source and target speakers' utterance, we estimate those latent representations and then reconstruct the desired utterance of converted voice to that of target speaker. We investigate the effectiveness of the approach by conducting voice conversion experiments with varying size of training utterances and it was able to achieve reasonable performance with even just one training utterance. We also examine the speech representation and show that World vocoder outperforms Short-time Fourier Transform (STFT) used in [1]. Finally, in the subjective tests, for one language and cross-lingual voice conversion, our approach achieved significantly better or comparable results compared to VAE-STFT and GMM baselines in speech quality and similarity. △ Less

Submitted 15 August, 2018; originally announced August 2018.

Comments: Proceedings of Interspeech 2018

arXiv:1807.01739 [pdf, other]

doi 10.1109/TAC.2019.2948268

Proximal algorithms for large-scale statistical modeling and sensor/actuator selection

Authors: Armin Zare, Hesameddin Mohammadi, Neil K. Dhingra, Tryphon T. Georgiou, Mihailo R. Jovanović

Abstract: Several problems in modeling and control of stochastically-driven dynamical systems can be cast as regularized semi-definite programs. We examine two such representative problems and show that they can be formulated in a similar manner. The first, in statistical modeling, seeks to reconcile observed statistics by suitably and minimally perturbing prior dynamics. The second seeks to optimally selec… ▽ More Several problems in modeling and control of stochastically-driven dynamical systems can be cast as regularized semi-definite programs. We examine two such representative problems and show that they can be formulated in a similar manner. The first, in statistical modeling, seeks to reconcile observed statistics by suitably and minimally perturbing prior dynamics. The second seeks to optimally select a subset of available sensors and actuators for control purposes. To address modeling and control of large-scale systems we develop a unified algorithmic framework using proximal methods. Our customized algorithms exploit problem structure and allow handling statistical modeling, as well as sensor and actuator selection, for substantially larger scales than what is amenable to current general-purpose solvers. We establish linear convergence of the proximal gradient algorithm, draw contrast between the proposed proximal algorithms and alternating direction method of multipliers, and provide examples that illustrate the merits and effectiveness of our framework. △ Less

Submitted 26 December, 2019; v1 submitted 4 July, 2018; originally announced July 2018.

Comments: To appear in IEEE Trans. Automat. Control

arXiv:1712.00522 [pdf, ps, other]

State Estimation For An Agonistic-Antagonistic Muscle System

Authors: Thang Nguyen, Holly Warner, Hung La, Hanieh Mohammadi, Dan Simon, Hanz Richter

Abstract: Research on assistive technology, rehabilitation, and prosthesis requires the understanding of human machine interaction, in which human muscular properties play a pivotal role. This paper studies a nonlinear agonistic-antagonistic muscle system based on the Hill muscle model. To investigate the characteristics of the muscle model, the problem of estimating the state variables and activation signa… ▽ More Research on assistive technology, rehabilitation, and prosthesis requires the understanding of human machine interaction, in which human muscular properties play a pivotal role. This paper studies a nonlinear agonistic-antagonistic muscle system based on the Hill muscle model. To investigate the characteristics of the muscle model, the problem of estimating the state variables and activation signals of the dual muscle system is considered. In this work, parameter uncertainty and unknown inputs are taken into account for the estimation problem. Three observers are presented: a high gain observer, a sliding mode observer, and an adaptive sliding mode observer. Theoretical analysis shows the convergence of the three observers. To facilitate numerical simulations, a backstep** controller is employed to drive the muscle system to track a desired trajectory. Numerical simulations reveal that the three observers are comparable and provide reliable estimates in noise free and noisy cases. The proposed schemes may serve as frameworks for estimation of complex multi-muscle systems, which could lead to intelligent exercise machines for adaptive training and rehabilitation, and adaptive prosthetics and exoskeletons. △ Less

Submitted 1 December, 2017; originally announced December 2017.

arXiv:1710.00342 [pdf, other]

Beam Switching Techniques for Millimeter Wave Vehicle to Infrastructure Communications

Authors: Hamed Mohammadi, Reza Mohammadkhani

Abstract: Beam alignment for millimeter wave (mm Wave) vehicular communications is challenging due to the high mobility of vehicles. Recent studies have proposed some beam switching techniques at Road Side Unit (RSU) for vehicle to infrastructure (V2I) communications, employing initial position and speed information of vehicles, that are sent through Dedicated Short Range Communications (DSRC) to the RSU. H… ▽ More Beam alignment for millimeter wave (mm Wave) vehicular communications is challenging due to the high mobility of vehicles. Recent studies have proposed some beam switching techniques at Road Side Unit (RSU) for vehicle to infrastructure (V2I) communications, employing initial position and speed information of vehicles, that are sent through Dedicated Short Range Communications (DSRC) to the RSU. However, inaccuracies of the provided information lead to beam misalignment. Some beam design parameters are suggested in the literature to combat this effect. But how these parameters should be tuned? Here, we evaluate the effect of all these parameters, and propose a beam design efficiency metric to perform beam alignment in the presence of the estimation errors, and to improve the performance by choosing the right design parameters. △ Less

Submitted 1 October, 2017; originally announced October 2017.

Comments: 6 pages, 7 figures, accepted to be published in iccke 2017

Showing 1–12 of 12 results for author: Mohammadi, H