-
Analysis of Reinforcement Learning Schemes for Trajectory Optimization of an Aerial Radio Unit
Authors:
Hossein Mohammadi,
Vuk Marojevic,
Bodong Shang
Abstract:
This paper introduces the deployment of unmanned aerial vehicles (UAVs) as lightweight wireless access points that leverage the fixed infrastructure in the context of the emerging open radio access network (O-RAN). More precisely, we propose an aerial radio unit that dynamically serves an under served area and connects to the distributed unit via a wireless fronthaul between the UAV and the closes…
▽ More
This paper introduces the deployment of unmanned aerial vehicles (UAVs) as lightweight wireless access points that leverage the fixed infrastructure in the context of the emerging open radio access network (O-RAN). More precisely, we propose an aerial radio unit that dynamically serves an under served area and connects to the distributed unit via a wireless fronthaul between the UAV and the closest tower. In this paper we analyze the UAV trajectory in terms of artificial intelligence (AI) when it serves both UEs and central units (CUs) at the same time in multi input multi output (MIMO) fading channel. We first demonstrate the nonconvexity of the problem of maximizing the overall network throughput based on UAV location, and then we use two different machine learning approaches to solve it. We first assume that the environment is a gridworld and then let the UAV explore the environment by flying from point A to point B, using both the offline Q-learning and the online SARSA algorithm and the achieved path-loss as the reward. With the intention of maximizing the average payoff, the trajectory in the second scenario is described as a Markov decision process (MDP). According to simulations, MDP produces better results in a smaller setting and in less time. In contrast, SARSA performs better in larger environments at the expense of a longer flight duration.
△ Less
Submitted 18 November, 2022;
originally announced November 2022.
-
Tradeoffs between convergence rate and noise amplification for momentum-based accelerated optimization algorithms
Authors:
Hesameddin Mohammadi,
Meisam Razaviyayn,
Mihailo R. Jovanović
Abstract:
We study momentum-based first-order optimization algorithms in which the iterations utilize information from the two previous steps and are subject to an additive white noise. This setup uses noise to account for uncertainty in either gradient evaluation or iteration updates, and it includes Polyak's heavy-ball and Nesterov's accelerated methods as special cases. For strongly convex quadratic prob…
▽ More
We study momentum-based first-order optimization algorithms in which the iterations utilize information from the two previous steps and are subject to an additive white noise. This setup uses noise to account for uncertainty in either gradient evaluation or iteration updates, and it includes Polyak's heavy-ball and Nesterov's accelerated methods as special cases. For strongly convex quadratic problems, we use the steady-state variance of the error in the optimization variable to quantify noise amplification and identify fundamental stochastic performance tradeoffs. Our approach utilizes the Jury stability criterion to provide a novel geometric characterization of conditions for linear convergence, and it reveals the relation between the noise amplification and convergence rate as well as their dependence on the condition number and the constant algorithmic parameters. This geometric insight leads to simple alternative proofs of standard convergence results and allows us to establish ``uncertainty principle'' of strongly convex optimization: for the two-step momentum method with linear convergence rate, the lower bound on the product between the settling time and noise amplification scales quadratically with the condition number. Our analysis also identifies a key difference between the gradient and iterate noise models: while the amplification of gradient noise can be made arbitrarily small by sufficiently decelerating the algorithm, the best achievable variance for the iterate noise model increases linearly with the settling time in the decelerating regime. Finally, we introduce two parameterized families of algorithms that strike a balance between noise amplification and settling time while preserving order-wise Pareto optimality for both noise models.
△ Less
Submitted 19 June, 2024; v1 submitted 24 September, 2022;
originally announced September 2022.
-
AI-based Optimal scheduling of Renewable AC Microgrids with bidirectional LSTM-Based Wind Power Forecasting
Authors:
Hossein Mohammadi,
Shiva Jokar,
Mojtaba Mohammadi,
Abdollah Kavousifard,
Morteza Dabbaghjamanesh
Abstract:
In terms of the operation of microgrids, optimal scheduling is a vital issue that must be taken into account. In this regard, this paper proposes an effective framework for optimal scheduling of renewable microgrids considering energy storage devices, wind turbines, micro turbines. Due to the nonlinearity and complexity of operation problems in microgrids, it is vital to use an accurate and robust…
▽ More
In terms of the operation of microgrids, optimal scheduling is a vital issue that must be taken into account. In this regard, this paper proposes an effective framework for optimal scheduling of renewable microgrids considering energy storage devices, wind turbines, micro turbines. Due to the nonlinearity and complexity of operation problems in microgrids, it is vital to use an accurate and robust optimization technique to efficiently solve this problem. To this end, in the proposed framework, the teacher learning-based optimization is utilized to efficiently solve the scheduling problem in the system. Moreover, a deep learning model based on bidirectional long short-term memory is proposed to address the short-term wind power forecasting problem. The feasibility and performance of the proposed framework as well as the effect of wind power forecasting on the operation efficiency are examined using IEEE 33-bus test system. Also, the Australian Wool north wind site data is utilized as a real-world dataset to evaluate the performance of the forecasting model. Results show the effective and efficient performance of the proposed framework in the optimal scheduling of microgrids.
△ Less
Submitted 8 August, 2022; v1 submitted 8 July, 2022;
originally announced August 2022.
-
Self Interference Management in In-Band Full-Duplex Systems
Authors:
Hossein Mohammadi,
Maryam Sabbaghian,
Vuk Marojevic
Abstract:
The evolution of wireless systems has led to a continuous increase in the demand for radio frequency spectrum. To address this issue, a technology that has received a lot of attention is In-Band Full-Duplex (IBFD). The interest in IBFD systems stems from its capability to simultaneously transmit and receive data in the same frequency. Cancelling the self interference (SI) from the transmitter to t…
▽ More
The evolution of wireless systems has led to a continuous increase in the demand for radio frequency spectrum. To address this issue, a technology that has received a lot of attention is In-Band Full-Duplex (IBFD). The interest in IBFD systems stems from its capability to simultaneously transmit and receive data in the same frequency. Cancelling the self interference (SI) from the transmitter to the collocated receiver plays a pivotal role in the performance of the system. There are two types of SI cancellation (SIC) approaches, passive and active. In this research, the focus is on active cancellation and, in particular, SIC in the digital domain. Among the direct and backscattered SI, the former has been studied for a long time; therefore, the backscatter is considered in this research and two SIC approaches are analyzed. The first achieves SIC through beamforming. This requires knowing the angle of the received SI to put the beam null-space in this direction. The second method removes SI by employing an Artificial Neural Networks (ANNs). Using an ANN, there is no need to know the direction of the SI. The neural network is trained with pilots which results in the network being able to separate the desired signal from the SI at the receiver. Bayesian Neural Networks show the importance of the weights and assign a parameter that facilitates ignoring the less significant ones. Through comparative simulations we demonstrate that the ANN-based SIC achieves equivalent bit error rate performance as two beamforming methods.
△ Less
Submitted 1 February, 2022;
originally announced February 2022.
-
AI-Driven Demodulators for Nonlinear Receivers in Shared Spectrum with High-Power Blockers
Authors:
Hossein Mohammadi,
Walaa AlQwider,
Talha Faizur Rahman,
Vuk Marojevic
Abstract:
Research has shown that communications systems and receivers suffer from high power adjacent channel signals, called blockers, that drive the radio frequency (RF) front end into nonlinear operation. Since simple systems, such as the Internet of Things (IoT), will coexist with sophisticated communications transceivers, radars and other spectrum consumers, these need to be protected employing a simp…
▽ More
Research has shown that communications systems and receivers suffer from high power adjacent channel signals, called blockers, that drive the radio frequency (RF) front end into nonlinear operation. Since simple systems, such as the Internet of Things (IoT), will coexist with sophisticated communications transceivers, radars and other spectrum consumers, these need to be protected employing a simple, yet adaptive solution to RF nonlinearity. This paper therefore proposes a flexible data driven approach that uses a simple artificial neural network (ANN) to aid in the removal of the third order intermodulation distortion (IMD) as part of the demodulation process. We introduce and numerically evaluate two artificial intelligence (AI)-enhanced receivers-ANN as the IMD canceler and ANN as the demodulator. Our results show that a simple ANN structure can significantly improve the bit error rate (BER) performance of nonlinear receivers with strong blockers and that the ANN architecture and configuration depends mainly on the RF front end characteristics, such as the third order intercept point (IP3). We therefore recommend that receivers have hardware tags and ways to monitor those over time so that the AI and software radio processing stack can be effectively customized and automatically updated to deal with changing operating conditions.
△ Less
Submitted 24 January, 2022;
originally announced January 2022.
-
Transient growth of accelerated optimization algorithms
Authors:
Hesameddin Mohammadi,
Samantha Samuelson,
Mihailo R. Jovanović
Abstract:
Optimization algorithms are increasingly being used in applications with limited time budgets. In many real-time and embedded scenarios, only a few iterations can be performed and traditional convergence metrics cannot be used to evaluate performance in these non-asymptotic regimes. In this paper, we examine the transient behavior of accelerated first-order optimization algorithms. For convex quad…
▽ More
Optimization algorithms are increasingly being used in applications with limited time budgets. In many real-time and embedded scenarios, only a few iterations can be performed and traditional convergence metrics cannot be used to evaluate performance in these non-asymptotic regimes. In this paper, we examine the transient behavior of accelerated first-order optimization algorithms. For convex quadratic problems, we employ tools from linear systems theory to show that transient growth arises from the presence of non-normal dynamics. We identify the existence of modes that yield an algebraic growth in early iterations and quantify the transient excursion from the optimal solution caused by these modes. For strongly convex smooth optimization problems, we utilize the theory of integral quadratic constraints (IQCs) to establish an upper bound on the magnitude of the transient response of Nesterov's accelerated algorithm. We show that both the Euclidean distance between the optimization variable and the global minimizer and the rise time to the transient peak are proportional to the square root of the condition number of the problem. Finally, for problems with large condition numbers, we demonstrate tightness of the bounds that we derive up to constant factors.
△ Less
Submitted 23 December, 2021; v1 submitted 14 March, 2021;
originally announced March 2021.
-
Convergence and sample complexity of gradient methods for the model-free linear quadratic regulator problem
Authors:
Hesameddin Mohammadi,
Armin Zare,
Mahdi Soltanolkotabi,
Mihailo R. Jovanović
Abstract:
Model-free reinforcement learning attempts to find an optimal control action for an unknown dynamical system by directly searching over the parameter space of controllers. The convergence behavior and statistical properties of these approaches are often poorly understood because of the nonconvex nature of the underlying optimization problems and the lack of exact gradient computation. In this pape…
▽ More
Model-free reinforcement learning attempts to find an optimal control action for an unknown dynamical system by directly searching over the parameter space of controllers. The convergence behavior and statistical properties of these approaches are often poorly understood because of the nonconvex nature of the underlying optimization problems and the lack of exact gradient computation. In this paper, we take a step towards demystifying the performance and efficiency of such methods by focusing on the standard infinite-horizon linear quadratic regulator problem for continuous-time systems with unknown state-space parameters. We establish exponential stability for the ordinary differential equation (ODE) that governs the gradient-flow dynamics over the set of stabilizing feedback gains and show that a similar result holds for the gradient descent method that arises from the forward Euler discretization of the corresponding ODE. We also provide theoretical bounds on the convergence rate and sample complexity of the random search method with two-point gradient estimates. We prove that the required simulation time for achieving $ε$-accuracy in the model-free setup and the total number of function evaluations both scale as $\log \, (1/ε)$.
△ Less
Submitted 15 March, 2021; v1 submitted 26 December, 2019;
originally announced December 2019.
-
Robustness of accelerated first-order algorithms for strongly convex optimization problems
Authors:
Hesameddin Mohammadi,
Meisam Razaviyayn,
Mihailo R. Jovanović
Abstract:
We study the robustness of accelerated first-order algorithms to stochastic uncertainties in gradient evaluation. Specifically, for unconstrained, smooth, strongly convex optimization problems, we examine the mean-squared error in the optimization variable when the iterates are perturbed by additive white noise. This type of uncertainty may arise in situations where an approximation of the gradien…
▽ More
We study the robustness of accelerated first-order algorithms to stochastic uncertainties in gradient evaluation. Specifically, for unconstrained, smooth, strongly convex optimization problems, we examine the mean-squared error in the optimization variable when the iterates are perturbed by additive white noise. This type of uncertainty may arise in situations where an approximation of the gradient is sought through measurements of a real system or in a distributed computation over a network. Even though the underlying dynamics of first-order algorithms for this class of problems are nonlinear, we establish upper bounds on the mean-squared deviation from the optimal solution that are tight up to constant factors. Our analysis quantifies fundamental trade-offs between noise amplification and convergence rates obtained via any acceleration scheme similar to Nesterov's or heavy-ball methods. To gain additional analytical insight, for strongly convex quadratic problems, we explicitly evaluate the steady-state variance of the optimization variable in terms of the eigenvalues of the Hessian of the objective function. We demonstrate that the entire spectrum of the Hessian, rather than just the extreme eigenvalues, influence robustness of noisy algorithms. We specialize this result to the problem of distributed averaging over undirected networks and examine the role of network size and topology on the robustness of noisy accelerated algorithms.
△ Less
Submitted 20 February, 2020; v1 submitted 27 May, 2019;
originally announced May 2019.
-
Investigation of Using Disentangled and Interpretable Representations for One-shot Cross-lingual Voice Conversion
Authors:
Seyed Hamidreza Mohammadi,
Taehwan Kim
Abstract:
We study the problem of cross-lingual voice conversion in non-parallel speech corpora and one-shot learning setting. Most prior work require either parallel speech corpora or enough amount of training data from a target speaker. However, we convert an arbitrary sentences of an arbitrary source speaker to target speaker's given only one target speaker training utterance. To achieve this, we formula…
▽ More
We study the problem of cross-lingual voice conversion in non-parallel speech corpora and one-shot learning setting. Most prior work require either parallel speech corpora or enough amount of training data from a target speaker. However, we convert an arbitrary sentences of an arbitrary source speaker to target speaker's given only one target speaker training utterance. To achieve this, we formulate the problem as learning disentangled speaker-specific and context-specific representations and follow the idea of [1] which uses Factorized Hierarchical Variational Autoencoder (FHVAE). After training FHVAE on multi-speaker training data, given arbitrary source and target speakers' utterance, we estimate those latent representations and then reconstruct the desired utterance of converted voice to that of target speaker. We investigate the effectiveness of the approach by conducting voice conversion experiments with varying size of training utterances and it was able to achieve reasonable performance with even just one training utterance. We also examine the speech representation and show that World vocoder outperforms Short-time Fourier Transform (STFT) used in [1]. Finally, in the subjective tests, for one language and cross-lingual voice conversion, our approach achieved significantly better or comparable results compared to VAE-STFT and GMM baselines in speech quality and similarity.
△ Less
Submitted 15 August, 2018;
originally announced August 2018.
-
Proximal algorithms for large-scale statistical modeling and sensor/actuator selection
Authors:
Armin Zare,
Hesameddin Mohammadi,
Neil K. Dhingra,
Tryphon T. Georgiou,
Mihailo R. Jovanović
Abstract:
Several problems in modeling and control of stochastically-driven dynamical systems can be cast as regularized semi-definite programs. We examine two such representative problems and show that they can be formulated in a similar manner. The first, in statistical modeling, seeks to reconcile observed statistics by suitably and minimally perturbing prior dynamics. The second seeks to optimally selec…
▽ More
Several problems in modeling and control of stochastically-driven dynamical systems can be cast as regularized semi-definite programs. We examine two such representative problems and show that they can be formulated in a similar manner. The first, in statistical modeling, seeks to reconcile observed statistics by suitably and minimally perturbing prior dynamics. The second seeks to optimally select a subset of available sensors and actuators for control purposes. To address modeling and control of large-scale systems we develop a unified algorithmic framework using proximal methods. Our customized algorithms exploit problem structure and allow handling statistical modeling, as well as sensor and actuator selection, for substantially larger scales than what is amenable to current general-purpose solvers. We establish linear convergence of the proximal gradient algorithm, draw contrast between the proposed proximal algorithms and alternating direction method of multipliers, and provide examples that illustrate the merits and effectiveness of our framework.
△ Less
Submitted 26 December, 2019; v1 submitted 4 July, 2018;
originally announced July 2018.
-
State Estimation For An Agonistic-Antagonistic Muscle System
Authors:
Thang Nguyen,
Holly Warner,
Hung La,
Hanieh Mohammadi,
Dan Simon,
Hanz Richter
Abstract:
Research on assistive technology, rehabilitation, and prosthesis requires the understanding of human machine interaction, in which human muscular properties play a pivotal role. This paper studies a nonlinear agonistic-antagonistic muscle system based on the Hill muscle model. To investigate the characteristics of the muscle model, the problem of estimating the state variables and activation signa…
▽ More
Research on assistive technology, rehabilitation, and prosthesis requires the understanding of human machine interaction, in which human muscular properties play a pivotal role. This paper studies a nonlinear agonistic-antagonistic muscle system based on the Hill muscle model. To investigate the characteristics of the muscle model, the problem of estimating the state variables and activation signals of the dual muscle system is considered. In this work, parameter uncertainty and unknown inputs are taken into account for the estimation problem. Three observers are presented: a high gain observer, a sliding mode observer, and an adaptive sliding mode observer. Theoretical analysis shows the convergence of the three observers. To facilitate numerical simulations, a backstep** controller is employed to drive the muscle system to track a desired trajectory. Numerical simulations reveal that the three observers are comparable and provide reliable estimates in noise free and noisy cases. The proposed schemes may serve as frameworks for estimation of complex multi-muscle systems, which could lead to intelligent exercise machines for adaptive training and rehabilitation, and adaptive prosthetics and exoskeletons.
△ Less
Submitted 1 December, 2017;
originally announced December 2017.
-
Beam Switching Techniques for Millimeter Wave Vehicle to Infrastructure Communications
Authors:
Hamed Mohammadi,
Reza Mohammadkhani
Abstract:
Beam alignment for millimeter wave (mm Wave) vehicular communications is challenging due to the high mobility of vehicles. Recent studies have proposed some beam switching techniques at Road Side Unit (RSU) for vehicle to infrastructure (V2I) communications, employing initial position and speed information of vehicles, that are sent through Dedicated Short Range Communications (DSRC) to the RSU. H…
▽ More
Beam alignment for millimeter wave (mm Wave) vehicular communications is challenging due to the high mobility of vehicles. Recent studies have proposed some beam switching techniques at Road Side Unit (RSU) for vehicle to infrastructure (V2I) communications, employing initial position and speed information of vehicles, that are sent through Dedicated Short Range Communications (DSRC) to the RSU. However, inaccuracies of the provided information lead to beam misalignment. Some beam design parameters are suggested in the literature to combat this effect. But how these parameters should be tuned? Here, we evaluate the effect of all these parameters, and propose a beam design efficiency metric to perform beam alignment in the presence of the estimation errors, and to improve the performance by choosing the right design parameters.
△ Less
Submitted 1 October, 2017;
originally announced October 2017.