-
Energy Aware Deep Reinforcement Learning Scheduling for Sensors Correlated in Time and Space
Authors:
Jernej Hribar,
Andrei Marinescu,
Alessandro Chiumento,
Luiz A. DaSilva
Abstract:
Millions of battery-powered sensors deployed for monitoring purposes in a multitude of scenarios, e.g., agriculture, smart cities, industry, etc., require energy-efficient solutions to prolong their lifetime. When these sensors observe a phenomenon distributed in space and evolving in time, it is expected that collected observations will be correlated in time and space. In this paper, we propose a…
▽ More
Millions of battery-powered sensors deployed for monitoring purposes in a multitude of scenarios, e.g., agriculture, smart cities, industry, etc., require energy-efficient solutions to prolong their lifetime. When these sensors observe a phenomenon distributed in space and evolving in time, it is expected that collected observations will be correlated in time and space. In this paper, we propose a Deep Reinforcement Learning (DRL) based scheduling mechanism capable of taking advantage of correlated information. We design our solution using the Deep Deterministic Policy Gradient (DDPG) algorithm. The proposed mechanism is capable of determining the frequency with which sensors should transmit their updates, to ensure accurate collection of observations, while simultaneously considering the energy available. To evaluate our scheduling mechanism, we use multiple datasets containing environmental observations obtained in multiple real deployments. The real observations enable us to model the environment with which the mechanism interacts as realistically as possible. We show that our solution can significantly extend the sensors' lifetime. We compare our mechanism to an idealized, all-knowing scheduler to demonstrate that its performance is near-optimal. Additionally, we highlight the unique feature of our design, energy-awareness, by displaying the impact of sensors' energy levels on the frequency of updates.
△ Less
Submitted 29 September, 2021; v1 submitted 19 November, 2020;
originally announced November 2020.
-
Using Deep Q-learning To Prolong the Lifetime of Correlated Internet of Things Devices
Authors:
Jernej Hribar,
Andrei Marinescu,
George A. Ropokis,
Luiz A. DaSilva
Abstract:
Battery-powered sensors deployed in the Internet of Things (IoT) require energy-efficient solutions to prolong their lifetime. When these sensors observe a physical phenomenon distributed in space and evolving in time, the collected observations are expected to be correlated. We take advantage of the exhibited correlation and propose an updating mechanism that employs deep Q-learning. Our mechanis…
▽ More
Battery-powered sensors deployed in the Internet of Things (IoT) require energy-efficient solutions to prolong their lifetime. When these sensors observe a physical phenomenon distributed in space and evolving in time, the collected observations are expected to be correlated. We take advantage of the exhibited correlation and propose an updating mechanism that employs deep Q-learning. Our mechanism is capable of determining the frequency with which sensors should transmit their updates while taking into the consideration an ever-changing environment. We evaluate our solution using observations obtained in a real deployment, and show that our proposed mechanism is capable of significantly extending battery-powered sensors' lifetime without compromising the accuracy of the observations provided to the IoT service.
△ Less
Submitted 7 February, 2019;
originally announced February 2019.
-
A Multi-Agent Neural Network for Dynamic Frequency Reuse in LTE Networks
Authors:
Andrei Marinescu,
Irene Macaluso,
Luiz A. DaSilva
Abstract:
Fractional Frequency Reuse techniques can be employed to address interference in mobile networks, improving throughput for edge users. There is a tradeoff between the coverage and overall throughput achievable, as interference avoidance techniques lead to a loss in a cell's overall throughput, with spectrum efficiency decreasing with the fencing off of orthogonal resources. In this paper we propos…
▽ More
Fractional Frequency Reuse techniques can be employed to address interference in mobile networks, improving throughput for edge users. There is a tradeoff between the coverage and overall throughput achievable, as interference avoidance techniques lead to a loss in a cell's overall throughput, with spectrum efficiency decreasing with the fencing off of orthogonal resources. In this paper we propose MANN, a dynamic multiagent frequency reuse scheme, where individual agents in charge of cells control their configurations based on input from neural networks. The agents' decisions are partially influenced by a coordinator agent, which attempts to maximise a global metric of the network (e.g., cell-edge performance). Each agent uses a neural network to estimate the best action (i.e., cell configuration) for its current environment setup, and attempts to maximise in turn a local metric, subject to the constraint imposed by the coordinator agent. Results show that our solution provides improved performance for edge users, increasing the throughput of the bottom 5% of users by 22%, while retaining 95% of a network's overall throughput from the full frequency reuse case. Furthermore, we show how our method improves on static fractional frequency reuse schemes.
△ Less
Submitted 16 January, 2018;
originally announced January 2018.
-
Decentralised Multi-Agent Reinforcement Learning for Dynamic and Uncertain Environments
Authors:
Andrei Marinescu,
Ivana Dusparic,
Adam Taylor,
Vinny Cahill,
Siobhán Clarke
Abstract:
Multi-Agent Reinforcement Learning (MARL) is a widely used technique for optimization in decentralised control problems. However, most applications of MARL are in static environments, and are not suitable when agent behaviour and environment conditions are dynamic and uncertain. Addressing uncertainty in such environments remains a challenging problem for MARL-based systems. The dynamic nature of…
▽ More
Multi-Agent Reinforcement Learning (MARL) is a widely used technique for optimization in decentralised control problems. However, most applications of MARL are in static environments, and are not suitable when agent behaviour and environment conditions are dynamic and uncertain. Addressing uncertainty in such environments remains a challenging problem for MARL-based systems. The dynamic nature of the environment causes previous knowledge of how agents interact to become outdated. Advanced knowledge of potential changes through prediction significantly supports agents converging to near-optimal control solutions. In this paper we propose P-MARL, a decentralised MARL algorithm enhanced by a prediction mechanism that provides accurate information regarding up-coming changes in the environment. This prediction is achieved by employing an Artificial Neural Network combined with a Self-Organising Map that detects and matches changes in the environment. The proposed algorithm is validated in a realistic smart-grid scenario, and provides a 92% Pareto efficient solution to an electric vehicle charging problem.
△ Less
Submitted 16 September, 2014;
originally announced September 2014.