-
Enhancing Solar Driver Forecasting with Multivariate Transformers
Authors:
Sergio Sanchez-Hurtado,
Victor Rodriguez-Fernandez,
Julia Briden,
Peng Mun Siew,
Richard Linares
Abstract:
In this work, we develop a comprehensive framework for F10.7, S10.7, M10.7, and Y10.7 solar driver forecasting with a time series Transformer (PatchTST). To ensure an equal representation of high and low levels of solar activity, we construct a custom loss function to weight samples based on the distance between the solar driver's historical distribution and the training set. The solar driver fore…
▽ More
In this work, we develop a comprehensive framework for F10.7, S10.7, M10.7, and Y10.7 solar driver forecasting with a time series Transformer (PatchTST). To ensure an equal representation of high and low levels of solar activity, we construct a custom loss function to weight samples based on the distance between the solar driver's historical distribution and the training set. The solar driver forecasting framework includes an 18-day lookback window and forecasts 6 days into the future. When benchmarked against the Space Environment Technologies (SET) dataset, our model consistently produces forecasts with a lower standard mean error in nearly all cases, with improved prediction accuracy during periods of high solar activity. All the code is available on Github https://github.com/ARCLab-MIT/sw-driver-forecaster.
△ Less
Submitted 22 June, 2024;
originally announced June 2024.
-
A New Monte-Carlo Model for the Space Environment
Authors:
Daniel Jang,
Davide Gusmini,
Peng Mun Siew,
Andrea D'Ambrosio,
Simone Servadio,
Pablo Machuca,
Richard Linares
Abstract:
This paper introduces a novel Monte Carlo (MC) method to simulate the evolution of the low-earth orbit environment, enhancing the MIT Orbital Capacity Analysis Tool (MOCAT). In recent decades, numerous space environment models have been developed by government agencies and research groups to understand and predict the dynamics of space debris. Our MC approach advances this by simulating the trajec…
▽ More
This paper introduces a novel Monte Carlo (MC) method to simulate the evolution of the low-earth orbit environment, enhancing the MIT Orbital Capacity Analysis Tool (MOCAT). In recent decades, numerous space environment models have been developed by government agencies and research groups to understand and predict the dynamics of space debris. Our MC approach advances this by simulating the trajectories of space objects and modeling their interactions, such as collisions and explosions. This aids in analyzing the trends of space-object and debris populations. A key innovation of our method is the computational efficiency in orbit propagation, which is crucial for handling potentially large numbers of objects over centuries. We present validation results against the IADC (Inter-Agency Space Debris Coordination Committee) study and explore various scenarios, including ones without future launches and those involving the launch of proposed megaconstellations with over 80,000 active payloads. With the improvement in computational efficiencies provided by this work, we can run these new scenarios that predict millions of trackable objects over a 200-year period. The previous state-of-the-art was 400,000 objects over the same period of time. Notably, while fewer megaconstellations are planned for altitudes above 800 km, even minimal failures in post-mission disposal or collision avoidance maneuvers can significantly impact orbital debris accumulation.
△ Less
Submitted 16 May, 2024;
originally announced May 2024.
-
Fair Concurrent Training of Multiple Models in Federated Learning
Authors:
Marie Siew,
Haoran Zhang,
Jong-Ik Park,
Yuezhou Liu,
Yichen Ruan,
Lili Su,
Stratis Ioannidis,
Edmund Yeh,
Carlee Joe-Wong
Abstract:
Federated learning (FL) enables collaborative learning across multiple clients. In most FL work, all clients train a single learning task. However, the recent proliferation of FL applications may increasingly require multiple FL tasks to be trained simultaneously, sharing clients' computing and communication resources, which we call Multiple-Model Federated Learning (MMFL). Current MMFL algorithms…
▽ More
Federated learning (FL) enables collaborative learning across multiple clients. In most FL work, all clients train a single learning task. However, the recent proliferation of FL applications may increasingly require multiple FL tasks to be trained simultaneously, sharing clients' computing and communication resources, which we call Multiple-Model Federated Learning (MMFL). Current MMFL algorithms use naive average-based client-task allocation schemes that can lead to unfair performance when FL tasks have heterogeneous difficulty levels, e.g., tasks with larger models may need more rounds and data to train. Just as naively allocating resources to generic computing jobs with heterogeneous resource needs can lead to unfair outcomes, naive allocation of clients to FL tasks can lead to unfairness, with some tasks having excessively long training times, or lower converged accuracies. Furthermore, in the FL setting, since clients are typically not paid for their training effort, we face a further challenge that some clients may not even be willing to train some tasks, e.g., due to high computational costs, which may exacerbate unfairness in training outcomes across tasks. We address both challenges by firstly designing FedFairMMFL, a difficulty-aware algorithm that dynamically allocates clients to tasks in each training round. We provide guarantees on airness and FedFairMMFL's convergence rate. We then propose a novel auction design that incentivizes clients to train multiple tasks, so as to fairly distribute clients' training efforts across the tasks. We show how our fairness-based learning and incentive mechanisms impact training convergence and finally evaluate our algorithm with multiple sets of learning tasks on real world datasets.
△ Less
Submitted 21 April, 2024;
originally announced April 2024.
-
Language Models are Spacecraft Operators
Authors:
Victor Rodriguez-Fernandez,
Alejandro Carrasco,
Jason Cheng,
Eli Scharf,
Peng Mun Siew,
Richard Linares
Abstract:
Recent trends are emerging in the use of Large Language Models (LLMs) as autonomous agents that take actions based on the content of the user text prompts. We intend to apply these concepts to the field of Guidance, Navigation, and Control in space, enabling LLMs to have a significant role in the decision-making process for autonomous satellite operations. As a first step towards this goal, we hav…
▽ More
Recent trends are emerging in the use of Large Language Models (LLMs) as autonomous agents that take actions based on the content of the user text prompts. We intend to apply these concepts to the field of Guidance, Navigation, and Control in space, enabling LLMs to have a significant role in the decision-making process for autonomous satellite operations. As a first step towards this goal, we have developed a pure LLM-based solution for the Kerbal Space Program Differential Games (KSPDG) challenge, a public software design competition where participants create autonomous agents for maneuvering satellites involved in non-cooperative space operations, running on the KSP game engine. Our approach leverages prompt engineering, few-shot prompting, and fine-tuning techniques to create an effective LLM-based agent that ranked 2nd in the competition. To the best of our knowledge, this work pioneers the integration of LLM agents into space research. Code is available at https://github.com/ARCLab-MIT/kspdg.
△ Less
Submitted 30 March, 2024;
originally announced April 2024.
-
An LLM-Based Digital Twin for Optimizing Human-in-the Loop Systems
Authors:
Hanqing Yang,
Marie Siew,
Carlee Joe-Wong
Abstract:
The increasing prevalence of Cyber-Physical Systems and the Internet of Things (CPS-IoT) applications and Foundation Models are enabling new applications that leverage real-time control of the environment. For example, real-time control of Heating, Ventilation and Air-Conditioning (HVAC) systems can reduce its usage when not needed for the comfort of human occupants, hence reducing energy consumpt…
▽ More
The increasing prevalence of Cyber-Physical Systems and the Internet of Things (CPS-IoT) applications and Foundation Models are enabling new applications that leverage real-time control of the environment. For example, real-time control of Heating, Ventilation and Air-Conditioning (HVAC) systems can reduce its usage when not needed for the comfort of human occupants, hence reducing energy consumption. Collecting real-time feedback on human preferences in such human-in-the-loop (HITL) systems, however, is difficult in practice. We propose the use of large language models (LLMs) to deal with the challenges of dynamic environments and difficult-to-obtain data in CPS optimization. In this paper, we present a case study that employs LLM agents to mimic the behaviors and thermal preferences of various population groups (e.g. young families, the elderly) in a shop** mall. The aggregated thermal preferences are integrated into an agent-in-the-loop based reinforcement learning algorithm AitL-RL, which employs the LLM as a dynamic simulation of the physical environment to learn how to balance between energy savings and occupant comfort. Our results show that LLMs are capable of simulating complex population movements within large open spaces. Besides, AitL-RL demonstrates superior performance compared to the popular existing policy of set point control, suggesting that adaptive and personalized decision-making is critical for efficient optimization in CPS-IoT applications. Through this case study, we demonstrate the potential of integrating advanced Foundation Models like LLMs into CPS-IoT to enhance system adaptability and efficiency. The project's code can be found on our GitHub repository.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Towards a Machine Learning-Based Approach to Predict Space Object Density Distributions
Authors:
Victor Rodriguez-Fernandez,
Sumiyajav Sarangerel,
Peng Mun Siew,
Pablo Machuca,
Daniel Jang,
Richard Linares
Abstract:
With the rapid increase in the number of Anthropogenic Space Objects (ASOs), Low Earth Orbit (LEO) is facing significant congestion, thereby posing challenges to space operators and risking the viability of the space environment for varied uses. Current models for examining this evolution, while detailed, are computationally demanding. To address these issues, we propose a novel machine learning-b…
▽ More
With the rapid increase in the number of Anthropogenic Space Objects (ASOs), Low Earth Orbit (LEO) is facing significant congestion, thereby posing challenges to space operators and risking the viability of the space environment for varied uses. Current models for examining this evolution, while detailed, are computationally demanding. To address these issues, we propose a novel machine learning-based model, as an extension of the MIT Orbital Capacity Tool (MOCAT). This advanced model is designed to accelerate the propagation of ASO density distributions, and it is trained on hundreds of simulations generated by an established and accurate model of the space environment evolution. We study how different deep learning-based solutions can potentially be good candidates for ASO propagation and manage the high-dimensionality of the data. To assess the model's capabilities, we conduct experiments in long term forecasting scenarios (around 100 years), analyze how and why the performance degrades over time, and discuss potential solutions to make this solution better.
△ Less
Submitted 8 January, 2024;
originally announced January 2024.
-
Transformer-based Atmospheric Density Forecasting
Authors:
Julia Briden,
Peng Mun Siew,
Victor Rodriguez-Fernandez,
Richard Linares
Abstract:
As the peak of the solar cycle approaches in 2025 and the ability of a single geomagnetic storm to significantly alter the orbit of Resident Space Objects (RSOs), techniques for atmospheric density forecasting are vital for space situational awareness. While linear data-driven methods, such as dynamic mode decomposition with control (DMDc), have been used previously for forecasting atmospheric den…
▽ More
As the peak of the solar cycle approaches in 2025 and the ability of a single geomagnetic storm to significantly alter the orbit of Resident Space Objects (RSOs), techniques for atmospheric density forecasting are vital for space situational awareness. While linear data-driven methods, such as dynamic mode decomposition with control (DMDc), have been used previously for forecasting atmospheric density, deep learning-based forecasting has the ability to capture nonlinearities in data. By learning multiple layer weights from historical atmospheric density data, long-term dependencies in the dataset are captured in the map** between the current atmospheric density state and control input to the atmospheric density state at the next timestep. This work improves upon previous linear propagation methods for atmospheric density forecasting, by develo** a nonlinear transformer-based architecture for atmospheric density forecasting. Empirical NRLMSISE-00 and JB2008, as well as physics-based TIEGCM atmospheric density models are compared for forecasting with DMDc and with the transformer-based propagator.
△ Less
Submitted 25 October, 2023;
originally announced October 2023.
-
Differentially Private Deep Q-Learning for Pattern Privacy Preservation in MEC Offloading
Authors:
Shuying Gan,
Marie Siew,
Chao Xu,
Tony Q. S. Quek
Abstract:
Mobile edge computing (MEC) is a promising paradigm to meet the quality of service (QoS) requirements of latency-sensitive IoT applications. However, attackers may eavesdrop on the offloading decisions to infer the edge server's (ES's) queue information and users' usage patterns, thereby incurring the pattern privacy (PP) issue. Therefore, we propose an offloading strategy which jointly minimizes…
▽ More
Mobile edge computing (MEC) is a promising paradigm to meet the quality of service (QoS) requirements of latency-sensitive IoT applications. However, attackers may eavesdrop on the offloading decisions to infer the edge server's (ES's) queue information and users' usage patterns, thereby incurring the pattern privacy (PP) issue. Therefore, we propose an offloading strategy which jointly minimizes the latency, ES's energy consumption, and task drop** rate, while preserving PP. Firstly, we formulate the dynamic computation offloading procedure as a Markov decision process (MDP). Next, we develop a Differential Privacy Deep Q-learning based Offloading (DP-DQO) algorithm to solve this problem while addressing the PP issue by injecting noise into the generated offloading decisions. This is achieved by modifying the deep Q-network (DQN) with a Function-output Gaussian process mechanism. We provide a theoretical privacy guarantee and a utility guarantee (learning error bound) for the DP-DQO algorithm and finally, conduct simulations to evaluate the performance of our proposed algorithm by comparing it with greedy and DQN-based algorithms.
△ Less
Submitted 9 February, 2023;
originally announced February 2023.
-
FIRE: A Failure-Adaptive Reinforcement Learning Framework for Edge Computing Migrations
Authors:
Marie Siew,
Shikhar Sharma,
Zekai Li,
Kun Guo,
Chao Xu,
Tania Lorido-Botran,
Tony Q. S. Quek,
Carlee Joe-Wong
Abstract:
In edge computing, users' service profiles are migrated due to user mobility. Reinforcement learning (RL) frameworks have been proposed to do so, often trained on simulated data. However, existing RL frameworks overlook occasional server failures, which although rare, impact latency-sensitive applications like autonomous driving and real-time obstacle detection. Nevertheless, these failures (rare…
▽ More
In edge computing, users' service profiles are migrated due to user mobility. Reinforcement learning (RL) frameworks have been proposed to do so, often trained on simulated data. However, existing RL frameworks overlook occasional server failures, which although rare, impact latency-sensitive applications like autonomous driving and real-time obstacle detection. Nevertheless, these failures (rare events), being not adequately represented in historical training data, pose a challenge for data-driven RL algorithms. As it is impractical to adjust failure frequency in real-world applications for training, we introduce FIRE, a framework that adapts to rare events by training a RL policy in an edge computing digital twin environment. We propose ImRE, an importance sampling-based Q-learning algorithm, which samples rare events proportionally to their impact on the value function. FIRE considers delay, migration, failure, and backup placement costs across individual and shared service profiles. We prove ImRE's boundedness and convergence to optimality. Next, we introduce novel deep Q-learning (ImDQL) and actor critic (ImACRE) versions of our algorithm to enhance scalability. We extend our framework to accommodate users with varying risk tolerances. Through trace driven experiments, we show that FIRE reduces costs compared to vanilla RL and the greedy baseline in the event of failures.
△ Less
Submitted 7 March, 2024; v1 submitted 28 September, 2022;
originally announced September 2022.
-
Towards Effective Resource Procurement in MEC: a Resource Re-selling Framework
Authors:
Marie Siew,
Shikhar Sharma,
Kun Guo,
Desmond Cai,
Wanli Wen,
Carlee Joe-Wong,
Tony Q. S. Quek
Abstract:
On-demand and resource reservation pricing models have been widely used in cloud computing, catering to different user requirements. Nevertheless, in Multi-Access Edge Computing (MEC), as the edge has limited resources compared to the cloud, on-demand users may not get their jobs served on time, or at all, if too many resources were reserved by reservation plan users. Concurrently, reservation pla…
▽ More
On-demand and resource reservation pricing models have been widely used in cloud computing, catering to different user requirements. Nevertheless, in Multi-Access Edge Computing (MEC), as the edge has limited resources compared to the cloud, on-demand users may not get their jobs served on time, or at all, if too many resources were reserved by reservation plan users. Concurrently, reservation plan users may possess excess un-utilized quota. To optimize this resource mismatch scenario, we propose a Sharing Quota Model (SQM) where reservation plan users can re-sell unused resource quota to on-demand users, with the mobile network operator (MNO) taking a commission. To analyze the user's aggregate behavior at equilibrium and investigate the MNO's incentive of allowing re-selling, we formulate a 3-stage non-cooperative Stackelberg Game. Solving this game, we characterize the optimal strategies of buyers and re-sellers. We show that on aggregate, users' optimal strategies give rise to 4 disjoint regions, dependent on the MNO's prices and supply levels. Based on this, we characterise the MNO's optimal prices for on-demand users. Numerical results show that having both the sharing and on-demand pool gives the MNO an optimal revenue when the on-demand pool's supply is low, and when the MNO's commission is low.
△ Less
Submitted 8 November, 2023; v1 submitted 1 March, 2022;
originally announced March 2022.
-
Let's Share VMs: Optimal Placement and Pricing across Base Stations in MEC Systems
Authors:
Marie Siew,
Kun Guo,
Desmond Cai,
Lingxiang Li,
Tony Q. S. Quek
Abstract:
In mobile edge computing (MEC) systems, users offload computationally intensive tasks to edge servers at base stations. However, with unequal demand across the network, there might be excess demand at some locations and underutilized resources at other locations. To address such load-unbalanced problem in MEC systems, in this paper we propose virtual machines (VMs) sharing across base stations. Sp…
▽ More
In mobile edge computing (MEC) systems, users offload computationally intensive tasks to edge servers at base stations. However, with unequal demand across the network, there might be excess demand at some locations and underutilized resources at other locations. To address such load-unbalanced problem in MEC systems, in this paper we propose virtual machines (VMs) sharing across base stations. Specifically, we consider the joint VM placement and pricing problem across base stations to match demand and supply and maximize revenue at the network level. To make this problem tractable, we decompose it into master and slave problems. For the placement master problem, we propose a Markov approximation algorithm MAP on the design of a continuous time Markov chain. As for the pricing slave problem, we propose OPA - an optimal VM pricing auction, where all users are truthful. Furthermore, given users' potential untruthful behaviors, we propose an incentive compatible auction iCAT along with a partitioning mechanism PUFF, for which we prove incentive compatibility and revenue guarantees. Finally, we combine MAP and OPA or PUFF to solve the original problem, and analyze the optimality gap. Simulation results show that collaborative base stations increases revenue by up to 50%.
△ Less
Submitted 15 January, 2021;
originally announced January 2021.
-
Optimal Pricing for Job Offloading in the MEC System with Two Priority Classes
Authors:
Lingxiang Li,
Marie Siew,
Tony Q. S. Quek,
Zhi Chen
Abstract:
Multi-Access edge computing (MEC) is an emerging paradigm where users offload computationally intensive jobs to the Access Point (AP). Given that the AP's resources are shared by selfish users, pricing is a useful tool for incentivising users to internalize the negative externality of delay they cause to other users. Nevertheless, different users have different negative valuations towards delay as…
▽ More
Multi-Access edge computing (MEC) is an emerging paradigm where users offload computationally intensive jobs to the Access Point (AP). Given that the AP's resources are shared by selfish users, pricing is a useful tool for incentivising users to internalize the negative externality of delay they cause to other users. Nevertheless, different users have different negative valuations towards delay as some are more delay sensitive. To serve heterogeneous users, we propose a priority pricing scheme where users can get served first for a higher price. Our goal is to find the prices such that in decision making, users will choose the class and the offloading frequency that jointly maximize social welfare. With the assumption that the AP knows users' profit functions, we derive in semi-closed form the optimal prices. However in practice, the reporting of users's profit information incurs a large signalling overhead. Besides, in reality users might falsely report their private profit information. To overcome this, we further propose a learning-based pricing mechanism where no knowledge of individual user profit functions is required. At equilibrium, the optimal prices and average edge delays are learnt, and users have chosen the correct priority class and offload at the socially optimal frequency.
△ Less
Submitted 30 August, 2020; v1 submitted 19 May, 2019;
originally announced May 2019.