-
FuRL: Visual-Language Models as Fuzzy Rewards for Reinforcement Learning
Authors:
Yuwei Fu,
Haichao Zhang,
Di Wu,
Wei Xu,
Benoit Boulet
Abstract:
In this work, we investigate how to leverage pre-trained visual-language models (VLM) for online Reinforcement Learning (RL). In particular, we focus on sparse reward tasks with pre-defined textual task descriptions. We first identify the problem of reward misalignment when applying VLM as a reward in RL tasks. To address this issue, we introduce a lightweight fine-tuning method, named Fuzzy VLM r…
▽ More
In this work, we investigate how to leverage pre-trained visual-language models (VLM) for online Reinforcement Learning (RL). In particular, we focus on sparse reward tasks with pre-defined textual task descriptions. We first identify the problem of reward misalignment when applying VLM as a reward in RL tasks. To address this issue, we introduce a lightweight fine-tuning method, named Fuzzy VLM reward-aided RL (FuRL), based on reward alignment and relay RL. Specifically, we enhance the performance of SAC/DrQ baseline agents on sparse reward tasks by fine-tuning VLM representations and using relay RL to avoid local minima. Extensive experiments on the Meta-world benchmark tasks demonstrate the efficacy of the proposed method. Code is available at: https://github.com/fuyw/FuRL.
△ Less
Submitted 4 June, 2024; v1 submitted 2 June, 2024;
originally announced June 2024.
-
An Online Spatial-Temporal Graph Trajectory Planner for Autonomous Vehicles
Authors:
Jilan Samiuddin,
Benoit Boulet,
Di Wu
Abstract:
The autonomous driving industry is expected to grow by over 20 times in the coming decade and, thus, motivate researchers to delve into it. The primary focus of their research is to ensure safety, comfort, and efficiency. An autonomous vehicle has several modules responsible for one or more of the aforementioned items. Among these modules, the trajectory planner plays a pivotal role in the safety…
▽ More
The autonomous driving industry is expected to grow by over 20 times in the coming decade and, thus, motivate researchers to delve into it. The primary focus of their research is to ensure safety, comfort, and efficiency. An autonomous vehicle has several modules responsible for one or more of the aforementioned items. Among these modules, the trajectory planner plays a pivotal role in the safety of the vehicle and the comfort of its passengers. The module is also responsible for respecting kinematic constraints and any applicable road constraints. In this paper, a novel online spatial-temporal graph trajectory planner is introduced to generate safe and comfortable trajectories. First, a spatial-temporal graph is constructed using the autonomous vehicle, its surrounding vehicles, and virtual nodes along the road with respect to the vehicle itself. Next, the graph is forwarded into a sequential network to obtain the desired states. To support the planner, a simple behavioral layer is also presented that determines kinematic constraints for the planner. Furthermore, a novel potential function is also proposed to train the network. Finally, the proposed planner is tested on three different complex driving tasks, and the performance is compared with two frequently used methods. The results show that the proposed planner generates safe and feasible trajectories while achieving similar or longer distances in the forward direction and comparable comfort ride.
△ Less
Submitted 18 April, 2024;
originally announced April 2024.
-
Traffic Signal Control Using Lightweight Transformers: An Offline-to-Online RL Approach
Authors:
Xingshuai Huang,
Di Wu,
Benoit Boulet
Abstract:
Efficient traffic signal control is critical for reducing traffic congestion and improving overall transportation efficiency. The dynamic nature of traffic flow has prompted researchers to explore Reinforcement Learning (RL) for traffic signal control (TSC). Compared with traditional methods, RL-based solutions have shown preferable performance. However, the application of RL-based traffic signal…
▽ More
Efficient traffic signal control is critical for reducing traffic congestion and improving overall transportation efficiency. The dynamic nature of traffic flow has prompted researchers to explore Reinforcement Learning (RL) for traffic signal control (TSC). Compared with traditional methods, RL-based solutions have shown preferable performance. However, the application of RL-based traffic signal controllers in the real world is limited by the low sample efficiency and high computational requirements of these solutions. In this work, we propose DTLight, a simple yet powerful lightweight Decision Transformer-based TSC method that can learn policy from easily accessible offline datasets. DTLight novelly leverages knowledge distillation to learn a lightweight controller from a well-trained larger teacher model to reduce implementation computation. Additionally, it integrates adapter modules to mitigate the expenses associated with fine-tuning, which makes DTLight practical for online adaptation with minimal computation and only a few fine-tuning steps during real deployment. Moreover, DTLight is further enhanced to be more applicable to real-world TSC problems. Extensive experiments on synthetic and real-world scenarios show that DTLight pre-trained purely on offline datasets can outperform state-of-the-art online RL-based methods in most scenarios. Experiment results also show that online fine-tuning further improves the performance of DTLight by up to 42.6% over the best online RL baseline methods. In this work, we also introduce Datasets specifically designed for TSC with offline RL (referred to as DTRL). Our datasets and code are publicly available.
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
Anomaly Detection with Ensemble of Encoder and Decoder
Authors:
Xijuan Sun,
Di Wu,
Arnaud Zinflou,
Benoit Boulet
Abstract:
Hacking and false data injection from adversaries can threaten power grids' everyday operations and cause significant economic loss. Anomaly detection in power grids aims to detect and discriminate anomalies caused by cyber attacks against the power system, which is essential for kee** power grids working correctly and efficiently. Different methods have been applied for anomaly detection, such…
▽ More
Hacking and false data injection from adversaries can threaten power grids' everyday operations and cause significant economic loss. Anomaly detection in power grids aims to detect and discriminate anomalies caused by cyber attacks against the power system, which is essential for kee** power grids working correctly and efficiently. Different methods have been applied for anomaly detection, such as statistical methods and machine learning-based methods. Usually, machine learning-based methods need to model the normal data distribution. In this work, we propose a novel anomaly detection method by modeling the data distribution of normal samples via multiple encoders and decoders. Specifically, the proposed method maps input samples into a latent space and then reconstructs output samples from latent vectors. The extra encoder finally maps reconstructed samples to latent representations. During the training phase, we optimize parameters by minimizing the reconstruction loss and encoding loss. Training samples are re-weighted to focus more on missed correlations between features of normal data. Furthermore, we employ the long short-term memory model as encoders and decoders to test its effectiveness. We also investigate a meta-learning-based framework for hyper-parameter tuning of our approach. Experiment results on network intrusion and power system datasets demonstrate the effectiveness of our proposed method, where our models consistently outperform all baselines.
△ Less
Submitted 11 March, 2023;
originally announced March 2023.
-
Adaptive Aggregation for Safety-Critical Control
Authors:
Huiliang Zhang,
Di Wu,
Benoit Boulet
Abstract:
Safety has been recognized as the central obstacle to preventing the use of reinforcement learning (RL) for real-world applications. Different methods have been developed to deal with safety concerns in RL. However, learning reliable RL-based solutions usually require a large number of interactions with the environment. Likewise, how to improve the learning efficiency, specifically, how to utilize…
▽ More
Safety has been recognized as the central obstacle to preventing the use of reinforcement learning (RL) for real-world applications. Different methods have been developed to deal with safety concerns in RL. However, learning reliable RL-based solutions usually require a large number of interactions with the environment. Likewise, how to improve the learning efficiency, specifically, how to utilize transfer learning for safe reinforcement learning, has not been well studied. In this work, we propose an adaptive aggregation framework for safety-critical control. Our method comprises two key techniques: 1) we learn to transfer the safety knowledge by aggregating the multiple source tasks and a target task through the attention network; 2) we separate the goal of improving task performance and reducing constraint violations by utilizing a safeguard. Experiment results demonstrate that our algorithm can achieve fewer safety violations while showing better data efficiency compared with several baselines.
△ Less
Submitted 7 February, 2023;
originally announced February 2023.
-
MetaEMS: A Meta Reinforcement Learning-based Control Framework for Building Energy Management System
Authors:
Huiliang Zhang,
Di Wu,
Benoit Boulet
Abstract:
The building sector has been recognized as one of the primary sectors for worldwide energy consumption. Improving the energy efficiency of the building sector can help reduce the operation cost and reduce the greenhouse gas emission. The energy management system (EMS) can monitor and control the operations of built-in appliances in buildings, so an efficient EMS is of crucial importance to improve…
▽ More
The building sector has been recognized as one of the primary sectors for worldwide energy consumption. Improving the energy efficiency of the building sector can help reduce the operation cost and reduce the greenhouse gas emission. The energy management system (EMS) can monitor and control the operations of built-in appliances in buildings, so an efficient EMS is of crucial importance to improve the building operation efficiency and maintain safe operations. With the growing penetration of renewable energy and electrical appliances, increasing attention has been paid to the development of intelligent building EMS. Recently, reinforcement learning (RL) has been applied for building EMS and has shown promising potential. However, most of the current RL-based EMS solutions would need a large amount of data to learn a reliable control policy, which limits the applicability of these solutions in the real world. In this work, we propose MetaEMS, which can help achieve better energy management performance with the benefits of RL and meta-learning. Experiment results showcase that our proposed MetaEMS can adapt faster to environment changes and perform better in most situations compared with other baselines.
△ Less
Submitted 22 October, 2022;
originally announced October 2022.
-
Time Series Anomaly Detection via Reinforcement Learning-Based Model Selection
Authors:
Jiuqi Elise Zhang,
Di Wu,
Benoit Boulet
Abstract:
Time series anomaly detection has been recognized as of critical importance for the reliable and efficient operation of real-world systems. Many anomaly detection methods have been developed based on various assumptions on anomaly characteristics. However, due to the complex nature of real-world data, different anomalies within a time series usually have diverse profiles supporting different anoma…
▽ More
Time series anomaly detection has been recognized as of critical importance for the reliable and efficient operation of real-world systems. Many anomaly detection methods have been developed based on various assumptions on anomaly characteristics. However, due to the complex nature of real-world data, different anomalies within a time series usually have diverse profiles supporting different anomaly assumptions. This makes it difficult to find a single anomaly detector that can consistently outperform other models. In this work, to harness the benefits of different base models, we propose a reinforcement learning-based model selection framework. Specifically, we first learn a pool of different anomaly detection models, and then utilize reinforcement learning to dynamically select a candidate model from these base models. Experiments on real-world data have demonstrated that the proposed strategy can indeed outplay all baseline models in terms of overall performance.
△ Less
Submitted 27 July, 2022; v1 submitted 19 May, 2022;
originally announced May 2022.
-
An Early Fault Detection Method of Rotating Machines Based on Multiple Feature Fusion with Stacking Architecture
Authors:
Wenbin Song,
Di Wu,
Weiming Shen,
Benoit Boulet
Abstract:
Early fault detection (EFD) of rotating machines is important to decrease the maintenance cost and improve the mechanical system stability. One of the key points of EFD is develo** a generic model to extract robust and discriminative features from different equipment for early fault detection. Most existing EFD methods focus on learning fault representation by one type of feature. However, a com…
▽ More
Early fault detection (EFD) of rotating machines is important to decrease the maintenance cost and improve the mechanical system stability. One of the key points of EFD is develo** a generic model to extract robust and discriminative features from different equipment for early fault detection. Most existing EFD methods focus on learning fault representation by one type of feature. However, a combination of multiple features can capture a more comprehensive representation of system state. In this paper, we propose an EFD method based on multiple feature fusion with stacking architecture (M2FSA). The proposed method can extract generic and discriminiative features to detect early faults by combining time domain (TD), frequency domain (FD), and time-frequency domain (TFD) features. In order to unify the dimensions of the different domain features, Stacked Denoising Autoencoder (SDAE) is utilized to learn deep features in three domains. The architecture of the proposed M2FSA consists of two layers. The first layer contains three base models, whose corresponding inputs are different deep features. The outputs of the first layer are concatenated to generate the input to the second layer, which consists of a meta model. The proposed method is tested on three bearing datasets. The results demonstrate that the proposed method is better than existing methods both in sensibility and reliability.
△ Less
Submitted 28 February, 2023; v1 submitted 1 May, 2022;
originally announced May 2022.
-
Meta-Learning Based Early Fault Detection for Rolling Bearings via Few-Shot Anomaly Detection
Authors:
Wenbin Song,
Di Wu,
Weiming Shen,
Benoit Boulet
Abstract:
Early fault detection (EFD) of rolling bearings can recognize slight deviation of the health states and contribute to the stability of mechanical systems. In practice, very limited target bearing data are available to conduct EFD, which makes it hard to adapt to the EFD task of new bearings. To address this problem, many transfer learning based EFD methods utilize historical data to learn transfer…
▽ More
Early fault detection (EFD) of rolling bearings can recognize slight deviation of the health states and contribute to the stability of mechanical systems. In practice, very limited target bearing data are available to conduct EFD, which makes it hard to adapt to the EFD task of new bearings. To address this problem, many transfer learning based EFD methods utilize historical data to learn transferable domain knowledge and conduct early fault detection on new target bearings. However, most existing methods only consider the distribution drift across different working conditions but ignore the difference between bearings under the same working condition, which is called Unit-to-Unit Variability (UtUV). The setting of EFD with limited target data considering UtUV can be formulated as a Few-shot Anomaly Detection task. Therefore, this paper proposes a novel EFD method based on meta-learning considering UtUV. The proposed method can learn a generic metric based on Relation Network (RN) to measure the similarity between normal data and the new arrival target bearing data. Besides, the proposed method utilizes a health state embedding strategy to decrease false alarms. The performance of proposed method is tested on two bearing datasets. The results show that the proposed method can detect incipient faults earlier than the baselines with lower false alarms.
△ Less
Submitted 28 February, 2023; v1 submitted 26 April, 2022;
originally announced April 2022.
-
Structured learning of safety guarantees for the control of uncertain dynamical systems
Authors:
Marc-Antoine Beaudoin,
Benoit Boulet
Abstract:
Approaches to kee** a dynamical system within state constraints typically rely on a model-based safety condition to limit the control signals. In the face of significant modeling uncertainty, the system can suffer from important performance penalties due to the safety condition becoming overly conservative. Machine learning can be employed to reduce the uncertainty around the system dynamics, an…
▽ More
Approaches to kee** a dynamical system within state constraints typically rely on a model-based safety condition to limit the control signals. In the face of significant modeling uncertainty, the system can suffer from important performance penalties due to the safety condition becoming overly conservative. Machine learning can be employed to reduce the uncertainty around the system dynamics, and allow for higher performance. In this article, we propose the safe uncertainty learning principle, and argue that the learning must be properly structured to preserve safety guarantees. For instance, robust safety conditions are necessary, and they must be initialized with conservative uncertainty bounds prior to learning. Also, the uncertainty bounds should only be tightened if the collected data sufficiently capture the future system behavior. To support the principle, two example problems are solved with control barrier functions: a lane-change controller for an autonomous vehicle, and an adaptive cruise controller. This work offers a way to evaluate if machine learning preserves safety guarantees during the control of uncertain dynamical systems. It also highlights challenging aspects of learning for control.
△ Less
Submitted 29 January, 2022; v1 submitted 6 December, 2021;
originally announced December 2021.
-
Learning-based synthesis of robust linear time-invariant controllers
Authors:
Marc-Antoine Beaudoin,
Benoit Boulet
Abstract:
Recent advances in learning for control allow to synthesize vehicle controllers from learned system dynamics and maintain robust stability guarantees. However, no approach is well-suited for training linear time-invariant (LTI) controllers using arbitrary learned models of the dynamics. This article introduces a method to do so. It uses a robust control framework to derive robust stability criteri…
▽ More
Recent advances in learning for control allow to synthesize vehicle controllers from learned system dynamics and maintain robust stability guarantees. However, no approach is well-suited for training linear time-invariant (LTI) controllers using arbitrary learned models of the dynamics. This article introduces a method to do so. It uses a robust control framework to derive robust stability criteria. It also uses simulated policy rollouts to obtain gradients on the controller parameters, which serve to improve the closed-loop performance. By formulating the stability criteria as penalties with computable gradients, they can be used to guide the controller parameters toward robust stability during gradient descent. The approach is flexible as it does not restrict the type of learned model for the simulated rollouts. The robust control framework ensures that the controller is already robustly stabilizing when first implemented on the actual system and no data is yet collected. It also ensures that the system stays stable in the event of a shift in dynamics, given the system behavior remains within assumed uncertainty bounds. We demonstrate the approach by synthesizing a controller for simulated autonomous lane change maneuvers. This work thus presents a flexible approach to learning robustly stabilizing LTI controllers that take advantage of modern machine learning techniques.
△ Less
Submitted 9 May, 2022; v1 submitted 6 December, 2021;
originally announced December 2021.
-
Improving gearshift controllers for electric vehicles with reinforcement learning
Authors:
Marc-Antoine Beaudoin,
Benoit Boulet
Abstract:
During a multi-speed transmission development process, the final calibration of the gearshift controller parameters is usually performed on a physical test bench. Engineers typically treat the map** from the controller parameters to the gearshift quality as a black-box, and use methods rooted in experimental design -- a purely statistical approach -- to infer the parameter combination that will…
▽ More
During a multi-speed transmission development process, the final calibration of the gearshift controller parameters is usually performed on a physical test bench. Engineers typically treat the map** from the controller parameters to the gearshift quality as a black-box, and use methods rooted in experimental design -- a purely statistical approach -- to infer the parameter combination that will maximize a chosen gearshift performance indicator. This approach unfortunately requires thousands of gearshift trials, ultimately discouraging the exploration of different control strategies. In this work, we calibrate the feedforward and feedback parameters of a gearshift controller using a model-based reinforcement learning algorithm adapted from Pilco. Experimental results show that the method optimizes the controller parameters with few gearshift trials. This approach can accelerate the exploration of gearshift control strategies, which is especially important for the emerging technology of multi-speed transmissions for electric vehicles.
△ Less
Submitted 1 December, 2021;
originally announced December 2021.
-
ModelLight: Model-Based Meta-Reinforcement Learning for Traffic Signal Control
Authors:
Xingshuai Huang,
Di Wu,
Michael Jenkin,
Benoit Boulet
Abstract:
Traffic signal control is of critical importance for the effective use of transportation infrastructures. The rapid increase of vehicle traffic and changes in traffic patterns make traffic signal control more and more challenging. Reinforcement Learning (RL)-based algorithms have demonstrated their potential in dealing with traffic signal control. However, most existing solutions require a large a…
▽ More
Traffic signal control is of critical importance for the effective use of transportation infrastructures. The rapid increase of vehicle traffic and changes in traffic patterns make traffic signal control more and more challenging. Reinforcement Learning (RL)-based algorithms have demonstrated their potential in dealing with traffic signal control. However, most existing solutions require a large amount of training data, which is unacceptable for many real-world scenarios. This paper proposes a novel model-based meta-reinforcement learning framework (ModelLight) for traffic signal control. Within ModelLight, an ensemble of models for road intersections and the optimization-based meta-learning method are used to improve the data efficiency of an RL-based traffic light control method. Experiments on real-world datasets demonstrate that ModelLight can outperform state-of-the-art traffic light control algorithms while substantially reducing the number of required interactions with the real-world environment.
△ Less
Submitted 6 December, 2021; v1 submitted 15 November, 2021;
originally announced November 2021.
-
Time Series Anomaly Detection for Smart Grids: A Survey
Authors:
Jiuqi Elise Zhang,
Di Wu,
Benoit Boulet
Abstract:
With the rapid increase in the integration of renewable energy generation and the wide adoption of various electric appliances, power grids are now faced with more and more challenges. One prominent challenge is to implement efficient anomaly detection for different types of anomalous behaviors within power grids. These anomalous behaviors might be induced by unusual consumption patterns of the us…
▽ More
With the rapid increase in the integration of renewable energy generation and the wide adoption of various electric appliances, power grids are now faced with more and more challenges. One prominent challenge is to implement efficient anomaly detection for different types of anomalous behaviors within power grids. These anomalous behaviors might be induced by unusual consumption patterns of the users, faulty grid infrastructures, outages, external cyberattacks, or energy fraud. Identifying such anomalies is of critical importance for the reliable and efficient operation of modern power grids. Various methods have been proposed for anomaly detection on power grid time-series data. This paper presents a short survey of the recent advances in anomaly detection for power grid time-series data. Specifically, we first outline current research challenges in the power grid anomaly detection domain and further review the major anomaly detection approaches. Finally, we conclude the survey by identifying the potential directions for future research.
△ Less
Submitted 16 July, 2021;
originally announced July 2021.
-
Data-driven Model Predictive and Reinforcement Learning Based Control for Building Energy Management: a Survey
Authors:
Huiliang Zhang,
Sayani Seal,
Di Wu,
Benoit Boulet,
Francois Bouffard,
Geza Joos
Abstract:
Building energy management is one of the core problems in modern power grids to reduce energy consumption while ensuring occupants' comfort. However, the building energy management system (BEMS) is now facing more challenges and uncertainties with the increasing penetration of renewables and complicated interactions between humans and buildings. Classical model predictive control (MPC) has shown i…
▽ More
Building energy management is one of the core problems in modern power grids to reduce energy consumption while ensuring occupants' comfort. However, the building energy management system (BEMS) is now facing more challenges and uncertainties with the increasing penetration of renewables and complicated interactions between humans and buildings. Classical model predictive control (MPC) has shown its capacity to reduce building energy consumption, but it suffers from labor-intensive modelling and complex on-line control optimization. Recently, with the growing accessibility to the building control and automation data, data-driven solutions have attracted more research interest. This paper presents a compact review of the recent advances in data-driven MPC and reinforcement learning based control methods for BEMS. The main challenges in these approaches and insights on the selection of a control method are discussed.
△ Less
Submitted 28 June, 2021;
originally announced June 2021.
-
Image-based Intraluminal Contact Force Monitoring in Robotic Vascular Navigation
Authors:
Masoud Razban,
Javad Dargahi,
Benoit Boulet
Abstract:
Embolization, stroke, ischaemic lesion, and perforation remain significant concerns in endovascular interventions. Intravascular sensing of tool interaction with the arteries is advantageous to minimize such complications and enhance navigation safety. Intraluminal information is currently limited due to the lack of intravascular contact sensing technologies. We present monitoring of the intralumi…
▽ More
Embolization, stroke, ischaemic lesion, and perforation remain significant concerns in endovascular interventions. Intravascular sensing of tool interaction with the arteries is advantageous to minimize such complications and enhance navigation safety. Intraluminal information is currently limited due to the lack of intravascular contact sensing technologies. We present monitoring of the intraluminal tool interaction with the arterial wall using an image-based estimation approach within vascular robotic navigation. The proposed image-based method employs continuous finite element simulation of the tool using imaging data to estimate multi-point forces along tool-vessel wall interaction. We implemented imaging algorithms to detect and track contacts, and compute pose measurements. The model is constructed based on the nonlinear beam element and flexural rigidity profile over the tool length. During remote cannulation of aortic arteries, intraluminal monitoring achieved tracking local contact forces, building a contour map of force on the arterial wall and estimating tool structural stress. Results suggest that high risk intraluminal forces may happen even with low insertion force. The presented online monitoring system delivers insight into the intraluminal behavior of endovascular tools and is well suited for intraoperative visual guidance for the clinician, robotic control of vascular procedures and research on interventional device design.
△ Less
Submitted 14 February, 2021; v1 submitted 19 December, 2020;
originally announced December 2020.
-
Fundamental limitations to no-jerk gearshifts of multi-speed transmission architectures in electric vehicles
Authors:
Marc-Antoine Beaudoin,
Benoit Boulet
Abstract:
Multi-speed transmissions can enhance the performance and reduce the overall cost of an electric vehicle, but they also introduce a challenge: avoiding gearshift jerk, which may sometimes prove to be impossible in the presence of motor and clutch saturation. In this article, we introduce three theorems that explicitly define the fundamental limitations to no-jerk gearshifts resulting from motor or…
▽ More
Multi-speed transmissions can enhance the performance and reduce the overall cost of an electric vehicle, but they also introduce a challenge: avoiding gearshift jerk, which may sometimes prove to be impossible in the presence of motor and clutch saturation. In this article, we introduce three theorems that explicitly define the fundamental limitations to no-jerk gearshifts resulting from motor or actuator saturation. We compare gearshifts that consist of transferring transmission torque from one friction clutch to another, to the case in which one of the clutches is a one-way clutch. We show that systems with a one-way clutch are more prone to motor saturation, causing gearshift jerk to be more often inevitable. We also study the influence of planetary gearsets on the gearshift dynamical trajectories, and expose the impact on the no-jerk limitations. This work offers tools to compare transmission architectures during the conceptual design phase of a new electric vehicle.
△ Less
Submitted 16 February, 2021; v1 submitted 25 September, 2020;
originally announced September 2020.
-
Centralized Model Predictive Control Strategy for Thermal Comfort and Residential Energy Management
Authors:
Sayani Seal,
Benoit Boulet,
Vahid Raissi Dehkordi
Abstract:
A novel centralized model predictive control (MPC) is proposed for comfort and energy management in a residential building. The residential setup used here is equipped with a photovoltaic (PV) solar system and a stationary home battery unit. An air-to-air multi-split heat pump (HP) is used as the primary heating system. The electric baseboard (BB) unit in each zone is used as a secondary system. T…
▽ More
A novel centralized model predictive control (MPC) is proposed for comfort and energy management in a residential building. The residential setup used here is equipped with a photovoltaic (PV) solar system and a stationary home battery unit. An air-to-air multi-split heat pump (HP) is used as the primary heating system. The electric baseboard (BB) unit in each zone is used as a secondary system. The MPC is simultaneously responsible for controlling the heating inputs of the HP and BB units for comfort management, as well as for the control of energy flow between the PV, the home battery and the bidirectional grid system. Variable Time-of-Use (ToU) rates are considered for the energy cost calculation and Feed-in-Tariff (FiT) is considered for selling energy to the grid. A 13.5% reduction in the energy cost is achieved with the centralized MPC as compared to a rule based energy management strategy. The solar energy generation and battery storage contribute to approximately 31% saving.
△ Less
Submitted 14 December, 2019;
originally announced December 2019.