Search | arXiv e-print repository

FuRL: Visual-Language Models as Fuzzy Rewards for Reinforcement Learning

Authors: Yuwei Fu, Haichao Zhang, Di Wu, Wei Xu, Benoit Boulet

Abstract: In this work, we investigate how to leverage pre-trained visual-language models (VLM) for online Reinforcement Learning (RL). In particular, we focus on sparse reward tasks with pre-defined textual task descriptions. We first identify the problem of reward misalignment when applying VLM as a reward in RL tasks. To address this issue, we introduce a lightweight fine-tuning method, named Fuzzy VLM r… ▽ More In this work, we investigate how to leverage pre-trained visual-language models (VLM) for online Reinforcement Learning (RL). In particular, we focus on sparse reward tasks with pre-defined textual task descriptions. We first identify the problem of reward misalignment when applying VLM as a reward in RL tasks. To address this issue, we introduce a lightweight fine-tuning method, named Fuzzy VLM reward-aided RL (FuRL), based on reward alignment and relay RL. Specifically, we enhance the performance of SAC/DrQ baseline agents on sparse reward tasks by fine-tuning VLM representations and using relay RL to avoid local minima. Extensive experiments on the Meta-world benchmark tasks demonstrate the efficacy of the proposed method. Code is available at: https://github.com/fuyw/FuRL. △ Less

Submitted 4 June, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

Comments: ICML 2024

arXiv:2404.12256 [pdf, other]

doi 10.1109/TIV.2024.3389640

An Online Spatial-Temporal Graph Trajectory Planner for Autonomous Vehicles

Authors: Jilan Samiuddin, Benoit Boulet, Di Wu

Abstract: The autonomous driving industry is expected to grow by over 20 times in the coming decade and, thus, motivate researchers to delve into it. The primary focus of their research is to ensure safety, comfort, and efficiency. An autonomous vehicle has several modules responsible for one or more of the aforementioned items. Among these modules, the trajectory planner plays a pivotal role in the safety… ▽ More The autonomous driving industry is expected to grow by over 20 times in the coming decade and, thus, motivate researchers to delve into it. The primary focus of their research is to ensure safety, comfort, and efficiency. An autonomous vehicle has several modules responsible for one or more of the aforementioned items. Among these modules, the trajectory planner plays a pivotal role in the safety of the vehicle and the comfort of its passengers. The module is also responsible for respecting kinematic constraints and any applicable road constraints. In this paper, a novel online spatial-temporal graph trajectory planner is introduced to generate safe and comfortable trajectories. First, a spatial-temporal graph is constructed using the autonomous vehicle, its surrounding vehicles, and virtual nodes along the road with respect to the vehicle itself. Next, the graph is forwarded into a sequential network to obtain the desired states. To support the planner, a simple behavioral layer is also presented that determines kinematic constraints for the planner. Furthermore, a novel potential function is also proposed to train the network. Finally, the proposed planner is tested on three different complex driving tasks, and the performance is compared with two frequently used methods. The results show that the proposed planner generates safe and feasible trajectories while achieving similar or longer distances in the forward direction and comparable comfort ride. △ Less

Submitted 18 April, 2024; originally announced April 2024.

Comments: This is the accepted version and published in the "Early Access" area of IEEE Xplore for the IEEE Transactions on Intelligent Vehicles on 16 April 2024. Article statistics: 11 pages, 9 figures, 2 tables

arXiv:2312.07795 [pdf, other]

Traffic Signal Control Using Lightweight Transformers: An Offline-to-Online RL Approach

Authors: Xingshuai Huang, Di Wu, Benoit Boulet

Abstract: Efficient traffic signal control is critical for reducing traffic congestion and improving overall transportation efficiency. The dynamic nature of traffic flow has prompted researchers to explore Reinforcement Learning (RL) for traffic signal control (TSC). Compared with traditional methods, RL-based solutions have shown preferable performance. However, the application of RL-based traffic signal… ▽ More Efficient traffic signal control is critical for reducing traffic congestion and improving overall transportation efficiency. The dynamic nature of traffic flow has prompted researchers to explore Reinforcement Learning (RL) for traffic signal control (TSC). Compared with traditional methods, RL-based solutions have shown preferable performance. However, the application of RL-based traffic signal controllers in the real world is limited by the low sample efficiency and high computational requirements of these solutions. In this work, we propose DTLight, a simple yet powerful lightweight Decision Transformer-based TSC method that can learn policy from easily accessible offline datasets. DTLight novelly leverages knowledge distillation to learn a lightweight controller from a well-trained larger teacher model to reduce implementation computation. Additionally, it integrates adapter modules to mitigate the expenses associated with fine-tuning, which makes DTLight practical for online adaptation with minimal computation and only a few fine-tuning steps during real deployment. Moreover, DTLight is further enhanced to be more applicable to real-world TSC problems. Extensive experiments on synthetic and real-world scenarios show that DTLight pre-trained purely on offline datasets can outperform state-of-the-art online RL-based methods in most scenarios. Experiment results also show that online fine-tuning further improves the performance of DTLight by up to 42.6% over the best online RL baseline methods. In this work, we also introduce Datasets specifically designed for TSC with offline RL (referred to as DTRL). Our datasets and code are publicly available. △ Less

Submitted 12 December, 2023; originally announced December 2023.

arXiv:2303.06431 [pdf, other]

Anomaly Detection with Ensemble of Encoder and Decoder

Authors: Xijuan Sun, Di Wu, Arnaud Zinflou, Benoit Boulet

Abstract: Hacking and false data injection from adversaries can threaten power grids' everyday operations and cause significant economic loss. Anomaly detection in power grids aims to detect and discriminate anomalies caused by cyber attacks against the power system, which is essential for kee** power grids working correctly and efficiently. Different methods have been applied for anomaly detection, such… ▽ More Hacking and false data injection from adversaries can threaten power grids' everyday operations and cause significant economic loss. Anomaly detection in power grids aims to detect and discriminate anomalies caused by cyber attacks against the power system, which is essential for kee** power grids working correctly and efficiently. Different methods have been applied for anomaly detection, such as statistical methods and machine learning-based methods. Usually, machine learning-based methods need to model the normal data distribution. In this work, we propose a novel anomaly detection method by modeling the data distribution of normal samples via multiple encoders and decoders. Specifically, the proposed method maps input samples into a latent space and then reconstructs output samples from latent vectors. The extra encoder finally maps reconstructed samples to latent representations. During the training phase, we optimize parameters by minimizing the reconstruction loss and encoding loss. Training samples are re-weighted to focus more on missed correlations between features of normal data. Furthermore, we employ the long short-term memory model as encoders and decoders to test its effectiveness. We also investigate a meta-learning-based framework for hyper-parameter tuning of our approach. Experiment results on network intrusion and power system datasets demonstrate the effectiveness of our proposed method, where our models consistently outperform all baselines. △ Less

Submitted 11 March, 2023; originally announced March 2023.

arXiv:2302.03586 [pdf, other]

Adaptive Aggregation for Safety-Critical Control

Authors: Huiliang Zhang, Di Wu, Benoit Boulet

Abstract: Safety has been recognized as the central obstacle to preventing the use of reinforcement learning (RL) for real-world applications. Different methods have been developed to deal with safety concerns in RL. However, learning reliable RL-based solutions usually require a large number of interactions with the environment. Likewise, how to improve the learning efficiency, specifically, how to utilize… ▽ More Safety has been recognized as the central obstacle to preventing the use of reinforcement learning (RL) for real-world applications. Different methods have been developed to deal with safety concerns in RL. However, learning reliable RL-based solutions usually require a large number of interactions with the environment. Likewise, how to improve the learning efficiency, specifically, how to utilize transfer learning for safe reinforcement learning, has not been well studied. In this work, we propose an adaptive aggregation framework for safety-critical control. Our method comprises two key techniques: 1) we learn to transfer the safety knowledge by aggregating the multiple source tasks and a target task through the attention network; 2) we separate the goal of improving task performance and reducing constraint violations by utilizing a safeguard. Experiment results demonstrate that our algorithm can achieve fewer safety violations while showing better data efficiency compared with several baselines. △ Less

Submitted 7 February, 2023; originally announced February 2023.

arXiv:2210.12590 [pdf, other]

MetaEMS: A Meta Reinforcement Learning-based Control Framework for Building Energy Management System

Authors: Huiliang Zhang, Di Wu, Benoit Boulet

Abstract: The building sector has been recognized as one of the primary sectors for worldwide energy consumption. Improving the energy efficiency of the building sector can help reduce the operation cost and reduce the greenhouse gas emission. The energy management system (EMS) can monitor and control the operations of built-in appliances in buildings, so an efficient EMS is of crucial importance to improve… ▽ More The building sector has been recognized as one of the primary sectors for worldwide energy consumption. Improving the energy efficiency of the building sector can help reduce the operation cost and reduce the greenhouse gas emission. The energy management system (EMS) can monitor and control the operations of built-in appliances in buildings, so an efficient EMS is of crucial importance to improve the building operation efficiency and maintain safe operations. With the growing penetration of renewable energy and electrical appliances, increasing attention has been paid to the development of intelligent building EMS. Recently, reinforcement learning (RL) has been applied for building EMS and has shown promising potential. However, most of the current RL-based EMS solutions would need a large amount of data to learn a reliable control policy, which limits the applicability of these solutions in the real world. In this work, we propose MetaEMS, which can help achieve better energy management performance with the benefits of RL and meta-learning. Experiment results showcase that our proposed MetaEMS can adapt faster to environment changes and perform better in most situations compared with other baselines. △ Less

Submitted 22 October, 2022; originally announced October 2022.

Comments: arXiv admin note: text overlap with arXiv:1909.10165 by other authors

arXiv:2205.09884 [pdf, other]

Time Series Anomaly Detection via Reinforcement Learning-Based Model Selection

Authors: Jiuqi Elise Zhang, Di Wu, Benoit Boulet

Abstract: Time series anomaly detection has been recognized as of critical importance for the reliable and efficient operation of real-world systems. Many anomaly detection methods have been developed based on various assumptions on anomaly characteristics. However, due to the complex nature of real-world data, different anomalies within a time series usually have diverse profiles supporting different anoma… ▽ More Time series anomaly detection has been recognized as of critical importance for the reliable and efficient operation of real-world systems. Many anomaly detection methods have been developed based on various assumptions on anomaly characteristics. However, due to the complex nature of real-world data, different anomalies within a time series usually have diverse profiles supporting different anomaly assumptions. This makes it difficult to find a single anomaly detector that can consistently outperform other models. In this work, to harness the benefits of different base models, we propose a reinforcement learning-based model selection framework. Specifically, we first learn a pool of different anomaly detection models, and then utilize reinforcement learning to dynamically select a candidate model from these base models. Experiments on real-world data have demonstrated that the proposed strategy can indeed outplay all baseline models in terms of overall performance. △ Less

Submitted 27 July, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

Comments: Accepted by IEEE Canadian Conference on Electrical and Computer Engineering (CCECE) 2022

arXiv:2205.00511

An Early Fault Detection Method of Rotating Machines Based on Multiple Feature Fusion with Stacking Architecture

Authors: Wenbin Song, Di Wu, Weiming Shen, Benoit Boulet

Abstract: Early fault detection (EFD) of rotating machines is important to decrease the maintenance cost and improve the mechanical system stability. One of the key points of EFD is develo** a generic model to extract robust and discriminative features from different equipment for early fault detection. Most existing EFD methods focus on learning fault representation by one type of feature. However, a com… ▽ More Early fault detection (EFD) of rotating machines is important to decrease the maintenance cost and improve the mechanical system stability. One of the key points of EFD is develo** a generic model to extract robust and discriminative features from different equipment for early fault detection. Most existing EFD methods focus on learning fault representation by one type of feature. However, a combination of multiple features can capture a more comprehensive representation of system state. In this paper, we propose an EFD method based on multiple feature fusion with stacking architecture (M2FSA). The proposed method can extract generic and discriminiative features to detect early faults by combining time domain (TD), frequency domain (FD), and time-frequency domain (TFD) features. In order to unify the dimensions of the different domain features, Stacked Denoising Autoencoder (SDAE) is utilized to learn deep features in three domains. The architecture of the proposed M2FSA consists of two layers. The first layer contains three base models, whose corresponding inputs are different deep features. The outputs of the first layer are concatenated to generate the input to the second layer, which consists of a meta model. The proposed method is tested on three bearing datasets. The results demonstrate that the proposed method is better than existing methods both in sensibility and reliability. △ Less

Submitted 28 February, 2023; v1 submitted 1 May, 2022; originally announced May 2022.

Comments: The results require to be updated

arXiv:2204.12637

Meta-Learning Based Early Fault Detection for Rolling Bearings via Few-Shot Anomaly Detection

Authors: Wenbin Song, Di Wu, Weiming Shen, Benoit Boulet

Abstract: Early fault detection (EFD) of rolling bearings can recognize slight deviation of the health states and contribute to the stability of mechanical systems. In practice, very limited target bearing data are available to conduct EFD, which makes it hard to adapt to the EFD task of new bearings. To address this problem, many transfer learning based EFD methods utilize historical data to learn transfer… ▽ More Early fault detection (EFD) of rolling bearings can recognize slight deviation of the health states and contribute to the stability of mechanical systems. In practice, very limited target bearing data are available to conduct EFD, which makes it hard to adapt to the EFD task of new bearings. To address this problem, many transfer learning based EFD methods utilize historical data to learn transferable domain knowledge and conduct early fault detection on new target bearings. However, most existing methods only consider the distribution drift across different working conditions but ignore the difference between bearings under the same working condition, which is called Unit-to-Unit Variability (UtUV). The setting of EFD with limited target data considering UtUV can be formulated as a Few-shot Anomaly Detection task. Therefore, this paper proposes a novel EFD method based on meta-learning considering UtUV. The proposed method can learn a generic metric based on Relation Network (RN) to measure the similarity between normal data and the new arrival target bearing data. Besides, the proposed method utilizes a health state embedding strategy to decrease false alarms. The performance of proposed method is tested on two bearing datasets. The results show that the proposed method can detect incipient faults earlier than the baselines with lower false alarms. △ Less

Submitted 28 February, 2023; v1 submitted 26 April, 2022; originally announced April 2022.

Comments: The results require to be updated

arXiv:2112.03347 [pdf, other]

doi 10.1109/TIV.2022.3148212

Structured learning of safety guarantees for the control of uncertain dynamical systems

Authors: Marc-Antoine Beaudoin, Benoit Boulet

Abstract: Approaches to kee** a dynamical system within state constraints typically rely on a model-based safety condition to limit the control signals. In the face of significant modeling uncertainty, the system can suffer from important performance penalties due to the safety condition becoming overly conservative. Machine learning can be employed to reduce the uncertainty around the system dynamics, an… ▽ More Approaches to kee** a dynamical system within state constraints typically rely on a model-based safety condition to limit the control signals. In the face of significant modeling uncertainty, the system can suffer from important performance penalties due to the safety condition becoming overly conservative. Machine learning can be employed to reduce the uncertainty around the system dynamics, and allow for higher performance. In this article, we propose the safe uncertainty learning principle, and argue that the learning must be properly structured to preserve safety guarantees. For instance, robust safety conditions are necessary, and they must be initialized with conservative uncertainty bounds prior to learning. Also, the uncertainty bounds should only be tightened if the collected data sufficiently capture the future system behavior. To support the principle, two example problems are solved with control barrier functions: a lane-change controller for an autonomous vehicle, and an adaptive cruise controller. This work offers a way to evaluate if machine learning preserves safety guarantees during the control of uncertain dynamical systems. It also highlights challenging aspects of learning for control. △ Less

Submitted 29 January, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

arXiv:2112.03345 [pdf, other]

doi 10.1109/TIV.2022.3174029

Learning-based synthesis of robust linear time-invariant controllers

Authors: Marc-Antoine Beaudoin, Benoit Boulet

Abstract: Recent advances in learning for control allow to synthesize vehicle controllers from learned system dynamics and maintain robust stability guarantees. However, no approach is well-suited for training linear time-invariant (LTI) controllers using arbitrary learned models of the dynamics. This article introduces a method to do so. It uses a robust control framework to derive robust stability criteri… ▽ More Recent advances in learning for control allow to synthesize vehicle controllers from learned system dynamics and maintain robust stability guarantees. However, no approach is well-suited for training linear time-invariant (LTI) controllers using arbitrary learned models of the dynamics. This article introduces a method to do so. It uses a robust control framework to derive robust stability criteria. It also uses simulated policy rollouts to obtain gradients on the controller parameters, which serve to improve the closed-loop performance. By formulating the stability criteria as penalties with computable gradients, they can be used to guide the controller parameters toward robust stability during gradient descent. The approach is flexible as it does not restrict the type of learned model for the simulated rollouts. The robust control framework ensures that the controller is already robustly stabilizing when first implemented on the actual system and no data is yet collected. It also ensures that the system stays stable in the event of a shift in dynamics, given the system behavior remains within assumed uncertainty bounds. We demonstrate the approach by synthesizing a controller for simulated autonomous lane change maneuvers. This work thus presents a flexible approach to learning robustly stabilizing LTI controllers that take advantage of modern machine learning techniques. △ Less

Submitted 9 May, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

arXiv:2112.00529 [pdf, other]

doi 10.1016/j.mechmachtheory.2021.104654

Improving gearshift controllers for electric vehicles with reinforcement learning

Authors: Marc-Antoine Beaudoin, Benoit Boulet

Abstract: During a multi-speed transmission development process, the final calibration of the gearshift controller parameters is usually performed on a physical test bench. Engineers typically treat the map** from the controller parameters to the gearshift quality as a black-box, and use methods rooted in experimental design -- a purely statistical approach -- to infer the parameter combination that will… ▽ More During a multi-speed transmission development process, the final calibration of the gearshift controller parameters is usually performed on a physical test bench. Engineers typically treat the map** from the controller parameters to the gearshift quality as a black-box, and use methods rooted in experimental design -- a purely statistical approach -- to infer the parameter combination that will maximize a chosen gearshift performance indicator. This approach unfortunately requires thousands of gearshift trials, ultimately discouraging the exploration of different control strategies. In this work, we calibrate the feedforward and feedback parameters of a gearshift controller using a model-based reinforcement learning algorithm adapted from Pilco. Experimental results show that the method optimizes the controller parameters with few gearshift trials. This approach can accelerate the exploration of gearshift control strategies, which is especially important for the emerging technology of multi-speed transmissions for electric vehicles. △ Less

Submitted 1 December, 2021; originally announced December 2021.

arXiv:2111.08067 [pdf, other]

ModelLight: Model-Based Meta-Reinforcement Learning for Traffic Signal Control

Authors: Xingshuai Huang, Di Wu, Michael Jenkin, Benoit Boulet

Abstract: Traffic signal control is of critical importance for the effective use of transportation infrastructures. The rapid increase of vehicle traffic and changes in traffic patterns make traffic signal control more and more challenging. Reinforcement Learning (RL)-based algorithms have demonstrated their potential in dealing with traffic signal control. However, most existing solutions require a large a… ▽ More Traffic signal control is of critical importance for the effective use of transportation infrastructures. The rapid increase of vehicle traffic and changes in traffic patterns make traffic signal control more and more challenging. Reinforcement Learning (RL)-based algorithms have demonstrated their potential in dealing with traffic signal control. However, most existing solutions require a large amount of training data, which is unacceptable for many real-world scenarios. This paper proposes a novel model-based meta-reinforcement learning framework (ModelLight) for traffic signal control. Within ModelLight, an ensemble of models for road intersections and the optimization-based meta-learning method are used to improve the data efficiency of an RL-based traffic light control method. Experiments on real-world datasets demonstrate that ModelLight can outperform state-of-the-art traffic light control algorithms while substantially reducing the number of required interactions with the real-world environment. △ Less

Submitted 6 December, 2021; v1 submitted 15 November, 2021; originally announced November 2021.

arXiv:2107.08835 [pdf, other]

Time Series Anomaly Detection for Smart Grids: A Survey

Authors: Jiuqi Elise Zhang, Di Wu, Benoit Boulet

Abstract: With the rapid increase in the integration of renewable energy generation and the wide adoption of various electric appliances, power grids are now faced with more and more challenges. One prominent challenge is to implement efficient anomaly detection for different types of anomalous behaviors within power grids. These anomalous behaviors might be induced by unusual consumption patterns of the us… ▽ More With the rapid increase in the integration of renewable energy generation and the wide adoption of various electric appliances, power grids are now faced with more and more challenges. One prominent challenge is to implement efficient anomaly detection for different types of anomalous behaviors within power grids. These anomalous behaviors might be induced by unusual consumption patterns of the users, faulty grid infrastructures, outages, external cyberattacks, or energy fraud. Identifying such anomalies is of critical importance for the reliable and efficient operation of modern power grids. Various methods have been proposed for anomaly detection on power grid time-series data. This paper presents a short survey of the recent advances in anomaly detection for power grid time-series data. Specifically, we first outline current research challenges in the power grid anomaly detection domain and further review the major anomaly detection approaches. Finally, we conclude the survey by identifying the potential directions for future research. △ Less

Submitted 16 July, 2021; originally announced July 2021.

Comments: 6 pages; Preprint Submitted to IEEE Canadian Electrical Power and Energy Conference (EPEC2021)

arXiv:2106.14450 [pdf, other]

Data-driven Model Predictive and Reinforcement Learning Based Control for Building Energy Management: a Survey

Authors: Huiliang Zhang, Sayani Seal, Di Wu, Benoit Boulet, Francois Bouffard, Geza Joos

Abstract: Building energy management is one of the core problems in modern power grids to reduce energy consumption while ensuring occupants' comfort. However, the building energy management system (BEMS) is now facing more challenges and uncertainties with the increasing penetration of renewables and complicated interactions between humans and buildings. Classical model predictive control (MPC) has shown i… ▽ More Building energy management is one of the core problems in modern power grids to reduce energy consumption while ensuring occupants' comfort. However, the building energy management system (BEMS) is now facing more challenges and uncertainties with the increasing penetration of renewables and complicated interactions between humans and buildings. Classical model predictive control (MPC) has shown its capacity to reduce building energy consumption, but it suffers from labor-intensive modelling and complex on-line control optimization. Recently, with the growing accessibility to the building control and automation data, data-driven solutions have attracted more research interest. This paper presents a compact review of the recent advances in data-driven MPC and reinforcement learning based control methods for BEMS. The main challenges in these approaches and insights on the selection of a control method are discussed. △ Less

Submitted 28 June, 2021; originally announced June 2021.

arXiv:2012.10762 [pdf, other]

Image-based Intraluminal Contact Force Monitoring in Robotic Vascular Navigation

Authors: Masoud Razban, Javad Dargahi, Benoit Boulet

Abstract: Embolization, stroke, ischaemic lesion, and perforation remain significant concerns in endovascular interventions. Intravascular sensing of tool interaction with the arteries is advantageous to minimize such complications and enhance navigation safety. Intraluminal information is currently limited due to the lack of intravascular contact sensing technologies. We present monitoring of the intralumi… ▽ More Embolization, stroke, ischaemic lesion, and perforation remain significant concerns in endovascular interventions. Intravascular sensing of tool interaction with the arteries is advantageous to minimize such complications and enhance navigation safety. Intraluminal information is currently limited due to the lack of intravascular contact sensing technologies. We present monitoring of the intraluminal tool interaction with the arterial wall using an image-based estimation approach within vascular robotic navigation. The proposed image-based method employs continuous finite element simulation of the tool using imaging data to estimate multi-point forces along tool-vessel wall interaction. We implemented imaging algorithms to detect and track contacts, and compute pose measurements. The model is constructed based on the nonlinear beam element and flexural rigidity profile over the tool length. During remote cannulation of aortic arteries, intraluminal monitoring achieved tracking local contact forces, building a contour map of force on the arterial wall and estimating tool structural stress. Results suggest that high risk intraluminal forces may happen even with low insertion force. The presented online monitoring system delivers insight into the intraluminal behavior of endovascular tools and is well suited for intraoperative visual guidance for the clinician, robotic control of vascular procedures and research on interventional device design. △ Less

Submitted 14 February, 2021; v1 submitted 19 December, 2020; originally announced December 2020.

arXiv:2009.12410 [pdf, other]

doi 10.1016/j.mechmachtheory.2021.104290

Fundamental limitations to no-jerk gearshifts of multi-speed transmission architectures in electric vehicles

Authors: Marc-Antoine Beaudoin, Benoit Boulet

Abstract: Multi-speed transmissions can enhance the performance and reduce the overall cost of an electric vehicle, but they also introduce a challenge: avoiding gearshift jerk, which may sometimes prove to be impossible in the presence of motor and clutch saturation. In this article, we introduce three theorems that explicitly define the fundamental limitations to no-jerk gearshifts resulting from motor or… ▽ More Multi-speed transmissions can enhance the performance and reduce the overall cost of an electric vehicle, but they also introduce a challenge: avoiding gearshift jerk, which may sometimes prove to be impossible in the presence of motor and clutch saturation. In this article, we introduce three theorems that explicitly define the fundamental limitations to no-jerk gearshifts resulting from motor or actuator saturation. We compare gearshifts that consist of transferring transmission torque from one friction clutch to another, to the case in which one of the clutches is a one-way clutch. We show that systems with a one-way clutch are more prone to motor saturation, causing gearshift jerk to be more often inevitable. We also study the influence of planetary gearsets on the gearshift dynamical trajectories, and expose the impact on the no-jerk limitations. This work offers tools to compare transmission architectures during the conceptual design phase of a new electric vehicle. △ Less

Submitted 16 February, 2021; v1 submitted 25 September, 2020; originally announced September 2020.

arXiv:1912.06943 [pdf, other]

Centralized Model Predictive Control Strategy for Thermal Comfort and Residential Energy Management

Authors: Sayani Seal, Benoit Boulet, Vahid Raissi Dehkordi

Abstract: A novel centralized model predictive control (MPC) is proposed for comfort and energy management in a residential building. The residential setup used here is equipped with a photovoltaic (PV) solar system and a stationary home battery unit. An air-to-air multi-split heat pump (HP) is used as the primary heating system. The electric baseboard (BB) unit in each zone is used as a secondary system. T… ▽ More A novel centralized model predictive control (MPC) is proposed for comfort and energy management in a residential building. The residential setup used here is equipped with a photovoltaic (PV) solar system and a stationary home battery unit. An air-to-air multi-split heat pump (HP) is used as the primary heating system. The electric baseboard (BB) unit in each zone is used as a secondary system. The MPC is simultaneously responsible for controlling the heating inputs of the HP and BB units for comfort management, as well as for the control of energy flow between the PV, the home battery and the bidirectional grid system. Variable Time-of-Use (ToU) rates are considered for the energy cost calculation and Feed-in-Tariff (FiT) is considered for selling energy to the grid. A 13.5% reduction in the energy cost is achieved with the centralized MPC as compared to a rule based energy management strategy. The solar energy generation and battery storage contribute to approximately 31% saving. △ Less

Submitted 14 December, 2019; originally announced December 2019.

Comments: 13 pages, 11 figures, 8 tables. Submitted to Energy, Elsevier

Showing 1–18 of 18 results for author: Boulet, B