Search | arXiv e-print repository

arXiv:2406.19015 [pdf, other]

Lithium-Ion Battery System Health Monitoring and Fault Analysis from Field Data Using Gaussian Processes

Authors: Joachim Schaeffer, Eric Lenz, Duncan Gulla, Martin Z. Bazant, Richard D. Braatz, Rolf Findeisen

Abstract: Health monitoring, fault analysis, and detection are critical for the safe and sustainable operation of battery systems. We apply Gaussian process resistance models on lithium iron phosphate battery field data to effectively separate the time-dependent and operating point-dependent resistance. The data set contains 29 battery systems returned to the manufacturer for warranty, each with eight cells… ▽ More Health monitoring, fault analysis, and detection are critical for the safe and sustainable operation of battery systems. We apply Gaussian process resistance models on lithium iron phosphate battery field data to effectively separate the time-dependent and operating point-dependent resistance. The data set contains 29 battery systems returned to the manufacturer for warranty, each with eight cells in series, totaling 232 cells and 131 million data rows. We develop probabilistic fault detection rules using recursive spatiotemporal Gaussian processes. These processes allow the quick processing of over a million data points, enabling advanced online monitoring and furthering the understanding of battery pack failure in the field. The analysis underlines that often, only a single cell shows abnormal behavior or a knee point, consistent with weakest-link failure for cells connected in series, amplified by local resistive heating. The results further the understanding of how batteries degrade and fail in the field and demonstrate the potential of efficient online monitoring based on data. We open-source the code and publish the large data set upon completion of the review of this article. △ Less

Submitted 27 June, 2024; originally announced June 2024.

arXiv:2404.06125 [pdf, other]

Learning Model Predictive Control Parameters via Bayesian Optimization for Battery Fast Charging

Authors: Sebastian Hirt, Andreas Höhl, Joachim Schaeffer, Johannes Pohlodek, Richard D. Braatz, Rolf Findeisen

Abstract: Tuning parameters in model predictive control (MPC) presents significant challenges, particularly when there is a notable discrepancy between the controller's predictions and the actual behavior of the closed-loop plant. This mismatch may stem from factors like substantial model-plant differences, limited prediction horizons that do not cover the entire time of interest, or unforeseen system distu… ▽ More Tuning parameters in model predictive control (MPC) presents significant challenges, particularly when there is a notable discrepancy between the controller's predictions and the actual behavior of the closed-loop plant. This mismatch may stem from factors like substantial model-plant differences, limited prediction horizons that do not cover the entire time of interest, or unforeseen system disturbances. Such mismatches can jeopardize both performance and safety, including constraint satisfaction. Traditional methods address this issue by modifying the finite horizon cost function to better reflect the overall operational cost, learning parts of the prediction model from data, or implementing robust MPC strategies, which might be either computationally intensive or overly cautious. As an alternative, directly optimizing or learning the controller parameters to enhance closed-loop performance has been proposed. We apply Bayesian optimization for efficient learning of unknown model parameters and parameterized constraint backoff terms, aiming to improve closed-loop performance of battery fast charging. This approach establishes a hierarchical control framework where Bayesian optimization directly fine-tunes closed-loop behavior towards a global and long-term objective, while MPC handles lower-level, short-term control tasks. For lithium-ion battery fast charging, we show that the learning approach not only ensures safe operation but also maximizes closed-loop performance. This includes maintaining the battery's operation below its maximum terminal voltage and reducing charging times, all achieved using a standard nominal MPC model with a short horizon and notable initial model-plant mismatch. △ Less

Submitted 9 April, 2024; originally announced April 2024.

Comments: 6 pages, 5 figures, accepted for ADCHEM 2024

arXiv:2404.04049 [pdf, other]

Cycle Life Prediction for Lithium-ion Batteries: Machine Learning and More

Authors: Joachim Schaeffer, Giacomo Galuppini, **wook Rhyu, Patrick A. Asinger, Robin Droop, Rolf Findeisen, Richard D. Braatz

Abstract: Batteries are dynamic systems with complicated nonlinear aging, highly dependent on cell design, chemistry, manufacturing, and operational conditions. Prediction of battery cycle life and estimation of aging states is important to accelerate battery R&D, testing, and to further the understanding of how batteries degrade. Beyond testing, battery management systems rely on real-time models and onboa… ▽ More Batteries are dynamic systems with complicated nonlinear aging, highly dependent on cell design, chemistry, manufacturing, and operational conditions. Prediction of battery cycle life and estimation of aging states is important to accelerate battery R&D, testing, and to further the understanding of how batteries degrade. Beyond testing, battery management systems rely on real-time models and onboard diagnostics and prognostics for safe operation. Estimating the state of health and remaining useful life of a battery is important to optimize performance and use resources optimally. This tutorial begins with an overview of first-principles, machine learning, and hybrid battery models. Then, a typical pipeline for the development of interpretable machine learning models is explained and showcased for cycle life prediction from laboratory testing data. We highlight the challenges of machine learning models, motivating the incorporation of physics in hybrid modeling approaches, which are needed to decipher the aging trajectory of batteries but require more data and further work on the physics of battery degradation. The tutorial closes with a discussion on generalization and further research directions. △ Less

Submitted 5 April, 2024; originally announced April 2024.

Comments: 6 pages, 3 figures, accepted for ACC 2024

arXiv:2402.17120 [pdf, other]

LCEN: A Novel Feature Selection Algorithm for Nonlinear, Interpretable Machine Learning Models

Authors: Pedro Seber, Richard D. Braatz

Abstract: Interpretable architectures can have advantages over black-box architectures, and interpretability is essential for the application of machine learning in critical settings, such as aviation or medicine. However, the simplest, most commonly used interpretable architectures, such as LASSO or elastic net (EN), are limited to linear predictions and have poor feature selection capabilities. In this wo… ▽ More Interpretable architectures can have advantages over black-box architectures, and interpretability is essential for the application of machine learning in critical settings, such as aviation or medicine. However, the simplest, most commonly used interpretable architectures, such as LASSO or elastic net (EN), are limited to linear predictions and have poor feature selection capabilities. In this work, we introduce the LASSO-Clip-EN (LCEN) algorithm for the creation of nonlinear, interpretable machine learning models. LCEN is tested on a wide variety of artificial and empirical datasets, frequently creating more accurate, sparser models than other architectures, including those for building sparse, nonlinear models. LCEN is robust against many issues typically present in datasets and modeling, including noise, multicollinearity, data scarcity, and hyperparameter variance. LCEN is also able to rediscover multiple physical laws from empirical data and, for processes with no known physical laws, LCEN achieves better results than many other dense and sparse methods -- including using 10.8-fold fewer features than dense methods and 8.1-fold fewer features than EN on one dataset, and is comparable to or better than ANNs on multiple datasets. △ Less

Submitted 5 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

arXiv:2309.00564 [pdf, other]

doi 10.1016/j.compchemeng.2023.108471

Interpretation of High-Dimensional Linear Regression: Effects of Nullspace and Regularization Demonstrated on Battery Data

Authors: Joachim Schaeffer, Eric Lenz, William C. Chueh, Martin Z. Bazant, Rolf Findeisen, Richard D. Braatz

Abstract: High-dimensional linear regression is important in many scientific fields. This article considers discrete measured data of underlying smooth latent processes, as is often obtained from chemical or biological systems. Interpretation in high dimensions is challenging because the nullspace and its interplay with regularization shapes regression coefficients. The data's nullspace contains all coeffic… ▽ More High-dimensional linear regression is important in many scientific fields. This article considers discrete measured data of underlying smooth latent processes, as is often obtained from chemical or biological systems. Interpretation in high dimensions is challenging because the nullspace and its interplay with regularization shapes regression coefficients. The data's nullspace contains all coefficients that satisfy $\mathbf{Xw}=\mathbf{0}$, thus allowing very different coefficients to yield identical predictions. We developed an optimization formulation to compare regression coefficients and coefficients obtained by physical engineering knowledge to understand which part of the coefficient differences are close to the nullspace. This nullspace method is tested on a synthetic example and lithium-ion battery data. The case studies show that regularization and z-scoring are design choices that, if chosen corresponding to prior physical knowledge, lead to interpretable regression results. Otherwise, the combination of the nullspace and regularization hinders interpretability and can make it impossible to obtain regression coefficients close to the true coefficients when there is a true underlying linear model. Furthermore, we demonstrate that regression methods that do not produce coefficients orthogonal to the nullspace, such as fused lasso, can improve interpretability. In conclusion, the insights gained from the nullspace perspective help to make informed design choices for building regression models on high-dimensional data and reasoning about potential underlying linear models, which are important for system optimization and improving scientific understanding. △ Less

Submitted 6 September, 2023; v1 submitted 1 September, 2023; originally announced September 2023.

Comments: Manuscript: 14 pages, 7 figures; Supplementary Information: 4 pages, 2 figures; Code available: https://github.com/JoachimSchaeffer/HDRegAnalytics

MSC Class: 62J07; 62P99 ACM Class: I.2.6

arXiv:2308.02104 [pdf, other]

Mechanistic Modeling and Analysis of Thermal Radiation in Conventional, Microwave-assisted, and Hybrid Freeze Drying for Biopharmaceutical Manufacturing

Authors: Prakitr Srisuma, George Barbastathis, Richard D. Braatz

Abstract: In freeze drying, thermal radiation has a significant effect on the drying process of vials located near the corner and edge of the trays, resulting in non-uniformity of the products. Understanding and being able to predict the impact of thermal radiation are therefore critical to accurate determination of the drying process endpoint given the variation in heat transfer of each vial. This article… ▽ More In freeze drying, thermal radiation has a significant effect on the drying process of vials located near the corner and edge of the trays, resulting in non-uniformity of the products. Understanding and being able to predict the impact of thermal radiation are therefore critical to accurate determination of the drying process endpoint given the variation in heat transfer of each vial. This article presents a new mechanistic model that describes complex thermal radiation during primary drying in conventional, microwave-assisted, and hybrid freeze drying. Modeling of thermal radiation employs the diffuse gray surface model and radiation network approach, which systematically and accurately incorporates simultaneous radiation exchange between every surface including the chamber wall and vials, allowing the framework to be seamlessly applied for analyzing various freeze-dryer designs. Model validation with data from the literature shows accurate prediction of the drying times for all vials, including inner, edge, and corner vials. The validated model is demonstrated for thermal radiation analysis and parametric studies to guide the design and optimization of freeze dryers. △ Less

Submitted 26 September, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

arXiv:2306.15079 [pdf, ps, other]

A direct optimization algorithm for input-constrained MPC

Authors: Liang Wu, Richard D. Braatz

Abstract: Providing an execution time certificate is a pressing requirement when deploying Model Predictive Control (MPC) in real-time embedded systems such as microcontrollers. Real-time MPC requires that its worst-case (maximum) execution time must be theoretically guaranteed to be smaller than the sampling time in closed-loop. This technical note considers input-constrained MPC problems and exploits the… ▽ More Providing an execution time certificate is a pressing requirement when deploying Model Predictive Control (MPC) in real-time embedded systems such as microcontrollers. Real-time MPC requires that its worst-case (maximum) execution time must be theoretically guaranteed to be smaller than the sampling time in closed-loop. This technical note considers input-constrained MPC problems and exploits the structure of the resulting box-constrained QPs. Then, we propose a \textit{cost-free} and \textit{data-independent} initialization strategy, which enables us, for the first time, to remove the initialization assumption of feasible full-Newton interior-point algorithms. We prove that the number of iterations of our proposed algorithm is \textit{only dimension-dependent} (\textit{data-independent}), \textit{simple-calculated}, and \textit{exact} (not \textit{worst-case}) with the value $\left\lceil\frac{\log(\frac{2n}ε)}{-2\log(\frac{\sqrt{2n}}{\sqrt{2n}+\sqrt{2}-1})}\right\rceil \!+ 1$, where $n$ denotes the problem dimension and $ε$ denotes the constant stop** tolerance. These features enable our algorithm to trivially certify the execution time of nonlinear MPC (via online linearized schemes) or adaptive MPC problems. The execution-time-certified capability of our algorithm is theoretically and numerically validated through an open-loop unstable AFTI-16 example. △ Less

Submitted 30 March, 2024; v1 submitted 26 June, 2023; originally announced June 2023.

Comments: 8 pages, Resubmitted to IEEE TAC

arXiv:2304.06835 [pdf, other]

doi 10.1016/j.cma.2023.116591

Automated Translation and Accelerated Solving of Differential Equations on Multiple GPU Platforms

Authors: Utkarsh Utkarsh, Valentin Churavy, Yingbo Ma, Tim Besard, Prakitr Srisuma, Tim Gymnich, Adam R. Gerlach, Alan Edelman, George Barbastathis, Richard D. Braatz, Christopher Rackauckas

Abstract: We demonstrate a high-performance vendor-agnostic method for massively parallel solving of ensembles of ordinary differential equations (ODEs) and stochastic differential equations (SDEs) on GPUs. The method is integrated with a widely used differential equation solver library in a high-level language (Julia's DifferentialEquations.jl) and enables GPU acceleration without requiring code changes by… ▽ More We demonstrate a high-performance vendor-agnostic method for massively parallel solving of ensembles of ordinary differential equations (ODEs) and stochastic differential equations (SDEs) on GPUs. The method is integrated with a widely used differential equation solver library in a high-level language (Julia's DifferentialEquations.jl) and enables GPU acceleration without requiring code changes by the user. Our approach achieves state-of-the-art performance compared to hand-optimized CUDA-C++ kernels while performing 20--100$\times$ faster than the vectorizing map (vmap) approach implemented in JAX and PyTorch. Performance evaluation on NVIDIA, AMD, Intel, and Apple GPUs demonstrates performance portability and vendor-agnosticism. We show composability with MPI to enable distributed multi-GPU workflows. The implemented solvers are fully featured -- supporting event handling, automatic differentiation, and incorporation of datasets via the GPU's texture memory -- allowing scientists to take advantage of GPU acceleration on all major current architectures without changing their model code and without loss of performance. We distribute the software as an open-source library https://github.com/SciML/DiffEqGPU.jl △ Less

Submitted 13 November, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

Comments: 14 figures

Journal ref: Computer Methods in Applied Mechanics and Engineering, Volume 419, 2024

arXiv:2302.03362 [pdf, other]

doi 10.1149/1945-7111/acd8fb

Machine Learning Benchmarks for the Classification of Equivalent Circuit Models from Electrochemical Impedance Spectra

Authors: Joachim Schaeffer, Paul Gasper, Esteban Garcia-Tamayo, Raymond Gasper, Masaki Adachi, Juan Pablo Gaviria-Cardona, Simon Montoya-Bedoya, Anoushka Bhutani, Andrew Schiek, Rhys Goodall, Rolf Findeisen, Richard D. Braatz, Simon Engelke

Abstract: Analysis of Electrochemical Impedance Spectroscopy (EIS) data for electrochemical systems often consists of defining an Equivalent Circuit Model (ECM) using expert knowledge and then optimizing the model parameters to deconvolute various resistance, capacitive, inductive, or diffusion responses. For small data sets, this procedure can be conducted manually; however, it is not feasible to manually… ▽ More Analysis of Electrochemical Impedance Spectroscopy (EIS) data for electrochemical systems often consists of defining an Equivalent Circuit Model (ECM) using expert knowledge and then optimizing the model parameters to deconvolute various resistance, capacitive, inductive, or diffusion responses. For small data sets, this procedure can be conducted manually; however, it is not feasible to manually define a proper ECM for extensive data sets with a wide range of EIS responses. Automatic identification of an ECM would substantially accelerate the analysis of large sets of EIS data. We showcase machine learning methods to classify the ECMs of 9,300 impedance spectra provided by QuantumScape for the BatteryDEV hackathon. The best-performing approach is a gradient-boosted tree model utilizing a library to automatically generate features, followed by a random forest model using the raw spectral data. A convolutional neural network using boolean images of Nyquist representations is presented as an alternative, although it achieves a lower accuracy. We publish the data and open source the associated code. The approaches described in this article can serve as benchmarks for further studies. A key remaining challenge is the identifiability of the labels, underlined by the model performances and the comparison of misclassified spectra. △ Less

Submitted 4 May, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

Comments: Manuscript: 17 pages, 9 figures; Supplementary Information: 9 pages, 6 figures

MSC Class: 68T10

arXiv:2206.12404 [pdf, other]

Efficient Numerical Schemes for Multidimensional Population Balance Models

Authors: Pavan Inguva, Richard D. Braatz

Abstract: Multidimensional population balance models (PBMs) describe chemical and biological processes having a distribution over two or more intrinsic properties (such as size and age, or two independent spatial variables). The incorporation of additional intrinsic variables into a PBM improves its descriptive capability and can be necessary to capture specific features of interest. As most PBMs of interes… ▽ More Multidimensional population balance models (PBMs) describe chemical and biological processes having a distribution over two or more intrinsic properties (such as size and age, or two independent spatial variables). The incorporation of additional intrinsic variables into a PBM improves its descriptive capability and can be necessary to capture specific features of interest. As most PBMs of interest cannot be solved analytically, computationally expensive high-order finite difference or finite volume methods are frequently used to obtain an accurate numerical solution. We propose a finite difference scheme based on operator splitting and solving each sub-problem at the limit of numerical stability that achieves a discretization error that is zero for certain classes of PBMs and low enough to be acceptable for other classes. In conjunction to employing specially constructed meshes and variable transformations, the scheme exploits the commutative property of the differential operators present in many classes of PBMs. The scheme has very low computational cost -- potentially as low as just memory reallocation. Multiple case studies demonstrate the performance of the proposed scheme. △ Less

Submitted 20 June, 2022; originally announced June 2022.

arXiv:1911.04386 [pdf, other]

doi 10.1016/j.compchemeng.2020.106991

Fault Detection and Identification using Bayesian Recurrent Neural Networks

Authors: Weike Sun, Antonio R. C. Paiva, Peng Xu, Anantha Sundaram, Richard D. Braatz

Abstract: In processing and manufacturing industries, there has been a large push to produce higher quality products and ensure maximum efficiency of processes. This requires approaches to effectively detect and resolve disturbances to ensure optimal operations. While the control system can compensate for many types of disturbances, there are changes to the process which it still cannot handle adequately. I… ▽ More In processing and manufacturing industries, there has been a large push to produce higher quality products and ensure maximum efficiency of processes. This requires approaches to effectively detect and resolve disturbances to ensure optimal operations. While the control system can compensate for many types of disturbances, there are changes to the process which it still cannot handle adequately. It is therefore important to further develop monitoring systems to effectively detect and identify those faults such that they can be quickly resolved by operators. In this paper, a novel probabilistic fault detection and identification method is proposed which adopts a newly developed deep learning approach using Bayesian recurrent neural networks~(BRNNs) with variational dropout. The BRNN model is general and can model complex nonlinear dynamics. Moreover, compared to traditional statistic-based data-driven fault detection and identification methods, the proposed BRNN-based method yields uncertainty estimates which allow for simultaneous fault detection of chemical processes, direct fault identification, and fault propagation analysis. The outstanding performance of this method is demonstrated and contrasted to (dynamic) principal component analysis, which are widely applied in the industry, in the benchmark Tennessee Eastman process~(TEP) and a real chemical manufacturing dataset. △ Less

Submitted 26 June, 2020; v1 submitted 11 November, 2019; originally announced November 2019.

Comments: 43 pages, 23 figures. Accepted for publication in Computers & Chemical Engineering

MSC Class: 68T05

Showing 1–11 of 11 results for author: Braatz, R D