-
Structural Health Monitoring with Functional Data: Two Case Studies
Authors:
Philipp Wittenberg,
Sven Knoth,
Jan Gertheiss
Abstract:
Structural Health Monitoring (SHM) is increasingly used in civil engineering. One of its main purposes is to detect and assess changes in infrastructure conditions to reduce possible maintenance downtime and increase safety. Ideally, this process should be automated and implemented in real-time. Recent advances in sensor technology facilitate data collection and process automation, resulting in ma…
▽ More
Structural Health Monitoring (SHM) is increasingly used in civil engineering. One of its main purposes is to detect and assess changes in infrastructure conditions to reduce possible maintenance downtime and increase safety. Ideally, this process should be automated and implemented in real-time. Recent advances in sensor technology facilitate data collection and process automation, resulting in massive data streams. Functional data analysis (FDA) can be used to model and aggregate the data obtained transparently and interpretably. In two real-world case studies of bridges in Germany and Belgium, this paper demonstrates how a function-on-function regression approach, combined with profile monitoring, can be applied to SHM data to adjust sensor/system outputs for environmental-induced variation and detect changes in construction. Specifically, we consider the R package \texttt{funcharts} and discuss some challenges when using this software on real-world SHM data. For instance, we show that pre-smoothing of the data can improve and extend its usability.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
Introducing ChatSQC: Enhancing Statistical Quality Control with Augmented AI
Authors:
Fadel M. Megahed,
Ying-Ju Chen,
Inez Zwetsloot,
Sven Knoth,
Douglas C. Montgomery,
L. Allison Jones-Farmer
Abstract:
We introduce ChatSQC, an innovative chatbot system that combines the power of OpenAI's Large Language Models (LLM) with a specific knowledge base in Statistical Quality Control (SQC). Our research focuses on enhancing LLMs using specific SQC references, shedding light on how data preprocessing parameters and LLM selection impact the quality of generated responses. By illustrating this process, we…
▽ More
We introduce ChatSQC, an innovative chatbot system that combines the power of OpenAI's Large Language Models (LLM) with a specific knowledge base in Statistical Quality Control (SQC). Our research focuses on enhancing LLMs using specific SQC references, shedding light on how data preprocessing parameters and LLM selection impact the quality of generated responses. By illustrating this process, we hope to motivate wider community engagement to refine LLM design and output appraisal techniques. We also highlight potential research opportunities within the SQC domain that can be facilitated by leveraging ChatSQC, thereby broadening the application spectrum of SQC. A primary goal of our work is to provide a template and proof-of-concept on how LLMs can be utilized by our community. To continuously improve ChatSQC, we ask the SQC community to provide feedback, highlight potential issues, request additional features, and/or contribute via pull requests through our public GitHub repository. Additionally, the team will continue to explore adding supplementary reference material that would further improve the contextual understanding of the chatbot. Overall, ChatSQC serves as a testament to the transformative potential of AI within SQC, and we hope it will spur further advancements in the integration of AI in this field.
△ Less
Submitted 28 March, 2024; v1 submitted 22 August, 2023;
originally announced August 2023.
-
How Generative AI models such as ChatGPT can be (Mis)Used in SPC Practice, Education, and Research? An Exploratory Study
Authors:
Fadel M. Megahed,
Ying-Ju Chen,
Joshua A. Ferris,
Sven Knoth,
L. Allison Jones-Farmer
Abstract:
Generative Artificial Intelligence (AI) models such as OpenAI's ChatGPT have the potential to revolutionize Statistical Process Control (SPC) practice, learning, and research. However, these tools are in the early stages of development and can be easily misused or misunderstood. In this paper, we give an overview of the development of Generative AI. Specifically, we explore ChatGPT's ability to pr…
▽ More
Generative Artificial Intelligence (AI) models such as OpenAI's ChatGPT have the potential to revolutionize Statistical Process Control (SPC) practice, learning, and research. However, these tools are in the early stages of development and can be easily misused or misunderstood. In this paper, we give an overview of the development of Generative AI. Specifically, we explore ChatGPT's ability to provide code, explain basic concepts, and create knowledge related to SPC practice, learning, and research. By investigating responses to structured prompts, we highlight the benefits and limitations of the results. Our study indicates that the current version of ChatGPT performs well for structured tasks, such as translating code from one language to another and explaining well-known concepts but struggles with more nuanced tasks, such as explaining less widely known terms and creating code from scratch. We find that using new AI tools may help practitioners, educators, and researchers to be more efficient and productive. However, in their current stages of development, some results are misleading and wrong. Overall, the use of generative AI models in SPC must be properly validated and used in conjunction with other methods to ensure accurate results.
△ Less
Submitted 17 February, 2023;
originally announced February 2023.
-
Another look at synthetic-type control charts
Authors:
Sven Knoth
Abstract:
During the last two decades, in statistical process monitoring plentiful new methods appeared with synthetic-type control charts being a prominent constituent. These charts became popular designs for several reasons. The two most important ones are simplicity and proclaimed excellent change point detection performance. Whereas there is no doubt about the former, we deal here with the latter. We wi…
▽ More
During the last two decades, in statistical process monitoring plentiful new methods appeared with synthetic-type control charts being a prominent constituent. These charts became popular designs for several reasons. The two most important ones are simplicity and proclaimed excellent change point detection performance. Whereas there is no doubt about the former, we deal here with the latter. We will demonstrate that their performance is questionable. Expanding on some previous skeptical articles we want to critically reflect upon recently developed variants of synthetic-type charts in order to emphasize that there is little reason to apply and to push this special class of control charts.
△ Less
Submitted 5 December, 2021;
originally announced December 2021.
-
A Critique of a Variety of "Memory-Based'' Process Monitoring Methods
Authors:
Sven Knoth,
Nesma A. Saleh,
Mahmoud A. Mahmoud,
William H. Woodall,
Victor G. Tercero-Gomez
Abstract:
Many extensions and modifications have been made to standard process monitoring methods such as the exponentially weighted moving average (EWMA) chart and the cumulative sum (CUSUM) chart. In addition, new schemes have been proposed based on alternative weighting of past data, usually to put greater emphasis on past data and less weight on current and recent data. In other cases, the output of one…
▽ More
Many extensions and modifications have been made to standard process monitoring methods such as the exponentially weighted moving average (EWMA) chart and the cumulative sum (CUSUM) chart. In addition, new schemes have been proposed based on alternative weighting of past data, usually to put greater emphasis on past data and less weight on current and recent data. In other cases, the output of one process monitoring method, such as the EWMA statistic, is used as the input to another method, such as the CUSUM chart. Often the recursive formula for a control chart statistic is itself used recursively to form a new control chart statistic. We find the use of these ad hoc methods to be unjustified. Statistical performance comparisons justifying the use of these methods have been either flawed by focusing only on zero-state run length metrics or by making comparisons to an unnecessarily weak competitor.
△ Less
Submitted 20 October, 2021;
originally announced October 2021.
-
A Review and Critique of Auxiliary Information-Based Process Monitoring Methods
Authors:
Nesma A. Saleh,
Mahmoud A. Mahmoud,
William H. Woodall,
Sven Knoth
Abstract:
We review the rapidly growing literature on auxiliary information-based (AIB) process monitoring methods. Under this approach, there is an assumption that the auxiliary variable, which is correlated with the quality variable of interest, has a known mean, or some other parameter, which cannot change over time. We demonstrate that violations of this assumption can have serious adverse effects both…
▽ More
We review the rapidly growing literature on auxiliary information-based (AIB) process monitoring methods. Under this approach, there is an assumption that the auxiliary variable, which is correlated with the quality variable of interest, has a known mean, or some other parameter, which cannot change over time. We demonstrate that violations of this assumption can have serious adverse effects both when the process is stable and when there has been a process shift. Some process shifts can become undetectable. We also show that the basic AIB approach is a special case of simple linear regression profile monitoring. The AIB charting techniques require strong assumptions. Based on our results, we warn against the use of AIB approach in quality control applications.
△ Less
Submitted 30 September, 2021;
originally announced October 2021.
-
The Case against Generally Weighted Moving Average (GWMA) Control Charts
Authors:
Sven Knoth,
William H. Woodall,
Víctor G. Tercero-Gómez
Abstract:
We argue against the use of generally weighted moving average (GWMA) control charts. Our primary reasons are the following: 1) There is no recursive formula for the GWMA control chart statistic, so all previous data must be stored and used in the calculation of each chart statistic. 2) The Markovian property does not apply to the GWMA statistics, so computer simulation must be used to determine co…
▽ More
We argue against the use of generally weighted moving average (GWMA) control charts. Our primary reasons are the following: 1) There is no recursive formula for the GWMA control chart statistic, so all previous data must be stored and used in the calculation of each chart statistic. 2) The Markovian property does not apply to the GWMA statistics, so computer simulation must be used to determine control limits and the statistical performance. 3) An appropriately designed, and much simpler, exponentially weighted moving average (EWMA) chart provides as good or better statistical performance. 4) In some cases the GWMA chart gives more weight to past data values than to current values.
△ Less
Submitted 1 July, 2021;
originally announced July 2021.
-
Controlling the EWMA $S^2$ control chart false alarm behavior when the in-control variance level must be estimated
Authors:
Sven Knoth
Abstract:
Investigating the problem of setting control limits in the case of parameter uncertainty is more accessible when monitoring the variance because only one parameter has to be estimated. Simply ignoring the induced uncertainty frequently leads to control charts with poor false alarm performances. Adjusting the unconditional in-control (IC) average run length (ARL) makes the situation even worse. Gua…
▽ More
Investigating the problem of setting control limits in the case of parameter uncertainty is more accessible when monitoring the variance because only one parameter has to be estimated. Simply ignoring the induced uncertainty frequently leads to control charts with poor false alarm performances. Adjusting the unconditional in-control (IC) average run length (ARL) makes the situation even worse. Guaranteeing a minimum conditional IC ARL with some given probability is another very popular approach to solving these difficulties. However, it is very conservative as well as more complex and more difficult to communicate. We utilize the probability of a false alarm within the planned number of points to be plotted on the control chart. It turns out that adjusting this probability produces notably different limit adjustments compared to controlling the unconditional IC ARL. We then develop numerical algorithms to determine the respective modifications of the upper and two-sided exponentially weighted moving average (EWMA) charts based on the sample variance for normally distributed data. These algorithms are made available within an R package. Finally, the impacts of the EWMA smoothing constant and the size of the preliminary sample on the control chart design and its performance are studied.
△ Less
Submitted 11 January, 2021;
originally announced January 2021.
-
The Steady-State Behavior of Multivariate Exponentially Weighted Moving Average Control Charts
Authors:
Sven Knoth
Abstract:
Multivariate Exponentially Weighted Moving Average, MEWMA, charts are popular, handy and effective procedures to detect distributional changes in a stream of multivariate data. For doing appropriate performance analysis, dealing with the steady-state behavior of the MEWMA statistic is essential. Going beyond early papers, we derive quite accurate approximations of the respective steady-state densi…
▽ More
Multivariate Exponentially Weighted Moving Average, MEWMA, charts are popular, handy and effective procedures to detect distributional changes in a stream of multivariate data. For doing appropriate performance analysis, dealing with the steady-state behavior of the MEWMA statistic is essential. Going beyond early papers, we derive quite accurate approximations of the respective steady-state densities of the MEWMA statistic. It turns out that these densities could be rewritten as the product of two functions depending on one argument only which allows feasible calculation. For proving the related statements, the presentation of the non-central chisquare density deploying the confluent hypergeometric limit function is applied. Using the new methods it was found that for large dimensions, the steady-state behavior becomes different to what one might expect from the univariate monitoring field. Based on the integral equation driven methods, steady-state and worst-case average run lengths are calculated with higher accuracy than before. Eventually, optimal MEWMA smoothing constants are derived for all considered measures.
△ Less
Submitted 15 August, 2018;
originally announced August 2018.