-
Multistep Criticality Search and Power Sha** in Microreactors with Reinforcement Learning
Authors:
Majdi I. Radaideh,
Leo Tunkle,
Dean Price,
Kamal Abdulraheem,
Linyu Lin,
Moutaz Elias
Abstract:
Reducing operation and maintenance costs is a key objective for advanced reactors in general and microreactors in particular. To achieve this reduction, develo** robust autonomous control algorithms is essential to ensure safe and autonomous reactor operation. Recently, artificial intelligence and machine learning algorithms, specifically reinforcement learning (RL) algorithms, have seen rapid i…
▽ More
Reducing operation and maintenance costs is a key objective for advanced reactors in general and microreactors in particular. To achieve this reduction, develo** robust autonomous control algorithms is essential to ensure safe and autonomous reactor operation. Recently, artificial intelligence and machine learning algorithms, specifically reinforcement learning (RL) algorithms, have seen rapid increased application to control problems, such as plasma control in fusion tokamaks and building energy management. In this work, we introduce the use of RL for intelligent control in nuclear microreactors. The RL agent is trained using proximal policy optimization (PPO) and advantage actor-critic (A2C), cutting-edge deep RL techniques, based on a high-fidelity simulation of a microreactor design inspired by the Westinghouse eVinci\textsuperscript{TM} design. We utilized a Serpent model to generate data on drum positions, core criticality, and core power distribution for training a feedforward neural network surrogate model. This surrogate model was then used to guide a PPO and A2C control policies in determining the optimal drum position across various reactor burnup states, ensuring critical core conditions and symmetrical power distribution across all six core portions. The results demonstrate the excellent performance of PPO in identifying optimal drum positions, achieving a hextant power tilt ratio of approximately 1.002 (within the limit of $<$ 1.02) and maintaining criticality within a 10 pcm range. A2C did not provide as competitive of a performance as PPO in terms of performance metrics for all burnup steps considered in the cycle. Additionally, the results highlight the capability of well-trained RL control policies to quickly identify control actions, suggesting a promising approach for enabling real-time autonomous control through digital twins.
△ Less
Submitted 22 June, 2024;
originally announced June 2024.
-
Probabilistic Reconstruction of Paleodemographic Signals
Authors:
L. M. Arthur,
F. Chelazzi,
D. Lawrence,
M. D. Price
Abstract:
We present a comprehensive Bayesian approach to paleodemography, emphasizing the proper handling of uncertainties. We then apply that framework to survey data from Cyprus, and quantify the uncertainties in the paleodemographic estimates to demonstrate the applicability of the Bayesian approach and to show the large uncertainties present in current paleodemographic models and data. We also discuss…
▽ More
We present a comprehensive Bayesian approach to paleodemography, emphasizing the proper handling of uncertainties. We then apply that framework to survey data from Cyprus, and quantify the uncertainties in the paleodemographic estimates to demonstrate the applicability of the Bayesian approach and to show the large uncertainties present in current paleodemographic models and data. We also discuss methods to reduce the uncertainties and improve the efficacy of paleodemographic models.
△ Less
Submitted 11 June, 2024; v1 submitted 8 December, 2023;
originally announced December 2023.
-
Optimal Bayesian design for model discrimination via classification
Authors:
Markus Hainy,
David J. Price,
Olivier Restif,
Christopher Drovandi
Abstract:
Performing optimal Bayesian design for discriminating between competing models is computationally intensive as it involves estimating posterior model probabilities for thousands of simulated datasets. This issue is compounded further when the likelihood functions for the rival models are computationally expensive. A new approach using supervised classification methods is developed to perform Bayes…
▽ More
Performing optimal Bayesian design for discriminating between competing models is computationally intensive as it involves estimating posterior model probabilities for thousands of simulated datasets. This issue is compounded further when the likelihood functions for the rival models are computationally expensive. A new approach using supervised classification methods is developed to perform Bayesian optimal model discrimination design. This approach requires considerably fewer simulations from the candidate models than previous approaches using approximate Bayesian computation. Further, it is easy to assess the performance of the optimal design through the misclassification error rate. The approach is particularly useful in the presence of models with intractable likelihoods but can also provide computational advantages when the likelihoods are manageable.
△ Less
Submitted 6 April, 2022; v1 submitted 14 September, 2018;
originally announced September 2018.
-
An Induced Natural Selection Heuristic for Finding Optimal Bayesian Experimental Designs
Authors:
David J. Price,
Nigel G. Bean,
Joshua V. Ross,
Jonathan Tuke
Abstract:
Bayesian optimal experimental design has immense potential to inform the collection of data so as to subsequently enhance our understanding of a variety of processes. However, a major impediment is the difficulty in evaluating optimal designs for problems with large, or high-dimensional, design spaces. We propose an efficient search heuristic suitable for general optimisation problems, with a part…
▽ More
Bayesian optimal experimental design has immense potential to inform the collection of data so as to subsequently enhance our understanding of a variety of processes. However, a major impediment is the difficulty in evaluating optimal designs for problems with large, or high-dimensional, design spaces. We propose an efficient search heuristic suitable for general optimisation problems, with a particular focus on optimal Bayesian experimental design problems. The heuristic evaluates the objective (utility) function at an initial, randomly generated set of input values. At each generation of the algorithm, input values are "accepted" if their corresponding objective (utility) function satisfies some acceptance criteria, and new inputs are sampled about these accepted points. We demonstrate the new algorithm by evaluating the optimal Bayesian experimental designs for the previously considered death, pharmacokinetic and logistic regression models. Comparisons to the current "gold-standard" method are given to demonstrate the proposed algorithm as a computationally-efficient alternative for moderately-large design problems (i.e., up to approximately 40-dimensions).
△ Less
Submitted 13 March, 2018; v1 submitted 16 March, 2017;
originally announced March 2017.