-
Synthesizing data products, mathematical models, and observational measurements for lake temperature forecasting
Authors:
Maike F. Holthuijzen,
Robert B. Gramacy,
Cayelan C. Carey,
Dave M. Higdon,
R. Quinn Thomas
Abstract:
We present a novel forecasting framework for lake water temperature profiles, crucial for managing lake ecosystems and drinking water resources. The General Lake Model (GLM), a one-dimensional process-based model, has been widely used for this purpose, but, similar to many process-based simulation models, it: requires a large number of input variables, many of which are stochastic; presents challe…
▽ More
We present a novel forecasting framework for lake water temperature profiles, crucial for managing lake ecosystems and drinking water resources. The General Lake Model (GLM), a one-dimensional process-based model, has been widely used for this purpose, but, similar to many process-based simulation models, it: requires a large number of input variables, many of which are stochastic; presents challenges for uncertainty quantification (UQ); and can exhibit model bias. To address these issues, we propose a Gaussian process (GP) surrogate-based forecasting approach that efficiently handles large, high-dimensional data and accounts for input-dependent variability and systematic GLM bias. We validate the proposed approach and compare it with other forecasting methods, including a climatological model and raw GLM simulations. Our results demonstrate that our bias-corrected GP surrogate (GPBC) can outperform competing approaches in terms of forecast accuracy and UQ up to two weeks into the future.
△ Less
Submitted 3 July, 2024;
originally announced July 2024.
-
Embedding Positive Process Models into Lognormal Bayesian State Space Frameworks using Moment Matching
Authors:
John W. Smith,
Leah R. Johnson,
R. Quinn Thomas
Abstract:
In ecology it is common for processes to be bounded based on physical constraints of the system. One common example is the positivity constraint, which applies to phenomena such as duration times, population sizes, and total stock of a system's commodity. In this paper, we propose a novel method for embedding these dynamical systems into a lognormal state space model using an approach based on mom…
▽ More
In ecology it is common for processes to be bounded based on physical constraints of the system. One common example is the positivity constraint, which applies to phenomena such as duration times, population sizes, and total stock of a system's commodity. In this paper, we propose a novel method for embedding these dynamical systems into a lognormal state space model using an approach based on moment matching. Our method enforces the positivity constraint, allows for embedding of arbitrary mean evolution and variance structure, and has a closed-form Markov transition density which allows for more flexibility in fitting techniques. We discuss two existing lognormal state space models, and examine how they differ from the method presented here. We use 180 synthetic datasets to compare the forecasting performance under model misspecification and assess estimability of precision parameters between our method and existing methods. We find that our models well under misspecification, and that fixing the observation variance both helps to improve estimation of the process variance and improves forecast performance. To test our method on a difficult problem, we compare the predictive performance of two lognormal state space models in predicting Leaf Area Index over a 151 day horizon by embedding a process-based ecosystem model. We find that our moment matching model performs better than its competitor, and is better suited for long predictive horizons. Overall, our study helps to inform practitioners about the importance of embedding sensible dynamics when using models complex systems to predict out of sample.
△ Less
Submitted 20 December, 2022;
originally announced December 2022.
-
Assessing Ecosystem State Space Models: Identifiability and Estimation
Authors:
John W. Smith,
Leah R. Johnson,
Robert Q. Thomas
Abstract:
Bayesian methods are increasingly being applied to parameterize mechanistic process models used in environmental prediction and forecasting. In particular, models describing ecosystem dynamics with multiple states that are linear and autoregressive at each step in time can be treated as statistical state space models. In this paper we examine this subset of ecosystem models, giving closed form Gib…
▽ More
Bayesian methods are increasingly being applied to parameterize mechanistic process models used in environmental prediction and forecasting. In particular, models describing ecosystem dynamics with multiple states that are linear and autoregressive at each step in time can be treated as statistical state space models. In this paper we examine this subset of ecosystem models, giving closed form Gibbs sampling updates for latent states and process precision parameters when process and observation errors are normally distributed. We use simulated data from an example model (DALECev) to assess the performance of parameter estimation and identifiability under scenarios of gaps in observations. We show that process precision estimates become unreliable as temporal gaps between observed state data increase. To improve estimates, particularly precisions, we introduce a method of tuning the timestep of the latent states to leverage higher-frequency driver information. Further, we show that data cloning is a suitable method for assessing parameter identifiability in this class of models. Overall, our study helps inform the application of state space models to ecological forecasting applications where 1) data are not available for all states and transfers at the operational timestep for the ecosystem model and 2) process uncertainty estimation is desired.
△ Less
Submitted 17 October, 2021;
originally announced October 2021.
-
Physics-Guided Architecture (PGA) of Neural Networks for Quantifying Uncertainty in Lake Temperature Modeling
Authors:
Arka Daw,
R. Quinn Thomas,
Cayelan C. Carey,
Jordan S. Read,
Alison P. Appling,
Anuj Karpatne
Abstract:
To simultaneously address the rising need of expressing uncertainties in deep learning models along with producing model outputs which are consistent with the known scientific knowledge, we propose a novel physics-guided architecture (PGA) of neural networks in the context of lake temperature modeling where the physical constraints are hard coded in the neural network architecture. This allows us…
▽ More
To simultaneously address the rising need of expressing uncertainties in deep learning models along with producing model outputs which are consistent with the known scientific knowledge, we propose a novel physics-guided architecture (PGA) of neural networks in the context of lake temperature modeling where the physical constraints are hard coded in the neural network architecture. This allows us to integrate such models with state of the art uncertainty estimation approaches such as Monte Carlo (MC) Dropout without sacrificing the physical consistency of our results. We demonstrate the effectiveness of our approach in ensuring better generalizability as well as physical consistency in MC estimates over data collected from Lake Mendota in Wisconsin and Falling Creek Reservoir in Virginia, even with limited training data. We further show that our MC estimates correctly match the distribution of ground-truth observations, thus making the PGA paradigm amenable to physically grounded uncertainty quantification.
△ Less
Submitted 6 November, 2019;
originally announced November 2019.