Search | arXiv e-print repository

Machine Learning Applied to the Detection of Mycotoxin in Food: A Review

Authors: Alan Inglis, Andrew Parnell, Natarajan Subramani, Fiona Doohan

Abstract: Mycotoxins, toxic secondary metabolites produced by certain fungi, pose significant threats to global food safety and public health. These compounds can contaminate a variety of crops, leading to economic losses and health risks to both humans and animals. Traditional lab analysis methods for mycotoxin detection can be time-consuming and may not always be suitable for large-scale screenings. Howev… ▽ More Mycotoxins, toxic secondary metabolites produced by certain fungi, pose significant threats to global food safety and public health. These compounds can contaminate a variety of crops, leading to economic losses and health risks to both humans and animals. Traditional lab analysis methods for mycotoxin detection can be time-consuming and may not always be suitable for large-scale screenings. However, in recent years, machine learning (ML) methods have gained popularity for use in the detection of mycotoxins and in the food safety industry in general, due to their accurate and timely predictions. We provide a systematic review on some of the recent ML applications for detecting/predicting the presence of mycotoxin on a variety of food ingredients, highlighting their advantages, challenges, and potential for future advancements. We address the need for reproducibility and transparency in ML research through open access to data and code. An observation from our findings is the frequent lack of detailed reporting on hyperparameters in many studies as well as a lack of open source code, which raises concerns about the reproducibility and optimisation of the ML models used. The findings reveal that while the majority of studies predominantly utilised neural networks for mycotoxin detection, there was a notable diversity in the types of neural network architectures employed, with convolutional neural networks being the most popular. △ Less

Submitted 23 April, 2024; originally announced April 2024.

Comments: 39 pages, 8 figures, review paper

arXiv:2404.02228 [pdf, other]

Seemingly unrelated Bayesian additive regression trees for cost-effectiveness analyses in healthcare

Authors: Jonas Esser, Mateus Maia, Andrew C. Parnell, Judith Bosmans, Hanneke van Dongen, Thomas Klausch, Keefe Murphy

Abstract: In recent years, theoretical results and simulation evidence have shown Bayesian additive regression trees to be a highly-effective method for nonparametric regression. Motivated by cost-effectiveness analyses in health economics, where interest lies in jointly modelling the costs of healthcare treatments and the associated health-related quality of life experienced by a patient, we propose a mult… ▽ More In recent years, theoretical results and simulation evidence have shown Bayesian additive regression trees to be a highly-effective method for nonparametric regression. Motivated by cost-effectiveness analyses in health economics, where interest lies in jointly modelling the costs of healthcare treatments and the associated health-related quality of life experienced by a patient, we propose a multivariate extension of BART applicable in regression and classification analyses with several correlated outcome variables. Our framework overcomes some key limitations of existing multivariate BART models by allowing each individual response to be associated with different ensembles of trees, while still handling dependencies between the outcomes. In the case of continuous outcomes, our model is essentially a nonparametric version of seemingly unrelated regression. Likewise, our proposal for binary outcomes is a nonparametric generalisation of the multivariate probit model. We give suggestions for easily interpretable prior distributions, which allow specification of both informative and uninformative priors. We provide detailed discussions of MCMC sampling methods to conduct posterior inference. Our methods are implemented in the R package `suBART'. We showcase their performance through extensive simulations and an application to an empirical case study from health economics. By also accommodating propensity scores in a manner befitting a causal analysis, we find substantial evidence for a novel trauma care intervention's cost-effectiveness. △ Less

Submitted 10 June, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

arXiv:2310.03599 [pdf, other]

Output Feedback Reinforcement Learning with Parameter Optimisation for Temperature Control in a Material Extrusion Additive Manufacturing system

Authors: Eleni Zavrakli, Andrew Parnell, Subhrakanti Dey

Abstract: With the rapid development of Additive Manufacturing (AM) comes an urgent need for advanced monitoring and control of the process. Many aspects of the AM process play a significant role in the efficiency, accuracy and repeatability of the process, with temperature regulation being one of the most important ones. In this work, we solve the problem of optimal tracking control for a state space tempe… ▽ More With the rapid development of Additive Manufacturing (AM) comes an urgent need for advanced monitoring and control of the process. Many aspects of the AM process play a significant role in the efficiency, accuracy and repeatability of the process, with temperature regulation being one of the most important ones. In this work, we solve the problem of optimal tracking control for a state space temperature model of a Big Area Additive Manufacturing (BAAM) system. In particular, we address the problem of designing a Linear Quadratic Tracking (LQT) controller when access to the exact system state is not possible, except in the form of measurements. We initially solve the problem with a model-based approach based on reinforcement learning concepts, with state estimation through an observer. We then design a model-free reinforcement-learning based controller with an internal state estimation step and demonstrate its performance through a simulator of the systems' behaviour. Our results showcase the possibility of achieving comparable results while learning optimal policies directly from process data, without the need for an accurate, intricate model of the process. We consider this outcome to be a significant stride towards autonomous intelligent manufacturing. △ Less

Submitted 5 October, 2023; originally announced October 2023.

arXiv:2307.07039 [pdf, other]

Data-driven Linear Quadratic Tracking based Temperature Control of a Big Area Additive Manufacturing System

Authors: Eleni Zavrakli, Andrew Parnell, Andrew Dickson, Subhrakanti Dey

Abstract: Designing efficient closed-loop control algorithms is a key issue in Additive Manufacturing (AM), as various aspects of the AM process require continuous monitoring and regulation, with temperature being a particularly significant factor. Here we study closed-loop control of a state space temperature model with a focus on both model-based and data-driven methods. We demonstrate these approaches us… ▽ More Designing efficient closed-loop control algorithms is a key issue in Additive Manufacturing (AM), as various aspects of the AM process require continuous monitoring and regulation, with temperature being a particularly significant factor. Here we study closed-loop control of a state space temperature model with a focus on both model-based and data-driven methods. We demonstrate these approaches using a simulator of the temperature evolution in the extruder of a Big Area Additive Manufacturing system (BAAM). We perform an in-depth comparison of the performance of these methods using the simulator. We find that we can learn an effective controller using solely simulated process data. Our approach achieves parity in performance compared to model-based controllers and so lessens the need for estimating a large number of parameters of the intricate and complicated process model. We believe this result is an important step towards autonomous intelligent manufacturing. △ Less

Submitted 13 July, 2023; originally announced July 2023.

arXiv:2306.10847 [pdf, other]

reslr: An R package for relative sea level modelling

Authors: Maeve Upton, Andrew Parnell, Niamh Cahill

Abstract: We present reslr, an R package to perform Bayesian modelling of relative sea level data. We include a variety of different statistical models previously proposed in the literature, with a unifying framework for loading data, fitting models, and summarising the results. Relative sea-level data often contain measurement error in multiple dimensions and so our package allows for these to be included… ▽ More We present reslr, an R package to perform Bayesian modelling of relative sea level data. We include a variety of different statistical models previously proposed in the literature, with a unifying framework for loading data, fitting models, and summarising the results. Relative sea-level data often contain measurement error in multiple dimensions and so our package allows for these to be included in the statistical models. When plotting the output sea level curves, the focus is often on comparing rates of change, and so our package allows for computation of the derivative of sea level curves with appropriate consideration of the uncertainty. We provide a large example dataset from the Atlantic coast of North America and show some of the results that might be obtained from our package. △ Less

Submitted 19 June, 2023; originally announced June 2023.

arXiv:2306.07817 [pdf, other]

simmr: A package for fitting Stable Isotope Mixing Models in R

Authors: Emma Govan, Andrew L. Jackson, Richard Inger, Stuart Bearhop, Andrew C. Parnell

Abstract: We introduce an R package for fitting Stable Isotope Mixing Models (SIMMs) via both Markov chain Monte Carlo and Variational Bayes. The package is mainly used for estimating dietary contributions from food sources taken via measurements of stable isotope ratios from animals. It can also be used to estimate proportional contributions of a mixture from known sources, for example apportionment of riv… ▽ More We introduce an R package for fitting Stable Isotope Mixing Models (SIMMs) via both Markov chain Monte Carlo and Variational Bayes. The package is mainly used for estimating dietary contributions from food sources taken via measurements of stable isotope ratios from animals. It can also be used to estimate proportional contributions of a mixture from known sources, for example apportionment of river sediment, amongst many other use cases. The package contains a simple structure which allows non-expert users to interface with the package, with most of the computational complexity hidden behind the main fitting functions. In this paper we detail the background to these functions and provide case studies on how the package should be used. Further examples are available in the online package vignettes. △ Less

Submitted 13 June, 2023; originally announced June 2023.

Comments: 27 pages, 9 figures

arXiv:2306.03042 [pdf, other]

SERT: A Transfomer Based Model for Spatio-Temporal Sensor Data with Missing Values for Environmental Monitoring

Authors: Amin Shoari Nejad, Rocío Alaiz-Rodríguez, Gerard D. McCarthy, Brian Kelleher, Anthony Grey, Andrew Parnell

Abstract: Environmental monitoring is crucial to our understanding of climate change, biodiversity loss and pollution. The availability of large-scale spatio-temporal data from sources such as sensors and satellites allows us to develop sophisticated models for forecasting and understanding key drivers. However, the data collected from sensors often contain missing values due to faulty equipment or maintena… ▽ More Environmental monitoring is crucial to our understanding of climate change, biodiversity loss and pollution. The availability of large-scale spatio-temporal data from sources such as sensors and satellites allows us to develop sophisticated models for forecasting and understanding key drivers. However, the data collected from sensors often contain missing values due to faulty equipment or maintenance issues. The missing values rarely occur simultaneously leading to data that are multivariate misaligned sparse time series. We propose two models that are capable of performing multivariate spatio-temporal forecasting while handling missing data naturally without the need for imputation. The first model is a transformer-based model, which we name SERT (Spatio-temporal Encoder Representations from Transformers). The second is a simpler model named SST-ANN (Sparse Spatio-Temporal Artificial Neural Network) which is capable of providing interpretable results. We conduct extensive experiments on two different datasets for multivariate spatio-temporal forecasting and show that our models have competitive or superior performance to those at the state-of-the-art. △ Less

Submitted 9 June, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

Comments: 11 pages, 7 figures

arXiv:2303.04874 [pdf, other]

Bayesian Causal Forests for Multivariate Outcomes: Application to Irish Data From an International Large Scale Education Assessment

Authors: Nathan McJames, Andrew Parnell, Yong Chen Goh, Ann O'Shea

Abstract: Bayesian Causal Forests (BCF) is a causal inference machine learning model based on a highly flexible non-parametric regression and classification tool called Bayesian Additive Regression Trees (BART). Motivated by data from the Trends in International Mathematics and Science Study (TIMSS), which includes data on student achievement in both mathematics and science, we present a multivariate extens… ▽ More Bayesian Causal Forests (BCF) is a causal inference machine learning model based on a highly flexible non-parametric regression and classification tool called Bayesian Additive Regression Trees (BART). Motivated by data from the Trends in International Mathematics and Science Study (TIMSS), which includes data on student achievement in both mathematics and science, we present a multivariate extension of the BCF algorithm. With the help of simulation studies we show that our approach can accurately estimate causal effects for multiple outcomes subject to the same treatment. We also apply our model to Irish data from TIMSS 2019. Our findings reveal the positive effects of having access to a study desk at home (Mathematics ATE 95% CI: [0.20, 11.67]) while also highlighting the negative consequences of students often feeling hungry at school (Mathematics ATE 95% CI: [-11.15, -2.78] , Science ATE 95% CI: [-10.82,-1.72]) or often being absent (Mathematics ATE 95% CI: [-12.47, -1.55]). △ Less

Submitted 8 March, 2023; originally announced March 2023.

Comments: 26 pages, 6 figures

arXiv:2301.09556 [pdf, other]

A noisy-input generalised additive model for relative sea-level change along the Atlantic coast of North America

Authors: Maeve Upton, Andrew Parnell, Andrew Kemp, Erica Ashe, Gerard McCarthy, Niamh Cahill

Abstract: We propose a Bayesian, noisy-input, spatial-temporal generalised additive model to examine regional relative sea-level (RSL) changes over time. The model provides probabilistic estimates of component drivers of regional RSL change via the combination of a univariate spline capturing a common regional signal over time, random slopes and intercepts capturing site-specific (local), long-term linear t… ▽ More We propose a Bayesian, noisy-input, spatial-temporal generalised additive model to examine regional relative sea-level (RSL) changes over time. The model provides probabilistic estimates of component drivers of regional RSL change via the combination of a univariate spline capturing a common regional signal over time, random slopes and intercepts capturing site-specific (local), long-term linear trends and a spatial-temporal spline capturing residual, non-linear, local variations. Proxy and instrumental records of RSL and corresponding measurement errors inform the model and a noisy-input method accounts for proxy temporal uncertainties. Results focus on the decomposition of RSL over the past 3000 years along the Atlantic coast of North America. △ Less

Submitted 23 January, 2023; originally announced January 2023.

arXiv:2301.03655 [pdf, other]

Bayesian Additive Main Effects and Multiplicative Interaction Models using Tensor Regression for Multi-environmental Trials

Authors: Antonia A. L. Dos Santos, Danilo A. Sarti, Rafael A. Moral, Andrew C. Parnell

Abstract: We propose a Bayesian tensor regression model to accommodate the effect of multiple factors on phenotype prediction. We adopt a set of prior distributions that resolve identifiability issues that may arise between the parameters in the model. Simulation experiments show that our method out-performs previous related models and machine learning algorithms under different sample sizes and degrees of… ▽ More We propose a Bayesian tensor regression model to accommodate the effect of multiple factors on phenotype prediction. We adopt a set of prior distributions that resolve identifiability issues that may arise between the parameters in the model. Simulation experiments show that our method out-performs previous related models and machine learning algorithms under different sample sizes and degrees of complexity. We further explore the applicability of our model by analysing real-world data related to wheat production across Ireland from 2010 to 2019. Our model performs competitively and overcomes key limitations found in other analogous approaches. Finally, we adapt a set of visualisations for the posterior distribution of the tensor effects that facilitate the identification of optimal interactions between the tensor variables whilst accounting for the uncertainty in the posterior distribution. △ Less

Submitted 9 January, 2023; originally announced January 2023.

arXiv:2212.01457 [pdf, other]

NEAL: An open-source tool for audio annotation

Authors: Anthony Gibbons, Ian Donohue, Courtney E. Gorman, Emma King, Andrew Parnell

Abstract: Passive acoustic monitoring is used widely in ecology, biodiversity, and conservation studies. Data sets collected via acoustic monitoring are often extremely large and built to be processed automatically using Artificial Intelligence and Machine learning models, which aim to replicate the work of domain experts. These models, being supervised learning algorithms, need to be trained on high qualit… ▽ More Passive acoustic monitoring is used widely in ecology, biodiversity, and conservation studies. Data sets collected via acoustic monitoring are often extremely large and built to be processed automatically using Artificial Intelligence and Machine learning models, which aim to replicate the work of domain experts. These models, being supervised learning algorithms, need to be trained on high quality annotations produced by experts. Since the experts are often resource-limited, a cost-effective process for annotating audio is needed to get maximal use out of the data. We present an open-source interactive audio data annotation tool, NEAL (Nature+Energy Audio Labeller). Built using R and the associated Shiny framework, the tool provides a reactive environment where users can quickly annotate audio files and adjust settings that automatically change the corresponding elements of the user interface. The app has been designed with the goal of having both expert birders and citizen scientists contribute to acoustic annotation projects. The popularity and flexibility of R programming in bioacoustics means that the Shiny app can be modified for other bird labelling data sets, or even to generic audio labelling tasks. We demonstrate the app by labelling data collected from wind farm sites across Ireland. △ Less

Submitted 8 December, 2022; v1 submitted 2 December, 2022; originally announced December 2022.

arXiv:2210.11391 [pdf, other]

vivid: An R package for Variable Importance and Variable Interactions Displays for Machine Learning Models

Authors: Alan Inglis, Andrew Parnell, Catherine Hurley

Abstract: We present vivid, an R package for visualizing variable importance and variable interactions in machine learning models. The package provides a range of displays including heatmap and graph-based displays for viewing variable importance and interaction jointly and partial dependence plots in both a matrix layout and an alternative layout emphasizing important variable subsets. With the intention o… ▽ More We present vivid, an R package for visualizing variable importance and variable interactions in machine learning models. The package provides a range of displays including heatmap and graph-based displays for viewing variable importance and interaction jointly and partial dependence plots in both a matrix layout and an alternative layout emphasizing important variable subsets. With the intention of increasing a machine learning models' interpretability and making the work applicable to a wider readership, we discuss the design choices behind our implementation by focusing on the package structure and providing an in-depth look at the package functions and key features. We also provide a practical illustration of the software in use on a data set. △ Less

Submitted 23 June, 2023; v1 submitted 20 October, 2022; originally announced October 2022.

Comments: 15 pages, 7 figures

arXiv:2210.00847 [pdf, other]

Review of Clustering Methods for Functional Data

Authors: Mimi Zhang, Andrew Parnell

Abstract: Functional data clustering is to identify heterogeneous morphological patterns in the continuous functions underlying the discrete measurements/observations. Application of functional data clustering has appeared in many publications across various fields of sciences, including but not limited to biology, (bio)chemistry, engineering, environmental science, medical science, psychology, social scien… ▽ More Functional data clustering is to identify heterogeneous morphological patterns in the continuous functions underlying the discrete measurements/observations. Application of functional data clustering has appeared in many publications across various fields of sciences, including but not limited to biology, (bio)chemistry, engineering, environmental science, medical science, psychology, social science, etc. The phenomenal growth of the application of functional data clustering indicates the urgent need for a systematic approach to develop efficient clustering methods and scalable algorithmic implementations. On the other hand, there is abundant literature on the cluster analysis of time series, trajectory data, spatio-temporal data, etc., which are all related to functional data. Therefore, an overarching structure of existing functional data clustering methods will enable the cross-pollination of ideas across various research fields. We here conduct a comprehensive review of original clustering methods for functional data. We propose a systematic taxonomy that explores the connections and differences among the existing functional data clustering methods and relates them to the conventional multivariate clustering methods. The structure of the taxonomy is built on three main attributes of a functional data clustering method and therefore is more reliable than existing categorizations. The review aims to bridge the gap between the functional data analysis community and the clustering community and to generate new principles for functional data clustering. △ Less

Submitted 3 October, 2022; originally announced October 2022.

arXiv:2209.06880 [pdf, other]

Vector Time Series Modelling of Turbidity in Dublin Bay

Authors: Amin Shoari Nejad, Gerard D. McCarthy, Brian Kelleher, Anthony Grey, Andrew Parnell

Abstract: Turbidity is commonly monitored as an important water quality index. Human activities, such as dredging and dum** operations, can disrupt turbidity levels and should be monitored and analyzed for possible effects. In this paper, we model the variations of turbidity in Dublin Bay over space and time to investigate the effects of dum** and dredging while controlling for the effect of wind speed… ▽ More Turbidity is commonly monitored as an important water quality index. Human activities, such as dredging and dum** operations, can disrupt turbidity levels and should be monitored and analyzed for possible effects. In this paper, we model the variations of turbidity in Dublin Bay over space and time to investigate the effects of dum** and dredging while controlling for the effect of wind speed as a common atmospheric effect. We develop a novel Vector Auto-Regressive Conditional Heteroskedasticity (VARCH) approach to modelling the dynamical behaviour of turbidity over different locations and at different water depths. We use daily values of turbidity during the years 2017-2018 to fit the model. We show that the results of our fitted model are in line with the observed data and that the uncertainties, measured through Bayesian credible intervals, are well calibrated. Furthermore, we show that the daily effects of dredging and dum** on turbidity are negligible in comparison to that of wind speed. △ Less

Submitted 14 September, 2022; originally announced September 2022.

Comments: 11 pages, 9 figures

arXiv:2208.08966 [pdf, other]

Visualizations for Bayesian Additive Regression Trees

Authors: Alan Inglis, Andrew Parnell, Catherine Hurley

Abstract: Tree-based regression and classification has become a standard tool in modern data science. Bayesian Additive Regression Trees (BART) has in particular gained wide popularity due its flexibility in dealing with interactions and non-linear effects. BART is a Bayesian tree-based machine learning method that can be applied to both regression and classification problems and yields competitive or super… ▽ More Tree-based regression and classification has become a standard tool in modern data science. Bayesian Additive Regression Trees (BART) has in particular gained wide popularity due its flexibility in dealing with interactions and non-linear effects. BART is a Bayesian tree-based machine learning method that can be applied to both regression and classification problems and yields competitive or superior results when compared to other predictive models. As a Bayesian model, BART allows the practitioner to explore the uncertainty around predictions through the posterior distribution. In this paper, we present new visualization techniques for exploring BART models. We construct conventional plots to analyze a model's performance and stability as well as create new tree-based plots to analyze variable importance, interaction, and tree structure. We employ Value Suppressing Uncertainty Palettes (VSUP) to construct heatmaps that display variable importance and interactions jointly using color scale to represent posterior uncertainty. Our new visualizations are designed to work with the most popular BART R packages available, namely BART, dbarts, and bartMachine. Our approach is implemented in the R package bartMan (BART Model ANalysis). △ Less

Submitted 11 September, 2022; v1 submitted 18 August, 2022; originally announced August 2022.

Comments: 25 pages, 15 figures

arXiv:2207.00011 [pdf, other]

Variational Inference for Additive Main and Multiplicative Interaction Effects Models

Authors: AntÔnia A. L. Dos Santos, Rafael A. Moral, Danilo A. Sarti, Andrew C. Parnell

Abstract: In plant breeding the presence of a genotype by environment (GxE) interaction has a strong impact on cultivation decision making and the introduction of new crop cultivars. The combination of linear and bilinear terms has been shown to be very useful in modelling this type of data. A widely-used approach to identify GxE is the Additive Main Effects and Multiplicative Interaction Effects (AMMI) mod… ▽ More In plant breeding the presence of a genotype by environment (GxE) interaction has a strong impact on cultivation decision making and the introduction of new crop cultivars. The combination of linear and bilinear terms has been shown to be very useful in modelling this type of data. A widely-used approach to identify GxE is the Additive Main Effects and Multiplicative Interaction Effects (AMMI) model. However, as data frequently can be high-dimensional, Markov chain Monte Carlo (MCMC) approaches can be computationally infeasible. In this article, we consider a variational inference approach for such a model. We derive variational approximations for estimating the parameters and we compare the approximations to MCMC using both simulated and real data. The new inferential framework we propose is on average two times faster whilst maintaining the same predictive performance as MCMC. △ Less

Submitted 29 June, 2022; originally announced July 2022.

arXiv:2204.07207 [pdf, other]

Hierarchical Embedded Bayesian Additive Regression Trees

Authors: Bruna Wundervald, Andrew Parnell, Katarina Domijan

Abstract: We propose a simple yet powerful extension of Bayesian Additive Regression Trees which we name Hierarchical Embedded BART (HE-BART). The model allows for random effects to be included at the terminal node level of a set of regression trees, making HE-BART a non-parametric alternative to mixed effects models which avoids the need for the user to specify the structure of the random effects in the mo… ▽ More We propose a simple yet powerful extension of Bayesian Additive Regression Trees which we name Hierarchical Embedded BART (HE-BART). The model allows for random effects to be included at the terminal node level of a set of regression trees, making HE-BART a non-parametric alternative to mixed effects models which avoids the need for the user to specify the structure of the random effects in the model, whilst maintaining the prediction and uncertainty calibration properties of standard BART. Using simulated and real-world examples, we demonstrate that this new extension yields superior predictions for many of the standard mixed effects models' example data sets, and yet still provides consistent estimates of the random effect variances. In a future version of this paper, we outline its use in larger, more advanced data sets and structures. △ Less

Submitted 24 April, 2023; v1 submitted 14 April, 2022; originally announced April 2022.

arXiv:2204.02112 [pdf, other]

GP-BART: a novel Bayesian additive regression trees approach using Gaussian processes

Authors: Mateus Maia, Keefe Murphy, Andrew C. Parnell

Abstract: The Bayesian additive regression trees (BART) model is an ensemble method extensively and successfully used in regression tasks due to its consistently strong predictive performance and its ability to quantify uncertainty. BART combines "weak" tree models through a set of shrinkage priors, whereby each tree explains a small portion of the variability in the data. However, the lack of smoothness an… ▽ More The Bayesian additive regression trees (BART) model is an ensemble method extensively and successfully used in regression tasks due to its consistently strong predictive performance and its ability to quantify uncertainty. BART combines "weak" tree models through a set of shrinkage priors, whereby each tree explains a small portion of the variability in the data. However, the lack of smoothness and the absence of an explicit covariance structure over the observations in standard BART can yield poor performance in cases where such assumptions would be necessary. The Gaussian processes Bayesian additive regression trees (GP-BART) model is an extension of BART which addresses this limitation by assuming Gaussian process (GP) priors for the predictions of each terminal node among all trees. The model's effectiveness is demonstrated through applications to simulated and real-world data, surpassing the performance of traditional modeling approaches in various scenarios. △ Less

Submitted 14 September, 2023; v1 submitted 5 April, 2022; originally announced April 2022.

arXiv:2202.09383 [pdf, other]

A Bayesian Hierarchical Time Series Model for Reconstructing Hydroclimate from Multiple Proxies

Authors: Niamh Cahill, Jacky Croke, Micheline Campbell, Kate Hughes, John Vitkovsky, Jack Eaton Kilgallen, Andrew Parnell

Abstract: We propose a Bayesian hierarchical model which produces probabilistic reconstructions of hydroclimatic variability in Queensland Australia. The model provides a standardised approach to hydroclimate reconstruction using multiple palaeoclimate proxy records derived from natural archives such as speleothems, ice cores and tree rings. The method combines time-series modelling with inverse prediction… ▽ More We propose a Bayesian hierarchical model which produces probabilistic reconstructions of hydroclimatic variability in Queensland Australia. The model provides a standardised approach to hydroclimate reconstruction using multiple palaeoclimate proxy records derived from natural archives such as speleothems, ice cores and tree rings. The method combines time-series modelling with inverse prediction to quantify the relationships between a given hydroclimate index and relevant proxies over an instrumental period and subsequently reconstruct the hydroclimate back through time. We present case studies for Brisbane and Fitzroy catchments focusing on two hydroclimate indices, the Rainfall Index (RFI) and the Standardised Precipitation-Evapotranspiration Index (SPEI). The probabilistic nature of the reconstructions allows us to estimate the probability that a hydroclimate index in any reconstruction year was lower (higher) than the minimum (maximum) value observed over the instrumental period. In Brisbane, the RFI is unlikely (probabilities < 20%) to have exhibited extremes beyond the minimum/maximum values observed between 1889 and 2017. However, in Fitzroy there are several years during the reconstruction period where the RFI is likely (> 50% probability) to have exhibited behaviour beyond the minimum/maximum of what has been observed. For SPEI, the probability of observing such extremes since the end of the instrumental period in 1889 doesn't exceed 50% in any reconstruction year in Brisbane or Fitzroy. △ Less

Submitted 8 August, 2022; v1 submitted 18 February, 2022; originally announced February 2022.

Comments: 20 pages, 10 figures

MSC Class: 62P12

arXiv:2111.08616 [pdf, other]

Inference for extreme spatial temperature events in a changing climate with application to Ireland

Authors: Dáire Healy, Jonathan Tawn, Peter Thorne, Andrew Parnell

Abstract: We investigate the changing nature of the frequency, magnitude and spatial extent of extreme temperatures in Ireland from 1931 to 2022. We develop an extreme value model that captures spatial and temporal non-stationarity in extreme daily maximum temperature data. We model the tails of the marginal variables using the generalised Pareto distribution and the spatial dependence of extreme events by… ▽ More We investigate the changing nature of the frequency, magnitude and spatial extent of extreme temperatures in Ireland from 1931 to 2022. We develop an extreme value model that captures spatial and temporal non-stationarity in extreme daily maximum temperature data. We model the tails of the marginal variables using the generalised Pareto distribution and the spatial dependence of extreme events by a semi-parametric Brown-Resnick r-generalised Pareto process, with parameters of each model allowed to change over time. We use weather station observations for modelling extreme events since data from climate models (not conditioned on observational data) can over-smooth these events and have trends determined by the specific climate model configuration. However, climate models do provide valuable information about the detailed physiography over Ireland and the associated climate response. We propose novel methods which exploit the climate model data to overcome issues linked to the sparse and biased sampling of the observations. Our analysis identifies a temporal change in the marginal behaviour of extreme temperature events over the study domain, which is much larger than the change in mean temperature levels over this time window. We illustrate how these characteristics result in increased spatial coverage of the events that exceed critical temperatures. △ Less

Submitted 31 March, 2023; v1 submitted 16 November, 2021; originally announced November 2021.

arXiv:2109.14966 [pdf, other]

Bayesian Multi-Species N-Mixture Models for Unmarked Animal Communities

Authors: Niamh Mimnagh, Andrew Parnell, Estevao Prado, Rafael de Andrade Moral

Abstract: We propose an extension of the N-mixture model which allows for the estimation of both abundances of multiple species simultaneously and their inter-species correlations. We also propose further extensions to this multi-species N-mixture model, one of which permits us to examine data which has an excess of zero counts, and another which allows us to relax the assumption of closure inherent in N-mi… ▽ More We propose an extension of the N-mixture model which allows for the estimation of both abundances of multiple species simultaneously and their inter-species correlations. We also propose further extensions to this multi-species N-mixture model, one of which permits us to examine data which has an excess of zero counts, and another which allows us to relax the assumption of closure inherent in N-mixture models through the incorporation of an AR term in the abundance. The inclusion of a multivariate normal distribution as prior on the random effect in the abundance facilitates the estimation of a matrix of interspecies correlations. Each model is also fitted to avian point data collected as part of the NABBS 2010-2019. Results of simulation studies reveal that these models produce accurate estimates of abundance, inter-species correlations and detection probabilities at both small and large sample sizes, in scenarios with small, large and no zero inflation. Results of model-fitting to the North American Breeding Bird Survey data reveal an increase in Bald Eagle population size in southeastern Alaska in the decade examined.Our novel multi-species N-mixture model accounts for full communities, allowing us to examine abundances of every species present in a study area and, as these species do not exist in a vacuum, allowing us to estimate correlations between species' abundances.While previous multi-species abundance models have allowed for the estimation of abundance and detection probability, ours is the first to address the estimation of both positive and negative inter-species correlations, which allows us to begin to make inferences as to the effect that these species' abundances have on one another. Our modelling approach provides a method of quantifying the strength of association between species' population sizes, and is of practical use to population and conservation ecologists. △ Less

Submitted 15 August, 2022; v1 submitted 30 September, 2021; originally announced September 2021.

Comments: 60 pages, 4 figures

arXiv:2108.07636 [pdf, other]

Accounting for shared covariates in semi-parametric Bayesian additive regression trees

Authors: Estevão B. Prado, Andrew C. Parnell, Keefe Murphy, Nathan McJames, Ann O'Shea, Rafael A. Moral

Abstract: We propose some extensions to semi-parametric models based on Bayesian additive regression trees (BART). In the semi-parametric BART paradigm, the response variable is approximated by a linear predictor and a BART model, where the linear component is responsible for estimating the main effects and BART accounts for non-specified interactions and non-linearities. Previous semi-parametric models bas… ▽ More We propose some extensions to semi-parametric models based on Bayesian additive regression trees (BART). In the semi-parametric BART paradigm, the response variable is approximated by a linear predictor and a BART model, where the linear component is responsible for estimating the main effects and BART accounts for non-specified interactions and non-linearities. Previous semi-parametric models based on BART have assumed that the set of covariates in the linear predictor and the BART model are mutually exclusive in an attempt to avoid poor coverage properties and reduce bias in the estimates of the parameters in the linear predictor. The main novelty in our approach lies in the way we change the tree-generation moves in BART to deal with this bias and resolve non-identifiability issues between the parametric and non-parametric components, even when they have covariates in common. This allows us to model complex interactions involving the covariates of primary interest, both among themselves and with those in the BART component. Our novel method is developed with a view to analysing data from an international education assessment, where certain predictors of students' achievements in mathematics are of particular interpretational interest. Through additional simulation studies and another application to a well-known benchmark dataset, we also show competitive performance when compared to regression models, alternative formulations of semi-parametric BART, and other tree-based methods. The implementation of the proposed method is available at \url{https://github.com/ebprado/CSP-BART}. △ Less

Submitted 3 June, 2022; v1 submitted 17 August, 2021; originally announced August 2021.

arXiv:2108.04310 [pdf, other]

Visualizing Variable Importance and Variable Interaction Effects in Machine Learning Models

Authors: Alan Inglis, Andrew Parnell, Catherine Hurley

Abstract: Variable importance, interaction measures, and partial dependence plots are important summaries in the interpretation of statistical and machine learning models. In this paper we describe new visualization techniques for exploring these model summaries. We construct heatmap and graph-based displays showing variable importance and interaction jointly, which are carefully designed to highlight impor… ▽ More Variable importance, interaction measures, and partial dependence plots are important summaries in the interpretation of statistical and machine learning models. In this paper we describe new visualization techniques for exploring these model summaries. We construct heatmap and graph-based displays showing variable importance and interaction jointly, which are carefully designed to highlight important aspects of the fit. We describe a new matrix-type layout showing all single and bivariate partial dependence plots, and an alternative layout based on graph Eulerians focusing on key subsets. Our new visualizations are model-agnostic and are applicable to regression and classification supervised learning settings. They enhance interpretation even in situations where the number of variables is large. Our R package vivid (variable importance and variable interaction displays) provides an implementation. △ Less

Submitted 11 October, 2021; v1 submitted 9 August, 2021; originally announced August 2021.

Comments: 31 pages, 10 figures

arXiv:2107.12809 [pdf, other]

Bayesian Optimisation for Sequential Experimental Design with Applications in Additive Manufacturing

Authors: Mimi Zhang, Andrew Parnell, Dermot Brabazon, Alessio Benavoli

Abstract: Bayesian optimization (BO) is an approach to globally optimizing black-box objective functions that are expensive to evaluate. BO-powered experimental design has found wide application in materials science, chemistry, experimental physics, drug development, etc. This work aims to bring attention to the benefits of applying BO in designing experiments and to provide a BO manual, covering both metho… ▽ More Bayesian optimization (BO) is an approach to globally optimizing black-box objective functions that are expensive to evaluate. BO-powered experimental design has found wide application in materials science, chemistry, experimental physics, drug development, etc. This work aims to bring attention to the benefits of applying BO in designing experiments and to provide a BO manual, covering both methodology and software, for the convenience of anyone who wants to apply or learn BO. In particular, we briefly explain the BO technique, review all the applications of BO in additive manufacturing, compare and exemplify the features of different open BO libraries, unlock new potential applications of BO to other types of data (e.g., preferential output). This article is aimed at readers with some understanding of Bayesian methods, but not necessarily with knowledge of additive manufacturing; the software performance overview and implementation instructions are instrumental for any experimental-design practitioner. Moreover, our review in the field of additive manufacturing highlights the current knowledge and technological trends of BO. This article has a supplementary material online. △ Less

Submitted 8 October, 2023; v1 submitted 27 July, 2021; originally announced July 2021.

arXiv:2104.15088 [pdf, other]

Optimal age-specific vaccination control for COVID-19: an Irish case study

Authors: Eleni Zavrakli, Andrew Parnell, David Malone, Ken Duffy, Subhrakanti Dey

Abstract: The outbreak of a novel coronavirus causing severe acute respiratory syndrome in December 2019 has escalated into a worldwide pandemic. In this work, we propose a compartmental model to describe the dynamics of transmission of infection and use it to obtain the optimal vaccination control. The model accounts for the various stages of the vaccination and the optimisation is focused on minimising th… ▽ More The outbreak of a novel coronavirus causing severe acute respiratory syndrome in December 2019 has escalated into a worldwide pandemic. In this work, we propose a compartmental model to describe the dynamics of transmission of infection and use it to obtain the optimal vaccination control. The model accounts for the various stages of the vaccination and the optimisation is focused on minimising the infections to protect the population and relieve the healthcare system. As a case study we selected the Republic of Ireland. We use data provided by Ireland's COVID-19 Data-Hub and simulate the evolution of the pandemic with and without the vaccination in place for two different scenarios, one representative of a national lockdown situation and the other indicating looser restrictions in place. One of the main findings of our work is that the optimal approach would involve a vaccination programme where the older population is vaccinated in larger numbers earlier while simultaneously part of the younger population also gets vaccinated to lower the risk of transmission between groups. We compare our simulated results with that of the vaccination policy taken by the Irish government to explore the advantages of our optimisation method. Our comparison suggests that a similar reduction in cases may have been possible even with a reduced set of vaccinations being available for use. △ Less

Submitted 24 October, 2022; v1 submitted 30 April, 2021; originally announced April 2021.

MSC Class: 34H05; 49L12

arXiv:2102.03880 [pdf, other]

doi 10.1103/PhysRevLett.126.193902

X-ray Ptychography with a Laboratory Source

Authors: Darren J. Batey, Frederic Van Assche, Sander Vanheule, Matthieu N. Boone, Andrew J. Parnell, Oleksandr O. Mykhaylyk, Christoph Rau, Silvia Cipiccia

Abstract: X-ray ptychography has revolutionised nanoscale phase contrast imaging at large-scale synchrotron sources in recent years. We present here the first successful demonstration of the technique in a small-scale laboratory setting. We conducted an experiment with a liquid metal-jet X-ray source and a single photon-counting detector with a high spectral resolution. The experiment used a spot size of 5… ▽ More X-ray ptychography has revolutionised nanoscale phase contrast imaging at large-scale synchrotron sources in recent years. We present here the first successful demonstration of the technique in a small-scale laboratory setting. We conducted an experiment with a liquid metal-jet X-ray source and a single photon-counting detector with a high spectral resolution. The experiment used a spot size of 5 microns to produce a ptychographic phase image of a Siemens star test pattern with a sub-micron spatial resolution. The result and methodology presented show how high-resolution phase contrast imaging can now be performed at small-scale laboratory sources worldwide. △ Less

Submitted 10 February, 2021; v1 submitted 7 February, 2021; originally announced February 2021.

Comments: 14 pages, 4 figures

Journal ref: Phys. Rev. Lett. 126, 193902 (2021)

arXiv:2007.04177 [pdf, other]

Modelling excess zeros in count data: A new perspective on modelling approaches

Authors: John Haslett, Andrew C. Parnell, John Hinde, Rafael A. Moral

Abstract: We consider the analysis of count data in which the observed frequency of zero counts is unusually large, typically with respect to the Poisson distribution. We focus on two alternative modelling approaches: Over-Dispersion (OD) models, and Zero-Inflation (ZI) models, both of which can be seen as generalisations of the Poisson distribution; we refer to these as Implicit and Explicit ZI models, res… ▽ More We consider the analysis of count data in which the observed frequency of zero counts is unusually large, typically with respect to the Poisson distribution. We focus on two alternative modelling approaches: Over-Dispersion (OD) models, and Zero-Inflation (ZI) models, both of which can be seen as generalisations of the Poisson distribution; we refer to these as Implicit and Explicit ZI models, respectively. Although sometimes seen as competing approaches, they can be complementary; OD is a consequence of ZI modelling, and ZI is a by-product of OD modelling. The central objective in such analyses is often concerned with inference on the effect of covariates on the mean, in light of the apparent excess of zeros in the counts. Typically the modelling of the excess zeros per se is a secondary objective and there are choices to be made between, and within, the OD and ZI approaches. The contribution of this paper is primarily conceptual. We contrast, descriptively, the impact on zeros of the two approaches. We further offer a novel descriptive characterisation of alternative ZI models, including the classic hurdle and mixture models, by providing a unifying theoretical framework for their comparison. This in turn leads to a novel and technically simpler ZI model. We develop the underlying theory for univariate counts and touch on its implication for multivariate count data. △ Less

Submitted 29 July, 2021; v1 submitted 8 July, 2020; originally announced July 2020.

Comments: 41 pages, 3 figures, 1 table

arXiv:2006.07515 [pdf, other]

Generalizing Gain Penalization for Feature Selection in Tree-based Models

Authors: Bruna Wundervald, Andrew Parnell, Katarina Domijan

Abstract: We develop a new approach for feature selection via gain penalization in tree-based models. First, we show that previous methods do not perform sufficient regularization and often exhibit sub-optimal out-of-sample performance, especially when correlated features are present. Instead, we develop a new gain penalization idea that exhibits a general local-global regularization for tree-based models.… ▽ More We develop a new approach for feature selection via gain penalization in tree-based models. First, we show that previous methods do not perform sufficient regularization and often exhibit sub-optimal out-of-sample performance, especially when correlated features are present. Instead, we develop a new gain penalization idea that exhibits a general local-global regularization for tree-based models. The new method allows for more flexibility in the choice of feature-specific importance weights. We validate our method on both simulated and real data and implement itas an extension of the popular R package ranger. △ Less

Submitted 12 June, 2020; originally announced June 2020.

Comments: 13 pages, 2 figures

arXiv:2006.07493 [pdf, other]

doi 10.1007/s11222-021-09997-3

Bayesian Additive Regression Trees with Model Trees

Authors: Estevão B. Prado, Rafael A. Moral, Andrew C. Parnell

Abstract: Bayesian Additive Regression Trees (BART) is a tree-based machine learning method that has been successfully applied to regression and classification problems. BART assumes regularisation priors on a set of trees that work as weak learners and is very flexible for predicting in the presence of non-linearity and high-order interactions. In this paper, we introduce an extension of BART, called Model… ▽ More Bayesian Additive Regression Trees (BART) is a tree-based machine learning method that has been successfully applied to regression and classification problems. BART assumes regularisation priors on a set of trees that work as weak learners and is very flexible for predicting in the presence of non-linearity and high-order interactions. In this paper, we introduce an extension of BART, called Model Trees BART (MOTR-BART), that considers piecewise linear functions at node levels instead of piecewise constants. In MOTR-BART, rather than having a unique value at node level for the prediction, a linear predictor is estimated considering the covariates that have been used as the split variables in the corresponding tree. In our approach, local linearities are captured more efficiently and fewer trees are required to achieve equal or better performance than BART. Via simulation studies and real data applications, we compare MOTR-BART to its main competitors. R code for MOTR-BART implementation is available at https://github.com/ebprado/MOTR-BART. △ Less

Submitted 10 March, 2021; v1 submitted 12 June, 2020; originally announced June 2020.

Journal ref: Statistics and Computing 31, 20 (2021)

arXiv:1911.05376 [pdf, other]

Real-Time Anomaly Detection for Advanced Manufacturing: Improving on Twitter's State of the Art

Authors: Caitríona M. Ryan, Andrew Parnell, Catherine Mahoney

Abstract: The detection of anomalies in real time is paramount to maintain performance and efficiency across a wide range of applications including web services and smart manufacturing. This paper presents a novel algorithm to detect anomalies in streaming time series data via statistical learning. We adapt the generalised extreme studentised deviate test [1] to streaming data by using a sliding window appr… ▽ More The detection of anomalies in real time is paramount to maintain performance and efficiency across a wide range of applications including web services and smart manufacturing. This paper presents a novel algorithm to detect anomalies in streaming time series data via statistical learning. We adapt the generalised extreme studentised deviate test [1] to streaming data by using a sliding window approach. This is made computationally feasible by recursive updates of the Grubbs test statistic [2]. Moreover, a priority queue [3] is employed to reduce memory requirements, where subsets of the required data streaming window are maintained in the algorithm rather than the full list. Our method is statistically principled. It is suitable for streaming data and it outperforms the AnomalyDetection software package, recently released by Twitter Inc. (Twitter) [4] and used by multiple teams at Twitter as their state of the art on a daily basis [5]. The methodology is demonstrated using an example of unlabelled data from the Twitter AnomalyDetection GitHub repository and using a real manufacturing example with labelled anomalies. △ Less

Submitted 21 July, 2020; v1 submitted 13 November, 2019; originally announced November 2019.

arXiv:1906.06744 [pdf, other]

Bayesian spatial extreme value analysis of maximum temperatures in County Dublin, Ireland

Authors: John O'Sullivan, Conor Sweeney, Andrew C. Parnell

Abstract: In this study, we begin a comprehensive characterisation of temperature extremes in Ireland for the period 1981-2010. We produce return levels of anomalies of daily maximum temperature extremes for an area over Ireland, for the 30-year period 1981-2010. We employ extreme value theory (EVT) to model the data using the generalised Pareto distribution (GPD) as part of a three-level Bayesian hierarchi… ▽ More In this study, we begin a comprehensive characterisation of temperature extremes in Ireland for the period 1981-2010. We produce return levels of anomalies of daily maximum temperature extremes for an area over Ireland, for the 30-year period 1981-2010. We employ extreme value theory (EVT) to model the data using the generalised Pareto distribution (GPD) as part of a three-level Bayesian hierarchical model. We use predictive processes in order to solve the computationally difficult problem of modelling data over a very dense spatial field. To our knowledge, this is the first study to combine predictive processes and EVT in this manner. The model is fit using Markov chain Monte Carlo (MCMC) algorithms. Posterior parameter estimates and return level surfaces are produced, in addition to specific site analysis at synoptic stations, including Casement Aerodrome and Dublin Airport. Observational data from the period 2011-2018 is included in this site analysis to determine if there is evidence of a change in the observed extremes. An increase in the frequency of extreme anomalies, but not the severity, is observed for this period. We found that the frequency of observed extreme anomalies from 2011-2018 at the Casement Aerodrome and Phoenix Park synoptic stations exceed the upper bounds of the credible intervals from the model by 20% and 7% respectively. △ Less

Submitted 16 June, 2019; originally announced June 2019.

arXiv:1812.09178 [pdf]

An Evaluation of Methods for Real-Time Anomaly Detection using Force Measurements from the Turning Process

Authors: Yuanzhi Huang, Eamonn Ahearne, Szymon Baron, Andrew Parnell

Abstract: We examined the use of three conventional anomaly detection methods and assess their potential for on-line tool wear monitoring. Through efficient data processing and transformation of the algorithm proposed here, in a real-time environment, these methods were tested for fast evaluation of cutting tools on CNC machines. The three-dimensional force data streams we used were extracted from a turning… ▽ More We examined the use of three conventional anomaly detection methods and assess their potential for on-line tool wear monitoring. Through efficient data processing and transformation of the algorithm proposed here, in a real-time environment, these methods were tested for fast evaluation of cutting tools on CNC machines. The three-dimensional force data streams we used were extracted from a turning experiment of 21 runs for which a tool was run until it generally satisfied an end-of-life criterion. Our real-time anomaly detection algorithm was scored and optimised according to how precisely it can predict the progressive wear of the tool flank. Most of our tool wear predictions were accurate and reliable as illustrated in our off-line simulation results. Particularly when the multivariate analysis was applied, the algorithm we develop was found to be very robust across different scenarios and against parameter changes. It shall be reasonably easy to apply our approach elsewhere for real-time tool wear analytics. △ Less

Submitted 20 December, 2018; originally announced December 2018.

MSC Class: 60G35; 62P30; 68T10 ACM Class: G.3; I.5.0; I.6.0

arXiv:1810.10488 [pdf, other]

Statistical modeling of rates and trends in Holocene relative sea level

Authors: Erica L. Ashe, Niamh Cahill, Carling Hay, Nicole S. Khan, Andrew Kemp, Simon Engelhart, Benjamin P. Horton, Andrew Parnell, Robert E. Kopp

Abstract: Characterizing the spatio-temporal variability of relative sea level (RSL) and estimating local, regional, and global RSL trends requires statistical analysis of RSL data. Formal statistical treatments, needed to account for the spatially and temporally sparse distribution of data and for geochronological and elevational uncertainties, have advanced considerably over the last decade. Time-series m… ▽ More Characterizing the spatio-temporal variability of relative sea level (RSL) and estimating local, regional, and global RSL trends requires statistical analysis of RSL data. Formal statistical treatments, needed to account for the spatially and temporally sparse distribution of data and for geochronological and elevational uncertainties, have advanced considerably over the last decade. Time-series models have adopted more flexible and physically-informed specifications with more rigorous quantification of uncertainties. Spatio-temporal models have evolved from simple regional averaging to frameworks that more richly represent the correlation structure of RSL across space and time. More complex statistical approaches enable rigorous quantification of spatial and temporal variability, the combination of geographically disparate data, and the separation of the RSL field into various components associated with different driving processes. We review the range of statistical modeling and analysis choices used in the literature, reformulating them for ease of comparison in a common hierarchical statistical framework. The hierarchical framework separates each model into different levels, clearly partitioning measurement and inferential uncertainty from process variability. Placing models in a hierarchical framework enables us to highlight both the similarities and differences among modeling and analysis choices. We illustrate the implications of some modeling and analysis choices currently used in the literature by comparing the results of their application to common datasets within a hierarchical framework. In light of the complex patterns of spatial and temporal variability exhibited by RSL, we recommend non-parametric approaches for modeling temporal and spatio-temporal RSL. △ Less

Submitted 24 October, 2018; originally announced October 2018.

Comments: 30 pages, 7 figures

arXiv:1805.00555 [pdf, other]

A general framework for modelling zero inflation

Authors: John Haslett, Andrew Parnell, James Sweeney

Abstract: We propose a new framework for the modelling of count data exhibiting zero inflation (ZI). The main part of this framework includes a new and more general parameterisation for ZI models which naturally includes both over- and under-inflation. It further sheds new theoretical light on modelling and inference and permits a simpler alternative, which we term as multiplicative, in contrast to the domi… ▽ More We propose a new framework for the modelling of count data exhibiting zero inflation (ZI). The main part of this framework includes a new and more general parameterisation for ZI models which naturally includes both over- and under-inflation. It further sheds new theoretical light on modelling and inference and permits a simpler alternative, which we term as multiplicative, in contrast to the dominant mixture and hurdle models. Our approach gives the statistician access to new types of ZI of which mixture and hurdle are special cases. We outline a simple parameterised modelling approach which can help to infer both ZI type and degree and provide an underlying treatment that shows that current ZI models are themselves typically within the exponential family, thus permitting much simpler theory, computation and classical inference. We outline some possibilities for a natural Bayesian framework for inference; and a rich basis for work on correlated ZI counts. The present paper is an incomplete report on the underlying theory. A later version will include computational issues and provide further examples. △ Less

Submitted 1 May, 2018; originally announced May 2018.

Comments: 24 pages, 3 figures

arXiv:1612.05735 [pdf, other]

Contrasting Prediction Methods for Early Warning Systems at Undergraduate Level

Authors: Emma Howard, Maria Meehan, Andrew Parnell

Abstract: In this study, we investigate prediction methods for an early warning system for a large STEM undergraduate course. Recent studies have provided evidence in favour of adopting early warning systems as a means of identifying at-risk students. Many of these early warning systems rely on data from students' engagement with Learning Management Systems (LMSs). Our study examines eight prediction method… ▽ More In this study, we investigate prediction methods for an early warning system for a large STEM undergraduate course. Recent studies have provided evidence in favour of adopting early warning systems as a means of identifying at-risk students. Many of these early warning systems rely on data from students' engagement with Learning Management Systems (LMSs). Our study examines eight prediction methods, and investigates the optimal time in a course to apply an early warning system. We present findings from a statistics university course which has a large proportion of resources on the LMS Blackboard and weekly continuous assessment. We identify weeks 5-6 of our course (half way through the semester) as an optimal time to implement an early warning system, as it allows time for the students to make changes to their study patterns whilst retaining reasonable prediction accuracy. Using detailed (fine-grained) variables, clustering and our final prediction method of BART (Bayesian Additive Regressive Trees) we are able to predict students' final grade by week 6 based on mean absolute error (MAE) to 6.5 percentage points. We provide our R code for implementation of the prediction methods used in a GitHub repository. △ Less

Submitted 20 August, 2017; v1 submitted 17 December, 2016; originally announced December 2016.

arXiv:1608.03091 [pdf, other]

Prediction of tool-wear in turning of medical grade cobalt chromium molybdenum alloy (ASTM F75) using non-parametric Bayesian models

Authors: Damien McParland, Szymon Baron, Sarah O'Rourke, Denis Dowling, Eamonn Ahearne, Andrew Parnell

Abstract: We present a novel approach to estimating the effect of control parameters on tool wear rates and related changes in the three force components in turning of medical grade Co-Cr-Mo (ASTM F75) alloy. Co-Cr-Mo is known to be a difficult to cut material which, due to a combination of mechanical and physical properties, is used for the critical structural components of implantable medical prosthetics.… ▽ More We present a novel approach to estimating the effect of control parameters on tool wear rates and related changes in the three force components in turning of medical grade Co-Cr-Mo (ASTM F75) alloy. Co-Cr-Mo is known to be a difficult to cut material which, due to a combination of mechanical and physical properties, is used for the critical structural components of implantable medical prosthetics. We run a designed experiment which enables us to estimate tool wear from feed rate and cutting speed, and constrain them using a Bayesian hierarchical Gaussian Process model which enables prediction of tool wear rates for untried experimental settings. The predicted tool wear rates are non-linear and, using our models, we can identify experimental settings which optimise the life of the tool. This approach has potential in the future for realtime application of data analytics to machining processes. △ Less

Submitted 10 August, 2016; originally announced August 2016.

arXiv:1512.09163 [pdf]

doi 10.1364/OE.24.011808

Dynamic lens and monovision 3D displays to improve viewer comfort

Authors: Paul V. Johnson, Jared A. Q. Parnell, Joowan Kim, Christopher D. Saunter, Gordon D. Love, Martin S. Banks

Abstract: Stereoscopic 3D (S3D) displays provide an additional sense of depth compared to non-stereoscopic displays by sending slightly different images to the two eyes. But conventional S3D displays do not reproduce all natural depth cues. In particular, focus cues are incorrect causing mismatches between accommodation and vergence: The eyes must accommodate to the display screen to create sharp retinal im… ▽ More Stereoscopic 3D (S3D) displays provide an additional sense of depth compared to non-stereoscopic displays by sending slightly different images to the two eyes. But conventional S3D displays do not reproduce all natural depth cues. In particular, focus cues are incorrect causing mismatches between accommodation and vergence: The eyes must accommodate to the display screen to create sharp retinal images even when binocular disparity drives the eyes to converge to other distances. This mismatch causes visual discomfort and reduces visual performance. We propose and assess two new techniques that are designed to reduce the vergence-accommodation conflict and thereby decrease discomfort and increase visual performance. These techniques are much simpler to implement than previous conflict-reducing techniques. △ Less

Submitted 30 December, 2015; originally announced December 2015.

arXiv:1508.02010 [pdf, other]

doi 10.5194/cp-12-525-2016

A Bayesian Hierarchical Model for Reconstructing Sea Levels: From Raw Data to Rates of Change

Authors: Niamh Cahill, Andrew C. Kemp, Benjamin P. Horton, Andrew C. Parnell

Abstract: We present a holistic Bayesian hierarchical model for reconstructing the continuous and dynamic evolution of relative sea-level (RSL) change with fully quantified uncertainty. The reconstruction is produced from biological (foraminifera) and geochemical (δ13C) sea-level indicators preserved in dated cores of salt-marsh sediment. Our model is comprised of three modules: (1) A Bayesian transfer func… ▽ More We present a holistic Bayesian hierarchical model for reconstructing the continuous and dynamic evolution of relative sea-level (RSL) change with fully quantified uncertainty. The reconstruction is produced from biological (foraminifera) and geochemical (δ13C) sea-level indicators preserved in dated cores of salt-marsh sediment. Our model is comprised of three modules: (1) A Bayesian transfer function for the calibration of foraminifera into tidal elevation, which is flexible enough to formally accommodate additional proxies (in this case bulk-sediment δ13C values); (2) A chronology developed from an existing Bchron age-depth model, and (3) An existing errors-in-variables integrated Gaussian process (EIV-IGP) model for estimating rates of sea-level change. We illustrate our approach using a case study of Common Era sea-level variability from New Jersey, U.S.A. We develop a new Bayesian transfer function (B-TF), with and without the δ13C proxy and compare our results to those from a widely-used weighted-averaging transfer function (WA-TF). The formal incorporation of a second proxy into the B-TF model results in smaller vertical uncertainties and improved accuracy for reconstructed RSL. The vertical uncertainty from the multi-proxy B-TF is ~28% smaller on average compared to the WA-TF. When evaluated against historic tide-gauge measurements, the multi-proxy B-TF most accurately reconstructs the RSL changes observed in the instrumental record (MSE = 0.003). The holistic model provides a single, unifying framework for reconstructing and analysing sea level through time. This approach is suitable for reconstructing other paleoenvironmental variables using biological proxies. △ Less

Submitted 9 August, 2015; originally announced August 2015.

Comments: 27 pages, 7 figures

arXiv:1507.00181 [pdf, other]

Bayesian Additive Regression Trees using Bayesian Model Averaging

Authors: Belinda Hernández, Adrian E. Raftery, Stephen R. Pennington, Andrew C. Parnell

Abstract: Bayesian Additive Regression Trees (BART) is a statistical sum of trees model. It can be considered a Bayesian version of machine learning tree ensemble methods where the individual trees are the base learners. However for data sets where the number of variables $p$ is large (e.g. $p>5,000$) the algorithm can become prohibitively expensive, computationally. Another method which is popular for hi… ▽ More Bayesian Additive Regression Trees (BART) is a statistical sum of trees model. It can be considered a Bayesian version of machine learning tree ensemble methods where the individual trees are the base learners. However for data sets where the number of variables $p$ is large (e.g. $p>5,000$) the algorithm can become prohibitively expensive, computationally. Another method which is popular for high dimensional data is random forests, a machine learning algorithm which grows trees using a greedy search for the best split points. However, as it is not a statistical model, it cannot produce probabilistic estimates or predictions. We propose an alternative algorithm for BART called BART-BMA, which uses Bayesian Model Averaging and a greedy search algorithm to produce a model which is much more efficient than BART for datasets with large $p$. BART-BMA incorporates elements of both BART and random forests to offer a model-based algorithm which can deal with high-dimensional data. We have found that BART-BMA can be run in a reasonable time on a standard laptop for the "small $n$ large $p$" scenario which is common in many areas of bioinformatics. We showcase this method using simulated data and data from two real proteomic experiments; one to distinguish between patients with cardiovascular disease and controls and another to classify agressive from non-agressive prostate cancer. We compare our results to their main competitors. Open source code written in R and Rcpp to run BART-BMA can be found at: https://github.com/BelindaHernandez/BART-BMA.git △ Less

Submitted 8 July, 2015; v1 submitted 1 July, 2015; originally announced July 2015.

arXiv:1407.6242 [pdf, ps, other]

Frequency behaviour for multinomial counts of fisheries discards via a nested wavelet zero and N inflated binomial model

Authors: Andrew C. Parnell, Norman Graham, Andrew L. Jackson, Mafalda Viana

Abstract: In this paper we identify the changing frequency behaviour of multinomial counts of fish species discarded by vessels in the Irish Sea. We use a Bayesian hierarchical model which captures dynamic frequency changes via a shrinkage model applied to wavelet basis functions. Wavelets are known for capturing data features at different temporal scales; we use a recently-proposed shrinkage prior from the… ▽ More In this paper we identify the changing frequency behaviour of multinomial counts of fish species discarded by vessels in the Irish Sea. We use a Bayesian hierarchical model which captures dynamic frequency changes via a shrinkage model applied to wavelet basis functions. Wavelets are known for capturing data features at different temporal scales; we use a recently-proposed shrinkage prior from the factor analysis literature so that features at the finest levels of detail exhibit the greatest shrinkage. Rather than using a multinomial distribution for monitoring the changes in discards over time, which can be slow to fit and inflexible, we use a nested zero-and-N inflated (ZaNI) binomial distribution which enables much faster computation with no obvious deterioration in model flexibility. Our results show that seasonal behaviour in these data are not regular and occur at different frequencies. We also show that the nested ZaNI binomial distribution is a good fit to multinomial count data of this sort when an informative nested structure is applied. △ Less

Submitted 23 July, 2014; originally announced July 2014.

Comments: 24 pages, 9 figures

arXiv:1407.0064 [pdf, ps, other]

The zero & $N$-inflated binomial distribution with applications

Authors: James Sweeney, John Haslett, Andrew C. Parnell

Abstract: In this article we consider the distribution arising when two zero-inflated Poisson count processes are constrained by their sum total, resulting in a novel zero & $N$-inflated binomial distribution. This result motivates a general class of model for applications in which a sum-constrained count response is subject to multiple sources of heterogeneity, principally an excess of zeroes and $N$'s in… ▽ More In this article we consider the distribution arising when two zero-inflated Poisson count processes are constrained by their sum total, resulting in a novel zero & $N$-inflated binomial distribution. This result motivates a general class of model for applications in which a sum-constrained count response is subject to multiple sources of heterogeneity, principally an excess of zeroes and $N$'s in the underlying count generating process. Two examples from the ecological regression literature are used to illustrate the wide applicability of the proposed model, and serve to detail its substantial superiority in modelling performance as compared to competing models. We also present an extension to the modelling framework for more complex cases, considering a gender study dataset which is overdispersed relative to the new likelihood, and conclude the article with the description of a general framework for a zero & $N$-inflated multinomial distribution. △ Less

Submitted 17 February, 2016; v1 submitted 30 June, 2014; originally announced July 2014.

arXiv:1402.3014 [pdf, other]

Joint Inference of Misaligned Irregular Time Series with Application to Greenland Ice Core Data

Authors: Thinh K. Doan, Andrew C. Parnell, John Haslett

Abstract: Ice cores provide insight into the past climate over many millennia. Due to ice compaction, the raw data for any single core are irregular in time. Multiple cores have different irregularities; jointly these series are misaligned. After processing, such data are made available to researchers as regular time series: a data product. Typically, these cores are independently processed. In this paper,… ▽ More Ice cores provide insight into the past climate over many millennia. Due to ice compaction, the raw data for any single core are irregular in time. Multiple cores have different irregularities; jointly these series are misaligned. After processing, such data are made available to researchers as regular time series: a data product. Typically, these cores are independently processed. In this paper, we consider a fast Bayesian method for the joint processing of multiple irregular series. This is shown to be more efficient. Further, our approach permits a realistic modelling of the impact of the multiple sources of uncertainty. The methodology is illustrated with the analysis of a pair of ice cores (GISP2 and GRIP). Our data products, in the form of marginal posterior distributions on an arbitrary temporal grid, are finite Gaussian mixtures. We can also produce sample paths from the joint posterior distribution to study non-linear functionals of interest. More generally, the concept of joint analysis via hierarchical Gaussian process model can be widely extended as the models used can be viewed within the larger context of continuous space-time processes. △ Less

Submitted 22 September, 2014; v1 submitted 12 February, 2014; originally announced February 2014.

Comments: 14 pages, 8 figures

arXiv:1312.6761 [pdf, ps, other]

doi 10.1214/15-AOAS824

Modeling sea-level change using errors-in-variables integrated Gaussian processes

Authors: Niamh Cahill, Andrew C. Kemp, Benjamin P. Horton, Andrew C. Parnell

Abstract: We perform Bayesian inference on historical and late Holocene (last 2000 years) rates of sea-level change. The input data to our model are tide-gauge measurements and proxy reconstructions from cores of coastal sediment. These data are complicated by multiple sources of uncertainty, some of which arise as part of the data collection exercise. Notably, the proxy reconstructions include temporal unc… ▽ More We perform Bayesian inference on historical and late Holocene (last 2000 years) rates of sea-level change. The input data to our model are tide-gauge measurements and proxy reconstructions from cores of coastal sediment. These data are complicated by multiple sources of uncertainty, some of which arise as part of the data collection exercise. Notably, the proxy reconstructions include temporal uncertainty from dating of the sediment core using techniques such as radiocarbon. The model we propose places a Gaussian process prior on the rate of sea-level change, which is then integrated and set in an errors-in-variables framework to take account of age uncertainty. The resulting model captures the continuous and dynamic evolution of sea-level change with full consideration of all sources of uncertainty. We demonstrate the performance of our model using two real (and previously published) example data sets. The global tide-gauge data set indicates that sea-level rise increased from a rate with a posterior mean of 1.13 mm$/$yr in 1880 AD (0.89 to 1.28 mm$/$yr 95% credible interval for the posterior mean) to a posterior mean rate of 1.92 mm$/$yr in 2009 AD (1.84 to 2.03 mm$/$yr 95% credible interval for the posterior mean). The proxy reconstruction from North Carolina (USA) after correction for land-level change shows the 2000 AD rate of rise to have a posterior mean of 2.44 mm$/$yr (1.91 to 3.01 mm$/$yr 95% credible interval). This is unprecedented in at least the last 2000 years. △ Less

Submitted 11 September, 2015; v1 submitted 24 December, 2013; originally announced December 2013.

Comments: Published at http://dx.doi.org/10.1214/15-AOAS824 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AOAS-AOAS824

Journal ref: Annals of Applied Statistics 2015, Vol. 9, No. 2, 547-571

arXiv:1209.6457 [pdf, other]

Bayesian Stable Isotope Mixing Models

Authors: Andrew C. Parnell, Donald L. Phillips, Stuart Bearhop, Brice X. Semmens, Eric J. Ward, Jonathan W. Moore, Andrew L. Jackson, Richard Inger

Abstract: In this paper we review recent advances in Stable Isotope Mixing Models (SIMMs) and place them into an over-arching Bayesian statistical framework which allows for several useful extensions. SIMMs are used to quantify the proportional contributions of various sources to a mixture. The most widely used application is quantifying the diet of organisms based on the food sources they have been observe… ▽ More In this paper we review recent advances in Stable Isotope Mixing Models (SIMMs) and place them into an over-arching Bayesian statistical framework which allows for several useful extensions. SIMMs are used to quantify the proportional contributions of various sources to a mixture. The most widely used application is quantifying the diet of organisms based on the food sources they have been observed to consume. At the centre of the multivariate statistical model we propose is a compositional mixture of the food sources corrected for various metabolic factors. The compositional component of our model is based on the isometric log ratio (ilr) transform of Egozcue (2003). Through this transform we can apply a range of time series and non-parametric smoothing relationships. We illustrate our models with 3 case studies based on real animal dietary behaviour. △ Less

Submitted 28 September, 2012; originally announced September 2012.

Comments: 16 pages, 9 Figures, 1 Table

arXiv:1206.5009 [pdf, other]

On Bayesian Modelling of the Uncertainties in Palaeoclimate Reconstruction

Authors: Andrew C. Parnell, James Sweeney, Thinh K. Doan, Michael Salter-Townshend, Judy R. M. Allen, Brian Huntley, John Haslett

Abstract: We outline a model and algorithm to perform inference on the palaeoclimate and palaeoclimate volatility from pollen proxy data. We use a novel multivariate non-linear non-Gaussian state space model consisting of an observation equation linking climate to proxy data and an evolution equation driving climate change over time. The link from climate to proxy data is defined by a pre-calibrated forward… ▽ More We outline a model and algorithm to perform inference on the palaeoclimate and palaeoclimate volatility from pollen proxy data. We use a novel multivariate non-linear non-Gaussian state space model consisting of an observation equation linking climate to proxy data and an evolution equation driving climate change over time. The link from climate to proxy data is defined by a pre-calibrated forward model, as developed in Salter-Townshend and Haslett (2012) and Sweeney (2012). Climatic change is represented by a temporally-uncertain Normal-Inverse Gaussian Levy process, being able to capture large jumps in multivariate climate whilst remaining temporally consistent. The pre-calibrated nature of the forward model allows us to cut feedback between the observation and evolution equations and thus integrate out the state variable entirely whilst making minimal simplifying assumptions. A key part of this approach is the creation of mixtures of marginal data posteriors representing the information obtained about climate from each individual time point. Our approach allows for an extremely efficient MCMC algorithm, which we demonstrate with a pollen core from Sluggan Bog, County Antrim, Northern Ireland. △ Less

Submitted 21 June, 2012; originally announced June 2012.

Comments: 25 pages, 7 figures

Showing 1–45 of 45 results for author: Parnell, A