Skip to main content

Showing 1–15 of 15 results for author: Santos-Fernandez, E

.
  1. arXiv:2407.07120  [pdf

    stat.AP

    An Analysis of Pacing Profiles in Sprint Kayak Racing Using Functional Principal Components and Hidden Markov Models

    Authors: Harry Estreich, Nicola Bullock, Mark Osborne, Edgar Santos-Fernandez, Paul Pao-Yen Wu

    Abstract: This study analysed sprint kayak pacing profiles in order to categorise and compare an athlete's race profile throughout their career. We used functional principal component analysis of normalised velocity data for 500m and 1000m races to quantify pacing. The first four principal components explained 90.77% of the variation over 500m and 78.80% over 1000m. These principal components were then asso… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: 17 Pages, 7 Figures, 3 Tables

  2. arXiv:2403.10791  [pdf, other

    stat.AP

    Bayesian Design for Sampling Anomalous Spatio-Temporal Data

    Authors: Katie Buchhorn, Kerrie Mengersen, Edgar Santos-Fernandez, James McGree

    Abstract: Data collected from arrays of sensors are essential for informed decision-making in various systems. However, the presence of anomalies can compromise the accuracy and reliability of insights drawn from the collected data or information obtained via statistical analysis. This study aims to develop a robust Bayesian optimal experimental design (BOED) framework with anomaly detection methods for hig… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  3. arXiv:2305.12651  [pdf, other

    stat.ME stat.AP stat.CO

    Conditional normalization in time series analysis

    Authors: Puwasala Gamakumara, Edgar Santos-Fernandez, Priyanga Dilini Talagala, Rob J. Hyndman, Kerrie Mengersen, Catherine Leigh

    Abstract: Time series often reflect variation associated with other related variables. Controlling for the effect of these variables is useful when modeling or analysing the time series. We introduce a novel approach to normalize time series data conditional on a set of covariates. We do this by modeling the conditional mean and the conditional variance of the time series with generalized additive models us… ▽ More

    Submitted 21 May, 2023; originally announced May 2023.

    Comments: 36 pages, 26 Figures, Journal Article

  4. arXiv:2305.01144  [pdf, other

    stat.AP

    Increasing trust in new data sources: crowdsourcing image classification for ecology

    Authors: Edgar Santos-Fernandez, Julie Vercelloni, Aiden Price, Grace Heron, Bryce Christensen, Erin E. Peterson, Kerrie Mengersen

    Abstract: Crowdsourcing methods facilitate the production of scientific information by non-experts. This form of citizen science (CS) is becoming a key source of complementary data in many fields to inform data-driven decisions and study challenging problems. However, concerns about the validity of these data often constrain their utility. In this paper, we focus on the use of citizen science data in addres… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: 25 pages, 10 figures

  5. arXiv:2304.09367  [pdf, other

    cs.LG stat.AP

    Graph Neural Network-Based Anomaly Detection for River Network Systems

    Authors: Katie Buchhorn, Edgar Santos-Fernandez, Kerrie Mengersen, Robert Salomone

    Abstract: Water is the lifeblood of river networks, and its quality plays a crucial role in sustaining both aquatic ecosystems and human societies. Real-time monitoring of water quality is increasingly reliant on in-situ sensor technology. Anomaly detection is crucial for identifying erroneous patterns in sensor data, but can be a challenging task due to the complexity and variability of the data, even unde… ▽ More

    Submitted 31 May, 2023; v1 submitted 18 April, 2023; originally announced April 2023.

  6. arXiv:2211.10029  [pdf, other

    stat.AP

    Being Bayesian in the 2020s: opportunities and challenges in the practice of modern applied Bayesian statistics

    Authors: Joshua J. Bon, Adam Bretherton, Katie Buchhorn, Susanna Cramb, Christopher Drovandi, Conor Hassan, Adrianne L. Jenner, Helen J. Mayfield, James M. McGree, Kerrie Mengersen, Aiden Price, Robert Salomone, Edgar Santos-Fernandez, Julie Vercelloni, Xiaoyu Wang

    Abstract: Building on a strong foundation of philosophy, theory, methods and computation over the past three decades, Bayesian approaches are now an integral part of the toolkit for most statisticians and data scientists. Whether they are dedicated Bayesians or opportunistic users, applied professionals can now reap many of the benefits afforded by the Bayesian paradigm. In this paper, we touch on six moder… ▽ More

    Submitted 17 January, 2023; v1 submitted 17 November, 2022; originally announced November 2022.

    Comments: 27 pages, 8 figures

  7. arXiv:2209.04117  [pdf, other

    stat.ME stat.AP stat.ML

    clusterBMA: Bayesian model averaging for clustering

    Authors: Owen Forbes, Edgar Santos-Fernandez, Paul Pao-Yen Wu, Hong-Bo Xie, Paul E. Schwenn, Jim Lagopoulos, Lia Mills, Dashiell D. Sacks, Daniel F. Hermens, Kerrie Mengersen

    Abstract: Various methods have been developed to combine inference across multiple sets of results for unsupervised clustering, within the ensemble clustering literature. The approach of reporting results from one `best' model out of several candidate clustering models generally ignores the uncertainty that arises from model selection, and results in inferences that are sensitive to the particular model and… ▽ More

    Submitted 25 March, 2023; v1 submitted 9 September, 2022; originally announced September 2022.

  8. arXiv:2206.05369  [pdf, other

    stat.ME stat.AP

    Bayesian Design with Sampling Windows for Complex Spatial Processes

    Authors: Katie Buchhorn, Kerrie Mengersen, Edgar Santos-Fernandez, Erin E. Peterson, James M. McGree

    Abstract: Optimal design facilitates intelligent data collection. In this paper, we introduce a fully Bayesian design approach for spatial processes with complex covariance structures, like those typically exhibited in natural ecosystems. Coordinate Exchange algorithms are commonly used to find optimal design points. However, collecting data at specific points is often infeasible in practice. Currently, the… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

  9. arXiv:2203.04165  [pdf, other

    stat.AP stat.CO stat.ML

    On the intrinsic dimensionality of Covid-19 data: a global perspective

    Authors: Abhishek Varghese, Edgar Santos-Fernandez, Francesco Denti, Antonietta Mira, Kerrie Mengersen

    Abstract: This paper aims to develop a global perspective of the complexity of the relationship between the standardised per-capita growth rate of Covid-19 cases, deaths, and the OxCGRT Covid-19 Stringency Index, a measure describing a country's stringency of lockdown policies. To achieve our goal, we use a heterogeneous intrinsic dimension estimator implemented as a Bayesian mixture model, called Hidalgo.… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

    MSC Class: 62P10

  10. arXiv:2202.07166  [pdf, other

    stat.CO stat.ME

    SSNbayes: An R package for Bayesian spatio-temporal modelling on stream networks

    Authors: Edgar Santos-Fernandez, Jay M. Ver Hoef, James M. McGree, Daniel J. Isaak, Kerrie Mengersen, Erin E. Peterson

    Abstract: Spatio-temporal models are widely used in many research areas from ecology to epidemiology. However, most covariance functions describe spatial relationships based on Euclidean distance only. In this paper, we introduce the R package SSNbayes for fitting Bayesian spatio-temporal models and making predictions on branching stream networks. SSNbayes provides a linear regression framework with multipl… ▽ More

    Submitted 14 February, 2022; originally announced February 2022.

  11. Bayesian spatio-temporal models for stream networks

    Authors: Edgar Santos-Fernandez, Jay M. Ver Hoef, Erin E. Peterson, James McGree, Daniel Isaak, Kerrie Mengersen

    Abstract: Spatio-temporal models are widely used in many research areas including ecology. The recent proliferation of the use of in-situ sensors in streams and rivers supports space-time water quality modelling and monitoring in near real-time. A new family of spatio-temporal models is introduced. These models incorporate spatial dependence using stream distance while temporal autocorrelation is captured u… ▽ More

    Submitted 14 February, 2022; v1 submitted 5 March, 2021; originally announced March 2021.

    Comments: 30 pages, 10 figs

  12. arXiv:2006.00741  [pdf, other

    stat.AP stat.OT

    Correcting misclassification errors in crowdsourced ecological data: A Bayesian perspective

    Authors: Edgar Santos-Fernandez, Erin E. Peterson, Julie Vercelloni, Em Rushworth, Kerrie Mengersen

    Abstract: Many research domains use data elicited from "citizen scientists" when a direct measure of a process is expensive or infeasible. However, participants may report incorrect estimates or classifications due to their lack of skill. We demonstrate how Bayesian hierarchical models can be used to learn about latent variables of interest, while accounting for the participants' abilities. The model is des… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

    Comments: 18 figures, 5 tables

  13. arXiv:2003.06966  [pdf, other

    stat.AP

    Bayesian item response models for citizen science ecological data

    Authors: Edgar Santos-Fernandez, Kerrie Mengersen

    Abstract: So-called 'citizen science' data elicited from crowds has become increasingly popular in many fields including ecology. However, the quality of this information is being frequently debated by many within the scientific community. Therefore, modern citizen science implementations require measures of the users' proficiency that account for the difficulty of the tasks. We introduce a new methodologic… ▽ More

    Submitted 25 May, 2020; v1 submitted 15 March, 2020; originally announced March 2020.

    Comments: under review, 24 pages, 10 figures

  14. arXiv:2002.04148  [pdf, other

    stat.AP

    The role of intrinsic dimension in high-resolution player tracking data -- Insights in basketball

    Authors: Edgar Santos-Fernandez, Francesco Denti, Kerrie Mengersen, Antonietta Mira

    Abstract: A new range of statistical analysis has emerged in sports after the introduction of the high-resolution player tracking technology, specifically in basketball. However, this high dimensional data is often challenging for statistical inference and decision making. In this article, we employ Hidalgo, a state-of-the-art Bayesian mixture model that allows the estimation of heterogeneous intrinsic dime… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

    Comments: 21 pages, 16 figures, Codes + data + results can be found in https://github.com/EdgarSantos-Fernandez/id_basketball, Submitted

  15. arXiv:1808.05298  [pdf

    stat.AP

    Monitoring through many eyes: Integrating disparate datasets to improve monitoring of the Great Barrier Reef

    Authors: Erin E Peterson, Edgar Santos-Fernández, Carla Chen, Sam Clifford, Julie Vercelloni, Alan Pearse, Ross Brown, Bryce Christensen, Allan James, Ken Anthony, Jennifer Loder, Manuel González-Rivero, Chris Roelfsema, M. Julian Caley, Tomasz Bednarz, Kerrie Mengersen

    Abstract: Numerous organisations collect data in the Great Barrier Reef (GBR), but they are rarely analysed together due to different program objectives, methods, and data quality. We developed a weighted spatiotemporal Bayesian model and used it to integrate image based hard coral data collected by professional and citizen scientists, who captured and or classified underwater images. We used the model to p… ▽ More

    Submitted 27 March, 2019; v1 submitted 15 August, 2018; originally announced August 2018.