Search | arXiv e-print repository

Symmetry: a General Structure in Nonparametric Regression

Authors: Louis G. Christie, John A. D. Aston

Abstract: In this paper we present the framework of symmetry in nonparametric regression. This generalises the framework of covariate sparsity, where the regression function depends only on at most $s < d$ of the covariates, which is a special case of translation symmetry with linear orbits. In general this extends to other types of functions that capture lower dimensional behavior even when these structure… ▽ More In this paper we present the framework of symmetry in nonparametric regression. This generalises the framework of covariate sparsity, where the regression function depends only on at most $s < d$ of the covariates, which is a special case of translation symmetry with linear orbits. In general this extends to other types of functions that capture lower dimensional behavior even when these structures are non-linear. We show both that known symmetries of regression functions can be exploited to give similarly faster rates, and that unknown symmetries with Lipschitz actions can be estimated sufficiently quickly to obtain the same rates. This is done by explicit constructions of partial symmetrisation operators that are then applied to usual estimators, and with a two step M-estimator of the maximal symmetry of the regression function. We also demonstrate the finite sample performance of these estimators on synthetic data. △ Less

Submitted 19 April, 2024; originally announced April 2024.

Comments: 29 Pages, 4 Figures, 2 Appendices

MSC Class: 62G08

arXiv:2310.02874 [pdf, other]

Recent Methodological Advances in Federated Learning for Healthcare

Authors: Fan Zhang, Daniel Kreuter, Yichen Chen, Sören Dittmer, Samuel Tull, Tolou Shadbahr, BloodCounts! Collaboration, Jacobus Preller, James H. F. Rudd, John A. D. Aston, Carola-Bibiane Schönlieb, Nicholas Gleadall, Michael Roberts

Abstract: For healthcare datasets, it is often not possible to combine data samples from multiple sites due to ethical, privacy or logistical concerns. Federated learning allows for the utilisation of powerful machine learning algorithms without requiring the pooling of data. Healthcare data has many simultaneous challenges which require new methodologies to address, such as highly-siloed data, class imbala… ▽ More For healthcare datasets, it is often not possible to combine data samples from multiple sites due to ethical, privacy or logistical concerns. Federated learning allows for the utilisation of powerful machine learning algorithms without requiring the pooling of data. Healthcare data has many simultaneous challenges which require new methodologies to address, such as highly-siloed data, class imbalance, missing data, distribution shifts and non-standardised variables. Federated learning adds significant methodological complexity to conventional centralised machine learning, requiring distributed optimisation, communication between nodes, aggregation of models and redistribution of models. In this systematic review, we consider all papers on Scopus that were published between January 2015 and February 2023 and which describe new federated learning methodologies for addressing challenges with healthcare data. We performed a detailed review of the 89 papers which fulfilled these criteria. Significant systemic issues were identified throughout the literature which compromise the methodologies in many of the papers reviewed. We give detailed recommendations to help improve the quality of the methodology development for federated learning in healthcare. △ Less

Submitted 4 October, 2023; originally announced October 2023.

Comments: Supplementary table of extracted data at the end of the document

arXiv:2307.13579 [pdf, other]

Reinterpreting survival analysis in the universal approximator age

Authors: Sören Dittmer, Michael Roberts, Jacobus Preller, AIX COVNET, James H. F. Rudd, John A. D. Aston, Carola-Bibiane Schönlieb

Abstract: Survival analysis is an integral part of the statistical toolbox. However, while most domains of classical statistics have embraced deep learning, survival analysis only recently gained some minor attention from the deep learning community. This recent development is likely in part motivated by the COVID-19 pandemic. We aim to provide the tools needed to fully harness the potential of survival ana… ▽ More Survival analysis is an integral part of the statistical toolbox. However, while most domains of classical statistics have embraced deep learning, survival analysis only recently gained some minor attention from the deep learning community. This recent development is likely in part motivated by the COVID-19 pandemic. We aim to provide the tools needed to fully harness the potential of survival analysis in deep learning. On the one hand, we discuss how survival analysis connects to classification and regression. On the other hand, we provide technical tools. We provide a new loss function, evaluation metrics, and the first universal approximating network that provably produces survival curves without numeric integration. We show that the loss function and model outperform other approaches using a large numerical study. △ Less

Submitted 25 July, 2023; originally announced July 2023.

arXiv:2306.09177 [pdf, other]

Dis-AE: Multi-domain & Multi-task Generalisation on Real-World Clinical Data

Authors: Daniel Kreuter, Samuel Tull, Julian Gilbey, Jacobus Preller, BloodCounts! Consortium, John A. D. Aston, James H. F. Rudd, Suthesh Sivapalaratnam, Carola-Bibiane Schönlieb, Nicholas Gleadall, Michael Roberts

Abstract: Clinical data is often affected by clinically irrelevant factors such as discrepancies between measurement devices or differing processing methods between sites. In the field of machine learning (ML), these factors are known as domains and the distribution differences they cause in the data are known as domain shifts. ML models trained using data from one domain often perform poorly when applied t… ▽ More Clinical data is often affected by clinically irrelevant factors such as discrepancies between measurement devices or differing processing methods between sites. In the field of machine learning (ML), these factors are known as domains and the distribution differences they cause in the data are known as domain shifts. ML models trained using data from one domain often perform poorly when applied to data from another domain, potentially leading to wrong predictions. As such, develo** machine learning models that can generalise well across multiple domains is a challenging yet essential task in the successful application of ML in clinical practice. In this paper, we propose a novel disentangled autoencoder (Dis-AE) neural network architecture that can learn domain-invariant data representations for multi-label classification of medical measurements even when the data is influenced by multiple interacting domain shifts at once. The model utilises adversarial training to produce data representations from which the domain can no longer be determined. We evaluate the model's domain generalisation capabilities on synthetic datasets and full blood count (FBC) data from blood donors as well as primary and secondary care patients, showing that Dis-AE improves model generalisation on multiple domains simultaneously while preserving clinically relevant information. △ Less

Submitted 15 June, 2023; originally announced June 2023.

Comments: 17 pages main body, 5 figures, 18 pages of appendix

arXiv:2304.02577 [pdf, other]

ECG Feature Importance Rankings: Cardiologists vs. Algorithms

Authors: Temesgen Mehari, Ashish Sundar, Alen Bosnjakovic, Peter Harris, Steven E. Williams, Axel Loewe, Olaf Doessel, Claudia Nagel, Nils Strodthoff, Philip J. Aston

Abstract: Feature importance methods promise to provide a ranking of features according to importance for a given classification task. A wide range of methods exist but their rankings often disagree and they are inherently difficult to evaluate due to a lack of ground truth beyond synthetic datasets. In this work, we put feature importance methods to the test on real-world data in the domain of cardiology,… ▽ More Feature importance methods promise to provide a ranking of features according to importance for a given classification task. A wide range of methods exist but their rankings often disagree and they are inherently difficult to evaluate due to a lack of ground truth beyond synthetic datasets. In this work, we put feature importance methods to the test on real-world data in the domain of cardiology, where we try to distinguish three specific pathologies from healthy subjects based on ECG features comparing to features used in cardiologists' decision rules as ground truth. Some methods generally performed well and others performed poorly, while some methods did well on some but not all of the problems considered. △ Less

Submitted 5 April, 2023; originally announced April 2023.

arXiv:2304.01789 [pdf]

Communication of Statistics and Evidence in Times of Crisis

Authors: Claudia R Schneider, John R Kerr, Sarah Dryhurst, John A D Aston

Abstract: This review provides an overview of concepts relating to the communication of statistical and empirical evidence in times of crisis, with a special focus on COVID-19. In it, we consider topics relating both to the communication of numbers -- such as the role of format, context, comparisons, and visualization -- and the communication of evidence more broadly -- such as evidence quality, the influen… ▽ More This review provides an overview of concepts relating to the communication of statistical and empirical evidence in times of crisis, with a special focus on COVID-19. In it, we consider topics relating both to the communication of numbers -- such as the role of format, context, comparisons, and visualization -- and the communication of evidence more broadly -- such as evidence quality, the influence of changes in available evidence, transparency, and repeated decision making. A central focus is on the communication of the inherent uncertainties in statistical analysis, especially in rapidly changing informational environments during crises. We present relevant literature on these topics and draw connections to the communication of statistics and empirical evidence during the COVID-19 pandemic and beyond. We finish by suggesting some considerations for those faced with communicating statistics and evidence in times of crisis. △ Less

Submitted 4 April, 2023; originally announced April 2023.

Comments: 33 pages; 4 figures

arXiv:2303.13616 [pdf, other]

Estimating Maximal Symmetries of Regression Functions via Subgroup Lattices

Authors: Louis G. Christie, John A. D. Aston

Abstract: We present a method for estimating the maximal symmetry of a continuous regression function. Knowledge of such a symmetry can be used to significantly improve modelling by removing the modes of variation resulting from the symmetries. Symmetry estimation is carried out using hypothesis testing for invariance strategically over the subgroup lattice of a search group G acting on the feature space. W… ▽ More We present a method for estimating the maximal symmetry of a continuous regression function. Knowledge of such a symmetry can be used to significantly improve modelling by removing the modes of variation resulting from the symmetries. Symmetry estimation is carried out using hypothesis testing for invariance strategically over the subgroup lattice of a search group G acting on the feature space. We show that the estimation of the unique maximal invariant subgroup of G generalises useful tools from linear dimension reduction to a non linear context. We show that the estimation is consistent when the subgroup lattice chosen is finite, even when some of the subgroups themselves are infinite. We demonstrate the performance of this estimator in synthetic settings and apply the methods to two data sets: satellite measurements of the earth's magnetic field intensity; and the distribution of sunspots. △ Less

Submitted 19 December, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

Comments: 47 Pages, 16 figures

MSC Class: 62F10 (primary) 62G08; 22A26 (secondary)

arXiv:2211.14212 [pdf, other]

doi 10.1088/1361-6560/acd616

On Krylov Methods for Large Scale CBCT Reconstruction

Authors: Malena Sabate Landman, Ander Biguri, Sepideh Hatamikia, Richard Boardman, John Aston, Carola-Bibiane Schonlieb

Abstract: Krylov subspace methods are a powerful family of iterative solvers for linear systems of equations, which are commonly used for inverse problems due to their intrinsic regularization properties. Moreover, these methods are naturally suited to solve large-scale problems, as they only require matrix-vector products with the system matrix (and its adjoint) to compute approximate solutions, and they d… ▽ More Krylov subspace methods are a powerful family of iterative solvers for linear systems of equations, which are commonly used for inverse problems due to their intrinsic regularization properties. Moreover, these methods are naturally suited to solve large-scale problems, as they only require matrix-vector products with the system matrix (and its adjoint) to compute approximate solutions, and they display a very fast convergence. Even if this class of methods has been widely researched and studied in the numerical linear algebra community, its use in applied medical physics and applied engineering is still very limited. e.g. in realistic large-scale Computed Tomography (CT) problems, and more specifically in Cone Beam CT (CBCT). This work attempts to breach this gap by providing a general framework for the most relevant Krylov subspace methods applied to 3D CT problems, including the most well-known Krylov solvers for non-square systems (CGLS, LSQR, LSMR), possibly in combination with Tikhonov regularization, and methods that incorporate total variation (TV) regularization. This is provided within an open source framework: the Tomographic Iterative GPU-based Reconstruction (TIGRE) toolbox, with the idea of promoting accessibility and reproducibility of the results for the algorithms presented. Finally, numerical results in synthetic and real-world 3D CT applications (medical CBCT and μ-CT datasets) are provided to showcase and compare the different Krylov subspace methods presented in the paper, as well as their suitability for different kinds of problems. △ Less

Submitted 25 November, 2022; originally announced November 2022.

Comments: submitted

arXiv:2210.13191 [pdf, other]

doi 10.1038/s42256-023-00665-x

Navigating the challenges in creating complex data systems: a development philosophy

Authors: Sören Dittmer, Michael Roberts, Julian Gilbey, Ander Biguri, AIX-COVNET Collaboration, Jacobus Preller, James H. F. Rudd, John A. D. Aston, Carola-Bibiane Schönlieb

Abstract: In this perspective, we argue that despite the democratization of powerful tools for data science and machine learning over the last decade, develo** the code for a trustworthy and effective data science system (DSS) is getting harder. Perverse incentives and a lack of widespread software engineering (SE) skills are among many root causes we identify that naturally give rise to the current syste… ▽ More In this perspective, we argue that despite the democratization of powerful tools for data science and machine learning over the last decade, develo** the code for a trustworthy and effective data science system (DSS) is getting harder. Perverse incentives and a lack of widespread software engineering (SE) skills are among many root causes we identify that naturally give rise to the current systemic crisis in reproducibility of DSSs. We analyze why SE and building large complex systems is, in general, hard. Based on these insights, we identify how SE addresses those difficulties and how we can apply and generalize SE methods to construct DSSs that are fit for purpose. We advocate two key development philosophies, namely that one should incrementally grow -- not biphasically plan and build -- DSSs, and one should always employ two types of feedback loops during development: one which tests the code's correctness and another that evaluates the code's efficacy. △ Less

Submitted 21 October, 2022; originally announced October 2022.

arXiv:2210.05797 [pdf, other]

Joint Modeling for Geometry and Functionality of Cerebral Cortical Surface Images

Authors: **g**g Zou, Chi-Hua Chen, John A. D. Aston

Abstract: We propose a framework for jointly modeling the geometry and functionality in high dimensional functional surfaces. The proposed mixed effects model characterizes effects of subject-specific covariates and exogenous stimuli on functional surfaces while accounting for potential mutual-influence of their geometry and functionality. This is achieved through a computationally efficient estimation meth… ▽ More We propose a framework for jointly modeling the geometry and functionality in high dimensional functional surfaces. The proposed mixed effects model characterizes effects of subject-specific covariates and exogenous stimuli on functional surfaces while accounting for potential mutual-influence of their geometry and functionality. This is achieved through a computationally efficient estimation method that incorporates regularized estimation of the precision matrix of the random effects. We perform a thorough analysis of cerebral cortical surface structural MRI and task fMRI data from the Human Connectome Project and discover relationships between the geometric shapes of cortical surface and neuronal activation responding to task stimuli. Our findings highlight new modes of correspondence between cortical surface shape and functional activation relevant to emotion processing. △ Less

Submitted 9 February, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

arXiv:2207.06530 [pdf]

Estimation of Soft Robotic Bladder Compression for Smart Helmets using IR Range Finding and Hall Effect Magnetic Sensing

Authors: Colin Pollard, Jonathan Aston, Mark A. Minor

Abstract: This research focuses on soft robotic bladders that are used to monitor and control the interaction between a user's head and the shell of a Smart Helmet. Compression of these bladders determines impact dissipation; hence the focus of this paper is sensing and estimation of bladder compression. An IR rangefinder-based solution is evaluated using regression techniques as well as a Neural Network to… ▽ More This research focuses on soft robotic bladders that are used to monitor and control the interaction between a user's head and the shell of a Smart Helmet. Compression of these bladders determines impact dissipation; hence the focus of this paper is sensing and estimation of bladder compression. An IR rangefinder-based solution is evaluated using regression techniques as well as a Neural Network to estimate bladder compression. A Hall-Effect (HE) magnetic sensing system is also examined where HE sensors embedded in the base of the bladder sense the position of a magnet in the top of the bladder. The paper presents the HE sensor array, signal processing of HE voltage data, and then a Neural Network (NN) for predicting bladder compression. Efficacy of different training data sets on NN performance is studied. Different NN configurations are examined to determine a configuration that provides accurate estimates with as few nodes as possible. Different bladder compression profiles are evaluated to characterize IR range finding and HE based techniques in application scenarios. △ Less

Submitted 13 July, 2022; originally announced July 2022.

arXiv:2206.13897 [pdf, other]

Statistical Depth for Big Functional Data with Application to Neuroimaging

Authors: Alicia Nieto-Reyes, John A. D. Aston

Abstract: Functional depth is the functional data analysis technique that orders a functional data set. Unlike the case of data on the real line, defining this order is non-trivial, and particularly, with functional data, there are a number of properties that any depth should satisfy. We propose a new depth which both satisfies the properties required of a functional depth but also one which can be used in… ▽ More Functional depth is the functional data analysis technique that orders a functional data set. Unlike the case of data on the real line, defining this order is non-trivial, and particularly, with functional data, there are a number of properties that any depth should satisfy. We propose a new depth which both satisfies the properties required of a functional depth but also one which can be used in the case where there are a very large number of functional observations or in the case where the observations are functions of several continuous variables (such as images, for example). We give theoretical justification for our choice, and evaluate our proposed depth through simulation. We finally apply the proposed depth to the problem of yielding a completely non-parametric deconvolution of Positron Emission Tomography (PET) data for a very large number of curves across the image, as well as to the problem of finding a representative subject from a set of PET scans. △ Less

Submitted 28 June, 2022; originally announced June 2022.

arXiv:2206.08478 [pdf, other]

doi 10.1038/s43856-023-00356-z

Classification of datasets with imputed missing values: does imputation quality matter?

Authors: Tolou Shadbahr, Michael Roberts, Jan Stanczuk, Julian Gilbey, Philip Teare, Sören Dittmer, Matthew Thorpe, Ramon Vinas Torne, Evis Sala, Pietro Lio, Mishal Patel, AIX-COVNET Collaboration, James H. F. Rudd, Tuomas Mirtti, Antti Rannikko, John A. D. Aston, **g Tang, Carola-Bibiane Schönlieb

Abstract: Classifying samples in incomplete datasets is a common aim for machine learning practitioners, but is non-trivial. Missing data is found in most real-world datasets and these missing values are typically imputed using established methods, followed by classification of the now complete, imputed, samples. The focus of the machine learning researcher is then to optimise the downstream classification… ▽ More Classifying samples in incomplete datasets is a common aim for machine learning practitioners, but is non-trivial. Missing data is found in most real-world datasets and these missing values are typically imputed using established methods, followed by classification of the now complete, imputed, samples. The focus of the machine learning researcher is then to optimise the downstream classification performance. In this study, we highlight that it is imperative to consider the quality of the imputation. We demonstrate how the commonly used measures for assessing quality are flawed and propose a new class of discrepancy scores which focus on how well the method recreates the overall distribution of the data. To conclude, we highlight the compromised interpretability of classifier models trained using poorly imputed data. △ Less

Submitted 16 June, 2022; originally announced June 2022.

Comments: 17 pages, 10 figures, 30 supplementary pages

arXiv:2205.15280 [pdf, other]

Testing for Geometric Invariance and Equivariance

Authors: Louis G. Christie, John A. D. Aston

Abstract: Invariant and equivariant models incorporate the symmetry of an object to be estimated (here non-parametric regression functions $f : \mathcal{X} \rightarrow \mathbb{R}$). These models perform better (with respect to $L^2$ loss) and are increasingly being used in practice, but encounter problems when the symmetry is falsely assumed. In this paper we present a framework for testing for $G$-equivari… ▽ More Invariant and equivariant models incorporate the symmetry of an object to be estimated (here non-parametric regression functions $f : \mathcal{X} \rightarrow \mathbb{R}$). These models perform better (with respect to $L^2$ loss) and are increasingly being used in practice, but encounter problems when the symmetry is falsely assumed. In this paper we present a framework for testing for $G$-equivariance for any semi-group $G$. This will give confidence to the use of such models when the symmetry is not known a priori. These tests are independent of the model and are computationally quick, so can be easily used before model fitting to test their validity. △ Less

Submitted 30 May, 2022; originally announced May 2022.

Comments: 15 Pages, 6 Figures

arXiv:2204.05622 [pdf, other]

Eigen-Adjusted Functional Principal Component Analysis

Authors: Ci-Ren Jiang, Eardi Lila, John AD Aston, Jane-Ling Wang

Abstract: Functional Principal Component Analysis (FPCA) has become a widely-used dimension reduction tool for functional data analysis. When additional covariates are available, existing FPCA models integrate them either in the mean function or in both the mean function and the covariance function. However, methods of the first kind are not suitable for data that display second-order variation, while those… ▽ More Functional Principal Component Analysis (FPCA) has become a widely-used dimension reduction tool for functional data analysis. When additional covariates are available, existing FPCA models integrate them either in the mean function or in both the mean function and the covariance function. However, methods of the first kind are not suitable for data that display second-order variation, while those of the second kind are time-consuming and make it difficult to perform subsequent statistical analyses on the dimension-reduced representations. To tackle these issues, we introduce an eigen-adjusted FPCA model that integrates covariates in the covariance function only through its eigenvalues. In particular, different structures on the covariate-specific eigenvalues -- corresponding to different practical problems -- are discussed to illustrate the model's flexibility as well as utility. To handle functional observations under different sampling schemes, we employ local linear smoothers to estimate the mean function and the pooled covariance function, and a weighted least square approach to estimate the covariate-specific eigenvalues. The convergence rates of the proposed estimators are further investigated under the different sampling schemes. In addition to simulation studies, the proposed model is applied to functional Magnetic Resonance Imaging scans, collected within the Human Connectome Project, for functional connectivity investigation. △ Less

Submitted 12 April, 2022; originally announced April 2022.

Comments: 31 pages, 4 figures

arXiv:2108.01995 [pdf, other]

doi 10.1098/rsta.2020.0262

Robustness of convolutional neural networks to physiological ECG noise

Authors: J. Venton, P. M. Harris, A. Sundar, N. A. S. Smith, P. J. Aston

Abstract: The electrocardiogram (ECG) is one of the most widespread diagnostic tools in healthcare and supports the diagnosis of cardiovascular disorders. Deep learning methods are a successful and popular technique to detect indications of disorders from an ECG signal. However, there are open questions around the robustness of these methods to various factors, including physiological ECG noise. In this stu… ▽ More The electrocardiogram (ECG) is one of the most widespread diagnostic tools in healthcare and supports the diagnosis of cardiovascular disorders. Deep learning methods are a successful and popular technique to detect indications of disorders from an ECG signal. However, there are open questions around the robustness of these methods to various factors, including physiological ECG noise. In this study we generate clean and noisy versions of an ECG dataset before applying Symmetric Projection Attractor Reconstruction (SPAR) and scalogram image transformations. A pretrained convolutional neural network is trained using transfer learning to classify these image transforms. For the clean ECG dataset, F1 scores for SPAR attractor and scalogram transforms were 0.70 and 0.79, respectively, and the scores decreased by less than 0.05 for the noisy ECG datasets. Notably, when the network trained on clean data was used to classify the noisy datasets, performance decreases of up to 0.18 in F1 scores were seen. However, when the network trained on the noisy data was used to classify the clean dataset, the performance decrease was less than 0.05. We conclude that physiological ECG noise impacts classification using deep learning methods and careful consideration should be given to the inclusion of noisy ECG signals in the training data when develo** supervised networks for ECG classification. △ Less

Submitted 2 August, 2021; originally announced August 2021.

Comments: 16 pages, 7 figures

arXiv:2009.06059 [pdf, other]

doi 10.1214/21-AOAS1572

Functional random effects modeling of brain shape and connectivity

Authors: Eardi Lila, John A. D. Aston

Abstract: We present a statistical framework that jointly models brain shape and functional connectivity, which are two complex aspects of the brain that have been classically studied independently. We adopt a Riemannian modeling approach to account for the non-Euclidean geometry of the space of shapes and the space of connectivity that constrains trajectories of co-variation to be valid statistical estimat… ▽ More We present a statistical framework that jointly models brain shape and functional connectivity, which are two complex aspects of the brain that have been classically studied independently. We adopt a Riemannian modeling approach to account for the non-Euclidean geometry of the space of shapes and the space of connectivity that constrains trajectories of co-variation to be valid statistical estimates. In order to disentangle genetic sources of variability from those driven by unique environmental factors, we embed a functional random effects model in the Riemannian framework. We apply the proposed model to the Human Connectome Project dataset to explore spontaneous co-variation between brain shape and connectivity in young healthy individuals. △ Less

Submitted 26 January, 2022; v1 submitted 13 September, 2020; originally announced September 2020.

Comments: 27 pages

Journal ref: Ann. Appl. Stat. 16 (4) 2122 - 2144, December 2022

arXiv:1907.01840 [pdf, other]

A Variational Model Dedicated to Joint Segmentation, Registration and Atlas Generation for Shape Analysis

Authors: Noémie Debroux, John Aston, Fabien Bonardi, Alistair Forbes, Carole Le Guyader, Marina Romanchikova, Carola Schönlieb

Abstract: In medical image analysis, constructing an atlas, i.e. a mean representative of an ensemble of images, is a critical task for practitioners to estimate variability of shapes inside a population, and to characterise and understand how structural shape changes have an impact on health. This involves identifying significant shape constituents of a set of images, a process called segmentation, and map… ▽ More In medical image analysis, constructing an atlas, i.e. a mean representative of an ensemble of images, is a critical task for practitioners to estimate variability of shapes inside a population, and to characterise and understand how structural shape changes have an impact on health. This involves identifying significant shape constituents of a set of images, a process called segmentation, and map** this group of images to an unknown mean image, a task called registration, making a statistical analysis of the image population possible. To achieve this goal, we propose treating these operations jointly to leverage their positive mutual influence, in a hyperelasticity setting, by viewing the shapes to be matched as Ogden materials. The approach is complemented by novel hard constraints on the $L^\infty$ norm of both the Jacobian and its inverse, ensuring that the deformation is a bi-Lipschitz homeomorphism. Segmentation is based on the Potts model, which allows for a partition into more than two regions, i.e. more than one shape. The connection to the registration problem is ensured by the dissimilarity measure that aims to align the segmented shapes. A representation of the deformation field in a linear space equipped with a scalar product is then computed in order to perform a geometry-driven Principal Component Analysis (PCA) and to extract the main modes of variations inside the image population. Theoretical results emphasizing the mathematical soundness of the model are provided, among which existence of minimisers, analysis of a numerical method of resolution, asymptotic results and a PCA analysis, as well as numerical simulations demonstrating the ability of the modeling to produce an atlas exhibiting sharp edges, high contrast and a consistent shape. △ Less

Submitted 3 July, 2019; originally announced July 2019.

arXiv:1903.00288 [pdf, other]

Detecting changes in the covariance structure of functional time series with application to fMRI data

Authors: Christina Stoehr, John A D Aston, Claudia Kirch

Abstract: Functional magnetic resonance imaging (fMRI) data provides information concerning activity in the brain and in particular the interactions between brain regions. Resting state fMRI data is widely used for inferring connectivities in the brain which are not due to external factors. As such analyzes strongly rely on stationarity, change point procedures can be applied in order to detect possible dev… ▽ More Functional magnetic resonance imaging (fMRI) data provides information concerning activity in the brain and in particular the interactions between brain regions. Resting state fMRI data is widely used for inferring connectivities in the brain which are not due to external factors. As such analyzes strongly rely on stationarity, change point procedures can be applied in order to detect possible deviations from this crucial assumption. In this paper, we model fMRI data as functional time series and develop tools for the detection of deviations from covariance stationarity via change point alternatives. We propose a nonparametric procedure which is based on dimension reduction techniques. However, as the projection of the functional time series on a finite and rather low-dimensional subspace involves the risk of missing changes which are orthogonal to the projection space, we also consider two test statistics which take the full functional structure into account. The proposed methods are compared in a simulation study and applied to more than 100 resting state fMRI data sets. △ Less

Submitted 1 March, 2019; originally announced March 2019.

Comments: 39 pages, 11 figures

MSC Class: 62P10

arXiv:1806.03954 [pdf, other]

doi 10.1088/1361-6420/ab8713

Representation and reconstruction of covariance operators in linear inverse problems

Authors: Eardi Lila, Simon Arridge, John A. D. Aston

Abstract: We introduce a framework for the reconstruction and representation of functions in a setting where these objects cannot be directly observed, but only indirect and noisy measurements are available, namely an inverse problem setting. The proposed methodology can be applied either to the analysis of indirectly observed functional images or to the associated covariance operators, representing second-… ▽ More We introduce a framework for the reconstruction and representation of functions in a setting where these objects cannot be directly observed, but only indirect and noisy measurements are available, namely an inverse problem setting. The proposed methodology can be applied either to the analysis of indirectly observed functional images or to the associated covariance operators, representing second-order information, and thus lying on a non-Euclidean space. To deal with the ill-posedness of the inverse problem, we exploit the spatial structure of the sample data by introducing a flexible regularizing term embedded in the model. Thanks to its efficiency, the proposed model is applied to MEG data, leading to a novel approach to the investigation of functional connectivity. △ Less

Submitted 13 September, 2020; v1 submitted 11 June, 2018; originally announced June 2018.

Comments: 40 pages

arXiv:1711.09877 [pdf, other]

doi 10.1038/s41467-019-09230-w

Accurate autocorrelation modeling substantially improves fMRI reliability

Authors: Wiktor Olszowy, John Aston, Catarina Rua, Guy B. Williams

Abstract: Given the recent controversies in some neuroimaging statistical methods, we compare the most frequently used functional Magnetic Resonance Imaging (fMRI) analysis packages: AFNI, FSL and SPM, with regard to temporal autocorrelation modeling. This process, sometimes known as pre-whitening, is conducted in virtually all task fMRI studies. We employ eleven datasets containing 980 scans corresponding… ▽ More Given the recent controversies in some neuroimaging statistical methods, we compare the most frequently used functional Magnetic Resonance Imaging (fMRI) analysis packages: AFNI, FSL and SPM, with regard to temporal autocorrelation modeling. This process, sometimes known as pre-whitening, is conducted in virtually all task fMRI studies. We employ eleven datasets containing 980 scans corresponding to different fMRI protocols and subject populations. Though autocorrelation modeling in AFNI is not perfect, its performance is much higher than the performance of autocorrelation modeling in FSL and SPM. The residual autocorrelated noise in FSL and SPM leads to heavily confounded first level results, particularly for low-frequency experimental designs. Our results show superior performance of SPM's alternative pre-whitening: FAST, over SPM's default. The reliability of task fMRI studies would increase with more accurate autocorrelation modeling. Furthermore, reliability could increase if the packages provided diagnostic plots. This way the investigator would be aware of pre-whitening problems. △ Less

Submitted 6 September, 2018; v1 submitted 27 November, 2017; originally announced November 2017.

Comments: compared to the third version, we investigated: (1) the impact of slice timing correction on pre-whitening and (2) the impact of pre-whitening on group results using the mixed effects model 3dMEMA

Journal ref: Nature Communications, volume 10, Article number: 1220 (2019)

arXiv:1709.00623 [pdf, other]

Estimation of temperature-dependent growth profiles for the assessment of time of hatching in forensic entomology

Authors: D. Pigoli, J. A. D. Aston, F. Ferraty, A. Mazumder, C. Richards, M. J. R. Hall

Abstract: Forensic entomology contributes important information to crime scene investigations. In this paper, we propose a method to estimate the hatching time of larvae (or maggots) based on their lengths, the temperature profile at the crime scene and experimental data on larval development. This requires the estimation of a time-dependent growth curve from experiments where larvae have been exposed to a… ▽ More Forensic entomology contributes important information to crime scene investigations. In this paper, we propose a method to estimate the hatching time of larvae (or maggots) based on their lengths, the temperature profile at the crime scene and experimental data on larval development. This requires the estimation of a time-dependent growth curve from experiments where larvae have been exposed to a relatively small number of constant temperature profiles. Since the temperature influences the developmental speed, a crucial step is the time alignment of the curves at different temperatures. We propose a model for time varying temperature profiles based on the local growth rate estimated from the experimental data. This allows us to estimate the most likely hatching time for a sample of larvae from the crime scene. Asymptotic properties are provided for the estimators of the growth curves and the hatching time. We explore via simulations the robustness of the method to errors in the estimated temperature profile. We also apply the methodology to data from two criminal cases from the United Kingdom. △ Less

Submitted 4 November, 2021; v1 submitted 2 September, 2017; originally announced September 2017.

Comments: 23 pages; 12 figures

arXiv:1707.00453 [pdf, other]

doi 10.1080/01621459.2019.1635479

Statistical Analysis of Functions on Surfaces, with an application to Medical Imaging

Authors: Eardi Lila, John A. D. Aston

Abstract: In Functional Data Analysis, data are commonly assumed to be smooth functions on a fixed interval of the real line. In this work, we introduce a comprehensive framework for the analysis of functional data, whose domain is a two-dimensional manifold and the domain itself is subject to variability from sample to sample. We formulate a statistical model for such data, here called Functions on Surface… ▽ More In Functional Data Analysis, data are commonly assumed to be smooth functions on a fixed interval of the real line. In this work, we introduce a comprehensive framework for the analysis of functional data, whose domain is a two-dimensional manifold and the domain itself is subject to variability from sample to sample. We formulate a statistical model for such data, here called Functions on Surfaces, which enables a joint representation of the geometric and functional aspects, and propose an associated estimation framework. We assess the validity of the framework by performing a simulation study and we finally apply it to the analysis of neuroimaging data of cortical thickness, acquired from the brains of different subjects, and thus lying on domains with different geometries. △ Less

Submitted 1 August, 2019; v1 submitted 3 July, 2017; originally announced July 2017.

Comments: 42 pages

arXiv:1706.05148 [pdf, other]

Hidden Talents of the Variational Autoencoder

Authors: Bin Dai, Yu Wang, John Aston, Gang Hua, David Wipf

Abstract: Variational autoencoders (VAE) represent a popular, flexible form of deep generative model that can be stochastically fit to samples from a given random process using an information-theoretic variational bound on the true underlying distribution. Once so-obtained, the model can be putatively used to generate new samples from this distribution, or to provide a low-dimensional latent representation… ▽ More Variational autoencoders (VAE) represent a popular, flexible form of deep generative model that can be stochastically fit to samples from a given random process using an information-theoretic variational bound on the true underlying distribution. Once so-obtained, the model can be putatively used to generate new samples from this distribution, or to provide a low-dimensional latent representation of existing samples. While quite effective in numerous application domains, certain important mechanisms which govern the behavior of the VAE are obfuscated by the intractable integrals and resulting stochastic approximations involved. Moreover, as a highly non-convex model, it remains unclear exactly how minima of the underlying energy relate to original design purposes. We attempt to better quantify these issues by analyzing a series of tractable special cases of increasing complexity. In doing so, we unveil interesting connections with more traditional dimensionality reduction models, as well as an intrinsic yet underappreciated propensity for robustly dismissing sparse outliers when estimating latent manifolds. With respect to the latter, we demonstrate that the VAE can be viewed as the natural evolution of recent robust PCA models, capable of learning nonlinear manifolds of unknown dimension obscured by gross corruptions. △ Less

Submitted 7 October, 2019; v1 submitted 16 June, 2017; originally announced June 2017.

Journal ref: The Journal of Machine Learning Research, Volume 19 Issue 1, January 2018 Pages 1573-1614

arXiv:1610.10040 [pdf, other]

A Spatial Modeling Approach for Linguistic Object Data: Analysing dialect sound variations across Great Britain

Authors: Shahin Tavakoli, Davide Pigoli, John A. D. Aston, John S. Coleman

Abstract: Dialect variation is of considerable interest in linguistics and other social sciences. However, traditionally it has been studied using proxies (transcriptions) rather than acoustic recordings directly. We introduce novel statistical techniques to analyse geolocalised speech recordings and to explore the spatial variation of pronunciations continuously over the region of interest, as opposed to t… ▽ More Dialect variation is of considerable interest in linguistics and other social sciences. However, traditionally it has been studied using proxies (transcriptions) rather than acoustic recordings directly. We introduce novel statistical techniques to analyse geolocalised speech recordings and to explore the spatial variation of pronunciations continuously over the region of interest, as opposed to traditional isoglosses, which provide a discrete partition of the region. Data of this type require an explicit modeling of the variation in the mean and the covariance. Usual Euclidean metrics are not appropriate, and we therefore introduce the concept of $d$-covariance, which allows consistent estimation both in space and at individual locations. We then propose spatial smoothing for these objects which accounts for the possibly non convex geometry of the domain of interest. We apply the proposed method to data from the spoken part of the British National Corpus, deposited at the British Library, London, and we produce maps of the dialect variation over Great Britain. In addition, the methods allow for acoustic reconstruction across the domain of interest, allowing researchers to listen to the statistical analysis. △ Less

Submitted 28 June, 2018; v1 submitted 31 October, 2016; originally announced October 2016.

Comments: 18 figures

MSC Class: 62G08; 62M30

arXiv:1606.02186 [pdf, other]

Stable and predictive functional domain selection with application to brain images

Authors: Ah Yeon Park, John A. D. Aston, Frederic Ferraty

Abstract: Motivated by increasing trends of relating brain images to a clinical outcome of interest, we propose a functional domain selection (FuDoS) method that effectively selects subregions of the brain associated with the outcome. View each individual's brain as a 3D functional object, the statistical aim is to distinguish the region where a regression coefficient $β(t)=0$ from $β(t)\neq0$, where $t$ de… ▽ More Motivated by increasing trends of relating brain images to a clinical outcome of interest, we propose a functional domain selection (FuDoS) method that effectively selects subregions of the brain associated with the outcome. View each individual's brain as a 3D functional object, the statistical aim is to distinguish the region where a regression coefficient $β(t)=0$ from $β(t)\neq0$, where $t$ denotes spatial location. FuDoS is composed of two stages of estimation. We first segment the brain into several small parts based on the correlation structure. Then, potential subsets are built using the obtained segments and their predictive performance are evaluated to select the best subset, augmented by a stability selection criterion. We conduct extensive simulations both for 1D and 3D functional data, and evaluate its effectiveness in selecting the true subregion. We also investigate predictive ability of the selected stable regions. To find the brain regions related to cognitive ability, FuDoS is applied to the ADNI's PET data. Due to the induced sparseness, the results naturally provide more interpretable information about the relations between the regions and the outcome. Moreover, the selected regions from our analysis show high associations with the expected anatomical brain areas known to have memory-related functions. △ Less

Submitted 7 June, 2016; originally announced June 2016.

arXiv:1604.06310 [pdf, other]

doi 10.1007/s13171-018-0143-9

Inference on covariance operators via concentration inequalities: k-sample tests, classification, and clustering via Rademacher complexities

Authors: Adam B. Kashlak, John A. D. Aston, Richard Nickl

Abstract: We propose a novel approach to the analysis of covariance operators making use of concentration inequalities. First, non-asymptotic confidence sets are constructed for such operators. Then, subsequent applications including a k sample test for equality of covariance, a functional data classifier, and an expectation-maximization style clustering algorithm are derived and tested on both simulated an… ▽ More We propose a novel approach to the analysis of covariance operators making use of concentration inequalities. First, non-asymptotic confidence sets are constructed for such operators. Then, subsequent applications including a k sample test for equality of covariance, a functional data classifier, and an expectation-maximization style clustering algorithm are derived and tested on both simulated and phoneme data. △ Less

Submitted 21 April, 2016; originally announced April 2016.

Comments: 15 pages, 2 figures, 6 tables

MSC Class: 62G05

Journal ref: Sankhya A 81 (2019) 214-243

arXiv:1601.03670 [pdf, other]

doi 10.1214/16-AOAS975

Smooth Principal Component Analysis over two-dimensional manifolds with an application to Neuroimaging

Authors: Eardi Lila, John A. D. Aston, Laura M. Sangalli

Abstract: Motivated by the analysis of high-dimensional neuroimaging signals located over the cortical surface, we introduce a novel Principal Component Analysis technique that can handle functional data located over a two-dimensional manifold. For this purpose a regularization approach is adopted, introducing a smoothing penalty coherent with the geodesic distance over the manifold. The model introduced ca… ▽ More Motivated by the analysis of high-dimensional neuroimaging signals located over the cortical surface, we introduce a novel Principal Component Analysis technique that can handle functional data located over a two-dimensional manifold. For this purpose a regularization approach is adopted, introducing a smoothing penalty coherent with the geodesic distance over the manifold. The model introduced can be applied to any manifold topology, can naturally handle missing data and functional samples evaluated in different grids of points. We approach the discretization task by means of finite element analysis and propose an efficient iterative algorithm for its resolution. We compare the performances of the proposed algorithm with other approaches classically adopted in literature. We finally apply the proposed method to resting state functional magnetic resonance imaging data from the Human Connectome Project, where the method shows substantial differential variations between brain regions that were not apparent with other approaches. △ Less

Submitted 12 September, 2016; v1 submitted 14 January, 2016; originally announced January 2016.

Comments: 33 pages

arXiv:1508.00436 [pdf, other]

The correlation space of Gaussian latent tree models and model selection without fitting

Authors: Nathaniel Shiers, Piotr Zwiernik, John A. D. Aston, Jim Q. Smith

Abstract: We provide a complete description of possible covariance matrices consistent with a Gaussian latent tree model for any tree. We then present techniques for utilising these constraints to assess whether observed data is compatible with that Gaussian latent tree model. Our method does not require us first to fit such a tree. We demonstrate the usefulness of the inverse-Wishart distribution for perfo… ▽ More We provide a complete description of possible covariance matrices consistent with a Gaussian latent tree model for any tree. We then present techniques for utilising these constraints to assess whether observed data is compatible with that Gaussian latent tree model. Our method does not require us first to fit such a tree. We demonstrate the usefulness of the inverse-Wishart distribution for performing preliminary assessments of tree-compatibility using semialgebraic constraints. Using results from Drton et al. (2008) we then provide the appropriate moments required for test statistics for assessing adherence to these equality constraints. These are shown to be effective even for small sample sizes and can be easily adjusted to test either the entire model or only certain macrostructures hypothesized within the tree. We illustrate our exploratory tetrad analysis using a linguistic application and our confirmatory tetrad analysis using a biological application. △ Less

Submitted 11 April, 2016; v1 submitted 3 August, 2015; originally announced August 2015.

Comments: 15 pages

arXiv:1507.07587 [pdf, other]

The statistical analysis of acoustic phonetic data: exploring differences between spoken Romance languages

Authors: Davide Pigoli, Pantelis Z. Hadjipantelis, John S. Coleman, John A. D. Aston

Abstract: The historical and geographical spread from older to more modern languages has long been studied by examining textual changes and in terms of changes in phonetic transcriptions. However, it is more difficult to analyze language change from an acoustic point of view, although this is usually the dominant mode of transmission. We propose a novel analysis approach for acoustic phonetic data, where th… ▽ More The historical and geographical spread from older to more modern languages has long been studied by examining textual changes and in terms of changes in phonetic transcriptions. However, it is more difficult to analyze language change from an acoustic point of view, although this is usually the dominant mode of transmission. We propose a novel analysis approach for acoustic phonetic data, where the aim will be to statistically model the acoustic properties of spoken words. We explore phonetic variation and change using a time-frequency representation, namely the log-spectrograms of speech recordings. We identify time and frequency covariance functions as a feature of the language; in contrast, mean spectrograms depend mostly on the particular word that has been uttered. We build models for the mean and covariances (taking into account the restrictions placed on the statistical analysis of such objects) and use these to define a phonetic transformation that models how an individual speaker would sound in a different language, allowing the exploration of phonetic differences between languages. Finally, we map back these transformations to the domain of sound recordings, allowing us to listen to the output of the statistical analysis. The proposed approach is demonstrated using recordings of the words corresponding to the numbers from "one" to "ten" as pronounced by speakers from five different Romance languages. △ Less

Submitted 18 May, 2017; v1 submitted 27 July, 2015; originally announced July 2015.

MSC Class: 62P99

arXiv:1505.02023 [pdf, other]

doi 10.1214/16-AOS1495

Tests for separability in nonparametric covariance operators of random surfaces

Authors: John A. D. Aston, Davide Pigoli, Shahin Tavakoli

Abstract: The assumption of separability of the covariance operator for a random image or hypersurface can be of substantial use in applications, especially in situations where the accurate estimation of the full covariance structure is unfeasible, either for computational reasons, or due to a small sample size. However, inferential tools to verify this assumption are somewhat lacking in high-dimensional or… ▽ More The assumption of separability of the covariance operator for a random image or hypersurface can be of substantial use in applications, especially in situations where the accurate estimation of the full covariance structure is unfeasible, either for computational reasons, or due to a small sample size. However, inferential tools to verify this assumption are somewhat lacking in high-dimensional or functional {data analysis} settings, where this assumption is most relevant. We propose here to test separability by focusing on $K$-dimensional projections of the difference between the covariance operator and a nonparametric separable approximation. The subspace we project onto is one generated by the eigenfunctions of the covariance operator estimated under the separability hypothesis, negating the need to ever estimate the full non-separable covariance. We show that the rescaled difference of the sample covariance operator with its separable approximation is asymptotically Gaussian. As a by-product of this result, we derive asymptotically pivotal tests under Gaussian assumptions, and propose bootstrap methods for approximating the distribution of the test statistics. We probe the finite sample performance through simulations studies, and present an application to log-spectrogram images from a phonetic linguistics dataset. △ Less

Submitted 3 June, 2016; v1 submitted 8 May, 2015; originally announced May 2015.

Comments: 47 pages, 10 figures, 4 tables

MSC Class: 62G10; 62G20

Journal ref: Annals of Statistics, Vol. 45, No. 4, 1431-1461 (2017)

arXiv:1411.2051 [pdf, ps, other]

A functional approach to deconvolve dynamic neuroimaging data

Authors: Ci-Ren Jiang, John A D Aston, Jane-Ling Wang

Abstract: Positron Emission Tomography (PET) is an imaging technique which can be used to investigate chemical changes in human biological processes such as cancer development or neurochemical reactions. Most dynamic PET scans are currently analyzed based on the assumption that linear first order kinetics can be used to adequately describe the system under observation. However, there has recently been stron… ▽ More Positron Emission Tomography (PET) is an imaging technique which can be used to investigate chemical changes in human biological processes such as cancer development or neurochemical reactions. Most dynamic PET scans are currently analyzed based on the assumption that linear first order kinetics can be used to adequately describe the system under observation. However, there has recently been strong evidence that this is not the case. In order to provide an analysis of PET data which is free from this compartmental assumption, we propose a nonparametric deconvolution and analysis model for dynamic PET data based on functional principal component analysis. This yields flexibility in the possible deconvolved functions while still performing well when a linear compartmental model setup is the true data generating mechanism. As the deconvolution needs to be performed on only a relative small number of basis functions rather than voxel by voxel in the entire 3-D volume, the methodology is both robust to typical brain imaging noise levels while also being computationally efficient. The new methodology is investigated through simulations in both 1-D functions and 2-D images and also applied to a neuroimaging study whose goal is the quantification of opioid receptor concentration in the brain. △ Less

Submitted 7 November, 2014; originally announced November 2014.

Comments: 33 pages, 10 figures

arXiv:1410.7148 [pdf, other]

An Introduction to Applications of Wavelet Benchmarking with Seasonal Adjustment

Authors: Homesh Sayal, John A. D. Aston, Duncan Elliott, Hernando Ombao

Abstract: Prior to adjustment, accounting conditions between national accounts data sets are frequently violated. Benchmarking is the procedure used by economic agencies to make such data sets consistent. It typically involves adjusting a high frequency time series (e.g. quarterly data) so it becomes consistent with a lower frequency version (e.g. annual data). Various methods have been developed to approac… ▽ More Prior to adjustment, accounting conditions between national accounts data sets are frequently violated. Benchmarking is the procedure used by economic agencies to make such data sets consistent. It typically involves adjusting a high frequency time series (e.g. quarterly data) so it becomes consistent with a lower frequency version (e.g. annual data). Various methods have been developed to approach this problem of inconsistency between data sets. This paper introduces a new statistical procedure; namely wavelet benchmarking. Wavelet properties allow high and low frequency processes to be jointly analysed and we show that benchmarking can be formulated and approached succinctly in the wavelet domain. Furthermore the time and frequency localisation properties of wavelets are ideal for handling more complicated benchmarking problems. The versatility of the procedure is demonstrated using simulation studies where we provide evidence showing it substantially outperforms currently used methods. Finally, we apply this novel method of wavelet benchmarking to official Office of National Statistics (ONS) data. △ Less

Submitted 27 October, 2014; originally announced October 2014.

Comments: 33 pages, 6 figures

arXiv:1410.0813 [pdf, other]

Gaussian Tree Constraints Applied to Acoustic Linguistic Functional Data

Authors: Nathaniel Shiers, John A. D. Aston, Jim Q. Smith, John S. Coleman

Abstract: Evolutionary models of languages are usually considered to take the form of trees. With the development of so-called tree constraints the plausibility of the tree model assumptions can be addressed by checking whether the moments of observed variables lie within regions consistent with trees. In our linguistic application, the data set comprises acoustic samples (audio recordings) from speakers of… ▽ More Evolutionary models of languages are usually considered to take the form of trees. With the development of so-called tree constraints the plausibility of the tree model assumptions can be addressed by checking whether the moments of observed variables lie within regions consistent with trees. In our linguistic application, the data set comprises acoustic samples (audio recordings) from speakers of five Romance languages or dialects. We wish to assess these functional data for compatibility with a hereditary tree model at the language level. A novel combination of canonical function analysis (CFA) with a separable covariance structure provides a method for generating a representative basis for the data. This resulting basis is formed of components which emphasize language differences whilst maintaining the integrity of the observational language-grou**s. A previously unexploited Gaussian tree constraint is then applied to component-by-component projections of the data to investigate adherence to an evolutionary tree. The results indicate that while a tree model is unlikely to be suitable for modeling all aspects of the acoustic linguistic data, certain features of the spoken Romance languages highlighted by the separable-CFA basis may indeed be suitably modeled as a tree. △ Less

Submitted 3 October, 2014; originally announced October 2014.

Comments: 48 pages

arXiv:1409.1771 [pdf, other]

Efficiency of change point tests in high dimensional settings

Authors: John A. D. Aston, Claudia Kirch

Abstract: While there is considerable work on change point analysis in univariate time series, more and more data being collected comes from high dimensional multivariate settings. This paper introduces the asymptotic concept of high dimensional efficiency which quantifies the detection power of different statistics in such situations. While being related to classic asymptotic relative efficiency, it is dif… ▽ More While there is considerable work on change point analysis in univariate time series, more and more data being collected comes from high dimensional multivariate settings. This paper introduces the asymptotic concept of high dimensional efficiency which quantifies the detection power of different statistics in such situations. While being related to classic asymptotic relative efficiency, it is different in that it provides the rate at which the change can get smaller with dimension while still being detectable. This also allows for comparisons of different methods with different null asymptotics as is for example the case in high-dimensional change point settings. Based on this new concept we investigate change point detection procedures using projections and develop asymptotic theory for how full panel (multivariate) tests compare with both oracle and random projections. Furthermore, for each given projection we can quantify a cone such that the corresponding projection statistic yields better power behavior if the true change direction is within this cone. The effect of misspecification of the covariance on the power of the tests is investigated, because in many high dimensional situations estimation of the full dependency (covariance) between the multivariate observations in the panel is often either computationally or even theoretically infeasible. It turns out that the projection statistic is much more robust in this respect in terms of size and somewhat more robust in terms of power. The theoretic quantification by the theory is accompanied by simulation results which confirm the theoretic (asymptotic) findings for surprisingly small samples. This shows in particular that the concept of high dimensional efficiency is indeed suitable to describe small sample power, and this is demonstrated in a multivariate example of market index data. △ Less

Submitted 25 June, 2016; v1 submitted 5 September, 2014; originally announced September 2014.

Comments: 37 pages, 6 figures

MSC Class: 62M10

arXiv:1406.4993 [pdf, other]

doi 10.1080/10618600.2016.1237363

Divide-and-Conquer with Sequential Monte Carlo

Authors: Fredrik Lindsten, Adam M. Johansen, Christian A. Naesseth, Bonnie Kirkpatrick, Thomas B. Schön, John Aston, Alexandre Bouchard-Côté

Abstract: We propose a novel class of Sequential Monte Carlo (SMC) algorithms, appropriate for inference in probabilistic graphical models. This class of algorithms adopts a divide-and-conquer approach based upon an auxiliary tree-structured decomposition of the model of interest, turning the overall inferential task into a collection of recursively solved sub-problems. The proposed method is applicable to… ▽ More We propose a novel class of Sequential Monte Carlo (SMC) algorithms, appropriate for inference in probabilistic graphical models. This class of algorithms adopts a divide-and-conquer approach based upon an auxiliary tree-structured decomposition of the model of interest, turning the overall inferential task into a collection of recursively solved sub-problems. The proposed method is applicable to a broad class of probabilistic graphical models, including models with loops. Unlike a standard SMC sampler, the proposed Divide-and-Conquer SMC employs multiple independent populations of weighted particles, which are resampled, merged, and propagated as the method progresses. We illustrate empirically that this approach can outperform standard methods in terms of the accuracy of the posterior expectation and marginal likelihood approximations. Divide-and-Conquer SMC also opens up novel parallel implementation options and the possibility of concentrating the computational effort on the most challenging sub-problems. We demonstrate its performance on a Markov random field and on a hierarchical logistic regression problem. △ Less

Submitted 30 June, 2015; v1 submitted 19 June, 2014; originally announced June 2014.

Journal ref: Journal of Computational and Graphical Statistics, 26(2):445-458, 2017

arXiv:1308.0868 [pdf, other]

Unifying Amplitude and Phase Analysis: A Compositional Data Approach to Functional Multivariate Mixed-Effects Modeling of Mandarin Chinese

Authors: Pantelis Z. Hadjipantelis, John A. D. Aston, Hans-Georg Müller, Jonathan P. Evans

Abstract: Mandarin Chinese is characterized by being a tonal language; the pitch (or $F_0$) of its utterances carries considerable linguistic information. However, speech samples from different individuals are subject to changes in amplitude and phase which must be accounted for in any analysis which attempts to provide a linguistically meaningful description of the language. A joint model for amplitude, ph… ▽ More Mandarin Chinese is characterized by being a tonal language; the pitch (or $F_0$) of its utterances carries considerable linguistic information. However, speech samples from different individuals are subject to changes in amplitude and phase which must be accounted for in any analysis which attempts to provide a linguistically meaningful description of the language. A joint model for amplitude, phase and duration is presented which combines elements from Functional Data Analysis, Compositional Data Analysis and Linear Mixed Effects Models. By decomposing functions via a functional principal component analysis, and connecting registration functions to compositional data analysis, a joint multivariate mixed effect model can be formulated which gives insights into the relationship between the different modes of variation as well as their dependence on linguistic and non-linguistic covariates. The model is applied to the COSPRO-1 data set, a comprehensive database of spoken Taiwanese Mandarin, containing approximately 50 thousand phonetically diverse sample $F_0$ contours (syllables), and reveals that phonetic information is jointly carried by both amplitude and phase variation. △ Less

Submitted 28 December, 2014; v1 submitted 4 August, 2013; originally announced August 2013.

Comments: 49 pages, 13 figures, small changes to discussion

arXiv:1303.3123 [pdf, other]

Towards Automatic Model Comparison: An Adaptive Sequential Monte Carlo Approach

Authors: Yan Zhou, Adam M Johansen, John A D Aston

Abstract: Model comparison for the purposes of selection, averaging and validation is a problem found throughout statistics. Within the Bayesian paradigm, these problems all require the calculation of the posterior probabilities of models within a particular class. Substantial progress has been made in recent years, but difficulties remain in the implementation of existing schemes. This paper presents adapt… ▽ More Model comparison for the purposes of selection, averaging and validation is a problem found throughout statistics. Within the Bayesian paradigm, these problems all require the calculation of the posterior probabilities of models within a particular class. Substantial progress has been made in recent years, but difficulties remain in the implementation of existing schemes. This paper presents adaptive sequential Monte Carlo (\smc) sampling strategies to characterise the posterior distribution of a collection of models, as well as the parameters of those models. Both a simple product estimator and a combination of \smc and a path sampling estimator are considered and existing theoretical results are extended to include the path sampling variant. A novel approach to the automatic specification of distributions within \smc algorithms is presented and shown to outperform the state of the art in this area. The performance of the proposed strategies is demonstrated via an extensive empirical study. Comparisons with state of the art algorithms show that the proposed algorithms are always competitive, and often substantially superior to alternative techniques, at equal computational cost and considerably less application-specific implementation effort. △ Less

Submitted 5 June, 2015; v1 submitted 13 March, 2013; originally announced March 2013.

Comments: 31 pages; 2 figures

arXiv:1301.2894 [pdf, ps, other]

doi 10.1214/12-AOAS565

Evaluating stationarity via change-point alternatives with applications to fMRI data

Authors: John A. D. Aston, Claudia Kirch

Abstract: Functional magnetic resonance imaging (fMRI) is now a well-established technique for studying the brain. However, in many situations, such as when data are acquired in a resting state, it is difficult to know whether the data are truly stationary or if level shifts have occurred. To this end, change-point detection in sequences of functional data is examined where the functional observations are d… ▽ More Functional magnetic resonance imaging (fMRI) is now a well-established technique for studying the brain. However, in many situations, such as when data are acquired in a resting state, it is difficult to know whether the data are truly stationary or if level shifts have occurred. To this end, change-point detection in sequences of functional data is examined where the functional observations are dependent and where the distributions of change-points from multiple subjects are required. Of particular interest is the case where the change-point is an epidemic change---a change occurs and then the observations return to baseline at a later time. The case where the covariance can be decomposed as a tensor product is considered with particular attention to the power analysis for detection. This is of interest in the application to fMRI, where the estimation of a full covariance structure for the three-dimensional image is not computationally feasible. Using the developed methods, a large study of resting state fMRI data is conducted to determine whether the subjects undertaking the resting scan have nonstationarities present in their time courses. It is found that a sizeable proportion of the subjects studied are not stationary. The change-point distribution for those subjects is empirically determined, as well as its theoretical properties examined. △ Less

Submitted 14 January, 2013; originally announced January 2013.

Comments: Published in at http://dx.doi.org/10.1214/12-AOAS565 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AOAS-AOAS565

Journal ref: Annals of Applied Statistics 2012, Vol. 6, No. 4, 1906-1948

arXiv:1209.0514 [pdf, other]

doi 10.1007/s00285-016-0995-3

Monotonicity of Fitness Landscapes and Mutation Rate Control

Authors: Roman V. Belavkin, Alastair Channon, Elizabeth Aston, John Aston, Rok Krasovec, Christopher G. Knight

Abstract: A common view in evolutionary biology is that mutation rates are minimised. However, studies in combinatorial optimisation and search have shown a clear advantage of using variable mutation rates as a control parameter to optimise the performance of evolutionary algorithms. Much biological theory in this area is based on Ronald Fisher's work, who used Euclidean geometry to study the relation betwe… ▽ More A common view in evolutionary biology is that mutation rates are minimised. However, studies in combinatorial optimisation and search have shown a clear advantage of using variable mutation rates as a control parameter to optimise the performance of evolutionary algorithms. Much biological theory in this area is based on Ronald Fisher's work, who used Euclidean geometry to study the relation between mutation size and expected fitness of the offspring in infinite phenotypic spaces. Here we reconsider this theory based on the alternative geometry of discrete and finite spaces of DNA sequences. First, we consider the geometric case of fitness being isomorphic to distance from an optimum, and show how problems of optimal mutation rate control can be solved exactly or approximately depending on additional constraints of the problem. Then we consider the general case of fitness communicating only partial information about the distance. We define weak monotonicity of fitness landscapes and prove that this property holds in all landscapes that are continuous and open at the optimum. This theoretical result motivates our hypothesis that optimal mutation rate functions in such landscapes will increase when fitness decreases in some neighbourhood of an optimum, resembling the control functions derived in the geometric case. We test this hypothesis experimentally by analysing approximately optimal mutation rate control functions in 115 complete landscapes of binding scores between DNA sequences and transcription factors. Our findings support the hypothesis and find that the increase of mutation rate is more rapid in landscapes that are less monotonic (more rugged). We discuss the relevance of these findings to living organisms. △ Less

Submitted 24 August, 2019; v1 submitted 3 September, 2012; originally announced September 2012.

MSC Class: 05B25 26A48 68W20 68T05 92B20 93E35 93B27

Journal ref: J. Math. Biol. (2016) 73: 1491

arXiv:1205.6310 [pdf, ps, other]

doi 10.1214/12-AOAS611

Dynamic filtering of static dipoles in magnetoencephalography

Authors: Alberto Sorrentino, Adam M. Johansen, John A. D. Aston, Thomas E. Nichols, Wilfrid S. Kendall

Abstract: We consider the problem of estimating neural activity from measurements of the magnetic fields recorded by magnetoencephalography. We exploit the temporal structure of the problem and model the neural current as a collection of evolving current dipoles, which appear and disappear, but whose locations are constant throughout their lifetime. This fully reflects the physiological interpretation of th… ▽ More We consider the problem of estimating neural activity from measurements of the magnetic fields recorded by magnetoencephalography. We exploit the temporal structure of the problem and model the neural current as a collection of evolving current dipoles, which appear and disappear, but whose locations are constant throughout their lifetime. This fully reflects the physiological interpretation of the model. In order to conduct inference under this proposed model, it was necessary to develop an algorithm based around state-of-the-art sequential Monte Carlo methods employing carefully designed importance distributions. Previous work employed a bootstrap filter and an artificial dynamic structure where dipoles performed a random walk in space, yielding nonphysical artefacts in the reconstructions; such artefacts are not observed when using the proposed model. The algorithm is validated with simulated data, in which it provided an average localisation error which is approximately half that of the bootstrap filter. An application to complex real data derived from a somatosensory experiment is presented. Assessment of model fit via marginal likelihood showed a clear preference for the proposed model and the associated reconstructions show better localisation. △ Less

Submitted 6 December, 2013; v1 submitted 29 May, 2012; originally announced May 2012.

Comments: Published in at http://dx.doi.org/10.1214/12-AOAS611 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AOAS-AOAS611

Journal ref: Annals of Applied Statistics 2013, Vol. 7, No. 2, 955-988

arXiv:1204.5953 [pdf, ps, other]

doi 10.1209/0295-5075/97/52001

Is Radioactive Decay Really Exponential?

Authors: Philip J. Aston

Abstract: Radioactive decay of an unstable isotope is widely believed to be exponential. This view is supported by experiments on rapidly decaying isotopes but is more difficult to verify for slowly decaying isotopes. The decay of 14C can be calibrated over a period of 12,550 years by comparing radiocarbon dates with dates obtained from dendrochronology. It is well known that this approach shows that radioc… ▽ More Radioactive decay of an unstable isotope is widely believed to be exponential. This view is supported by experiments on rapidly decaying isotopes but is more difficult to verify for slowly decaying isotopes. The decay of 14C can be calibrated over a period of 12,550 years by comparing radiocarbon dates with dates obtained from dendrochronology. It is well known that this approach shows that radiocarbon dates of over 3,000 years are in error, which is generally attributed to past variation in atmospheric levels of 14C. We note that predicted atmospheric variation (assuming exponential decay) does not agree with results from modelling, and that theoretical quantum mechanics does not predict exact exponential decay. We give mathematical arguments that non-exponential decay should be expected for slowly decaying isotopes and explore the consequences of non-exponential decay. We propose an experimental test of this prediction of non-exponential decay for 14C. If confirmed, a foundation stone of current dating methods will have been removed, requiring a radical reappraisal both of radioisotope dating methods and of currently predicted dates obtained using these methods. △ Less

Submitted 26 April, 2012; originally announced April 2012.

Journal ref: EPL, 97, 52001, 2012

arXiv:1111.5947 [pdf, other]

Computing the Invariant Measure and the Lyapunov Exponent for One-Dimensional Maps using a Measure-Preserving Polynomial Basis

Authors: Philip J. Aston, Oliver Junge

Abstract: We consider a generalisation of Ulam's method for approximating invariant densities of one-dimensional chaotic maps. Rather than use piecewise constant polynomials to approximate the density, we use polynomials of degree n which are defined by the requirement that they preserve the measure on n+1 neighbouring subintervals. Over the whole interval, this results in a discontinuous piecewise polynomi… ▽ More We consider a generalisation of Ulam's method for approximating invariant densities of one-dimensional chaotic maps. Rather than use piecewise constant polynomials to approximate the density, we use polynomials of degree n which are defined by the requirement that they preserve the measure on n+1 neighbouring subintervals. Over the whole interval, this results in a discontinuous piecewise polynomial approximation to the density. We prove error results where this approach is used to approximate smooth densities. We also consider the computation of the Lyapunov exponent using the polynomial density and show that the order of convergence is one order better than for the density itself. Together with using cubic polynomials in the density approximation, this yields a very efficient method for computing highly accurate estimates of the Lyapunov exponent. We illustrate the theoretical findings with some examples. △ Less

Submitted 25 November, 2011; originally announced November 2011.

MSC Class: 37M25; 65P20

arXiv:1106.4317 [pdf, ps, other]

doi 10.1371/journal.pcbi.1002401

A self-organizing state-space-model approach for parameter estimation in Hodgkin-Huxley-type models of single neurons

Authors: Dimitrios V. Vavoulis, Volko A. Straub, John A. D. Aston, Jianfeng Feng

Abstract: Traditionally, parameter estimation in biophysical neuron and neural network models usually adopts a global search algorithm, often combined with a local search method in order to minimize the value of a cost function, which measures the discrepancy between various features of the available experimental data and model output. In this study, we approach the problem of parameter estimation in conduc… ▽ More Traditionally, parameter estimation in biophysical neuron and neural network models usually adopts a global search algorithm, often combined with a local search method in order to minimize the value of a cost function, which measures the discrepancy between various features of the available experimental data and model output. In this study, we approach the problem of parameter estimation in conductance-based models of single neurons from a different perspective. By adopting a hidden-dynamical-systems formalism, we expressed parameter estimation as an inference problem in these systems, which can then be tackled using well-established statistical inference methods. The particular method we used was Kitagawa's self-organizing state-space model, which was applied on a number of Hodgkin-Huxley models using simulated or actual electrophysiological data. We showed that the algorithm can be used to estimate a large number of parameters, including maximal conductances, reversal potentials, kinetics of ionic currents and measurement noise, based on low-dimensional experimental data and sufficiently informative priors in the form of pre-defined constraints imposed on model parameters. The algorithm remained operational even when very noisy experimental data were used. Importantly, by combining the self-organizing state-space model with an adaptive sampling algorithm akin to the Covariance Matrix Adaptation Evolution Strategy we achieved a significant reduction in the variance of parameter estimates. The algorithm did not require the explicit formulation of a cost function and it was straightforward to apply on compartmental models and multiple data sets. Overall, the proposed methodology is particularly suitable for resolving high-dimensional inference problems based on noisy electrophysiological data and, therefore, a potentially useful tool in the construction of biophysical neuron models. △ Less

Submitted 29 October, 2011; v1 submitted 21 June, 2011; originally announced June 2011.

Journal ref: Vavoulis DV, Straub VA, Aston JAD, Feng J (2012) A Self-Organizing State-Space-Model Approach for Parameter Estimation in Hodgkin-Huxley-Type Models of Single Neurons. PLoS Comput Biol 8(3): e1002401

arXiv:0706.3985 [pdf, ps, other]

doi 10.1214/07-AOAS125

Distributions associated with general runs and patterns in hidden Markov models

Authors: John A. D. Aston, Donald E. K. Martin

Abstract: This paper gives a method for computing distributions associated with patterns in the state sequence of a hidden Markov model, conditional on observing all or part of the observation sequence. Probabilities are computed for very general classes of patterns (competing patterns and generalized later patterns), and thus, the theory includes as special cases results for a large class of problems tha… ▽ More This paper gives a method for computing distributions associated with patterns in the state sequence of a hidden Markov model, conditional on observing all or part of the observation sequence. Probabilities are computed for very general classes of patterns (competing patterns and generalized later patterns), and thus, the theory includes as special cases results for a large class of problems that have wide application. The unobserved state sequence is assumed to be Markovian with a general order of dependence. An auxiliary Markov chain is associated with the state sequence and is used to simplify the computations. Two examples are given to illustrate the use of the methodology. Whereas the first application is more to illustrate the basic steps in applying the theory, the second is a more detailed application to DNA sequences, and shows that the methods can be adapted to include restrictions related to biological knowledge. △ Less

Submitted 13 December, 2007; v1 submitted 27 June, 2007; originally announced June 2007.

Comments: Published in at http://dx.doi.org/10.1214/07-AOAS125 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-AOAS-AOAS125

Journal ref: Annals of Applied Statistics 2007, Vol. 1, No. 2, 585-611

arXiv:0706.3443 [pdf, other]

The SSM Toolbox for Matlab

Authors: Jyh-Ying Peng, John A. D. Aston

Abstract: State Space Models (SSM) is a MATLAB 7.0 software toolbox for doing time series analysis by state space methods. The software features fully interactive construction and combination of models, with support for univariate and multivariate models, complex time-varying (dynamic) models, non-Gaussian models, and various standard models such as ARIMA and structural time-series models. The software in… ▽ More State Space Models (SSM) is a MATLAB 7.0 software toolbox for doing time series analysis by state space methods. The software features fully interactive construction and combination of models, with support for univariate and multivariate models, complex time-varying (dynamic) models, non-Gaussian models, and various standard models such as ARIMA and structural time-series models. The software includes standard functions for Kalman filtering and smoothing, simulation smoothing, likelihood evaluation, parameter estimation, signal extraction and forecasting, with incorporation of exact initialization for filters and smoothers, and support for missing observations and multiple time series input with common analysis structure. The software also includes implementations of TRAMO model selection and Hillmer-Tiao decomposition for ARIMA models. The software will provide a general toolbox for doing time series analysis on the MATLAB platform, allowing users to take advantage of its readily available graph plotting and general matrix computation capabilities. △ Less

Submitted 23 June, 2007; originally announced June 2007.

Comments: Software available from authors

Report number: C-2007-02

arXiv:math/0702844 [pdf, ps, other]

doi 10.1214/074921706000001003

Modeling macroeconomic time series via heavy tailed distributions

Authors: J. A. D. Aston

Abstract: It has been shown that some macroeconomic time series, especially those where outliers could be present, can be well modelled using heavy tailed distributions for the noise components. Methods for deciding when and where heavy-tailed models should be preferred are investigated. These investigations primarily focus on automatic methods for model identification and selection. Current methods are e… ▽ More It has been shown that some macroeconomic time series, especially those where outliers could be present, can be well modelled using heavy tailed distributions for the noise components. Methods for deciding when and where heavy-tailed models should be preferred are investigated. These investigations primarily focus on automatic methods for model identification and selection. Current methods are extended to incorporate a non-Gaussian selection element, and various different criteria for deciding on which overall model should be used are examined. △ Less

Submitted 27 February, 2007; originally announced February 2007.

Comments: Published at http://dx.doi.org/10.1214/074921706000001003 in the IMS Lecture Notes Monograph Series (http://www.imstat.org/publications/lecnotes.htm) by the Institute of Mathematical Statistics (http://www.imstat.org)

Report number: IMS-LNMS52-LNMS5209 MSC Class: 91B82 (Primary) 62M10 (Secondary)

Journal ref: IMS Lecture Notes Monograph Series 2006, Vol. 52, 138-148

Showing 1–47 of 47 results for author: Aston, J