-
TrendLSW: Trend and Spectral Estimation of Nonstationary Time Series in R
Authors:
Euan T. McGonigle,
Rebecca Killick,
Matthew A. Nunes
Abstract:
The TrendLSW R package has been developed to provide users with a suite of wavelet-based techniques to analyse the statistical properties of nonstationary time series. The key components of the package are (a) two approaches for the estimation of the evolutionary wavelet spectrum in the presence of trend; and (b) wavelet-based trend estimation in the presence of locally stationary wavelet errors v…
▽ More
The TrendLSW R package has been developed to provide users with a suite of wavelet-based techniques to analyse the statistical properties of nonstationary time series. The key components of the package are (a) two approaches for the estimation of the evolutionary wavelet spectrum in the presence of trend; and (b) wavelet-based trend estimation in the presence of locally stationary wavelet errors via both linear and nonlinear wavelet thresholding; and (c) the calculation of associated pointwise confidence intervals. Lastly, the package directly implements boundary handling methods that enable the methods to be performed on data of arbitrary length, not just dyadic length as is common for wavelet-based methods, ensuring no pre-processing of data is necessary. The key functionality of the package is demonstrated through two data examples, arising from biology and activity monitoring.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
Automatic Locally Stationary Time Series Forecasting with application to predicting U.K. Gross Value Added Time Series under sudden shocks caused by the COVID pandemic
Authors:
Rebecca Killick,
Marina I. Knight,
Guy P. Nason,
Matthew A. Nunes,
Idris A. Eckley
Abstract:
Accurate forecasting of the U.K. gross value added (GVA) is fundamental for measuring the growth of the U.K. economy. A common nonstationarity in GVA data, such as the ABML series, is its increase in variance over time due to inflation. Transformed or inflation-adjusted series can still be challenging for classical stationarity-assuming forecasters. We adopt a different approach that works directl…
▽ More
Accurate forecasting of the U.K. gross value added (GVA) is fundamental for measuring the growth of the U.K. economy. A common nonstationarity in GVA data, such as the ABML series, is its increase in variance over time due to inflation. Transformed or inflation-adjusted series can still be challenging for classical stationarity-assuming forecasters. We adopt a different approach that works directly with the GVA series by advancing recent forecasting methods for locally stationary time series. Our approach results in more accurate and reliable forecasts, and continues to work well even when the ABML series becomes highly variable during the COVID pandemic.
△ Less
Submitted 14 March, 2023;
originally announced March 2023.
-
Data Centred Intelligent Geosciences: Research Agenda and Opportunities, Position Paper
Authors:
Aderson Farias do Nascimento,
Martin A. Musicante,
Umberto Souza da Costa,
Bruno M. Carvalho,
Marcus Alexandre Nunes,
Genoveva Vargas-Solar
Abstract:
This paper describes and discusses our vision to develop and reason about best practices and novel ways of curating data-centric geosciences knowledge (data, experiments, models, methods, conclusions, and interpretations). This knowledge is produced from applying statistical modelling, Machine Learning, and modern data analytics methods on geo-data collections. The problems address open methodolog…
▽ More
This paper describes and discusses our vision to develop and reason about best practices and novel ways of curating data-centric geosciences knowledge (data, experiments, models, methods, conclusions, and interpretations). This knowledge is produced from applying statistical modelling, Machine Learning, and modern data analytics methods on geo-data collections. The problems address open methodological questions in model building, models' assessment, prediction, and forecasting workflows.
△ Less
Submitted 20 August, 2022;
originally announced September 2022.
-
Modelling Time-Varying First and Second-Order Structure of Time Series via Wavelets and Differencing
Authors:
Euan T. McGonigle,
Rebecca Killick,
Matthew A. Nunes
Abstract:
Most time series observed in practice exhibit time-varying trend (first-order) and autocovariance (second-order) behaviour. Differencing is a commonly-used technique to remove the trend in such series, in order to estimate the time-varying second-order structure (of the differenced series). However, often we require inference on the second-order behaviour of the original series, for example, when…
▽ More
Most time series observed in practice exhibit time-varying trend (first-order) and autocovariance (second-order) behaviour. Differencing is a commonly-used technique to remove the trend in such series, in order to estimate the time-varying second-order structure (of the differenced series). However, often we require inference on the second-order behaviour of the original series, for example, when performing trend estimation. In this article, we propose a method, using differencing, to jointly estimate the time-varying trend and second-order structure of a nonstationary time series, within the locally stationary wavelet modelling framework. We develop a wavelet-based estimator of the second-order structure of the original time series based on the differenced estimate, and show how this can be incorporated into the estimation of the trend of the time series. We perform a simulation study to investigate the performance of the methodology, and demonstrate the utility of the method by analysing data examples from environmental and biomedical science.
△ Less
Submitted 6 April, 2022; v1 submitted 17 August, 2021;
originally announced August 2021.
-
A Music Classification Model based on Metric Learning and Feature Extraction from MP3 Audio Files
Authors:
Angelo C. Mendes da Silva,
Mauricio A. Nunes,
Raul Fonseca Neto
Abstract:
The development of models for learning music similarity and feature extraction from audio media files is an increasingly important task for the entertainment industry. This work proposes a novel music classification model based on metric learning and feature extraction from MP3 audio files. The metric learning process considers the learning of a set of parameterized distances employing a structure…
▽ More
The development of models for learning music similarity and feature extraction from audio media files is an increasingly important task for the entertainment industry. This work proposes a novel music classification model based on metric learning and feature extraction from MP3 audio files. The metric learning process considers the learning of a set of parameterized distances employing a structured prediction approach from a set of MP3 audio files containing several music genres. The main objective of this work is to make possible learning a personalized metric for each customer. To extract the acoustic information we use the Mel-Frequency Cepstral Coefficient (MFCC) and make a dimensionality reduction with the use of Principal Components Analysis. We attest the model validity performing a set of experiments and comparing the training and testing results with baseline algorithms, such as K-means and Soft Margin Linear Support Vector Machine (SVM). Experiments show promising results and encourage the future development of an online version of the learning model.
△ Less
Submitted 17 September, 2019; v1 submitted 29 May, 2019;
originally announced May 2019.
-
Dynamic detection of anomalous regions within distributed acoustic sensing data streams using locally stationary wavelet time series
Authors:
Rebecca E. Wilson,
Idris A. Eckley,
Matthew A. Nunes,
Timothy Park
Abstract:
Distributed acoustic sensing technology is increasingly being used to support production and well management within the oil and gas sector, for example to improve flow monitoring and production profiling. This sensing technology is capable of recording substantial data volumes at multiple depths within an oil well, giving unprecedented insights into production behaviour. However the technology is…
▽ More
Distributed acoustic sensing technology is increasingly being used to support production and well management within the oil and gas sector, for example to improve flow monitoring and production profiling. This sensing technology is capable of recording substantial data volumes at multiple depths within an oil well, giving unprecedented insights into production behaviour. However the technology is also prone to recording periods of anomalous behaviour, where the same physical features are concurrently observed at multiple depths. Such features are called `stripes' and are undesirable, detrimentally affecting well performance modelling. This paper focuses on the important challenge of develo** a principled approach to identifying such anomalous periods within distributed acoustic signals. We extend recent work on classifying locally stationary wavelet time series to an online setting and, in so doing, introduce a computationally-efficient online procedure capable of accurately identifying anomalous regions within multivariate time series.
△ Less
Submitted 25 September, 2018;
originally announced September 2018.
-
Modelling, Detrending and Decorrelation of Network Time Series
Authors:
M. I. Knight,
M. A. Nunes,
G. P. Nason
Abstract:
A network time series is a multivariate time series augmented by a graph that describes how variables (or nodes) are connected. We introduce the network autoregressive (integrated) moving average (NARIMA) processes: a set of flexible models for network time series. For fixed networks the NARIMA models are essentially equivalent to vector autoregressive moving average-type models. However, NARIMA m…
▽ More
A network time series is a multivariate time series augmented by a graph that describes how variables (or nodes) are connected. We introduce the network autoregressive (integrated) moving average (NARIMA) processes: a set of flexible models for network time series. For fixed networks the NARIMA models are essentially equivalent to vector autoregressive moving average-type models. However, NARIMA models are especially useful when the structure of the graph, associated with the multivariate time series, changes over time. Such network topology changes are invisible to standard VARMA-like models. For integrated NARIMA models we introduce network differencing, based on the network lifting (wavelet) transform, which removes trend. We exhibit our techniques on a network time series describing the evolution of mumps throughout counties of England and Wales weekly during 2005. We further demonstrate the action of network lifting on a simple bivariate VAR(1) model with associated two-node graph. We show theoretically that decorrelation occurs only in certain circumstances and maybe less than expected. This suggests that the time-decorrelation properties of spatial network lifting are due more to the trend removal properties of lifting rather than any kind of stochastic decorrelation.
△ Less
Submitted 10 March, 2016;
originally announced March 2016.
-
A Comparative Review of Dimension Reduction Methods in Approximate Bayesian Computation
Authors:
M. G. B. Blum,
M. A. Nunes,
D. Prangle,
S. A. Sisson
Abstract:
Approximate Bayesian computation (ABC) methods make use of comparisons between simulated and observed summary statistics to overcome the problem of computationally intractable likelihood functions. As the practical implementation of ABC requires computations based on vectors of summary statistics, rather than full data sets, a central question is how to derive low-dimensional summary statistics fr…
▽ More
Approximate Bayesian computation (ABC) methods make use of comparisons between simulated and observed summary statistics to overcome the problem of computationally intractable likelihood functions. As the practical implementation of ABC requires computations based on vectors of summary statistics, rather than full data sets, a central question is how to derive low-dimensional summary statistics from the observed data with minimal loss of information. In this article we provide a comprehensive review and comparison of the performance of the principal methods of dimension reduction proposed in the ABC literature. The methods are split into three nonmutually exclusive classes consisting of best subset selection methods, projection techniques and regularization. In addition, we introduce two new methods of dimension reduction. The first is a best subset selection method based on Akaike and Bayesian information criteria, and the second uses ridge regression as a regularization procedure. We illustrate the performance of these dimension reduction techniques through the analysis of three challenging models and data sets.
△ Less
Submitted 11 June, 2013; v1 submitted 16 February, 2012;
originally announced February 2012.