Skip to main content

Showing 1–23 of 23 results for author: Petitjean, F

.
  1. arXiv:2309.16353  [pdf, other

    cs.LG

    ShapeDBA: Generating Effective Time Series Prototypes using ShapeDTW Barycenter Averaging

    Authors: Ali Ismail-Fawaz, Hassan Ismail Fawaz, François Petitjean, Maxime Devanne, Jonathan Weber, Stefano Berretti, Geoffrey I. Webb, Germain Forestier

    Abstract: Time series data can be found in almost every domain, ranging from the medical field to manufacturing and wireless communication. Generating realistic and useful exemplars and prototypes is a fundamental data analysis task. In this paper, we investigate a novel approach to generating realistic and useful exemplars and prototypes for time series data. Our approach uses a new form of time series ave… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

    Comments: Published in AALTD workshop at ECML/PKDD 2023

  2. Computing Divergences between Discrete Decomposable Models

    Authors: Loong Kuan Lee, Nico Piatkowski, François Petitjean, Geoffrey I. Webb

    Abstract: There are many applications that benefit from computing the exact divergence between 2 discrete probability measures, including machine learning. Unfortunately, in the absence of any assumptions on the structure or independencies within these distributions, computing the divergence between them is an intractable problem in high dimensions. We show that we are able to compute a wide family of funct… ▽ More

    Submitted 30 November, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: 13 pages, 4 Figures, 3 Tables. Accepted to the 37th AAAI Conference on Artificial Intelligence (AAAI 2023)

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence. 37, 10 (Jun. 2023), 12243-12251

  3. arXiv:2102.10231  [pdf, other

    cs.LG stat.ML

    Elastic Similarity and Distance Measures for Multivariate Time Series

    Authors: Ahmed Shifaz, Charlotte Pelletier, Francois Petitjean, Geoffrey I. Webb

    Abstract: This paper contributes multivariate versions of seven commonly used elastic similarity and distance measures for time series data analytics. Elastic similarity and distance measures are a class of similarity measures that can compensate for misalignments in the time axis of time series data. We adapt two existing strategies used in a multivariate version of the well-known Dynamic Time War** (DTW… ▽ More

    Submitted 17 January, 2023; v1 submitted 19 February, 2021; originally announced February 2021.

    Comments: 40 pages, 12 figures

    ACM Class: I.5.0; I.5.2; I.5.3

  4. Tight lower bounds for Dynamic Time War**

    Authors: Geoffrey I. Webb, Francois Petitjean

    Abstract: Dynamic Time War** (DTW) is a popular similarity measure for aligning and comparing time series. Due to DTW's high computation time, lower bounds are often employed to screen poor matches. Many alternative lower bounds have been proposed, providing a range of different trade-offs between tightness and computational efficiency. LB Keogh provides a useful trade-off in many applications. Two recent… ▽ More

    Submitted 1 March, 2021; v1 submitted 14 February, 2021; originally announced February 2021.

    Comments: 26 pages, 23 figures, expanded version of a paper accepted for publication in Pattern Recognition. This revision fixed minor typos in the two algorithms

    MSC Class: 68T10 ACM Class: I.5.5

    Journal ref: Pattern Recognition, Volume 115, 2021, 107895, ISSN 0031-3203

  5. arXiv:2011.06428  [pdf, other

    cs.LG

    Discriminative, Generative and Self-Supervised Approaches for Target-Agnostic Learning

    Authors: Yuan **, Wray Buntine, Francois Petitjean, Geoffrey I. Webb

    Abstract: Supervised learning, characterized by both discriminative and generative learning, seeks to predict the values of single (or sometimes multiple) predefined target attributes based on a predefined set of predictor attributes. For applications where the information available and predictions to be made may vary from instance to instance, we propose the task of target-agnostic learning where arbitrary… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

  6. arXiv:2006.15311  [pdf, other

    cs.LG cs.AI stat.AP stat.ML

    Seasonal Averaged One-Dependence Estimators: A Novel Algorithm to Address Seasonal Concept Drift in High-Dimensional Stream Classification

    Authors: Rakshitha Godahewa, Trevor Yann, Christoph Bergmeir, Francois Petitjean

    Abstract: Stream classification methods classify a continuous stream of data as new labelled samples arrive. They often also have to deal with concept drift. This paper focuses on seasonal drift in stream classification, which can be found in many real-world application data sources. Traditional approaches of stream classification consider seasonal drift by including seasonal dummy/indicator variables or bu… ▽ More

    Submitted 27 June, 2020; originally announced June 2020.

    Journal ref: International Joint Conference on Neural Networks (IJCNN 2020)

  7. arXiv:2006.12672  [pdf, other

    cs.LG stat.ML

    Time Series Extrinsic Regression

    Authors: Chang Wei Tan, Christoph Bergmeir, Francois Petitjean, Geoffrey I. Webb

    Abstract: This paper studies Time Series Extrinsic Regression (TSER): a regression task of which the aim is to learn the relationship between a time series and a continuous scalar variable; a task closely related to time series classification (TSC), which aims to learn the relationship between a time series and a categorical class label. This task generalizes time series forecasting (TSF), relaxing the requ… ▽ More

    Submitted 3 February, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

  8. arXiv:2006.10996  [pdf, other

    cs.LG stat.ML

    Monash University, UEA, UCR Time Series Extrinsic Regression Archive

    Authors: Chang Wei Tan, Christoph Bergmeir, Francois Petitjean, Geoffrey I. Webb

    Abstract: Time series research has gathered lots of interests in the last decade, especially for Time Series Classification (TSC) and Time Series Forecasting (TSF). Research in TSC has greatly benefited from the University of California Riverside and University of East Anglia (UCR/UEA) Time Series Archives. On the other hand, the advancement in Time Series Forecasting relies on time series forecasting compe… ▽ More

    Submitted 19 October, 2020; v1 submitted 19 June, 2020; originally announced June 2020.

  9. arXiv:2005.11930  [pdf, ps, other

    cs.LG cs.CV stat.ML

    A Bayesian-inspired, deep learning-based, semi-supervised domain adaptation technique for land cover map**

    Authors: Benjamin Lucas, Charlotte Pelletier, Daniel Schmidt, Geoffrey I. Webb, François Petitjean

    Abstract: Land cover maps are a vital input variable to many types of environmental research and management. While they can be produced automatically by machine learning techniques, these techniques require substantial training data to achieve high levels of accuracy, which are not always available. One technique researchers use when labelled training data are scarce is domain adaptation (DA) -- where data… ▽ More

    Submitted 10 March, 2021; v1 submitted 25 May, 2020; originally announced May 2020.

  10. ROCKET: Exceptionally fast and accurate time series classification using random convolutional kernels

    Authors: Angus Dempster, François Petitjean, Geoffrey I. Webb

    Abstract: Most methods for time series classification that attain state-of-the-art accuracy have high computational complexity, requiring significant training time even for smaller datasets, and are intractable for larger datasets. Additionally, many existing methods focus on a single type of feature such as shape or frequency. Building on the recent success of convolutional neural networks for time series… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

    Comments: 27 pages, 23 figures

  11. arXiv:1910.04341  [pdf, other

    cs.LG stat.ML

    Time series classification for varying length series

    Authors: Chang Wei Tan, Francois Petitjean, Eamonn Keogh, Geoffrey I. Webb

    Abstract: Research into time series classification has tended to focus on the case of series of uniform length. However, it is common for real-world time series data to have unequal lengths. Differing time series lengths may arise from a number of fundamentally different mechanisms. In this work, we identify and evaluate two classes of such mechanisms -- variations in sampling rate relative to the relevant… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.

    Comments: 23 pages

  12. InceptionTime: Finding AlexNet for Time Series Classification

    Authors: Hassan Ismail Fawaz, Benjamin Lucas, Germain Forestier, Charlotte Pelletier, Daniel F. Schmidt, Jonathan Weber, Geoffrey I. Webb, Lhassane Idoumghar, Pierre-Alain Muller, François Petitjean

    Abstract: This paper brings deep learning at the forefront of research into Time Series Classification (TSC). TSC is the area of machine learning tasked with the categorization (or labelling) of time series. The last few decades of work in this area have led to significant progress in the accuracy of classifiers, with the state of the art now represented by the HIVE-COTE algorithm. While extremely accurate,… ▽ More

    Submitted 5 December, 2020; v1 submitted 11 September, 2019; originally announced September 2019.

  13. TS-CHIEF: A Scalable and Accurate Forest Algorithm for Time Series Classification

    Authors: Ahmed Shifaz, Charlotte Pelletier, Francois Petitjean, Geoffrey I. Webb

    Abstract: Time Series Classification (TSC) has seen enormous progress over the last two decades. HIVE-COTE (Hierarchical Vote Collective of Transformation-based Ensembles) is the current state of the art in terms of classification accuracy. HIVE-COTE recognizes that time series data are a specific data type for which the traditional attribute-value representation, used predominantly in machine learning, fai… ▽ More

    Submitted 13 February, 2020; v1 submitted 25 June, 2019; originally announced June 2019.

    Comments: 37 pages, 10 figures

    Journal ref: Data Mining and Knowledge Discovery 34 (2020) 742-775

  14. arXiv:1904.07302  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Automatic alignment of surgical videos using kinematic data

    Authors: Hassan Ismail Fawaz, Germain Forestier, Jonathan Weber, François Petitjean, Lhassane Idoumghar, Pierre-Alain Muller

    Abstract: Over the past one hundred years, the classic teaching methodology of "see one, do one, teach one" has governed the surgical education systems worldwide. With the advent of Operation Room 2.0, recording video, kinematic and many other types of data during the surgery became an easy task, thus allowing artificial intelligence systems to be deployed and used in surgical and medical practice. Recently… ▽ More

    Submitted 26 April, 2019; v1 submitted 3 April, 2019; originally announced April 2019.

    Comments: Accepted at AIME 2019

  15. arXiv:1811.10166  [pdf, other

    cs.CV

    Temporal Convolutional Neural Network for the Classification of Satellite Image Time Series

    Authors: Charlotte Pelletier, Geoffrey I. Webb, Francois Petitjean

    Abstract: New remote sensing sensors now acquire high spatial and spectral Satellite Image Time Series (SITS) of the world. These series of images are a key component of classification systems that aim at obtaining up-to-date and accurate land cover maps of the Earth's surfaces. More specifically, the combination of the temporal, spectral and spatial resolutions of new SITS makes possible to monitor vegetat… ▽ More

    Submitted 30 January, 2019; v1 submitted 25 November, 2018; originally announced November 2018.

  16. Proximity Forest: An effective and scalable distance-based classifier for time series

    Authors: Benjamin Lucas, Ahmed Shifaz, Charlotte Pelletier, Lachlan O'Neill, Nayyar Zaidi, Bart Goethals, Francois Petitjean, Geoffrey I. Webb

    Abstract: Research into the classification of time series has made enormous progress in the last decade. The UCR time series archive has played a significant role in challenging and guiding the development of new learners for time series classification. The largest dataset in the UCR archive holds 10 thousand time series only; which may explain why the primary research focus has been in creating algorithms… ▽ More

    Submitted 12 December, 2018; v1 submitted 31 August, 2018; originally announced August 2018.

    Comments: 30 pages, 12 figures

  17. arXiv:1808.09617  [pdf, other

    cs.LG stat.ML

    Elastic bands across the path: A new framework and methods to lower bound DTW

    Authors: Chang Wei Tan, Francois Petitjean, Geoffrey I. Webb

    Abstract: There has been renewed recent interest in develo** effective lower bounds for Dynamic Time War** (DTW) distance between time series. These have many applications in time series indexing, clustering, forecasting, regression and classification. One of the key time series classification algorithms, the nearest neighbor algorithm with DTW distance (NN-DTW) is very expensive to compute, due to the… ▽ More

    Submitted 14 February, 2019; v1 submitted 28 August, 2018; originally announced August 2018.

  18. arXiv:1801.09354  [pdf, other

    cs.LG cs.AI

    On the Inter-relationships among Drift rate, Forgetting rate, Bias/variance profile and Error

    Authors: Nayyar A. Zaidi, Geoffrey I. Webb, Francois Petitjean, Germain Forestier

    Abstract: We propose two general and falsifiable hypotheses about expectations on generalization error when learning in the context of concept drift. One posits that as drift rate increases, the forgetting rate that minimizes generalization error will also increase and vice versa. The other posits that as a learner's forgetting rate increases, the bias/variance profile that minimizes generalization error wi… ▽ More

    Submitted 3 February, 2018; v1 submitted 28 January, 2018; originally announced January 2018.

  19. arXiv:1708.07581  [pdf, other

    cs.LG stat.ML

    Accurate parameter estimation for Bayesian Network Classifiers using Hierarchical Dirichlet Processes

    Authors: Francois Petitjean, Wray Buntine, Geoffrey I. Webb, Nayyar Zaidi

    Abstract: This paper introduces a novel parameter estimation method for the probability tables of Bayesian network classifiers (BNCs), using hierarchical Dirichlet processes (HDPs). The main result of this paper is to show that improved parameter estimation allows BNCs to outperform leading learning methods such as Random Forest for both 0-1 loss and RMSE, albeit just on categorical datasets. As data asse… ▽ More

    Submitted 8 May, 2018; v1 submitted 24 August, 2017; originally announced August 2017.

    MSC Class: 68Q87

  20. arXiv:1704.00362  [pdf, other

    cs.LG

    Understanding Concept Drift

    Authors: Geoffrey I. Webb, Loong Kuan Lee, François Petitjean, Bart Goethals

    Abstract: Concept drift is a major issue that greatly affects the accuracy and reliability of many real-world applications of machine learning. We argue that to tackle concept drift it is important to develop the capacity to describe and analyze it. We propose tools for this purpose, arguing for the importance of quantitative descriptions of drift in marginal distributions. We present quantitative drift ana… ▽ More

    Submitted 2 April, 2017; originally announced April 2017.

  21. Characterizing Concept Drift

    Authors: Geoffrey I. Webb, Roy Hyde, Hong Cao, Hai Long Nguyen, Francois Petitjean

    Abstract: Most machine learning models are static, but the world is dynamic, and increasing online deployment of learned models gives increasing urgency to the development of efficient and effective mechanisms to address learning in the context of non-stationary distributions, or as it is commonly called concept drift. However, the key issue of characterizing the different types of drift that can occur has… ▽ More

    Submitted 8 April, 2016; v1 submitted 12 November, 2015; originally announced November 2015.

    Comments: Accepted for publication in Data Mining and Knowledge Discovery

    Journal ref: Data Mining and Knowledge Discovery, 30(4), 964-994, 2016

  22. arXiv:1509.01346  [pdf, other

    cs.LG

    Deep Broad Learning - Big Models for Big Data

    Authors: Nayyar A. Zaidi, Geoffrey I. Webb, Mark J. Carman, Francois Petitjean

    Abstract: Deep learning has demonstrated the power of detailed modeling of complex high-order (multivariate) interactions in data. For some learning tasks there is power in learning models that are not only Deep but also Broad. By Broad, we mean models that incorporate evidence from large numbers of features. This is of especial value in applications where many different features and combinations of feature… ▽ More

    Submitted 4 September, 2015; originally announced September 2015.

  23. arXiv:1506.08009  [pdf, other

    cs.AI cs.LG stat.ML

    Skopus: Mining top-k sequential patterns under leverage

    Authors: Francois Petitjean, Tao Li, Nikolaj Tatti, Geoffrey I. Webb

    Abstract: This paper presents a framework for exact discovery of the top-k sequential patterns under Leverage. It combines (1) a novel definition of the expected support for a sequential pattern - a concept on which most interestingness measures directly rely - with (2) SkOPUS: a new branch-and-bound algorithm for the exact discovery of top-k sequential patterns under a given measure of interest. Our intere… ▽ More

    Submitted 4 February, 2018; v1 submitted 26 June, 2015; originally announced June 2015.

    Journal ref: Data Mining and Knowledge Discovery, September 2016, Volume 30, Issue 5, pp 1086-1111