-
Open challenges for Machine Learning based Early Decision-Making research
Authors:
Alexis Bondu,
Youssef Achenchabe,
Albert Bifet,
Fabrice Clérot,
Antoine Cornuéjols,
Joao Gama,
Georges Hébrail,
Vincent Lemaire,
Pierre-François Marteau
Abstract:
More and more applications require early decisions, i.e. taken as soon as possible from partially observed data. However, the later a decision is made, the more its accuracy tends to improve, since the description of the problem to hand is enriched over time. Such a compromise between the earliness and the accuracy of decisions has been particularly studied in the field of Early Time Series Classi…
▽ More
More and more applications require early decisions, i.e. taken as soon as possible from partially observed data. However, the later a decision is made, the more its accuracy tends to improve, since the description of the problem to hand is enriched over time. Such a compromise between the earliness and the accuracy of decisions has been particularly studied in the field of Early Time Series Classification. This paper introduces a more general problem, called Machine Learning based Early Decision Making (ML-EDM), which consists in optimizing the decision times of models in a wide range of settings where data is collected over time. After defining the ML-EDM problem, ten challenges are identified and proposed to the scientific community to further research in this area. These challenges open important application perspectives, discussed in this paper.
△ Less
Submitted 20 May, 2022; v1 submitted 27 April, 2022;
originally announced April 2022.
-
Autoencoder-based time series clustering with energy applications
Authors:
Guillaume Richard,
Benoît Grossin,
Guillaume Germaine,
Georges Hébrail,
Anne de Moliner
Abstract:
Time series clustering is a challenging task due to the specific nature of the data. Classical approaches do not perform well and need to be adapted either through a new distance measure or a data transformation. In this paper we investigate the combination of a convolutional autoencoder and a k-medoids algorithm to perfom time series clustering. The convolutional autoencoder allows to extract mea…
▽ More
Time series clustering is a challenging task due to the specific nature of the data. Classical approaches do not perform well and need to be adapted either through a new distance measure or a data transformation. In this paper we investigate the combination of a convolutional autoencoder and a k-medoids algorithm to perfom time series clustering. The convolutional autoencoder allows to extract meaningful features and reduce the dimension of the data, leading to an improvement of the subsequent clustering. Using simulation and energy related data to validate the approach, experimental results show that the clustering is robust to outliers thus leading to finer clusters than with standard methods.
△ Less
Submitted 10 February, 2020;
originally announced February 2020.
-
Exploratory Analysis of Functional Data via Clustering and Optimal Segmentation
Authors:
Georges Hébrail,
Bernard Hugueney,
Yves Lechevallier,
Fabrice Rossi
Abstract:
We propose in this paper an exploratory analysis algorithm for functional data. The method partitions a set of functions into $K$ clusters and represents each cluster by a simple prototype (e.g., piecewise constant). The total number of segments in the prototypes, $P$, is chosen by the user and optimally distributed among the clusters via two dynamic programming algorithms. The practical relevance…
▽ More
We propose in this paper an exploratory analysis algorithm for functional data. The method partitions a set of functions into $K$ clusters and represents each cluster by a simple prototype (e.g., piecewise constant). The total number of segments in the prototypes, $P$, is chosen by the user and optimally distributed among the clusters via two dynamic programming algorithms. The practical relevance of the method is shown on two real world datasets.
△ Less
Submitted 3 April, 2010;
originally announced April 2010.