-
Greedy feature selection: Classifier-dependent feature selection via greedy methods
Authors:
Fabiana Camattari,
Sabrina Guastavino,
Francesco Marchetti,
Michele Piana,
Emma Perracchione
Abstract:
The purpose of this study is to introduce a new approach to feature ranking for classification tasks, called in what follows greedy feature selection. In statistical learning, feature selection is usually realized by means of methods that are independent of the classifier applied to perform the prediction using that reduced number of features. Instead, greedy feature selection identifies the most…
▽ More
The purpose of this study is to introduce a new approach to feature ranking for classification tasks, called in what follows greedy feature selection. In statistical learning, feature selection is usually realized by means of methods that are independent of the classifier applied to perform the prediction using that reduced number of features. Instead, greedy feature selection identifies the most important feature at each step and according to the selected classifier. In the paper, the benefits of such scheme are investigated theoretically in terms of model capacity indicators, such as the Vapnik-Chervonenkis (VC) dimension or the kernel alignment, and tested numerically by considering its application to the problem of predicting geo-effective manifestations of the active Sun.
△ Less
Submitted 8 March, 2024;
originally announced March 2024.
-
A comprehensive theoretical framework for the optimization of neural networks classification performance with respect to weighted metrics
Authors:
Francesco Marchetti,
Sabrina Guastavino,
Cristina Campi,
Federico Benvenuto,
Michele Piana
Abstract:
In many contexts, customized and weighted classification scores are designed in order to evaluate the goodness of the predictions carried out by neural networks. However, there exists a discrepancy between the maximization of such scores and the minimization of the loss function in the training phase. In this paper, we provide a complete theoretical setting that formalizes weighted classification…
▽ More
In many contexts, customized and weighted classification scores are designed in order to evaluate the goodness of the predictions carried out by neural networks. However, there exists a discrepancy between the maximization of such scores and the minimization of the loss function in the training phase. In this paper, we provide a complete theoretical setting that formalizes weighted classification metrics and then allows the construction of losses that drive the model to optimize these metrics of interest. After a detailed theoretical analysis, we show that our framework includes as particular instances well-established approaches such as classical cost-sensitive learning, weighted cross entropy loss functions and value-weighted skill scores.
△ Less
Submitted 22 May, 2023;
originally announced May 2023.
-
Implementation paradigm for supervised flare forecasting studies: a deep learning application with video data
Authors:
Sabrina Guastavino,
Francesco Marchetti,
Federico Benvenuto,
Cristina Campi,
Michele Piana
Abstract:
Solar flare forecasting can be realized by means of the analysis of magnetic data through artificial intelligence techniques. The aim is to predict whether a magnetic active region (AR) will originate solar flares above a certain class within a certain amount of time. A crucial issue is concerned with the way the adopted machine learning method is implemented, since forecasting results strongly de…
▽ More
Solar flare forecasting can be realized by means of the analysis of magnetic data through artificial intelligence techniques. The aim is to predict whether a magnetic active region (AR) will originate solar flares above a certain class within a certain amount of time. A crucial issue is concerned with the way the adopted machine learning method is implemented, since forecasting results strongly depend on the criterion with which training, validation, and test sets are populated. In this paper we propose a general paradigm to generate these sets in such a way that they are independent from each other and internally well-balanced in terms of AR flaring effectiveness. This set generation process provides a ground for comparison for the performance assessment of machine learning algorithms. Finally, we use this implementation paradigm in the case of a deep neural network, which takes as input videos of magnetograms recorded by the Helioseismic and Magnetic Imager on-board the Solar Dynamics Observatory (SDO/HMI). To our knowledge, this is the first time that the solar flare forecasting problem is addressed by means of a deep neural network for video classification, which does not require any a priori extraction of features from the HMI magnetograms.
△ Less
Submitted 24 October, 2021;
originally announced October 2021.
-
Score-oriented loss (SOL) functions
Authors:
Francesco Marchetti,
Sabrina Guastavino,
Michele Piana,
Cristina Campi
Abstract:
Loss functions engineering and the assessment of forecasting performances are two crucial and intertwined aspects of supervised machine learning. This paper focuses on binary classification to introduce a class of loss functions that are defined on probabilistic confusion matrices and that allow an automatic and a priori maximization of the skill scores. The performances of these loss functions ar…
▽ More
Loss functions engineering and the assessment of forecasting performances are two crucial and intertwined aspects of supervised machine learning. This paper focuses on binary classification to introduce a class of loss functions that are defined on probabilistic confusion matrices and that allow an automatic and a priori maximization of the skill scores. The performances of these loss functions are validated during the training phase of two experimental forecasting problems, thus showing that the probability distribution function associated with the confusion matrices significantly impacts the outcome of the score maximization process.
△ Less
Submitted 29 March, 2021;
originally announced March 2021.
-
Desaturating EUV observations of solar flaring storms
Authors:
Sabrina Guastavino,
Michele Piana,
Anna Maria Massone,
Richard Schwartz,
Federico Benvenuto
Abstract:
Image saturation has been an issue for several instruments in solar astronomy, mainly at EUV wavelengths. However, with the launch of the Atmospheric Imaging Assembly (AIA) as part of the payload of the Solar Dynamic Observatory (SDO) image saturation has become a big data issue, involving around 10^$ frames of the impressive dataset this beautiful telescope has been providing every year since Feb…
▽ More
Image saturation has been an issue for several instruments in solar astronomy, mainly at EUV wavelengths. However, with the launch of the Atmospheric Imaging Assembly (AIA) as part of the payload of the Solar Dynamic Observatory (SDO) image saturation has become a big data issue, involving around 10^$ frames of the impressive dataset this beautiful telescope has been providing every year since February 2010. This paper introduces a novel desaturation method, which is able to recover the signal in the saturated region of any AIA image by exploiting no other information but the one contained in the image itself. This peculiar methodological property, jointly with the unprecedented statistical reliability of the desaturated images, could make this algorithm the perfect tool for the realization of a reconstruction pipeline for AIA data, able to work properly even in the case of long-lasting, very energetic flaring events.
△ Less
Submitted 8 April, 2019;
originally announced April 2019.
-
On the connection between supervised learning and linear inverse problems
Authors:
Sabrina Guastavino,
Federico Benvenuto
Abstract:
In this paper we investigate the connection between supervised learning and linear inverse problems. We first show that a linear inverse problem can be view as a function approximation problem in a reproducing kernel Hilbert space (RKHS) and then we prove that to each of these approximation problems corresponds a class of inverse problems. Analogously, we show that Tikhonov solutions of this class…
▽ More
In this paper we investigate the connection between supervised learning and linear inverse problems. We first show that a linear inverse problem can be view as a function approximation problem in a reproducing kernel Hilbert space (RKHS) and then we prove that to each of these approximation problems corresponds a class of inverse problems. Analogously, we show that Tikhonov solutions of this class correspond to the Tikhonov solution of the approximation problem. Thanks to this correspondence, we show that supervised learning and linear discrete inverse problems can be thought of as two instances of the approximation problem in a RKHS. These instances are formalized by means of a sampling operator which takes into account both deterministic and random samples and leads to discretized problems. We then analyze the discretized problems and we study the convergence of their solutions to the ones of the approximation problem in a RKHS, both in the deterministic and statistical framework. Finally, we prove there exists a relation between the convergence rates computed with respect to the noise level and the ones computed with respect to the number of samples. This allows us to compare upper and lower bounds given in the statistical learning and in the deterministic infinite dimensional inverse problems theory.
△ Less
Submitted 30 July, 2018;
originally announced July 2018.