Search | arXiv e-print repository

Greedy feature selection: Classifier-dependent feature selection via greedy methods

Authors: Fabiana Camattari, Sabrina Guastavino, Francesco Marchetti, Michele Piana, Emma Perracchione

Abstract: The purpose of this study is to introduce a new approach to feature ranking for classification tasks, called in what follows greedy feature selection. In statistical learning, feature selection is usually realized by means of methods that are independent of the classifier applied to perform the prediction using that reduced number of features. Instead, greedy feature selection identifies the most… ▽ More The purpose of this study is to introduce a new approach to feature ranking for classification tasks, called in what follows greedy feature selection. In statistical learning, feature selection is usually realized by means of methods that are independent of the classifier applied to perform the prediction using that reduced number of features. Instead, greedy feature selection identifies the most important feature at each step and according to the selected classifier. In the paper, the benefits of such scheme are investigated theoretically in terms of model capacity indicators, such as the Vapnik-Chervonenkis (VC) dimension or the kernel alignment, and tested numerically by considering its application to the problem of predicting geo-effective manifestations of the active Sun. △ Less

Submitted 8 March, 2024; originally announced March 2024.

MSC Class: 68Q32; 68T07; 65D12

arXiv:2305.13472 [pdf, other]

A comprehensive theoretical framework for the optimization of neural networks classification performance with respect to weighted metrics

Authors: Francesco Marchetti, Sabrina Guastavino, Cristina Campi, Federico Benvenuto, Michele Piana

Abstract: In many contexts, customized and weighted classification scores are designed in order to evaluate the goodness of the predictions carried out by neural networks. However, there exists a discrepancy between the maximization of such scores and the minimization of the loss function in the training phase. In this paper, we provide a complete theoretical setting that formalizes weighted classification… ▽ More In many contexts, customized and weighted classification scores are designed in order to evaluate the goodness of the predictions carried out by neural networks. However, there exists a discrepancy between the maximization of such scores and the minimization of the loss function in the training phase. In this paper, we provide a complete theoretical setting that formalizes weighted classification metrics and then allows the construction of losses that drive the model to optimize these metrics of interest. After a detailed theoretical analysis, we show that our framework includes as particular instances well-established approaches such as classical cost-sensitive learning, weighted cross entropy loss functions and value-weighted skill scores. △ Less

Submitted 22 May, 2023; originally announced May 2023.

arXiv:2110.12554 [pdf, other]

doi 10.1051/0004-6361/202243617

Implementation paradigm for supervised flare forecasting studies: a deep learning application with video data

Authors: Sabrina Guastavino, Francesco Marchetti, Federico Benvenuto, Cristina Campi, Michele Piana

Abstract: Solar flare forecasting can be realized by means of the analysis of magnetic data through artificial intelligence techniques. The aim is to predict whether a magnetic active region (AR) will originate solar flares above a certain class within a certain amount of time. A crucial issue is concerned with the way the adopted machine learning method is implemented, since forecasting results strongly de… ▽ More Solar flare forecasting can be realized by means of the analysis of magnetic data through artificial intelligence techniques. The aim is to predict whether a magnetic active region (AR) will originate solar flares above a certain class within a certain amount of time. A crucial issue is concerned with the way the adopted machine learning method is implemented, since forecasting results strongly depend on the criterion with which training, validation, and test sets are populated. In this paper we propose a general paradigm to generate these sets in such a way that they are independent from each other and internally well-balanced in terms of AR flaring effectiveness. This set generation process provides a ground for comparison for the performance assessment of machine learning algorithms. Finally, we use this implementation paradigm in the case of a deep neural network, which takes as input videos of magnetograms recorded by the Helioseismic and Magnetic Imager on-board the Solar Dynamics Observatory (SDO/HMI). To our knowledge, this is the first time that the solar flare forecasting problem is addressed by means of a deep neural network for video classification, which does not require any a priori extraction of features from the HMI magnetograms. △ Less

Submitted 24 October, 2021; originally announced October 2021.

Journal ref: A&A 662, A105 (2022)

arXiv:2103.15522 [pdf, other]

Score-oriented loss (SOL) functions

Authors: Francesco Marchetti, Sabrina Guastavino, Michele Piana, Cristina Campi

Abstract: Loss functions engineering and the assessment of forecasting performances are two crucial and intertwined aspects of supervised machine learning. This paper focuses on binary classification to introduce a class of loss functions that are defined on probabilistic confusion matrices and that allow an automatic and a priori maximization of the skill scores. The performances of these loss functions ar… ▽ More Loss functions engineering and the assessment of forecasting performances are two crucial and intertwined aspects of supervised machine learning. This paper focuses on binary classification to introduce a class of loss functions that are defined on probabilistic confusion matrices and that allow an automatic and a priori maximization of the skill scores. The performances of these loss functions are validated during the training phase of two experimental forecasting problems, thus showing that the probability distribution function associated with the confusion matrices significantly impacts the outcome of the score maximization process. △ Less

Submitted 29 March, 2021; originally announced March 2021.

arXiv:1904.04211 [pdf, other]

doi 10.3847/1538-4357/ab35d8

Desaturating EUV observations of solar flaring storms

Authors: Sabrina Guastavino, Michele Piana, Anna Maria Massone, Richard Schwartz, Federico Benvenuto

Abstract: Image saturation has been an issue for several instruments in solar astronomy, mainly at EUV wavelengths. However, with the launch of the Atmospheric Imaging Assembly (AIA) as part of the payload of the Solar Dynamic Observatory (SDO) image saturation has become a big data issue, involving around 10^$ frames of the impressive dataset this beautiful telescope has been providing every year since Feb… ▽ More Image saturation has been an issue for several instruments in solar astronomy, mainly at EUV wavelengths. However, with the launch of the Atmospheric Imaging Assembly (AIA) as part of the payload of the Solar Dynamic Observatory (SDO) image saturation has become a big data issue, involving around 10^$ frames of the impressive dataset this beautiful telescope has been providing every year since February 2010. This paper introduces a novel desaturation method, which is able to recover the signal in the saturated region of any AIA image by exploiting no other information but the one contained in the image itself. This peculiar methodological property, jointly with the unprecedented statistical reliability of the desaturated images, could make this algorithm the perfect tool for the realization of a reconstruction pipeline for AIA data, able to work properly even in the case of long-lasting, very energetic flaring events. △ Less

Submitted 8 April, 2019; originally announced April 2019.

MSC Class: 85-08; 65J22; 68U10

arXiv:1807.11406 [pdf, ps, other]

On the connection between supervised learning and linear inverse problems

Authors: Sabrina Guastavino, Federico Benvenuto

Abstract: In this paper we investigate the connection between supervised learning and linear inverse problems. We first show that a linear inverse problem can be view as a function approximation problem in a reproducing kernel Hilbert space (RKHS) and then we prove that to each of these approximation problems corresponds a class of inverse problems. Analogously, we show that Tikhonov solutions of this class… ▽ More In this paper we investigate the connection between supervised learning and linear inverse problems. We first show that a linear inverse problem can be view as a function approximation problem in a reproducing kernel Hilbert space (RKHS) and then we prove that to each of these approximation problems corresponds a class of inverse problems. Analogously, we show that Tikhonov solutions of this class correspond to the Tikhonov solution of the approximation problem. Thanks to this correspondence, we show that supervised learning and linear discrete inverse problems can be thought of as two instances of the approximation problem in a RKHS. These instances are formalized by means of a sampling operator which takes into account both deterministic and random samples and leads to discretized problems. We then analyze the discretized problems and we study the convergence of their solutions to the ones of the approximation problem in a RKHS, both in the deterministic and statistical framework. Finally, we prove there exists a relation between the convergence rates computed with respect to the noise level and the ones computed with respect to the number of samples. This allows us to compare upper and lower bounds given in the statistical learning and in the deterministic infinite dimensional inverse problems theory. △ Less

Submitted 30 July, 2018; originally announced July 2018.

Comments: 32 pages

MSC Class: 65F22; 65J22; 62G08; 62G20

Showing 1–6 of 6 results for author: Guastavino, S