Search | arXiv e-print repository

Signed-Perturbed Sums Estimation of ARX Systems: Exact Coverage and Strong Consistency (Extended Version)

Authors: Algo Carè, Erik Weyer, Balázs Cs. Csáji, Marco C. Campi

Abstract: Sign-Perturbed Sums (SPS) is a system identification method that constructs confidence regions for the unknown system parameters. In this paper, we study SPS for ARX systems, and establish that the confidence regions are guaranteed to include the true model parameter with exact, user-chosen, probability under mild statistical assumptions, a property that holds true for any finite number of observe… ▽ More Sign-Perturbed Sums (SPS) is a system identification method that constructs confidence regions for the unknown system parameters. In this paper, we study SPS for ARX systems, and establish that the confidence regions are guaranteed to include the true model parameter with exact, user-chosen, probability under mild statistical assumptions, a property that holds true for any finite number of observed input-output data. Furthermore, we prove the strong consistency of the method, that is, as the number of data points increases, the confidence region gets smaller and smaller and will asymptotically almost surely exclude any parameter value different from the true one. In addition, we also show that, asymptotically, the SPS region is included in an ellipsoid which is marginally larger than the confidence ellipsoid obtained from the asymptotic theory of system identification. The results are theoretically proven and illustrated in a simulation example. △ Less

Submitted 18 February, 2024; originally announced February 2024.

arXiv:2312.05887 [pdf, other]

Three-dimensional numerical schemes for the segmentation of the psoas muscle in X-ray computed tomography images

Authors: Giulio Paolucci, Isabella Cama, Cristina Campi, Michele Piana

Abstract: The analysis of the psoas muscle in morphological and functional imaging has proved to be an accurate approach to assess sarcopenia, i.e. a systemic loss of skeletal muscle mass and function that may be correlated to multifactorial etiological aspects. The inclusion of sarcopenia assessment into a radiological workflow would need the implementation of computational pipelines for image processing t… ▽ More The analysis of the psoas muscle in morphological and functional imaging has proved to be an accurate approach to assess sarcopenia, i.e. a systemic loss of skeletal muscle mass and function that may be correlated to multifactorial etiological aspects. The inclusion of sarcopenia assessment into a radiological workflow would need the implementation of computational pipelines for image processing that guarantee segmentation reliability and a significant degree of automation. The present study utilizes three-dimensional numerical schemes for psoas segmentation in low-dose X-ray computed tomography images. Specifically, here we focused on the level set methodology and compared the performances of two standard approaches, a classical evolution model and a three-dimension geodesic model, with the performances of an original first-order modification of this latter one. The results of this analysis show that these gradient-based schemes guarantee reliability with respect to manual segmentation and that the first-order scheme requires a computational burden that is significantly smaller than the one needed by the second-order approach. △ Less

Submitted 10 December, 2023; originally announced December 2023.

MSC Class: 65M06; 92C55; 68U10

arXiv:2305.13472 [pdf, other]

A comprehensive theoretical framework for the optimization of neural networks classification performance with respect to weighted metrics

Authors: Francesco Marchetti, Sabrina Guastavino, Cristina Campi, Federico Benvenuto, Michele Piana

Abstract: In many contexts, customized and weighted classification scores are designed in order to evaluate the goodness of the predictions carried out by neural networks. However, there exists a discrepancy between the maximization of such scores and the minimization of the loss function in the training phase. In this paper, we provide a complete theoretical setting that formalizes weighted classification… ▽ More In many contexts, customized and weighted classification scores are designed in order to evaluate the goodness of the predictions carried out by neural networks. However, there exists a discrepancy between the maximization of such scores and the minimization of the loss function in the training phase. In this paper, we provide a complete theoretical setting that formalizes weighted classification metrics and then allows the construction of losses that drive the model to optimize these metrics of interest. After a detailed theoretical analysis, we show that our framework includes as particular instances well-established approaches such as classical cost-sensitive learning, weighted cross entropy loss functions and value-weighted skill scores. △ Less

Submitted 22 May, 2023; originally announced May 2023.

arXiv:2301.12767 [pdf, other]

Compression, Generalization and Learning

Authors: Marco C. Campi, Simone Garatti

Abstract: A compression function is a map that slims down an observational set into a subset of reduced size, while preserving its informational content. In multiple applications, the condition that one new observation makes the compressed set change is interpreted that this observation brings in extra information and, in learning theory, this corresponds to misclassification, or misprediction. In this pape… ▽ More A compression function is a map that slims down an observational set into a subset of reduced size, while preserving its informational content. In multiple applications, the condition that one new observation makes the compressed set change is interpreted that this observation brings in extra information and, in learning theory, this corresponds to misclassification, or misprediction. In this paper, we lay the foundations of a new theory that allows one to keep control on the probability of change of compression (which maps into the statistical "risk" in learning applications). Under suitable conditions, the cardinality of the compressed set is shown to be a consistent estimator of the probability of change of compression (without any upper limit on the size of the compressed set); moreover, unprecedentedly tight finite-sample bounds to evaluate the probability of change of compression are obtained under a generally applicable condition of preference. All results are usable in a fully agnostic setup, i.e., without requiring any a priori knowledge on the probability distribution of the observations. Not only these results offer a valid support to develop trust in observation-driven methodologies, they also play a fundamental role in learning techniques as a tool for hyper-parameter tuning. △ Less

Submitted 8 January, 2024; v1 submitted 30 January, 2023; originally announced January 2023.

Comments: https://www.jmlr.org/papers/v24/22-0605.html

Journal ref: Journal of Machine Learning Research, 24(339):1-74, 2023

arXiv:2110.12554 [pdf, other]

doi 10.1051/0004-6361/202243617

Implementation paradigm for supervised flare forecasting studies: a deep learning application with video data

Authors: Sabrina Guastavino, Francesco Marchetti, Federico Benvenuto, Cristina Campi, Michele Piana

Abstract: Solar flare forecasting can be realized by means of the analysis of magnetic data through artificial intelligence techniques. The aim is to predict whether a magnetic active region (AR) will originate solar flares above a certain class within a certain amount of time. A crucial issue is concerned with the way the adopted machine learning method is implemented, since forecasting results strongly de… ▽ More Solar flare forecasting can be realized by means of the analysis of magnetic data through artificial intelligence techniques. The aim is to predict whether a magnetic active region (AR) will originate solar flares above a certain class within a certain amount of time. A crucial issue is concerned with the way the adopted machine learning method is implemented, since forecasting results strongly depend on the criterion with which training, validation, and test sets are populated. In this paper we propose a general paradigm to generate these sets in such a way that they are independent from each other and internally well-balanced in terms of AR flaring effectiveness. This set generation process provides a ground for comparison for the performance assessment of machine learning algorithms. Finally, we use this implementation paradigm in the case of a deep neural network, which takes as input videos of magnetograms recorded by the Helioseismic and Magnetic Imager on-board the Solar Dynamics Observatory (SDO/HMI). To our knowledge, this is the first time that the solar flare forecasting problem is addressed by means of a deep neural network for video classification, which does not require any a priori extraction of features from the HMI magnetograms. △ Less

Submitted 24 October, 2021; originally announced October 2021.

Journal ref: A&A 662, A105 (2022)

arXiv:2103.15522 [pdf, other]

Score-oriented loss (SOL) functions

Authors: Francesco Marchetti, Sabrina Guastavino, Michele Piana, Cristina Campi

Abstract: Loss functions engineering and the assessment of forecasting performances are two crucial and intertwined aspects of supervised machine learning. This paper focuses on binary classification to introduce a class of loss functions that are defined on probabilistic confusion matrices and that allow an automatic and a priori maximization of the skill scores. The performances of these loss functions ar… ▽ More Loss functions engineering and the assessment of forecasting performances are two crucial and intertwined aspects of supervised machine learning. This paper focuses on binary classification to introduce a class of loss functions that are defined on probabilistic confusion matrices and that allow an automatic and a priori maximization of the skill scores. The performances of these loss functions are validated during the training phase of two experimental forecasting problems, thus showing that the probability distribution function associated with the confusion matrices significantly impacts the outcome of the score maximization process. △ Less

Submitted 29 March, 2021; originally announced March 2021.

arXiv:2103.05964 [pdf, other]

doi 10.13140/RG.2.2.30924.13446

Oversampling errors in multimodal medical imaging are due to the Gibbs effect

Authors: Davide Poggiali, Diego Cecchin, Cristina Campi, Stefano De Marchi

Abstract: To analyse multimodal 3-dimensional medical images, interpolation is required for resampling which - unavoidably - introduces an interpolation error. In this work we consider three segmented 3-dimensional images resampled with three different neuroimaging software tools for comparing undersampling and oversampling strategies and to identify where the oversampling error lies. The results indicate t… ▽ More To analyse multimodal 3-dimensional medical images, interpolation is required for resampling which - unavoidably - introduces an interpolation error. In this work we consider three segmented 3-dimensional images resampled with three different neuroimaging software tools for comparing undersampling and oversampling strategies and to identify where the oversampling error lies. The results indicate that undersampling to the lowest image size is advantageous in terms of mean value per segment errors and that the oversampling error is larger where the gradient is steeper, showing a Gibbs effect. △ Less

Submitted 7 May, 2021; v1 submitted 10 March, 2021; originally announced March 2021.

MSC Class: 68U10; 65D05; 41A15

arXiv:2004.05839 [pdf, other]

A Theory of the Risk for Optimization with Relaxation and its Application to Support Vector Machines

Authors: Marco C. Campi, Simone Garatti

Abstract: In this paper we consider optimization with relaxation, an ample paradigm to make data-driven designs. This approach was previously considered by the same authors of this work in Garatti and Campi (2019), a study that revealed a deep-seated connection between two concepts: risk (probability of not satisfying a new, out-of-sample, constraint) and complexity (according to a definition introduced in… ▽ More In this paper we consider optimization with relaxation, an ample paradigm to make data-driven designs. This approach was previously considered by the same authors of this work in Garatti and Campi (2019), a study that revealed a deep-seated connection between two concepts: risk (probability of not satisfying a new, out-of-sample, constraint) and complexity (according to a definition introduced in paper Garatti and Campi (2019)). This connection was shown to have profound implications in applications because it implied that the risk can be estimated from the complexity, a quantity that can be measured from the data without any knowledge of the data-generation mechanism. In the present work we establish new results. First, we expand the scope of Garatti and Campi (2019) so as to embrace a more general setup that covers various algorithms in machine learning. Then, we study classical support vector methods - including SVM (Support Vector Machine), SVR (Support Vector Regression) and SVDD (Support Vector Data Description) - and derive new results for the ability of these methods to generalize. All results are valid for any finite size of the data set. When the sample size tends to infinity, we establish the unprecedented result that the risk approaches the ratio between the complexity and the cardinality of the data sample, regardless of the value of the complexity. △ Less

Submitted 8 January, 2024; v1 submitted 13 April, 2020; originally announced April 2020.

Comments: https://www.jmlr.org/papers/v22/21-0641.html

Journal ref: Journal of Machine Learning Research 22(288):1-38, 2021

arXiv:1903.06762 [pdf, ps, other]

doi 10.1109/CDC40024.2019.9030247

The scenario approach meets uncertain variational inequalities and game theory

Authors: Dario Paccagnan, Marco C. Campi

Abstract: Variational inequalities are modelling tools used to capture a variety of decision-making problems arising in mathematical optimization, operations research, game theory. The scenario approach is a set of techniques developed to tackle stochastic optimization problems, take decisions based on historical data, and quantify their risk. The overarching goal of this manuscript is to bridge these two a… ▽ More Variational inequalities are modelling tools used to capture a variety of decision-making problems arising in mathematical optimization, operations research, game theory. The scenario approach is a set of techniques developed to tackle stochastic optimization problems, take decisions based on historical data, and quantify their risk. The overarching goal of this manuscript is to bridge these two areas of research, and thus broaden the class of problems amenable to be studied under the lens of the scenario approach. First and foremost, we provide out-of-samples feasibility guarantees for the solution of variational and quasi variational inequality problems. Second, we apply these results to two classes of uncertain games. In the first class, the uncertainty enters in the constraint sets, while in the second class the uncertainty enters in the cost functions. Finally, we exemplify the quality and relevance of our bounds through numerical simulations on a demand-response model. △ Less

Submitted 15 March, 2019; originally announced March 2019.

Comments: 8 pages, 3 figures

arXiv:1811.02502 [pdf, ps, other]

A dynamical model of opinion formation in voting processes under bounded confidence

Authors: Sergei Yu. Pilyugin, M. C. Campi

Abstract: In recent years, opinion dynamics has received an increasing attention, and various models have been introduced and evaluated mainly by simulation. In this study, we introduce and study a dynamical model inspired by the so-called `bounded confidence' approach where voters engaged in an electoral decision with two options are influenced by individuals sharing an opinion similar to their own. This m… ▽ More In recent years, opinion dynamics has received an increasing attention, and various models have been introduced and evaluated mainly by simulation. In this study, we introduce and study a dynamical model inspired by the so-called `bounded confidence' approach where voters engaged in an electoral decision with two options are influenced by individuals sharing an opinion similar to their own. This model allows one to capture salient features of the evolution of opinions and results in final clusters of voters. The model is nonlinear and discontinuous. We provide a detailed study of the model, including a complete classification of fixed points of the appearing dynamical system and analysis of their stability. It is shown that any trajectory tends to a fixed point. The model highlights that the final electoral outcome depends on the level of interaction in the society, besides the initial opinion of each individual, so that a strongly interconnected society can reverse the electoral outcome as compared to a society with looser exchange. △ Less

Submitted 6 November, 2018; originally announced November 2018.

Comments: 18 pages

MSC Class: 37N35 91D10 91C20

arXiv:1509.04774 [pdf, other]

Sign-Perturbed Sums (SPS) with Instrumental Variables for the Identification of ARX Systems - Extended Version

Authors: Valerio Volpe, Balázs Cs. Csáji, Algo Carè, Erik Weyer, Marco C. Campi

Abstract: We propose a generalization of the recently developed system identification method called Sign-Perturbed Sums (SPS). The proposed construction is based on the instrumental variables estimate and, unlike the original SPS, it can construct non-asymptotic confidence regions for linear regression models where the regressors contain past values of the output. Hence, it is applicable to ARX systems, as… ▽ More We propose a generalization of the recently developed system identification method called Sign-Perturbed Sums (SPS). The proposed construction is based on the instrumental variables estimate and, unlike the original SPS, it can construct non-asymptotic confidence regions for linear regression models where the regressors contain past values of the output. Hence, it is applicable to ARX systems, as well as systems with feedback. We show that this approach provides regions with exact confidence under weak assumptions, i.e., the true parameter is included in the regions with a (user-chosen) exact probability for any finite sample. The paper also proves the strong consistency of the method and proposes a computationally efficient generalization of the previously proposed ellipsoidal outer-approximation. Finally, the new method is demonstrated through numerical experiments, using both real-world and simulated data. △ Less

Submitted 15 September, 2015; originally announced September 2015.

Showing 1–11 of 11 results for author: Campi, C