Search | arXiv e-print repository

Mitigating subjectivity and bias in AI development indices: A robust approach to redefining country rankings

Authors: Betania Silva C Campello, Guilherme Dean Pelegrina, Renata Pelissari, Ricardo Suyama, Leonardo Tomazeli Duarte

Abstract: Countries worldwide have been implementing different actions national strategies for Artificial Intelligence (AI) to shape policy priorities and guide their development concerning AI. Several AI indices have emerged to assess countries' progress in AI development, aiding decision-making on investments and policy choices. Typically, these indices combine multiple indicators using linear additive me… ▽ More Countries worldwide have been implementing different actions national strategies for Artificial Intelligence (AI) to shape policy priorities and guide their development concerning AI. Several AI indices have emerged to assess countries' progress in AI development, aiding decision-making on investments and policy choices. Typically, these indices combine multiple indicators using linear additive methods such as weighted sums, although they are limited in their ability to account for interactions among indicators. Another limitation concerns the use of deterministic weights, which can be perceived as subjective and vulnerable to debate and scrutiny, especially by nations that feel disadvantaged. Aiming at mitigating these problems, we conduct a methodological analysis to derive AI indices based on multiple criteria decision analysis. Initially, we assess correlations between different AI dimensions and employ the Choquet integral to model them. Thus, we apply the Stochastic Multicriteria Acceptability Analysis (SMAA) to conduct a sensitivity analysis using both weighted sum and Choquet integral in order to evaluate the stability of the indices with regard the weights. Finally, we introduce a novel ranking methodology based on SMAA, which considers several sets of weights to derive the ranking of countries. As a result, instead of using predefined weights, in the proposed approach, the ranking is achieved based on the probabilities of countries in occupying a specific position. In the computational analysis, we utilize the data employed in The Global AI Index proposed by Tortoise. Results reveal correlations in the data, and our approach effectively mitigates bias. In the sensitivity analysis, we scrutinize changes in the ranking resulting from weight adjustments. We demonstrate that our proposal rankings closely align with those derived from weight variations, proving to be more robust. △ Less

Submitted 15 February, 2024; originally announced February 2024.

arXiv:2305.06994 [pdf, other]

A statistical approach to detect sensitive features in a group fairness setting

Authors: Guilherme Dean Pelegrina, Miguel Couceiro, Leonardo Tomazeli Duarte

Abstract: The use of machine learning models in decision support systems with high societal impact raised concerns about unfair (disparate) results for different groups of people. When evaluating such unfair decisions, one generally relies on predefined groups that are determined by a set of features that are considered sensitive. However, such an approach is subjective and does not guarantee that these fea… ▽ More The use of machine learning models in decision support systems with high societal impact raised concerns about unfair (disparate) results for different groups of people. When evaluating such unfair decisions, one generally relies on predefined groups that are determined by a set of features that are considered sensitive. However, such an approach is subjective and does not guarantee that these features are the only ones to be considered as sensitive nor that they entail unfair (disparate) outcomes. In this paper, we propose a preprocessing step to address the task of automatically recognizing sensitive features that does not require a trained model to verify unfair results. Our proposal is based on the Hilber-Schmidt independence criterion, which measures the statistical dependence of variable distributions. We hypothesize that if the dependence between the label vector and a candidate is high for a sensitive feature, then the information provided by this feature will entail disparate performance measures between groups. Our empirical results attest our hypothesis and show that several features considered as sensitive in the literature do not necessarily entail disparate (unfair) results. △ Less

Submitted 11 May, 2023; originally announced May 2023.

arXiv:2211.02166 [pdf, other]

A $k$-additive Choquet integral-based approach to approximate the SHAP values for local interpretability in machine learning

Authors: Guilherme Dean Pelegrina, Leonardo Tomazeli Duarte, Michel Grabisch

Abstract: Besides accuracy, recent studies on machine learning models have been addressing the question on how the obtained results can be interpreted. Indeed, while complex machine learning models are able to provide very good results in terms of accuracy even in challenging applications, it is difficult to interpret them. Aiming at providing some interpretability for such models, one of the most famous me… ▽ More Besides accuracy, recent studies on machine learning models have been addressing the question on how the obtained results can be interpreted. Indeed, while complex machine learning models are able to provide very good results in terms of accuracy even in challenging applications, it is difficult to interpret them. Aiming at providing some interpretability for such models, one of the most famous methods, called SHAP, borrows the Shapley value concept from game theory in order to locally explain the predicted outcome of an instance of interest. As the SHAP values calculation needs previous computations on all possible coalitions of attributes, its computational cost can be very high. Therefore, a SHAP-based method called Kernel SHAP adopts an efficient strategy that approximate such values with less computational effort. In this paper, we also address local interpretability in machine learning based on Shapley values. Firstly, we provide a straightforward formulation of a SHAP-based method for local interpretability by using the Choquet integral, which leads to both Shapley values and Shapley interaction indices. Moreover, we also adopt the concept of $k$-additive games from game theory, which contributes to reduce the computational effort when estimating the SHAP values. The obtained results attest that our proposal needs less computations on coalitions of attributes to approximate the SHAP values. △ Less

Submitted 3 November, 2022; originally announced November 2022.

arXiv:2209.04254 [pdf, other]

Shapley value-based approaches to explain the robustness of classifiers in machine learning

Authors: Guilherme Dean Pelegrina, Sajid Siraj

Abstract: The use of algorithm-agnostic approaches is an emerging area of research for explaining the contribution of individual features towards the predicted outcome. Whilst there is a focus on explaining the prediction itself, a little has been done on explaining the robustness of these models, that is, how each feature contributes towards achieving that robustness. In this paper, we propose the use of S… ▽ More The use of algorithm-agnostic approaches is an emerging area of research for explaining the contribution of individual features towards the predicted outcome. Whilst there is a focus on explaining the prediction itself, a little has been done on explaining the robustness of these models, that is, how each feature contributes towards achieving that robustness. In this paper, we propose the use of Shapley values to explain the contribution of each feature towards the model's robustness, measured in terms of Receiver-operating Characteristics (ROC) curve and the Area under the ROC curve (AUC). With the help of an illustrative example, we demonstrate the proposed idea of explaining the ROC curve, and visualising the uncertainties in these curves. For imbalanced datasets, the use of Precision-Recall Curve (PRC) is considered more appropriate, therefore we also demonstrate how to explain the PRCs with the help of Shapley values. The explanation of robustness can help analysts in a number of ways, for example, it can help in feature selection by identifying the irrelevant features that can be removed to reduce the computational complexity. It can also help in identifying the features having critical contributions or negative contributions towards robustness. △ Less

Submitted 3 November, 2022; v1 submitted 9 September, 2022; originally announced September 2022.

arXiv:2208.11362 [pdf, other]

A novel approach for Fair Principal Component Analysis based on eigendecomposition

Authors: Guilherme Dean Pelegrina, Leonardo Tomazeli Duarte

Abstract: Principal component analysis (PCA), a ubiquitous dimensionality reduction technique in signal processing, searches for a projection matrix that minimizes the mean squared error between the reduced dataset and the original one. Since classical PCA is not tailored to address concerns related to fairness, its application to actual problems may lead to disparity in the reconstruction errors of differe… ▽ More Principal component analysis (PCA), a ubiquitous dimensionality reduction technique in signal processing, searches for a projection matrix that minimizes the mean squared error between the reduced dataset and the original one. Since classical PCA is not tailored to address concerns related to fairness, its application to actual problems may lead to disparity in the reconstruction errors of different groups (e.g., men and women, whites and blacks, etc.), with potentially harmful consequences such as the introduction of bias towards sensitive groups. Although several fair versions of PCA have been proposed recently, there still remains a fundamental gap in the search for algorithms that are simple enough to be deployed in real systems. To address this, we propose a novel PCA algorithm which tackles fairness issues by means of a simple strategy comprising a one-dimensional search which exploits the closed-form solution of PCA. As attested by numerical experiments, the proposal can significantly improve fairness with a very small loss in the overall reconstruction error and without resorting to complex optimization schemes. Moreover, our findings are consistent in several real situations as well as in scenarios with both unbalanced and balanced datasets. △ Less

Submitted 24 August, 2022; originally announced August 2022.

arXiv:2012.04091 [pdf, other]

doi 10.1007/978-3-030-57524-3_6

An unsupervised capacity identification approach based on Sobol' indices

Authors: Guilherme D. Pelegrina, Leonardo T. Duarte, Michel Grabisch, João M. T. Romano

Abstract: In many ranking problems, some particular aspects of the addressed situation should be taken into account in the aggregation process. An example is the presence of correlations between criteria, which may introduce bias in the derived ranking. In these cases, aggregation functions based on a capacity may be used to overcome this inconvenience, such as the Choquet integral or the multilinear model.… ▽ More In many ranking problems, some particular aspects of the addressed situation should be taken into account in the aggregation process. An example is the presence of correlations between criteria, which may introduce bias in the derived ranking. In these cases, aggregation functions based on a capacity may be used to overcome this inconvenience, such as the Choquet integral or the multilinear model. The adoption of such strategies requires a stage to estimate the parameters of these aggregation operators. This task may be difficult in situations in which we do not have either further information about these parameters or preferences given by the decision maker. Therefore, the aim of this paper is to deal with such situations through an unsupervised approach for capacity identification based on the multilinear model. Our goal is to estimate a capacity that can mitigate the bias introduced by correlations in the decision data and, therefore, to provide a fairer result. The viability of our proposal is attested by numerical experiments with synthetic data △ Less

Submitted 7 December, 2020; originally announced December 2020.

Journal ref: In: Torra V., Narukawa Y., Nin J., Agell N. (eds). Modeling Decisions for Artificial Intelligence (MDAI 2020). Lecture Notes in Computer Science, vol 12256. Springer, Cham

arXiv:2012.04085 [pdf, other]

doi 10.1007/978-3-319-93764-9_52

Muticriteria decision making based on independent component analysis: A preliminary investigation considering the TOPSIS approach

Authors: Guilherme D. Pelegrina, Leonardo T. Duarte, João M. T. Romano

Abstract: This work proposes the application of independent component analysis to the problem of ranking different alternatives by considering criteria that are not necessarily statistically independent. In this case, the observed data (the criteria values for all alternatives) can be modeled as mixtures of latent variables. Therefore, in the proposed approach, we perform ranking by means of the TOPSIS appr… ▽ More This work proposes the application of independent component analysis to the problem of ranking different alternatives by considering criteria that are not necessarily statistically independent. In this case, the observed data (the criteria values for all alternatives) can be modeled as mixtures of latent variables. Therefore, in the proposed approach, we perform ranking by means of the TOPSIS approach and based on the independent components extracted from the collected decision data. Numerical experiments attest the usefulness of the proposed approach, as they show that working with latent variables leads to better results compared to already existing methods △ Less

Submitted 7 December, 2020; originally announced December 2020.

Journal ref: In: Deville Y., Gannot S., Mason R., Plumbley M., Ward D. (eds). Latent Variable Analysis and Signal Separation (LVA/ICA 2018). Lecture Notes in Computer Science, vol 10891. Springer, Cham

arXiv:2006.06137 [pdf, other]

doi 10.1109/IJCNN55064.2022.9892809

Analysis of Trade-offs in Fair Principal Component Analysis Based on Multi-objective Optimization

Authors: Guilherme D. Pelegrina, Renan D. B. Brotto, Leonardo T. Duarte, Romis Attux, João M. T. Romano

Abstract: In dimensionality reduction problems, the adopted technique may produce disparities between the representation errors of different groups. For instance, in the projected space, a specific class can be better represented in comparison with another one. In some situations, this unfair result may introduce ethical concerns. Aiming at overcoming this inconvenience, a fairness measure can be considered… ▽ More In dimensionality reduction problems, the adopted technique may produce disparities between the representation errors of different groups. For instance, in the projected space, a specific class can be better represented in comparison with another one. In some situations, this unfair result may introduce ethical concerns. Aiming at overcoming this inconvenience, a fairness measure can be considered when performing dimensionality reduction through Principal Component Analysis. However, a solution that increases fairness tends to increase the overall re-construction error. In this context, this paper proposes to address this trade-off by means of a multi-objective-based approach. For this purpose, we adopt a fairness measure associated with the disparity between the representation errors of different groups. Moreover, we investigate if the solution of a classical Principal Component Analysis can be used to find a fair projection. Numerical experiments attest that a fairer result can be achieved with a very small loss in the overall reconstruction error. △ Less

Submitted 3 October, 2022; v1 submitted 10 June, 2020; originally announced June 2020.

Journal ref: IEEE 2022 International Joint Conference on Neural Networks (IJCNN), 2022, pp. 1-8

arXiv:2002.02257 [pdf, other]

doi 10.1016/j.eswa.2019.01.008

Application of independent component analysis and TOPSIS to deal with dependent criteria in multicriteria decision problems

Authors: Guilherme Dean Pelegrina, Leonardo Tomazeli Duarte, João Marcos Travassos Romano

Abstract: A vast number of multicriteria decision making methods have been developed to deal with the problem of ranking a set of alternatives evaluated in a multicriteria fashion. Very often, these methods assume that the evaluation among criteria is statistically independent. However, in actual problems, the observed data may comprise dependent criteria, which, among other problems, may result in biased r… ▽ More A vast number of multicriteria decision making methods have been developed to deal with the problem of ranking a set of alternatives evaluated in a multicriteria fashion. Very often, these methods assume that the evaluation among criteria is statistically independent. However, in actual problems, the observed data may comprise dependent criteria, which, among other problems, may result in biased rankings. In order to deal with this issue, we propose a novel approach whose aim is to estimate, from the observed data, a set of independent latent criteria, which can be seen as an alternative representation of the original decision matrix. A central element of our approach is to formulate the decision problem as a blind source separation problem, which allows us to apply independent component analysis techniques to estimate the latent criteria. Moreover, we consider TOPSIS-based approaches to obtain the ranking of alternatives from the latent criteria. Results in both synthetic and actual data attest the relevance of the proposed approach. △ Less

Submitted 6 February, 2020; originally announced February 2020.

Journal ref: Expert Systems with Applications, Volume 122, Pages 262--280, May 2019

arXiv:2002.02241 [pdf, other]

doi 10.1016/j.eswa.2019.04.041

Application of multi-objective optimization to blind source separation

Authors: Guilherme Dean Pelegrina, Romis Attux, Leonardo Tomazeli Duarte

Abstract: Several problems in signal processing are addressed by expert systems which take into account a set of priors on the sought signals and systems. For instance, blind source separation is often tackled by means of a mono-objective formulation which relies on a separation criterion associated with a given property of the sought signals (sources). However, in many practical situations, there are more… ▽ More Several problems in signal processing are addressed by expert systems which take into account a set of priors on the sought signals and systems. For instance, blind source separation is often tackled by means of a mono-objective formulation which relies on a separation criterion associated with a given property of the sought signals (sources). However, in many practical situations, there are more than one property to be exploited and, as a consequence, a set of separation criteria may be used to recover the original signals. In this context, this paper addresses the separation problem by means of an approach based on multi-objective optimization. Differently from the existing methods, which provide only one estimate for the original signals, our proposal leads to a set of solutions that can be utilized by the system user to take his/her decision. Results obtained through numerical experiments over a set of biomedical signals highlight the viability of the proposed approach, which provides estimations closer to the mean squared error solutions compared to the ones achieved via a mono-objective formulation. Moreover, since our proposal is quite general, this work also contributes to encourage future researches to develop expert systems that exploit the multi-objective formulation in different source separation problems. △ Less

Submitted 6 February, 2020; originally announced February 2020.

Journal ref: Expert Systems with Applications, Volume 131, Pages 60--70, October 2019

arXiv:2002.01261 [pdf, other]

doi 10.1109/TCSII.2018.2821920

A Multi-Objective Approach for Post-Nonlinear Source Separation and its Application to Ion-Selective Electrodes

Authors: Guilherme Dean Pelegrina, Leonardo Tomazeli Duarte

Abstract: Blind source separation (BSS) methods have been applied to deal with the lack of selectivity of ion-selective electrodes (ISE). In this paper, differently from the standard BSS solutions, which are based on the optimization of a mono-objective cost function associated with a given property of the sought signals, we introduce a novel approach by relying on multi-objective optimization. Numerical ex… ▽ More Blind source separation (BSS) methods have been applied to deal with the lack of selectivity of ion-selective electrodes (ISE). In this paper, differently from the standard BSS solutions, which are based on the optimization of a mono-objective cost function associated with a given property of the sought signals, we introduce a novel approach by relying on multi-objective optimization. Numerical experiments with actual data attested that our proposal allows the incorporation of additional information on the interference model and also provides the user a set of solutions from which he/she can select a proper one according to his/her prior knowledge on the problem. △ Less

Submitted 4 February, 2020; originally announced February 2020.

Journal ref: IEEE Transactions on Circuits and Systems II: Express Briefs, Volume 65, Issue 12, Pages 2067--2071, December 2018

Showing 1–11 of 11 results for author: Pelegrina, G D