-
Systematic analysis of the impact of label noise correction on ML Fairness
Authors:
I. Oliveira e Silva,
C. Soares,
I. Sousa,
R. Ghani
Abstract:
Arbitrary, inconsistent, or faulty decision-making raises serious concerns, and preventing unfair models is an increasingly important challenge in Machine Learning. Data often reflect past discriminatory behavior, and models trained on such data may reflect bias on sensitive attributes, such as gender, race, or age. One approach to develo** fair models is to preprocess the training data to remov…
▽ More
Arbitrary, inconsistent, or faulty decision-making raises serious concerns, and preventing unfair models is an increasingly important challenge in Machine Learning. Data often reflect past discriminatory behavior, and models trained on such data may reflect bias on sensitive attributes, such as gender, race, or age. One approach to develo** fair models is to preprocess the training data to remove the underlying biases while preserving the relevant information, for example, by correcting biased labels. While multiple label noise correction methods are available, the information about their behavior in identifying discrimination is very limited. In this work, we develop an empirical methodology to systematically evaluate the effectiveness of label noise correction techniques in ensuring the fairness of models trained on biased datasets. Our methodology involves manipulating the amount of label noise and can be used with fairness benchmarks but also with standard ML datasets. We apply the methodology to analyze six label noise correction methods according to several fairness metrics on standard OpenML datasets. Our results suggest that the Hybrid Label Noise Correction method achieves the best trade-off between predictive performance and fairness. Clustering-Based Correction can reduce discrimination the most, however, at the cost of lower predictive performance.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
Evolved Explainable Classifications for Lymph Node Metastases
Authors:
Iam Palatnik de Sousa,
Marley Maria Bernardes Rebuzzi Vellasco,
Eduardo Costa da Silva
Abstract:
A novel evolutionary approach for Explainable Artificial Intelligence is presented: the "Evolved Explanations" model (EvEx). This methodology consists in combining Local Interpretable Model Agnostic Explanations (LIME) with Multi-Objective Genetic Algorithms to allow for automated segmentation parameter tuning in image classification tasks. In this case, the dataset studied is Patch-Camelyon, comp…
▽ More
A novel evolutionary approach for Explainable Artificial Intelligence is presented: the "Evolved Explanations" model (EvEx). This methodology consists in combining Local Interpretable Model Agnostic Explanations (LIME) with Multi-Objective Genetic Algorithms to allow for automated segmentation parameter tuning in image classification tasks. In this case, the dataset studied is Patch-Camelyon, comprised of patches from pathology whole slide images. A publicly available Convolutional Neural Network (CNN) was trained on this dataset to provide a binary classification for presence/absence of lymph node metastatic tissue. In turn, the classifications are explained by means of evolving segmentations, seeking to optimize three evaluation goals simultaneously. The final explanation is computed as the mean of all explanations generated by Pareto front individuals, evolved by the developed genetic algorithm. To enhance reproducibility and traceability of the explanations, each of them was generated from several different seeds, randomly chosen. The observed results show remarkable agreement between different seeds. Despite the stochastic nature of LIME explanations, regions of high explanation weights proved to have good agreement in the heat maps, as computed by pixel-wise relative standard deviations. The found heat maps coincide with expert medical segmentations, which demonstrates that this methodology can find high quality explanations (according to the evaluation metrics), with the novel advantage of automated parameter fine tuning. These results give additional insight into the inner workings of neural network black box decision making for medical data.
△ Less
Submitted 14 May, 2020;
originally announced May 2020.
-
A Survey on QoE-oriented Wireless Resources Scheduling
Authors:
Ivo Sousa,
Maria Paula Queluz,
António Rodrigues
Abstract:
Future wireless systems are expected to provide a wide range of services to more and more users. Advanced scheduling strategies thus arise not only to perform efficient radio resource management, but also to provide fairness among the users. On the other hand, the users' perceived quality, i.e., Quality of Experience (QoE), is becoming one of the main drivers within the schedulers design. In this…
▽ More
Future wireless systems are expected to provide a wide range of services to more and more users. Advanced scheduling strategies thus arise not only to perform efficient radio resource management, but also to provide fairness among the users. On the other hand, the users' perceived quality, i.e., Quality of Experience (QoE), is becoming one of the main drivers within the schedulers design. In this context, this paper starts by providing a comprehension of what is QoE and an overview of the evolution of wireless scheduling techniques. Afterwards, a survey on the most recent QoE-based scheduling strategies for wireless systems is presented, highlighting the application/service of the different approaches reported in the literature, as well as the parameters that were taken into account for QoE optimization. Therefore, this paper aims at hel** readers interested in learning the basic concepts of QoE-oriented wireless resources scheduling, as well as getting in touch with its current research frontier.
△ Less
Submitted 14 March, 2020; v1 submitted 22 May, 2017;
originally announced May 2017.
-
Full-Duplex Relaying in MIMO-OFDM Frequency-Selective Channels with Optimal Adaptive Filtering
Authors:
João S. Lemos,
Francisco A. Monteiro,
Ivo Sousa,
António Rodrigues
Abstract:
In-band full-duplex transmission allows a relay station to theoretically double its spectral efficiency by simultaneously receiving and transmitting in the same frequency band, when compared to the traditional half-duplex or out-of-band full-duplex counterpart. Consequently, the induced self-interference suffered by the relay may reach considerable power levels, which decreases the signal-to-inter…
▽ More
In-band full-duplex transmission allows a relay station to theoretically double its spectral efficiency by simultaneously receiving and transmitting in the same frequency band, when compared to the traditional half-duplex or out-of-band full-duplex counterpart. Consequently, the induced self-interference suffered by the relay may reach considerable power levels, which decreases the signal-to-interference-plus-noise ratio (SINR) in a decode-and-forward (DF) relay, leading to a degradation of the relay performance. This paper presents a technique to cope with the problem of self-interference in broadband multiple-input multiple-output (MIMO) relays. The proposed method uses a time-domain cancellation in a DF relay, where a replica of the interfering signal is created with the help of a recursive least squares (RLS) algorithm that estimates the interference frequency-selective channel. Its convergence mean time is shown to be negligible by simulation results, when compared to the length of a typical orthogonal-frequency division multiplexing (OFDM) sequences. Moreover, the bit-error-rate (BER) and the SINR in a OFDM transmission are evaluated, confirming that the proposed method extends significantly the range of self-interference power to which the relay is resilient to, when compared with other mitigation schemes.
△ Less
Submitted 29 July, 2015; v1 submitted 4 May, 2015;
originally announced May 2015.