HTML conversions sometimes display errors due to content that did not convert correctly from the source. This paper uses the following packages that are not yet supported by the HTML conversion tool. Feedback on these issues are not necessary; they are known and are being worked on.

  • failed: perpage

Authors: achieve the best HTML results from your LaTeX submissions by following these best practices.

License: arXiv.org perpetual non-exclusive license
arXiv:2404.05678v2 [stat.ML] 09 Apr 2024
\MakePerPage

footnote

Flexible Fairness Learning via Inverse Conditional Permutation

Yuheng Lai, Leying Guan Dept. of Mathematics, Renmin University of ChinaDept. of Biostatistics, Yale University.
Correspondence to: [email protected]
This work is supported by NSF DMS-2310836.
Abstract

Equalized odds, as a popular notion of algorithmic fairness, aims to ensure that sensitive variables, such as race and gender, do not unfairly influence the algorithm prediction when conditioning on the true outcome. Despite rapid advancements, most of the current research focuses on the violation of equalized odds caused by one sensitive attribute, leaving the challenge of simultaneously accounting for multiple attributes under-addressed. We address this gap by introducing a fairness learning approach that integrates adversarial learning with a novel inverse conditional permutation. This approach effectively and flexibly handles multiple sensitive attributes, potentially of mixed data types. The efficacy and flexibility of our method are demonstrated through both simulation studies and empirical analysis of real-world datasets.

1 Introduction

Machine learning models have become important tools for aiding decision-making in various applications. One of the challenges in applying machine learning is ensuring that the models are fair, i.e., they do not discriminate against minorities or other protected groups (Mehrabi et al., 2021). Several fairness concepts have been developed in the literature to address different practical needs (Mehrabi et al., 2021; Castelnovo et al., 2022). In this work, we consider the equalized odds criterion (Hardt et al., 2016), defined as

Y^AY.perpendicular-toabsentperpendicular-to^𝑌conditional𝐴𝑌\hat{Y}\mathchoice{\mathrel{\hbox to 0.0pt{$\displaystyle\perp$\hss}\mkern 2.0% mu{\displaystyle\perp}}}{\mathrel{\hbox to 0.0pt{$\textstyle\perp$\hss}\mkern 2% .0mu{\textstyle\perp}}}{\mathrel{\hbox to 0.0pt{$\scriptstyle\perp$\hss}\mkern 2% .0mu{\scriptstyle\perp}}}{\mathrel{\hbox to 0.0pt{$\scriptscriptstyle\perp$% \hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}A\mid Y.over^ start_ARG italic_Y end_ARG start_RELOP ⟂ ⟂ end_RELOP italic_A ∣ italic_Y . (1)

Here, Y𝑌Yitalic_Y is the response variable, A𝐴Aitalic_A is the sensitive attribute(s) that we care to protect(e.g. gender / race / income), and Y^^𝑌\hat{Y}over^ start_ARG italic_Y end_ARG is the prediction given by any model. Notice that, when drop** the conditional term, (1) becomes unconditional independence as Y^Aperpendicular-toabsentperpendicular-to^𝑌𝐴\hat{Y}\mathchoice{\mathrel{\hbox to 0.0pt{$\displaystyle\perp$\hss}\mkern 2.0% mu{\displaystyle\perp}}}{\mathrel{\hbox to 0.0pt{$\textstyle\perp$\hss}\mkern 2% .0mu{\textstyle\perp}}}{\mathrel{\hbox to 0.0pt{$\scriptstyle\perp$\hss}\mkern 2% .0mu{\scriptstyle\perp}}}{\mathrel{\hbox to 0.0pt{$\scriptscriptstyle\perp$% \hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}Aover^ start_ARG italic_Y end_ARG start_RELOP ⟂ ⟂ end_RELOP italic_A to accommodate the need of demographic parity. The requirements for a learned model to satisfy certain independence relations are not limited to the realm of fairness: in this broader context, statisticians have long been working on robust inference techniques based on the concept of a pivot – a quantity whose distribution is invariant with respect to the nuisance parameters (see, e.g., Keener (2010)).

Despite the exciting progress, most existing algorithms aiming for equalized odds can only handle one protected attribute. However, in fields including clinical research, there is a growing need to mitigate biases related to multiple sensitive attributes (Yang et al., 2022). It has also been pointed out that fairness gerrymandering can occur when algorithmic decision-making considers only a single sensitive attribute at a time (Kearns et al., 2018). Additionally, the equalized-odds problem in the context of continuous sensitive attributes is much less explored.

Here, we alleviate these two limitations by proposing a versatile equalized odds training scheme, FairICP, as illustrated by Figure 1: Building on the sensitive attribute resampling framework (Romano et al., 2020), we generate A~~𝐴\tilde{A}over~ start_ARG italic_A end_ARG using a novel inverse conditional permutation (ICP) strategy, conditional permutations of A𝐴Aitalic_A given Y𝑌Yitalic_Y, and construct a more fair model through regularizing the distribution of (Y^,A,Y)^𝑌𝐴𝑌(\hat{Y},A,Y)( over^ start_ARG italic_Y end_ARG , italic_A , italic_Y ) toward the distribution of (Y^,A~,Y)^𝑌~𝐴𝑌(\hat{Y},\tilde{A},Y)( over^ start_ARG italic_Y end_ARG , over~ start_ARG italic_A end_ARG , italic_Y ) (see Figure 1). Our contributions are summarized as below.

  • We propose a novel inverse conditional permutation (ICP) strategy to generate A~~𝐴\tilde{A}over~ start_ARG italic_A end_ARG, conditional permutations of A𝐴Aitalic_A, without estimating the multi-dimensional conditional density of A|Yconditional𝐴𝑌A|Yitalic_A | italic_Y.

  • We show theoretically that the equalized odds condition holds asymptotically for (Y^,A~,Y)^𝑌~𝐴𝑌(\hat{Y},\tilde{A},Y)( over^ start_ARG italic_Y end_ARG , over~ start_ARG italic_A end_ARG , italic_Y ) when the A~~𝐴\tilde{A}over~ start_ARG italic_A end_ARG is generated according to ICP.

  • We propose examining the fairness level with a recently developed non-parametric conditional dependence measure.

  • We demonstrate experimentally that FairICP enjoys improved efficacy and flexibility.

Refer to caption
Figure 1: Illustration of the FairICP framework. A𝐴Aitalic_A, X𝑋Xitalic_X, and Y𝑌Yitalic_Y denote the sensitive attributes, other features, and labels.
Related work

Existing fairness concepts can be divided into different categories, including statistical/group fairness (Hardt et al., 2016; Zafar et al., 2017), which aims to ensure similar predictions across different groups; individual fairness (Dwork et al., 2012), which targets similar predictions for similar individuals; and causality-based fairness (Kusner et al., 2017), which tries to reveal causal relationships. More comprehensive discussions can be found in (Mehrabi et al., 2021; Castelnovo et al., 2022). Prominent statistical fairness measures include demographic parity (Zafar et al., 2019), equal opportunity (Hardt et al., 2016), and equalized odds (Hardt et al., 2016), which can all be articulated as (conditional) independence relations from a statistical perspective. Given the fairness concept, the associated procedures can be generally categorized into three types: (1) pre-processing, (2) post-processing, and (3) in-processing. Pre-processing aims to correct potentially biased data before any model fitting procedures (Zemel et al., 2013; Feldman et al., 2015), while post-processing modifies the classifier’s output at the test phase, leaving the model unchanged (Hardt et al., 2016; Kim et al., 2018; Hebert-Johnson et al., 2018).

FairICP is an in-processing method that encourages equalized-odds fairness for multiple complex sensitive attributes during model training. Several in-processing methods have been previously introduced to address the violation of equalized odds. For example, Agarwal et al. (2018) describes a procedure for handling categorical sensitive attributes for binary classification. Mary et al. (2019) trains a model that penalizes the violation of equalized-odds, measured by the Hirschfeld-Gebelein-Rényi (HGR) Maximum Correlation Coefficient, and is designed to reduce equalized-odds violations in the presence of one sensitive attribute, whether categorical or continuous. Closely related to FairICP, another line of in-processing algorithms encourages fairness using an adversarial loss designed for different fairness metrics (Zhang et al., 2018). Particularly, Romano et al. (2020) proposes a novel adversarial learning loss that utilizes the resampled synthetic variable A~~𝐴\tilde{A}over~ start_ARG italic_A end_ARG from the conditional distribution of a potentially continuous A𝐴Aitalic_A conditional on Y𝑌Yitalic_Y. Although the joint consideration of multiple sensitive attributes has been explored for demographic parity under this framework (Creager et al., 2019), jointly modeling multiple sensitive attributes, especially continuous ones, remains an unresolved challenge. This challenge is largely due to the difficulty of estimating the conditional density of A|Yconditional𝐴𝑌A|Yitalic_A | italic_Y. Our approach shares similar loss designs with that of Romano et al. (2020) but employs a novel permutation technique capable of handling multiple and complex protected variables.

2 Method

We propose a general adversarial learning procedure to obtain models with improved equalized odds guarantee through utilizing a novel Inverse Conditional Permutation (ICP). The proposed procedure FairICP enables efficient fairness learning with multi-dimensional sensitive attributes with either categorical or continuous response Y𝑌Yitalic_Y. Before describing our proposal, we first define some notations used throughout this paper. We will also review the framework of model training with equalized odds penalty based on sensitive attribute re-sampling Romano et al. (2020) and the challenge in applying sensitive attribute re-sampling and existing methods for multidimensional attributes, which motivates our proposal.

Let (Xi,Ai,Yi)subscript𝑋𝑖subscript𝐴𝑖subscript𝑌𝑖\left(X_{i},A_{i},Y_{i}\right)( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) for i=1,,ntr𝑖1subscript𝑛tri=1,\ldots,n_{\mathrm{tr}}italic_i = 1 , … , italic_n start_POSTSUBSCRIPT roman_tr end_POSTSUBSCRIPT be i.i.d. generated triples of (feature, sensitive attribute, response). Let fθf(.)f_{\theta_{f}}(.)italic_f start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( . ) be a prediction function with model parameter θfsubscript𝜃𝑓\theta_{f}italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT. Although fθf(.)f_{\theta_{f}}(.)italic_f start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( . ) can be any prediction that is differentiable in θfsubscript𝜃𝑓\theta_{f}italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT, we will consider fθf(.)f_{\theta_{f}}(.)italic_f start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( . ) as the neural network throughout this work. Let Y^=fθf(X)^𝑌subscript𝑓subscript𝜃𝑓𝑋\hat{Y}=f_{\theta_{f}}(X)over^ start_ARG italic_Y end_ARG = italic_f start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_X ) be the prediction for Y𝑌Yitalic_Y given X𝑋Xitalic_X. For a regression problem, Y^^𝑌\hat{Y}over^ start_ARG italic_Y end_ARG is the predicted value of the continuous response Y𝑌Yitalic_Y; for a classification problem, the last layer of fθf(.)f_{\theta_{f}}(.)italic_f start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( . ) is a softmax layer and Y^^𝑌\hat{Y}over^ start_ARG italic_Y end_ARG is the predicted probability vector for being in each class. We also denote 𝐗=(X1,,Xntr),𝐀=(A1,,Antr)formulae-sequence𝐗subscript𝑋1subscript𝑋subscript𝑛tr𝐀subscript𝐴1subscript𝐴subscript𝑛tr{\mathbf{X}}=\left(X_{1},\ldots,X_{n_{\mathrm{tr}}}\right),{\mathbf{A}}=\left(% A_{1},\ldots,A_{n_{\mathrm{tr}}}\right)bold_X = ( italic_X start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_X start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_tr end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) , bold_A = ( italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_A start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_tr end_POSTSUBSCRIPT end_POSTSUBSCRIPT ), 𝐘=(Y1,,Yntr)𝐘subscript𝑌1subscript𝑌subscript𝑛tr{\mathbf{Y}}=\left(Y_{1},\ldots,Y_{n_{\mathrm{tr}}}\right)bold_Y = ( italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_Y start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_tr end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) and 𝐘^=(Y^1,,Y^ntr)^𝐘subscript^𝑌1subscript^𝑌subscript𝑛tr\hat{\mathbf{Y}}=(\hat{Y}_{1},\ldots,\hat{Y}_{n_{\mathrm{tr}}})over^ start_ARG bold_Y end_ARG = ( over^ start_ARG italic_Y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , over^ start_ARG italic_Y end_ARG start_POSTSUBSCRIPT italic_n start_POSTSUBSCRIPT roman_tr end_POSTSUBSCRIPT end_POSTSUBSCRIPT ).

2.1 Fairness-learning via sensitive attribute re-sampling

We first present the framework (Romano et al., 2020) denoted by Fair Dummies Learning (FDL) to achieve equalized odds for one sensitive attribute. Our terminology will differ somewhat from the terminology used in this reference, to help us introduce the new perspectives and frameworks in this paper later on.

To evaluate the potential violation of equalized odds (1) in prediction 𝐘^^𝐘\hat{\mathbf{Y}}over^ start_ARG bold_Y end_ARG, FDL construct a resampled version of the original sensitive attribute as 𝐀~~𝐀\tilde{{\mathbf{A}}}over~ start_ARG bold_A end_ARG to be a contrast and sample A~~𝐴\tilde{A}over~ start_ARG italic_A end_ARG according to 𝐀~Qntr(𝐘)\tilde{{\mathbf{A}}}\sim Q^{n_{\mathrm{tr}}}(\cdot\mid\mathbf{Y})over~ start_ARG bold_A end_ARG ∼ italic_Q start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT roman_tr end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( ⋅ ∣ bold_Y ), where Qntr(𝐘):=1intrQ(Yi)Q^{n_{\mathrm{tr}}}(\cdot\mid\mathbf{Y}):=\prod_{1\leq i\leq n_{\mathrm{tr}}}Q% \left(\cdot\mid Y_{i}\right)italic_Q start_POSTSUPERSCRIPT italic_n start_POSTSUBSCRIPT roman_tr end_POSTSUBSCRIPT end_POSTSUPERSCRIPT ( ⋅ ∣ bold_Y ) := ∏ start_POSTSUBSCRIPT 1 ≤ italic_i ≤ italic_n start_POSTSUBSCRIPT roman_tr end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_Q ( ⋅ ∣ italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), and Q(y)Q(\cdot\mid y)italic_Q ( ⋅ ∣ italic_y ) denotes the conditional distribution of A𝐴Aitalic_A given Y=y𝑌𝑦Y=yitalic_Y = italic_y. Since we generate 𝐀~~𝐀\tilde{{\mathbf{A}}}over~ start_ARG bold_A end_ARG without looking at 𝐘^^𝐘\hat{{\mathbf{Y}}}over^ start_ARG bold_Y end_ARG, the following equalized odds property holds: Yi^Ai~Yiperpendicular-toabsentperpendicular-to^subscript𝑌𝑖conditional~subscript𝐴𝑖subscript𝑌𝑖\hat{Y_{i}}\mathchoice{\mathrel{\hbox to 0.0pt{$\displaystyle\perp$\hss}\mkern 2% .0mu{\displaystyle\perp}}}{\mathrel{\hbox to 0.0pt{$\textstyle\perp$\hss}% \mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to 0.0pt{$\scriptstyle\perp$% \hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to 0.0pt{$% \scriptscriptstyle\perp$\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}\tilde{A_{% i}}\mid Y_{i}over^ start_ARG italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_RELOP ⟂ ⟂ end_RELOP over~ start_ARG italic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ∣ italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. Hence, we can measure the degree of violation to the equalized odds condition by measuring the discrepancy between the distribution of (𝐘^,𝐀,𝐘)^𝐘𝐀𝐘(\hat{{\mathbf{Y}}},{\mathbf{A}},{\mathbf{Y}})( over^ start_ARG bold_Y end_ARG , bold_A , bold_Y ) and the distribution of (𝐘^,𝐀~,𝐘)^𝐘~𝐀𝐘(\hat{{\mathbf{Y}}},\tilde{{\mathbf{A}}},{\mathbf{Y}})( over^ start_ARG bold_Y end_ARG , over~ start_ARG bold_A end_ARG , bold_Y ). Following this intuition, FDL utilizes GAN (Goodfellow et al., 2014), to iteratively learn how to separate the two distributions and optimize a fairness-regularized prediction loss. More specifically, define

f(θf)=𝔼XY[logpθf(YX)]subscript𝑓subscript𝜃𝑓subscript𝔼𝑋𝑌delimited-[]subscript𝑝subscript𝜃𝑓conditional𝑌𝑋\displaystyle\mathcal{L}_{f}\left(\theta_{f}\right)=\mathbb{E}_{XY}\left[-\log p% _{\theta_{f}}(Y\mid X)\right]caligraphic_L start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ) = blackboard_E start_POSTSUBSCRIPT italic_X italic_Y end_POSTSUBSCRIPT [ - roman_log italic_p start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_Y ∣ italic_X ) ] (2)
d(θf,θd)=𝔼Y^AY[logDθd(Y^,A,Y)]+𝔼Y^A~Y[log(1Dθd(Y^,A~,Y))]subscript𝑑subscript𝜃𝑓subscript𝜃𝑑subscript𝔼^𝑌𝐴𝑌delimited-[]subscript𝐷subscript𝜃𝑑^𝑌𝐴𝑌subscript𝔼^𝑌~𝐴𝑌delimited-[]1subscript𝐷subscript𝜃𝑑^𝑌~𝐴𝑌\displaystyle\mathcal{L}_{d}\left(\theta_{f},\theta_{d}\right)=\mathbb{E}_{% \hat{Y}AY}[-\log{D}_{\theta_{d}}(\hat{Y},A,Y)]+\mathbb{E}_{\hat{Y}\tilde{A}Y}[% -\log(1-{D}_{\theta_{d}}(\hat{Y},\tilde{A},Y))]caligraphic_L start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) = blackboard_E start_POSTSUBSCRIPT over^ start_ARG italic_Y end_ARG italic_A italic_Y end_POSTSUBSCRIPT [ - roman_log italic_D start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( over^ start_ARG italic_Y end_ARG , italic_A , italic_Y ) ] + blackboard_E start_POSTSUBSCRIPT over^ start_ARG italic_Y end_ARG over~ start_ARG italic_A end_ARG italic_Y end_POSTSUBSCRIPT [ - roman_log ( 1 - italic_D start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( over^ start_ARG italic_Y end_ARG , over~ start_ARG italic_A end_ARG , italic_Y ) ) ] (3)
Vμ(θf,θd)=(1μ)f(θf)μd(θf,θd)subscript𝑉𝜇subscript𝜃𝑓subscript𝜃𝑑1𝜇subscript𝑓subscript𝜃𝑓𝜇subscript𝑑subscript𝜃𝑓subscript𝜃𝑑\displaystyle V_{\mu}(\theta_{f},\theta_{d})=(1-\mu)\mathcal{L}_{f}(\theta_{f}% )-\mu\mathcal{L}_{d}(\theta_{f},\theta_{d})italic_V start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) = ( 1 - italic_μ ) caligraphic_L start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ) - italic_μ caligraphic_L start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) (4)

as the expected negative log-likelihood loss, the discriminator loss, and value function respectively, where Dθd(.)D_{\theta_{d}}(.)italic_D start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( . ) is the classifier which separates (𝐘^,𝐀,𝐘)^𝐘𝐀𝐘(\hat{{\mathbf{Y}}},{\mathbf{A}},{\mathbf{Y}})( over^ start_ARG bold_Y end_ARG , bold_A , bold_Y ) and (𝐘^,𝐀~,𝐘)^𝐘~𝐀𝐘(\hat{{\mathbf{Y}}},\tilde{\mathbf{A}},{\mathbf{Y}})( over^ start_ARG bold_Y end_ARG , over~ start_ARG bold_A end_ARG , bold_Y ), and μ[0,1]𝜇01\mu\in[0,1]italic_μ ∈ [ 0 , 1 ] is a tuning parameter that controls the prediction-fairness trade-off. Then, FDL learns θf,θdsubscript𝜃𝑓subscript𝜃𝑑\theta_{f},\theta_{d}italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT by finding the minimax solution

θ^f,θ^d=argminθfmaxθdVμ(θf,θd).subscript^𝜃𝑓subscript^𝜃𝑑subscriptsubscript𝜃𝑓subscriptsubscript𝜃𝑑subscript𝑉𝜇subscript𝜃𝑓subscript𝜃𝑑\hat{\theta}_{f},\hat{\theta}_{d}=\arg\min_{\theta_{f}}\max_{\theta_{d}}V_{\mu% }(\theta_{f},\theta_{d}).over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT = roman_arg roman_min start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_max start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) . (5)

FDL generates A~~𝐴\tilde{A}over~ start_ARG italic_A end_ARG through Conditional Randomization (CR) (Candès et al., 2018), which is done by re-sampling it from its (estimated) conditional distribution given other variables that we want to control for. However, the effectiveness of conditional randomization requires estimation of Q(AY)𝑄conditional𝐴𝑌Q(A\mid Y)italic_Q ( italic_A ∣ italic_Y ), which is challenging when A𝐴Aitalic_A is multi-dimensional (Scott, 1991). This challenge is not unique to FDL and needs to be addressed for other non-resampling-based approaches such as Holdout Randomization Test (HRT) (Tansey et al., 2022) as well. In addition, the sensitive attributes A𝐴Aitalic_A can also potentially be both discrete and continuous, which adds another layer of the challenge of estimating Q(AY)𝑄conditional𝐴𝑌Q(A\mid Y)italic_Q ( italic_A ∣ italic_Y ). An approach allows A𝐴Aitalic_A to have flexible types and scales well with the dimension of A𝐴Aitalic_A to help the promotion of fairness learning in many social and medical applications.

2.2 Fairness learning via ICP

To circumvent the challenge in learning the conditional density of A𝐴Aitalic_A given Y𝑌Yitalic_Y, we pivot to estimate Y𝑌Yitalic_Y given A𝐴Aitalic_A and leverage Conditional Permutation (CP) (Berrett et al., 2020) to generate a permuted version of 𝐀~~𝐀\tilde{{\mathbf{A}}}over~ start_ARG bold_A end_ARG which also has the property of equalized odds (1) asymptotically.

CP in fairness learning.

To begin with, we first introduce the vanilla CP strategy to generate permutation copies in Berrett et al. (2020) in our setting.

Let 𝒮nsubscript𝒮𝑛\mathcal{S}_{n}caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT denote the set of permutations on the indices {1,,n}1𝑛\{1,\ldots,n\}{ 1 , … , italic_n }. Given any vector 𝐱=(x1,,xn)𝐱subscript𝑥1subscript𝑥𝑛\mathbf{x}=\left(x_{1},\ldots,x_{n}\right)bold_x = ( italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_x start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) and any permutation π𝒮n𝜋subscript𝒮𝑛\pi\in\mathcal{S}_{n}italic_π ∈ caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, define 𝐱π=(xπ(1),,xπ(n))subscript𝐱𝜋subscript𝑥𝜋1subscript𝑥𝜋𝑛\mathbf{x}_{\pi}=\left(x_{\pi(1)},\ldots,x_{\pi(n)}\right)bold_x start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT = ( italic_x start_POSTSUBSCRIPT italic_π ( 1 ) end_POSTSUBSCRIPT , … , italic_x start_POSTSUBSCRIPT italic_π ( italic_n ) end_POSTSUBSCRIPT ) as permuted version of 𝐱𝐱\mathbf{x}bold_x with its entries reordered according to the permutation π𝜋\piitalic_π. Instead of drawing a permutation ΠΠ\Piroman_Π uniformly at random, CP assigns unequal sampling probability to permutations based on the conditional probability of observing AΠsubscript𝐴ΠA_{\Pi}italic_A start_POSTSUBSCRIPT roman_Π end_POSTSUBSCRIPT given Y𝑌Yitalic_Y:

{Π=π𝐀,𝐘}=qn(𝐀π𝐘)π𝒮nqn(𝐀π𝐘).conditional-setΠ𝜋𝐀𝐘superscript𝑞𝑛conditionalsubscript𝐀𝜋𝐘subscriptsuperscript𝜋subscript𝒮𝑛superscript𝑞𝑛conditionalsubscript𝐀superscript𝜋𝐘\mathbb{P}\left\{\Pi=\pi\mid{\mathbf{A}},{\mathbf{Y}}\right\}=\frac{q^{n}\left% ({\mathbf{A}}_{\pi}\mid{\mathbf{Y}}\right)}{\sum_{\pi^{\prime}\in\mathcal{S}_{% n}}q^{n}\left({\mathbf{A}}_{\pi^{\prime}}\mid{\mathbf{Y}}\right)}.blackboard_P { roman_Π = italic_π ∣ bold_A , bold_Y } = divide start_ARG italic_q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_A start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT ∣ bold_Y ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_A start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∣ bold_Y ) end_ARG . (6)

Here we let q(y)q(\cdot\mid y)italic_q ( ⋅ ∣ italic_y ) be the density of the distribution Q(y)Q(\cdot\mid y)italic_Q ( ⋅ ∣ italic_y ) (i.e., q(y)q(\cdot\mid y)italic_q ( ⋅ ∣ italic_y ) is the conditional density of A𝐴Aitalic_A given Y=y𝑌𝑦Y=yitalic_Y = italic_y ). We write qn(𝐘):=q(Y1)q(Yn)q^{n}(\cdot\mid{\mathbf{Y}}):=q\left(\cdot\mid Y_{1}\right)\cdots q\left(\cdot% \mid Y_{n}\right)italic_q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( ⋅ ∣ bold_Y ) := italic_q ( ⋅ ∣ italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) ⋯ italic_q ( ⋅ ∣ italic_Y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) to denote the product density. This leads to the synthetic 𝐀~=𝐀Π~𝐀subscript𝐀Π\tilde{\mathbf{A}}={\mathbf{A}}_{\Pi}over~ start_ARG bold_A end_ARG = bold_A start_POSTSUBSCRIPT roman_Π end_POSTSUBSCRIPT, which, intuitively, should have low dependence on 𝐘^^𝐘\hat{\mathbf{Y}}over^ start_ARG bold_Y end_ARG given 𝐘𝐘{\mathbf{Y}}bold_Y, and can thus be utilized to encourage equalized odds as described in (1).

ICP circumvents density estimation of AYconditional𝐴𝑌A\mid Yitalic_A ∣ italic_Y.

Unfortunately, conducting conditional permutation with multivariate A𝐴Aitalic_A relies on conditional density estimation of A𝐴Aitalic_A given Y𝑌Yitalic_Y and does not alleviate the issue arising from multivariate density estimation as we mentioned earlier. To circumvent this problem, we propose a simple ICP (inverse conditional permutation) strategy which is indirect yet scales better with the dimensionality of A𝐴Aitalic_A and can adapt easily to various data types of A𝐴Aitalic_A.

ICP begins with the observation that the distribution of (𝐀Π,𝐘)subscript𝐀Π𝐘({\mathbf{A}}_{\Pi},{\mathbf{Y}})( bold_A start_POSTSUBSCRIPT roman_Π end_POSTSUBSCRIPT , bold_Y ) is identical as the distribution of (𝐀,𝐘Π1)𝐀subscript𝐘superscriptΠ1({\mathbf{A}},{\mathbf{Y}}_{{\Pi}^{-1}})( bold_A , bold_Y start_POSTSUBSCRIPT roman_Π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ). Hence, intuitively, instead of determining ΠΠ\Piroman_Π based on the conditional law of A𝐴Aitalic_A given Y𝑌Yitalic_Y, we first consider the conditional permutation of Y𝑌Yitalic_Y given A𝐴Aitalic_A, which is one dimensional and can be estimated conveniently using standard regression or generalized regression techniques regardless of the complexity in A𝐴Aitalic_A. We then generate ΠΠ\Piroman_Π by applying an inverse operator to the distribution of these permutations. Specifically, we generate 𝐀~=𝐀Π~𝐀subscript𝐀Π\tilde{\mathbf{A}}={\mathbf{A}}_{\Pi}over~ start_ARG bold_A end_ARG = bold_A start_POSTSUBSCRIPT roman_Π end_POSTSUBSCRIPT with the following probabilities:

{Π=π𝐀,𝐘}=qn(𝐘π1𝐀)π𝒮nqn(𝐘π1𝐀).conditional-setΠ𝜋𝐀𝐘superscript𝑞𝑛conditionalsubscript𝐘superscript𝜋1𝐀subscriptsuperscript𝜋subscript𝒮𝑛superscript𝑞𝑛conditionalsubscript𝐘superscriptsuperscript𝜋1𝐀\mathbb{P}\left\{\Pi=\pi\mid{\mathbf{A}},{\mathbf{Y}}\right\}=\frac{q^{n}\left% ({\mathbf{Y}}_{{\pi}^{-1}}\mid{\mathbf{A}}\right)}{\sum_{\pi^{\prime}\in% \mathcal{S}_{n}}q^{n}\left({\mathbf{Y}}_{{\pi^{\prime}}^{-1}}\mid{\mathbf{A}}% \right)}.blackboard_P { roman_Π = italic_π ∣ bold_A , bold_Y } = divide start_ARG italic_q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_Y start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∣ bold_A ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_Y start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∣ bold_A ) end_ARG . (7)

Indeed, this intuition helps us to A~~𝐴\tilde{A}over~ start_ARG italic_A end_ARG which can be used to monitor the violation of the equalized odds condition.

Theorem 2.1.

For any n𝑛nitalic_n i.i.dformulae-sequence𝑖𝑖𝑑i.i.ditalic_i . italic_i . italic_d observations (𝐗,𝐀,𝐘)𝐗𝐀𝐘({\mathbf{X}},{\mathbf{A}},{\mathbf{Y}})( bold_X , bold_A , bold_Y ), let 𝐀~normal-~𝐀\tilde{\mathbf{A}}over~ start_ARG bold_A end_ARG be generated by the ICP sampling scheme (7). Let S(𝐀)𝑆𝐀S({\mathbf{A}})italic_S ( bold_A ) denote the unordered set of rows in 𝐀𝐀{\mathbf{A}}bold_A, and let p𝑝pitalic_p be the dimension of A𝐴Aitalic_A. We have

(1) If Y^AYperpendicular-toabsentperpendicular-tonormal-^𝑌conditional𝐴𝑌\hat{Y}\mathchoice{\mathrel{\hbox to 0.0pt{$\displaystyle\perp$\hss}\mkern 2.0% mu{\displaystyle\perp}}}{\mathrel{\hbox to 0.0pt{$\textstyle\perp$\hss}\mkern 2% .0mu{\textstyle\perp}}}{\mathrel{\hbox to 0.0pt{$\scriptstyle\perp$\hss}\mkern 2% .0mu{\scriptstyle\perp}}}{\mathrel{\hbox to 0.0pt{$\scriptscriptstyle\perp$% \hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}A\mid Yover^ start_ARG italic_Y end_ARG start_RELOP ⟂ ⟂ end_RELOP italic_A ∣ italic_Y, then (𝐘^,𝐀,𝐘)=d(𝐘^,𝐀~,𝐘)superscript𝑑normal-^𝐘𝐀𝐘normal-^𝐘normal-~𝐀𝐘(\hat{\mathbf{Y}},{\mathbf{A}},{\mathbf{Y}})\,{\buildrel d\over{=}}\,(\hat{% \mathbf{Y}},\tilde{\mathbf{A}},{\mathbf{Y}})( over^ start_ARG bold_Y end_ARG , bold_A , bold_Y ) start_RELOP SUPERSCRIPTOP start_ARG = end_ARG start_ARG italic_d end_ARG end_RELOP ( over^ start_ARG bold_Y end_ARG , over~ start_ARG bold_A end_ARG , bold_Y ).

(2) If (𝐘^,𝐀,𝐘)=d(𝐘^,𝐀~,𝐘)superscript𝑑normal-^𝐘𝐀𝐘normal-^𝐘normal-~𝐀𝐘(\hat{\mathbf{Y}},{\mathbf{A}},{\mathbf{Y}})\,{\buildrel d\over{=}}\,(\hat{% \mathbf{Y}},\tilde{\mathbf{A}},{\mathbf{Y}})( over^ start_ARG bold_Y end_ARG , bold_A , bold_Y ) start_RELOP SUPERSCRIPTOP start_ARG = end_ARG start_ARG italic_d end_ARG end_RELOP ( over^ start_ARG bold_Y end_ARG , over~ start_ARG bold_A end_ARG , bold_Y ), then 𝐘^𝐀(𝐘 and S(𝐀))perpendicular-toabsentperpendicular-tonormal-^𝐘conditional𝐀𝐘 and 𝑆𝐀\hat{\mathbf{Y}}\mathchoice{\mathrel{\hbox to 0.0pt{$\displaystyle\perp$\hss}% \mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to 0.0pt{$\textstyle\perp$% \hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to 0.0pt{$\scriptstyle% \perp$\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to 0.0pt{$% \scriptscriptstyle\perp$\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}{\mathbf{A% }}\mid({\mathbf{Y}}\mbox{ and }S({\mathbf{A}}))over^ start_ARG bold_Y end_ARG start_RELOP ⟂ ⟂ end_RELOP bold_A ∣ ( bold_Y and italic_S ( bold_A ) ). Further, when logpn0normal-→𝑝𝑛0\frac{\log p}{n}\rightarrow 0divide start_ARG roman_log italic_p end_ARG start_ARG italic_n end_ARG → 0, the asymptotic equalized odds condition holds: for any constant vectors t1subscript𝑡1t_{1}italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and t2subscript𝑡2t_{2}italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT,

[Y^t1,At2|Y][Y^t1|Y][At2|Y]n0.delimited-[]formulae-sequence^𝑌subscript𝑡1𝐴conditionalsubscript𝑡2𝑌delimited-[]^𝑌conditionalsubscript𝑡1𝑌delimited-[]𝐴conditionalsubscript𝑡2𝑌𝑛0\mathbb{P}\left[\hat{Y}\leq t_{1},A\leq t_{2}|Y\right]-\mathbb{P}\left[\hat{Y}% \leq t_{1}|Y\right]\mathbb{P}\left[A\leq t_{2}|Y\right]\overset{n\rightarrow% \infty}{\rightarrow}0.blackboard_P [ over^ start_ARG italic_Y end_ARG ≤ italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_A ≤ italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | italic_Y ] - blackboard_P [ over^ start_ARG italic_Y end_ARG ≤ italic_t start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT | italic_Y ] blackboard_P [ italic_A ≤ italic_t start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT | italic_Y ] start_OVERACCENT italic_n → ∞ end_OVERACCENT start_ARG → end_ARG 0 .
Remark 2.2.

In FDL, the availability of accurate conditional density A|Yconditional𝐴𝑌A|Yitalic_A | italic_Y enables the equivalence between (𝐘^,𝐀,𝐘)=d(𝐘^,𝐀~,𝐘)superscript𝑑^𝐘𝐀𝐘^𝐘~𝐀𝐘(\hat{\mathbf{Y}},{\mathbf{A}},{\mathbf{Y}})\,{\buildrel d\over{=}}\,(\hat{% \mathbf{Y}},\tilde{\mathbf{A}},{\mathbf{Y}})( over^ start_ARG bold_Y end_ARG , bold_A , bold_Y ) start_RELOP SUPERSCRIPTOP start_ARG = end_ARG start_ARG italic_d end_ARG end_RELOP ( over^ start_ARG bold_Y end_ARG , over~ start_ARG bold_A end_ARG , bold_Y ) and 𝐘^𝐀𝐘perpendicular-toabsentperpendicular-to^𝐘conditional𝐀𝐘\hat{\mathbf{Y}}\mathchoice{\mathrel{\hbox to 0.0pt{$\displaystyle\perp$\hss}% \mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to 0.0pt{$\textstyle\perp$% \hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to 0.0pt{$\scriptstyle% \perp$\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to 0.0pt{$% \scriptscriptstyle\perp$\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}{\mathbf{A% }}\mid{\mathbf{Y}}over^ start_ARG bold_Y end_ARG start_RELOP ⟂ ⟂ end_RELOP bold_A ∣ bold_Y, ICP pays an almost negligible price and offers a fast-rate asymptotic equivalence but circumvents the density estimation of AYconditional𝐴𝑌A\mid Yitalic_A ∣ italic_Y.

Motivated by this, we propose an adversarial learning procedure utilizing the permuted sensitive attributes A~~𝐴\tilde{A}over~ start_ARG italic_A end_ARG from the ICP sampling scheme (7), which is built under the same formulation of the loss function shown previously in Section 2.1. Let ^f(θf)subscript^𝑓subscript𝜃𝑓\hat{\mathcal{L}}_{f}(\theta_{f})over^ start_ARG caligraphic_L end_ARG start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ) and ^d(θf,θd)subscript^𝑑subscript𝜃𝑓subscript𝜃𝑑\hat{\mathcal{L}}_{d}(\theta_{f},\theta_{d})over^ start_ARG caligraphic_L end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) be the empirical realizations of the losses f(θf)subscript𝑓subscript𝜃𝑓\mathcal{L}_{f}(\theta_{f})caligraphic_L start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ), and d(θf,θd)subscript𝑑subscript𝜃𝑓subscript𝜃𝑑\mathcal{L}_{d}(\theta_{f},\theta_{d})caligraphic_L start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) defined in (2) and (3) respectively. Algorithm 1 presents the details. We detail the permutation sampling algorithm, Parallelized pairwise sampler, in Appendix B for the sake of completeness, which is adapted from Berrett et al. (2020).

Theorem 2.3.

If there exists a minimax solution (θ^f,θ^d)subscriptnormal-^𝜃𝑓subscriptnormal-^𝜃𝑑(\hat{\theta}_{f},\hat{\theta}_{d})( over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) for Vμ(.,.)V_{\mu}\left(.,.\right)italic_V start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( . , . ) defined in (5) such that Vμ(θ^f,θ^r)=subscript𝑉𝜇subscriptnormal-^𝜃𝑓subscriptnormal-^𝜃𝑟absentV_{\mu}(\hat{\theta}_{f},\hat{\theta}_{r})=italic_V start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ) = (1μ)H(YX)μlog(4)1𝜇𝐻conditional𝑌𝑋𝜇4(1-\mu)H(Y\mid X)-\mu\log(4)( 1 - italic_μ ) italic_H ( italic_Y ∣ italic_X ) - italic_μ roman_log ( 4 ), where H(YX)=𝔼XY[logp(YX)]𝐻conditional𝑌𝑋subscript𝔼𝑋𝑌delimited-[]𝑝conditional𝑌𝑋H(Y\mid X)=\mathbb{E}_{XY}\left[-\log p(Y\mid X)\right]italic_H ( italic_Y ∣ italic_X ) = blackboard_E start_POSTSUBSCRIPT italic_X italic_Y end_POSTSUBSCRIPT [ - roman_log italic_p ( italic_Y ∣ italic_X ) ] denotes the conditional entropy, then f^θ^f()subscriptnormal-^𝑓subscriptnormal-^𝜃𝑓normal-⋅\hat{f}_{\hat{\theta}_{f}}(\cdot)over^ start_ARG italic_f end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ⋅ ) is both an optimal and fair predictor, which simultaneously minimizes f(θf)subscript𝑓subscript𝜃𝑓\mathcal{L}_{f}\left(\theta_{f}\right)caligraphic_L start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ) and satisfies equalized odds simultaneously.

Input: Data (𝐗,𝐀,𝐘)={(Xi,Ai,Yi)}itr𝐗𝐀𝐘subscriptsubscript𝑋𝑖subscript𝐴𝑖subscript𝑌𝑖𝑖subscripttr({\mathbf{X}},{\mathbf{A}},{\mathbf{Y}})=\{(X_{i},A_{i},Y_{i})\}_{i\in\mathcal% {I}_{\mathrm{tr}}}( bold_X , bold_A , bold_Y ) = { ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) } start_POSTSUBSCRIPT italic_i ∈ caligraphic_I start_POSTSUBSCRIPT roman_tr end_POSTSUBSCRIPT end_POSTSUBSCRIPT

Parameters: penalty weight μ𝜇\muitalic_μ, step size α𝛼\alphaitalic_α, number of gradient steps Ngsubscript𝑁𝑔N_{g}italic_N start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT, and iterations T𝑇Titalic_T.

Output: predictive model f^θ^f()subscript^𝑓subscript^𝜃𝑓\hat{f}_{\hat{\theta}_{f}}(\cdot)over^ start_ARG italic_f end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ⋅ ) and discriminator D^θ^d()subscript^𝐷subscript^𝜃𝑑\hat{D}_{\hat{\theta}_{d}}(\cdot)over^ start_ARG italic_D end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ⋅ ).

1:for t=1,,T𝑡1𝑇t=1,\dots,Titalic_t = 1 , … , italic_T do
2:     Generate permuted copy 𝐀~normal-~𝐀\tilde{\mathbf{A}}over~ start_ARG bold_A end_ARG by (7) (using the sampler described in Appendix B)
3:     Update the discriminator parameters θdsubscript𝜃𝑑\theta_{d}italic_θ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT by repeating the following for Ngsubscript𝑁𝑔N_{g}italic_N start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT gradient steps:
θdθdαθd^d(θf,θd).subscript𝜃𝑑subscript𝜃𝑑𝛼subscriptsubscript𝜃𝑑subscript^𝑑subscript𝜃𝑓subscript𝜃𝑑\theta_{d}\leftarrow\theta_{d}-\alpha\nabla_{\theta_{d}}\hat{\mathcal{L}}_{d}(% \theta_{f},\theta_{d}).italic_θ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ← italic_θ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT - italic_α ∇ start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT end_POSTSUBSCRIPT over^ start_ARG caligraphic_L end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) .
4:     Update the predictive model parameters θfsubscript𝜃𝑓\theta_{f}italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT by repeating the following for Ngsubscript𝑁𝑔N_{g}italic_N start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT gradient steps:
θfθfαθf[(1μ)^f(θf)μ^d(θf,θd)].subscript𝜃𝑓subscript𝜃𝑓𝛼subscriptsubscript𝜃𝑓1𝜇subscript^𝑓subscript𝜃𝑓𝜇subscript^𝑑subscript𝜃𝑓subscript𝜃𝑑\theta_{f}\leftarrow\theta_{f}-\alpha\nabla_{\theta_{f}}\left[(1-\mu)\hat{% \mathcal{L}}_{f}(\theta_{f})-\mu\hat{\mathcal{L}}_{d}(\theta_{f},\theta_{d})% \right].italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ← italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT - italic_α ∇ start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ ( 1 - italic_μ ) over^ start_ARG caligraphic_L end_ARG start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ) - italic_μ over^ start_ARG caligraphic_L end_ARG start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) ] .
5:end for

Output: Predictive model f^θ^f()subscript^𝑓subscript^𝜃𝑓\hat{f}_{\hat{\theta}_{f}}(\cdot)over^ start_ARG italic_f end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ⋅ ).

Algorithm 1 Fairness learning via ICP

In practice, the assumption of the existence of an optimal and fair predictor (in terms of equalized odds) may not hold (Tang and Zhang, 2022). Setting μ𝜇\muitalic_μ to a large value will preferably enforce f𝑓fitalic_f to satisfy equalized odds while setting μ𝜇\muitalic_μ close to 0 will push f𝑓fitalic_f to be optimal: an increase in accuracy would often be accompanied by a decrease in fairness and vice-versa.

2.3 Density Estimation

The estimation of conditional densities is a crucial part of both our method and previous work (Romano et al., 2020; Mary et al., 2019). However, unlike the previous work which requires the estimation of AYconditional𝐴𝑌A\mid Yitalic_A ∣ italic_Y, our proposal looks into the inverse relationship of YAconditional𝑌𝐴Y\mid Aitalic_Y ∣ italic_A. In practice, our proposed method can easily leverage the state-of-the-art density estimator and is less disturbed by the increased complexity in A𝐴Aitalic_A, due to either dimension or data types.

In this manuscript, we applied Masked Autoregressive Flow (MAF) (Papamakarios et al., 2017) to estimate the conditional density of Y|Aconditional𝑌𝐴Y|Aitalic_Y | italic_A when Y𝑌Yitalic_Y is continuous and A1,,Aksubscript𝐴1subscript𝐴𝑘A_{1},\ldots,A_{k}italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_A start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT can take arbitrary data types (discrete or continuous) 111In MAF paper (Papamakarios et al., 2017), to estimate p(UV)𝑝conditional𝑈𝑉p(U\mid V)italic_p ( italic_U ∣ italic_V ), U𝑈Uitalic_U is assumed to be continuous while V𝑉Vitalic_V can take arbitrary form, but there’s no requirements about the dimensionality of U𝑈Uitalic_U and V𝑉Vitalic_V. In classification scenario when Y{0,1,,L}𝑌01𝐿Y\in\{0,1,\ldots,L\}italic_Y ∈ { 0 , 1 , … , italic_L }, one can always fit a classifier to model Y|Aconditional𝑌𝐴Y|Aitalic_Y | italic_A. To this end, FairICP is more feasible to handle more complex sensitive attributes and is suitable for both regression and classification tasks. To provide more theoretical and empirical insights into how the quality of density estimation affects CP and ICP, we have additional analysis in Appendix C.

3 Measuring the violation of equalized odds

To gain a reliable understanding of the potential violation of equalized odds using the trained model f^^𝑓\hat{f}over^ start_ARG italic_f end_ARG, we carry out a disciplined evaluation utilizing an untouched test set (𝐗te,𝐀te,𝐘te)={(Xi,Ai,Yi)}1tesuperscript𝐗𝑡𝑒superscript𝐀𝑡𝑒superscript𝐘𝑡𝑒subscriptsubscript𝑋𝑖subscript𝐴𝑖subscript𝑌𝑖1subscript𝑡𝑒({\mathbf{X}}^{te},{\mathbf{A}}^{te},{\mathbf{Y}}^{te})=\{(X_{i},A_{i},Y_{i})% \}_{1\in\mathcal{I}_{te}}( bold_X start_POSTSUPERSCRIPT italic_t italic_e end_POSTSUPERSCRIPT , bold_A start_POSTSUPERSCRIPT italic_t italic_e end_POSTSUPERSCRIPT , bold_Y start_POSTSUPERSCRIPT italic_t italic_e end_POSTSUPERSCRIPT ) = { ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) } start_POSTSUBSCRIPT 1 ∈ caligraphic_I start_POSTSUBSCRIPT italic_t italic_e end_POSTSUBSCRIPT end_POSTSUBSCRIPT and a recently proposed conditional independence measure.

3.1 Measure of Conditional Dependence

From a statistical point of view, we note that equalized odds (1) is exactly the notion of conditional independence. Thus, measuring the violation of equalized odds is equivalent to measuring conditional independence, and there have been some works trying to bridge these two problems (Mary et al., 2019; Kamishima et al., 2011; Romano et al., 2020).

In Mary et al. (2019), Hirschfeld-Gebelein-Renyi Maximum Correlation Coefficient (HGR) is chosen to measure the conditional dependence for equalized odds and used as a penalty term to fit a fair model. However, the estimation of HGR, which is based on kernel density estimation of A𝐴Aitalic_A, becomes difficult when A𝐴Aitalic_A is multivariate. Here, we take advantage of recent developments in conditional dependence measures and link them to our problem by introducing a flexible measure proposed by Huang et al. (2022).

Definition 3.1.

Kernel Partial Correlation (KPC) coefficient ρ2ρ2(U,VW)superscript𝜌2superscript𝜌2𝑈conditional𝑉𝑊\rho^{2}\equiv\rho^{2}(U,V\mid W)italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≡ italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_U , italic_V ∣ italic_W ) is defined as:

ρ2(U,VW):=𝔼[MMD2(PUWV,PUW)]𝔼[MMD2(δU,PUW)],assignsuperscript𝜌2𝑈conditional𝑉𝑊𝔼delimited-[]superscriptMMD2subscript𝑃conditional𝑈𝑊𝑉subscript𝑃conditional𝑈𝑊𝔼delimited-[]superscriptMMD2subscript𝛿𝑈subscript𝑃conditional𝑈𝑊\rho^{2}(U,V\mid W):=\frac{\mathbb{E}\left[\operatorname{MMD}^{2}\left(P_{U% \mid WV},P_{U\mid W}\right)\right]}{\mathbb{E}\left[\operatorname{MMD}^{2}% \left(\delta_{U},P_{U\mid W}\right)\right]},italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_U , italic_V ∣ italic_W ) := divide start_ARG blackboard_E [ roman_MMD start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_P start_POSTSUBSCRIPT italic_U ∣ italic_W italic_V end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_U ∣ italic_W end_POSTSUBSCRIPT ) ] end_ARG start_ARG blackboard_E [ roman_MMD start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( italic_δ start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT , italic_P start_POSTSUBSCRIPT italic_U ∣ italic_W end_POSTSUBSCRIPT ) ] end_ARG ,

where (U,V,W)Psimilar-to𝑈𝑉𝑊𝑃(U,V,W)\sim P( italic_U , italic_V , italic_W ) ∼ italic_P and P𝑃Pitalic_P is supported on a subset of some topological space 𝒰×𝒱×𝒲𝒰𝒱𝒲\mathcal{U}\times\mathcal{V}\times\mathcal{W}caligraphic_U × caligraphic_V × caligraphic_W, MMD is the maximum mean discrepancy - a distance metric between two probability distributions depending on the characteristic kernel k(,)𝑘k(\cdot,\cdot)italic_k ( ⋅ , ⋅ ) and δUsubscript𝛿𝑈\delta_{U}italic_δ start_POSTSUBSCRIPT italic_U end_POSTSUBSCRIPT denotes the Dirac measure at U𝑈Uitalic_U.

Under mild regularity conditions (see details in Huang et al. (2022)), ρ2superscript𝜌2\rho^{2}italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT satisfies several good properties for any joint distribution of (U,V,W)𝑈𝑉𝑊(U,V,W)( italic_U , italic_V , italic_W ) in Definition 3.1: (1) ρ2[0,1]superscript𝜌201\rho^{2}\in[0,1]italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∈ [ 0 , 1 ]; (2) ρ2=0superscript𝜌20\rho^{2}=0italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = 0 if and only if UVWperpendicular-toabsentperpendicular-to𝑈conditional𝑉𝑊U\mathchoice{\mathrel{\hbox to 0.0pt{$\displaystyle\perp$\hss}\mkern 2.0mu{% \displaystyle\perp}}}{\mathrel{\hbox to 0.0pt{$\textstyle\perp$\hss}\mkern 2.0% mu{\textstyle\perp}}}{\mathrel{\hbox to 0.0pt{$\scriptstyle\perp$\hss}\mkern 2% .0mu{\scriptstyle\perp}}}{\mathrel{\hbox to 0.0pt{$\scriptscriptstyle\perp$% \hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}V\mid Witalic_U start_RELOP ⟂ ⟂ end_RELOP italic_V ∣ italic_W; (3) ρ2=1superscript𝜌21\rho^{2}=1italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = 1 if and only if U𝑈Uitalic_U is a measurable function of V𝑉Vitalic_V given W𝑊Witalic_W. A consistent estimator calculated by geometric graph-based methods ρ2^^superscript𝜌2\hat{\rho^{2}}over^ start_ARG italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG (Section 3 in Huang et al. (2022)) is also provided in R Package KPC.

With the aid of KPC, we can rigorously quantify the violation of equalized odds by estimating ρ2(Y^,AY)superscript𝜌2^𝑌conditional𝐴𝑌\rho^{2}(\hat{Y},A\mid Y)italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( over^ start_ARG italic_Y end_ARG , italic_A ∣ italic_Y ), where A𝐴Aitalic_A can take arbitrary form and response Y𝑌Yitalic_Y can be continuous (regression) or categorical (classification).

3.2 Hypothesis test for equalized odds

To this end, we provide a formal hypothesis test with a statistical guarantee to detect any violation of equalized odds. Our hypothesis test once again uses the permuted version of 𝐀~~𝐀\tilde{\mathbf{A}}over~ start_ARG bold_A end_ARG and implements a conditional independence test. The idea is that we keep generating fake copies 𝐀~~𝐀\tilde{\mathbf{A}}over~ start_ARG bold_A end_ARG by (7), and by Theorem 2.1, (𝐘^,𝐀~,𝐘)^𝐘~𝐀𝐘(\hat{\mathbf{Y}},\tilde{\mathbf{A}},{\mathbf{Y}})( over^ start_ARG bold_Y end_ARG , over~ start_ARG bold_A end_ARG , bold_Y ) will have the same distribution as (𝐘^,𝐀,𝐘)^𝐘𝐀𝐘(\hat{\mathbf{Y}},{\mathbf{A}},{\mathbf{Y}})( over^ start_ARG bold_Y end_ARG , bold_A , bold_Y ) under the assumption of equalized odds (1). Therefore, we can use any test statistic T𝑇Titalic_T to obtain a valid hypothesis test since any test statistic T(𝐘^,𝐀~,𝐘)𝑇^𝐘~𝐀𝐘T(\hat{\mathbf{Y}},\tilde{\mathbf{A}},{\mathbf{Y}})italic_T ( over^ start_ARG bold_Y end_ARG , over~ start_ARG bold_A end_ARG , bold_Y ) will also have the same distribution as T(𝐘^,𝐀,𝐘)𝑇^𝐘𝐀𝐘T(\hat{\mathbf{Y}},{\mathbf{A}},{\mathbf{Y}})italic_T ( over^ start_ARG bold_Y end_ARG , bold_A , bold_Y ) under the assumption of equalized odds. The procedure of our proposed hypothesis test is in Algorithm 2.

Proposition 3.2.

Suppose the test observations (𝐗te,𝐀te,𝐘te)={(Xi,Yi,Ai)({\mathbf{X}}^{te},{\mathbf{A}}^{te},{\mathbf{Y}}^{te})=\{(X_{i},Y_{i},A_{i})( bold_X start_POSTSUPERSCRIPT italic_t italic_e end_POSTSUPERSCRIPT , bold_A start_POSTSUPERSCRIPT italic_t italic_e end_POSTSUPERSCRIPT , bold_Y start_POSTSUPERSCRIPT italic_t italic_e end_POSTSUPERSCRIPT ) = { ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) for 1inte}1\leq i\leq n_{\mathrm{te}}\}1 ≤ italic_i ≤ italic_n start_POSTSUBSCRIPT roman_te end_POSTSUBSCRIPT } are i.i.d.. 𝐘^te={f^(Xi)\hat{\mathbf{Y}}^{te}=\{\hat{f}(X_{i})over^ start_ARG bold_Y end_ARG start_POSTSUPERSCRIPT italic_t italic_e end_POSTSUPERSCRIPT = { over^ start_ARG italic_f end_ARG ( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) for 1inte}1\leq i\leq n_{\mathrm{te}}\}1 ≤ italic_i ≤ italic_n start_POSTSUBSCRIPT roman_te end_POSTSUBSCRIPT } for a learned model f^normal-^𝑓\hat{f}over^ start_ARG italic_f end_ARG (not necessarily trained as our proposed method). If H0:𝐘^te𝐀te𝐘tenormal-:subscript𝐻0perpendicular-toabsentperpendicular-tosuperscriptnormal-^𝐘𝑡𝑒conditionalsuperscript𝐀𝑡𝑒superscript𝐘𝑡𝑒H_{0}:\hat{\mathbf{Y}}^{te}\mathchoice{\mathrel{\hbox to 0.0pt{$\displaystyle% \perp$\hss}\mkern 2.0mu{\displaystyle\perp}}}{\mathrel{\hbox to 0.0pt{$% \textstyle\perp$\hss}\mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to 0.0pt{$% \scriptstyle\perp$\hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to 0.0% pt{$\scriptscriptstyle\perp$\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}{% \mathbf{A}}^{te}\mid{\mathbf{Y}}^{te}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT : over^ start_ARG bold_Y end_ARG start_POSTSUPERSCRIPT italic_t italic_e end_POSTSUPERSCRIPT start_RELOP ⟂ ⟂ end_RELOP bold_A start_POSTSUPERSCRIPT italic_t italic_e end_POSTSUPERSCRIPT ∣ bold_Y start_POSTSUPERSCRIPT italic_t italic_e end_POSTSUPERSCRIPT, i.e., equalized odds (1) holds, then the output p-value pvsubscript𝑝𝑣p_{v}italic_p start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT of Algorithm 2 is valid, satisfying {pvα}αsubscript𝑝𝑣𝛼𝛼\mathbb{P}\{p_{v}\leq\alpha\}\leq\alphablackboard_P { italic_p start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT ≤ italic_α } ≤ italic_α for any desired Type I𝐼Iitalic_I error rate α[0,1]𝛼01\alpha\in[0,1]italic_α ∈ [ 0 , 1 ] when H0subscript𝐻0H_{0}italic_H start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is true.

Input: Data (𝐗te,𝐀te,𝐘te)={(Y^i,Ai,Yi)}superscript𝐗𝑡𝑒superscript𝐀𝑡𝑒superscript𝐘𝑡𝑒subscript^𝑌𝑖subscript𝐴𝑖subscript𝑌𝑖({\mathbf{X}}^{te},{\mathbf{A}}^{te},{\mathbf{Y}}^{te})=\{(\hat{Y}_{i},A_{i},Y% _{i})\}( bold_X start_POSTSUPERSCRIPT italic_t italic_e end_POSTSUPERSCRIPT , bold_A start_POSTSUPERSCRIPT italic_t italic_e end_POSTSUPERSCRIPT , bold_Y start_POSTSUPERSCRIPT italic_t italic_e end_POSTSUPERSCRIPT ) = { ( over^ start_ARG italic_Y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) }, 1intest1𝑖subscript𝑛test1\leq i\leq n_{\mathrm{test}}1 ≤ italic_i ≤ italic_n start_POSTSUBSCRIPT roman_test end_POSTSUBSCRIPT

Parameter: the number of synthetic copies K𝐾Kitalic_K.

1:Compute the test statistic T𝑇Titalic_T on the test set: t*=T(𝐘^te,𝐀te,𝐘te)superscript𝑡𝑇superscript^𝐘𝑡𝑒superscript𝐀𝑡𝑒superscript𝐘𝑡𝑒t^{*}=T(\hat{\mathbf{Y}}^{te},{\mathbf{A}}^{te},{\mathbf{Y}}^{te})italic_t start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT = italic_T ( over^ start_ARG bold_Y end_ARG start_POSTSUPERSCRIPT italic_t italic_e end_POSTSUPERSCRIPT , bold_A start_POSTSUPERSCRIPT italic_t italic_e end_POSTSUPERSCRIPT , bold_Y start_POSTSUPERSCRIPT italic_t italic_e end_POSTSUPERSCRIPT ).
2:for k=1,,K𝑘1𝐾k=1,\dots,Kitalic_k = 1 , … , italic_K do
3:     Generate permuted copy 𝐀~ksubscript~𝐀𝑘\tilde{\mathbf{A}}_{k}over~ start_ARG bold_A end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT of 𝐀tesuperscript𝐀𝑡𝑒{\mathbf{A}}^{te}bold_A start_POSTSUPERSCRIPT italic_t italic_e end_POSTSUPERSCRIPT by (7) (using the sampler described in Appendix B)
4:     Compute the test statistic T𝑇Titalic_T using fake copy on the test set: t(k)=T(𝐘^te,𝐀~k,𝐘te)superscript𝑡𝑘𝑇superscript^𝐘𝑡𝑒subscript~𝐀𝑘superscript𝐘𝑡𝑒t^{(k)}=T(\hat{\mathbf{Y}}^{te},\tilde{\mathbf{A}}_{k},{\mathbf{Y}}^{te})italic_t start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT = italic_T ( over^ start_ARG bold_Y end_ARG start_POSTSUPERSCRIPT italic_t italic_e end_POSTSUPERSCRIPT , over~ start_ARG bold_A end_ARG start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT , bold_Y start_POSTSUPERSCRIPT italic_t italic_e end_POSTSUPERSCRIPT ).
5:end for
6:Compute the p𝑝pitalic_p-value:
pv=1K+1(1+k=1K𝕀[t*t(k)])subscript𝑝𝑣1𝐾11superscriptsubscript𝑘1𝐾𝕀delimited-[]superscript𝑡superscript𝑡𝑘p_{v}=\frac{1}{K+1}\left(1+\sum_{k=1}^{K}\mathbb{I}\left[t^{*}\geq{t}^{(k)}% \right]\right)italic_p start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT = divide start_ARG 1 end_ARG start_ARG italic_K + 1 end_ARG ( 1 + ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT blackboard_I [ italic_t start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ≥ italic_t start_POSTSUPERSCRIPT ( italic_k ) end_POSTSUPERSCRIPT ] )

Output: A p𝑝pitalic_p-value pvsubscript𝑝𝑣p_{v}italic_p start_POSTSUBSCRIPT italic_v end_POSTSUBSCRIPT for the hypothesis that equalized odds (1) holds.

Algorithm 2 ICP Test for Equalized Odds

We note that a similar hypothesis test for equalized odds is proposed in Romano et al. (2020) which is done by using a resampled version of A~~𝐴\tilde{A}over~ start_ARG italic_A end_ARG and choosing T𝑇Titalic_T in Algorithm 2 as described in Holdout Permutation Test (Tansey et al., 2022), which is based on a predictor r^(A,Y)^𝑟𝐴𝑌\hat{r}(A,Y)over^ start_ARG italic_r end_ARG ( italic_A , italic_Y ) aiming to predict Y^^𝑌\hat{Y}over^ start_ARG italic_Y end_ARG and is formulated as the empirical risk (e.g., mean squared error). However, such T(Y^,A,Y)𝑇^𝑌𝐴𝑌T(\hat{Y},A,Y)italic_T ( over^ start_ARG italic_Y end_ARG , italic_A , italic_Y ) chosen in Tansey et al. (2022) itself cannot serve as an accurate dependence measure as KPC does.

4 Experiments

In this section, we conduct numerical experiments to examine the effectiveness of the proposed approach on both synthetic datasets and real datasets.222The code is available at https://github.com/yuhenglai/FairICP All the details are included in Appendix D.

4.1 Experiments on synthetic datasets

4.1.1 Synthetic data generation

In this section, we explore the performance of FairICP in simulations with a continuous response Y𝑌Y\in{\mathbb{R}}italic_Y ∈ blackboard_R, and potentially multiple sensitive attributes are differently involved by two mechanisms:

  • Simulation 1: The response Y𝑌Yitalic_Y depends on two set of features X*Ksuperscript𝑋superscript𝐾X^{*}\in{\mathbb{R}}^{K}italic_X start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT and XKsuperscript𝑋superscript𝐾X^{\prime}\in{\mathbb{R}}^{K}italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT:

    Y𝒩(Σk=1KXk*+Σk=1KXk,σ2),similar-to𝑌𝒩superscriptsubscriptΣ𝑘1𝐾superscriptsubscript𝑋𝑘superscriptsubscriptΣ𝑘1𝐾superscriptsubscript𝑋𝑘superscript𝜎2\displaystyle Y\sim\mathcal{N}\left(\Sigma_{k=1}^{K}X_{k}^{*}+\Sigma_{k=1}^{K}% X_{k}^{\prime},\sigma^{2}\right),italic_Y ∼ caligraphic_N ( roman_Σ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT + roman_Σ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K end_POSTSUPERSCRIPT italic_X start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ,
    X1:K*𝒩(wA1:K,(1w)𝐈K),similar-tosuperscriptsubscript𝑋:1𝐾𝒩𝑤subscript𝐴:1𝐾1𝑤subscript𝐈𝐾\displaystyle X_{1:K}^{*}\sim\mathcal{N}(\sqrt{w}A_{1:K},(1-w)\mathbf{I}_{K}),italic_X start_POSTSUBSCRIPT 1 : italic_K end_POSTSUBSCRIPT start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∼ caligraphic_N ( square-root start_ARG italic_w end_ARG italic_A start_POSTSUBSCRIPT 1 : italic_K end_POSTSUBSCRIPT , ( 1 - italic_w ) bold_I start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT ) , (Sim1)
    X1:K𝒩(𝟎K,𝐈K),similar-tosuperscriptsubscript𝑋:1𝐾𝒩subscript0𝐾subscript𝐈𝐾\displaystyle X_{1:K}^{\prime}\sim\mathcal{N}(\mathbf{0}_{K},\mathbf{I}_{K}),italic_X start_POSTSUBSCRIPT 1 : italic_K end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∼ caligraphic_N ( bold_0 start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT , bold_I start_POSTSUBSCRIPT italic_K end_POSTSUBSCRIPT ) ,
  • Simulation 2: The response Y𝑌Yitalic_Y depends on two features X*superscript𝑋X^{*}\in{\mathbb{R}}italic_X start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∈ blackboard_R and Xsuperscript𝑋X^{\prime}\in{\mathbb{R}}italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ blackboard_R:

    Y𝒩(X*+X,σ2),similar-to𝑌𝒩superscript𝑋superscript𝑋superscript𝜎2\displaystyle Y\sim\mathcal{N}\left(X^{*}+X^{\prime},\sigma^{2}\right),italic_Y ∼ caligraphic_N ( italic_X start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT + italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_σ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) ,
    X*𝒩(wA1,1w),similar-tosuperscript𝑋𝒩𝑤subscript𝐴11𝑤\displaystyle X^{*}\sim\mathcal{N}(\sqrt{w}A_{1},1-w),italic_X start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT ∼ caligraphic_N ( square-root start_ARG italic_w end_ARG italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , 1 - italic_w ) , (Sim2)
    X𝒩(0,1),similar-tosuperscript𝑋𝒩01\displaystyle X^{\prime}\sim\mathcal{N}(0,1),italic_X start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∼ caligraphic_N ( 0 , 1 ) ,
  • Y𝑌Yitalic_Y is influenced by multiple sensitive attributes A1:Ksubscript𝐴:1𝐾A_{1:K}italic_A start_POSTSUBSCRIPT 1 : italic_K end_POSTSUBSCRIPT in the setting Sim1 and influenced by a sole sensitive attribute A1subscript𝐴1A_{1}italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT in the setting Sim2. The parameter w[0,1]𝑤01w\in[0,1]italic_w ∈ [ 0 , 1 ] controls the dependence of the predictive feature on A𝐴Aitalic_A, and we consider w=0.9𝑤0.9w=0.9italic_w = 0.9 as a high dependence scenario and w=0.6𝑤0.6w=0.6italic_w = 0.6 as a low dependence scenario in our experiments.

  • In both settings, all sensitive attributes are generated independently from a mixture of Gamma distributions to increase the difficulty of estimating A𝐴Aitalic_A:

    Ak12Gamma(1,1)+12Gamma(1,10),similar-tosubscript𝐴𝑘12𝐺𝑎𝑚𝑚𝑎1112𝐺𝑎𝑚𝑚𝑎110\displaystyle A_{k}\sim\frac{1}{2}Gamma(1,1)+\frac{1}{2}Gamma(1,10),italic_A start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT ∼ divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_G italic_a italic_m italic_m italic_a ( 1 , 1 ) + divide start_ARG 1 end_ARG start_ARG 2 end_ARG italic_G italic_a italic_m italic_m italic_a ( 1 , 10 ) ,

    where k=1,,K𝑘1𝐾k=1,\ldots,Kitalic_k = 1 , … , italic_K for setting Sim1 and k=1,,K+1𝑘1𝐾1k=1,\ldots,K+1italic_k = 1 , … , italic_K + 1 for setting Sim2.

We compare the proposed method FairICP to FDL and an oracle version of FairICP where Y|Aconditional𝑌𝐴Y|Aitalic_Y | italic_A is given as the true conditional density. These synthetic experiments are where we can reliably evaluate the violation of the equalized odds condition of different methods. We are interested in 1) investigating if FairICP is more effective than FDL as the number of noisy attributes increases (increased K𝐾Kitalic_K) by considering the easier problem of estimating the density of Y|Aconditional𝑌𝐴Y|Aitalic_Y | italic_A rather than A|Yconditional𝐴𝑌A|Yitalic_A | italic_Y; and 2) evaluating if KPC is a good measure for conditional dependence in the sense that it can capture the relative degree of violation of equalized odds when applying different methods to the same data sets.

4.1.2 Results on synthetic datasets

We compare FairICP with P(YA)𝑃conditional𝑌𝐴P(Y\mid A)italic_P ( italic_Y ∣ italic_A ) estimated by MAF (Papamakarios et al., 2017)), FDL with P(AY)𝑃conditional𝐴𝑌P(A\mid Y)italic_P ( italic_A ∣ italic_Y ) estimated by MAF, and the oracle version of FairICP with true density. For the measure of the violation of equalized odds, we calculate the empirical KPC =ρ^2(Y^,AY)absentsuperscript^𝜌2^𝑌conditional𝐴𝑌=\hat{\rho}^{2}(\hat{Y},A\mid Y)= over^ start_ARG italic_ρ end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( over^ start_ARG italic_Y end_ARG , italic_A ∣ italic_Y ) as R Package KPC with Gaussian kernel and default parameters (Huang et al., 2022). Apart from the KPC measure itself, we also consider a second evaluation metric using a hypothesis test as outlined by Algorithm 2 with T=KPC𝑇𝐾𝑃𝐶T=KPCitalic_T = italic_K italic_P italic_C, where we consider the power of rejecting the null hypothesis at level 0.050.050.050.05 as a measure of conditional dependence when utilizing the underlying true conditional density. The greater ρ^2superscript^𝜌2\hat{\rho}^{2}over^ start_ARG italic_ρ end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT or rejection power indicates stronger conditional dependence between A𝐴Aitalic_A and Y^^𝑌\hat{Y}over^ start_ARG italic_Y end_ARG given Y𝑌Yitalic_Y. Note that, in Sim2 only A1subscript𝐴1A_{1}italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT influences the Y𝑌Yitalic_Y, so the test will be based on ρ^2(Y^,A1Y)superscript^𝜌2^𝑌conditionalsubscript𝐴1𝑌\hat{\rho}^{2}(\hat{Y},A_{1}\mid Y)over^ start_ARG italic_ρ end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ( over^ start_ARG italic_Y end_ARG , italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∣ italic_Y ) to exclude the effects of noise (though the training is based on A1:K+1subscript𝐴:1𝐾1A_{1:K+1}italic_A start_POSTSUBSCRIPT 1 : italic_K + 1 end_POSTSUBSCRIPT for all methods to demonstrate the performance under noise).

Figure 2 and 3 show the trade-off curves between prediction loss and degree of fairness violations measured by KPC or its associated fairness testing power by Algorithm 2 with T=KPC𝑇𝐾𝑃𝐶T=KPCitalic_T = italic_K italic_P italic_C under settings Sim1 and Sim2 respectively, with K{1,5,10}𝐾1510K\in\{1,5,10\}italic_K ∈ { 1 , 5 , 10 } under the high-dependence scenario w=0.9𝑤0.9w=0.9italic_w = 0.9 (Results with low dependence on A are shown Appendix D.1). We implemented f𝑓fitalic_f as linear model and d𝑑ditalic_d as neural network, and all methods being compared are trained with different penalty parameter μ[0,1]𝜇01\mu\in[0,1]italic_μ ∈ [ 0 , 1 ] to show the trade-off. In both simulations, the trade-off by Pareto fronts is based on 100 independent runs with a sample size of 500 for the training set and 400 for the test set.

Figure 2 shows the results from the setting Sim1. Models from all three methods reduce to a plain linear regression without regard to fairness when μ=0𝜇0\mu=0italic_μ = 0, resulting in low prediction loss but a severe violation of equalized odds (evidenced by large KPC and statistical power); as μ𝜇\muitalic_μ goes larger, models pay more attention to fairness (lower KPC and power) by sacrificing more prediction loss. FairICP (proposed) performs very closely to the oracle model while outperforming FDL as the dimension of K𝐾Kitalic_K gets larger using both the KPC measure and the power measure, which fits our expectation and follows from the increased difficulty of estimating the conditional density of A|Yconditional𝐴𝑌A|Yitalic_A | italic_Y. FairICP shows a noticeable but still less performance reduction compared to the oracle model measured by KPC when the dimension of A𝐴Aitalic_A is 10, which is already large compared to what is examined in the current literature. Of note, this slight difference does not show up when measured by the power, likely due to an information loss when dichotomies the continuous KPC measure into the 0-1 decision given the p𝑝pitalic_p-value cutoff.

Figure 3 shows the results from setting Sim2 and delivers a similar message as Figure 2. The gaps between FairICP and FDL are wider compared to the results in Figure 2 as K𝐾Kitalic_K increases, which echos less percent of information about A𝐴Aitalic_A needed for estimating Y|Aconditional𝑌𝐴Y|Aitalic_Y | italic_A in setting Sim2.

Refer to caption
Figure 2: Prediction loss and metrics of fairness in simulation over 100 independent runs under setting Sim1 and w=0.9𝑤0.9w=0.9italic_w = 0.9. KPC estimate ρ^2superscript^𝜌2\hat{\rho}^{2}over^ start_ARG italic_ρ end_ARG start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and statistical power {p-value<0.05}𝑝-value0.05{\mathbb{P}}\{p\text{-value}<0.05\}blackboard_P { italic_p -value < 0.05 } are shown in left column and right column respectively. From top to bottom shows the results on different choices of noisy sensitive attribute dimension of K𝐾Kitalic_K. The X-axis represents the metrics of fairness and the Y-axis is the prediction loss. Each graph shows the proposed method, FDL, and oracle model with different hyperparameters μ𝜇\muitalic_μ.

Note that the power measure depends on how the permutation/sampling is conducted in practice, and its reliability hinges on the correctness of the sampling scheme, and thus, the accuracy of density estimation. In contrast, the direct KPC (Kernel-based Pearson Correlation) measure is independent of density estimation. Therefore, we can trust the power evaluation in our synthetic experiments, as we have utilized true conditional density estimation. The consistency between KPC measures and the power measures in our synthetic experiments suggests that KPC is a reasonable and density-estimation-free measure in real applications for comparing different learning methods

Results for when X𝑋Xitalic_X has a weaker dependence on A𝐴Aitalic_A with w=0.6𝑤0.6w=0.6italic_w = 0.6 are in Appendix D.1, which demonstrated the same message as from Figure 3 and Figure 2 with larger discrepancies across methods.

Refer to caption
Figure 3: Prediction loss and metrics of fairness in simulation over 100 independent runs under setting Sim2 and w=0.9𝑤0.9w=0.9italic_w = 0.9. Conditional dependence measure ρ2^^superscript𝜌2\hat{\rho^{2}}over^ start_ARG italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG and statistical power {p-value<0.05}𝑝-value0.05{\mathbb{P}}\{p\text{-value}<0.05\}blackboard_P { italic_p -value < 0.05 } are shown in the left column and right column respectively. From top to bottom shows the results on different choices of noisy sensitive attribute dimension of K𝐾Kitalic_K. The X-axis represents the metrics of fairness and the Y-axis is the prediction loss. Each graph shows the proposed method, FDL, and oracle model with different hyperparameters μ𝜇\muitalic_μ.

4.2 Real-data experiments

We consider real-world cases where we may need to protect more than one sensitive attribute. For all the experiments, we split the data into a training set (60%) and a test set (40%), and all the results shown are based on the test set.

4.2.1 Fair regression

In the Communities and Crime dataset 333Available at the UC Irvine Data Repository http://archive.ics.uci.edu/ml/datasets/Communities+and+Crime, each record describes the aggregate demographic properties of a different U.S. community; the data combines socio-economic data from the 1990 US Census, law enforcement data from the 1990 US LEMAS survey, and crime data from the 1995 FBI UCR. The total number of records is 1994, and the number of features is 122. Our task here is to predict the number of violent crimes per population for US cities while protecting all race information to avoid biasing police checks depending on the ethnic characteristics of the community. Specifically, we take three minority race information in this dataset into account (African American, Hispanic, Asian) as sensitive attributes instead of only one kind of race as done in the previous literature. We also consider the case where A𝐴Aitalic_A only includes one race (African American) as in Romano et al. (2020); Mary et al. (2019) for better comparison. All the sensitive attributes used here are continuous, representing the percentage of the population of certain races.

We compare our proposed methods with FDL (Romano et al., 2020) and HGR (Mary et al., 2019)444In Mary et al. 2019, since their implementation doesn’t directly apply to multiple sensitive attributes, we set the mean of three HGR coefficient of each attribute as penalty.. Note that we don’t include sensitive attributes as features in our experiments as in Romano et al. (2020); Mary et al. (2019). We consider neural networks as predictor f𝑓fitalic_f in all methods555We also consider f𝑓fitalic_f as a linear model in Appendix D.2, and we tune the hyperparameters as in (Romano et al., 2020) (see details in Appendix D).

We present our results as Pareto front in Fig 4 to show the trade-off curves of prediction and fairness given by our method and the state-of-the-art methods where the fairness is measured by both KPC and the power from the statistical test for fairness as outlined by Algorithm 2 with T𝑇Titalic_T chosen as KPC. We see that both metrics give similar trends: although there are some small discrepancies between using KPC and the fairness test, we observe that FairICP outperforms FDL and HGR especially when both three sensitive attributes are considered. Although the conditional density is now estimated and the fairness test might suffer from it, KPC is a robust measure regardless of the sampling scheme for A~~𝐴\tilde{A}over~ start_ARG italic_A end_ARG.

Refer to caption
Figure 4: Prediction loss and violation of equalized odds (ρ2^^superscript𝜌2\hat{\rho^{2}}over^ start_ARG italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG and power {p-value<0.05}𝑝-value0.05{\mathbb{P}}\{p\text{-value}<0.05\}blackboard_P { italic_p -value < 0.05 } by ρ2^^superscript𝜌2\hat{\rho^{2}}over^ start_ARG italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG) obtained by 3 different training methods in Crimes data over 100 random splits. The mean value and one standard deviation of loss are reported in the bar graph. Each graph shows the results of using different A𝐴Aitalic_A: 1 dim === (African American) and 3 dim === (African American, Hispanic, Asian). The Pareto front for each algorithm is obtained by varying the fairness trade-off parameter μ𝜇\muitalic_μ.

4.2.2 Fair classification

We then turn to a binary classification case that has been well-studied and considers two categorical sensitive attributes. The dataset we consider is ProPublica’s COMPAS recidivism data (5278 examples) 666Although it’s widely used in fairness-related literature, recently there have been critiques about the limitations of this dataset (Bao et al., 2022).. The task is to predict recidivism from someone’s criminal history, jail and prison time, demographics, and COMPAS risk scores. We choose two binary protected attributes A𝐴Aitalic_A: race (white vs. non-white) and sex. For this special task (binary classification against multiple binary sensitive attributes), we compare FairICP to two baselines HGR (Mary et al., 2019) and Exponentiated-gradient reduction (Agarwal et al., 2018), with the later developed for this particular kind of task. We aim to use this example to demonstrate the ability of FairICP to handle categorical observations and provide comparable performance with regard to the more tailored approach.

Refer to caption
Figure 5: Prediction loss and violation of equalized odds (DEO (8), ρ2^^superscript𝜌2\hat{\rho^{2}}over^ start_ARG italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG and statistical power {p-value<0.05}𝑝-value0.05{\mathbb{P}}\{p\text{-value}<0.05\}blackboard_P { italic_p -value < 0.05 } by ρ2^^superscript𝜌2\hat{\rho^{2}}over^ start_ARG italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG) obtained by 3 different training methods in COMPAS data over 100 random splits. The Pareto front for each algorithm is obtained by varying the hyper-parameter controlling the level of fairness.

In addition, apart from KPC and the corresponding fairness test, we also consider another fairness metric based on confusion matrix (Hardt et al., 2016; Cho et al., 2020) designed for such a binary classification task with categorical sensitive attributes to quantify equalized odds:

DEO:=assignDEOabsent\displaystyle\mathrm{DEO}:=roman_DEO := y{0,1}z𝒵|Pr(Y^=1Z=z,Y=y)Pr(Y^=1Y=y)|,subscript𝑦01subscript𝑧𝒵Pr^𝑌1𝑍𝑧𝑌𝑦Pr^𝑌1𝑌𝑦\displaystyle\sum_{y\in\{0,1\}}\sum_{z\in\mathcal{Z}}|\operatorname{Pr}(\hat{Y% }=1\mid Z=z,Y=y)-\operatorname{Pr}(\hat{Y}=1\mid Y=y)|,∑ start_POSTSUBSCRIPT italic_y ∈ { 0 , 1 } end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_z ∈ caligraphic_Z end_POSTSUBSCRIPT | roman_Pr ( over^ start_ARG italic_Y end_ARG = 1 ∣ italic_Z = italic_z , italic_Y = italic_y ) - roman_Pr ( over^ start_ARG italic_Y end_ARG = 1 ∣ italic_Y = italic_y ) | , (8)

where Y^^𝑌\hat{Y}over^ start_ARG italic_Y end_ARG is the predicted class label.

Similar to the regression case, we train neural network models as classifiers and discriminators 777We also consider f𝑓fitalic_f as a linear model in Appendix D.2 (see details in Appendix D).

Figure 5 shows that all three methods behave similarly overall in this classification example regarding their prediction-fairness trade-offs, with FairICP closely matching the performance of the exponential-gradient reduction (referred to as Reduction) using all three fairness evaluation metrics, and HGR slightly worse than FairICP and Reduction when measured by DEO.

5 Discussion

We introduced a flexible fairness learning approach, FairICP, to address the challenge of achieving equalized-odds fairness with complex sensitive attributes. FairICP combines adversarial learning with a novel inverse conditional permutation (ICP) strategy and offers a flexible and effective solution for handling sensitive attributes that may be of mixed data types and multidimensional in nature. We provided theoretical insights into the proposed method, elucidating the underpinning concepts and the rationale behind integrating ICP with adversarial learning. Furthermore, we conducted numerical experiments on both synthetic and real data to support our theoretical insights and demonstrate the efficacy and flexibility of our proposed method. We also noted that the majority of the computational burden for FairICP lies in training the adversarial prediction model, based on our experience (as also mentioned in Zhang et al. (2018); Romano et al. (2020)), with that from the density estimation and ICP sampling being negligible in comparison. The scalability challenge of the adversarial techniques should be more carefully addressed by implementing more efficient methods, which we view as a future direction for improving FairICP

References

  • Agarwal et al. (2018) Agarwal, A., A. Beygelzimer, M. Dudík, J. Langford, and H. Wallach (2018). A reductions approach to fair classification. In International conference on machine learning, pp.  60–69. PMLR.
  • Bao et al. (2022) Bao, M., A. Zhou, S. Zottola, B. Brubach, S. Desmarais, A. Horowitz, K. Lum, and S. Venkatasubramanian (2022). It’s compaslicated: The messy relationship between rai datasets and algorithmic fairness benchmarks.
  • Berrett et al. (2020) Berrett, T. B., Y. Wang, R. F. Barber, and R. J. Samworth (2020). The conditional permutation test for independence while controlling for confounders. Journal of the Royal Statistical Society Series B: Statistical Methodology 82(1), 175–197.
  • Candès et al. (2018) Candès, E., Y. Fan, L. Janson, and J. Lv (2018). Panning for gold: Model-X knockoffs for high-dimensional controlled variable selection. Journal of the Royal Statistical Society: Series B 80(3), 551–577.
  • Castelnovo et al. (2022) Castelnovo, A., R. Crupi, G. Greco, D. Regoli, I. G. Penco, and A. C. Cosentini (2022). A clarification of the nuances in the fairness metrics landscape. Scientific Reports 12(1), 4209.
  • Cho et al. (2020) Cho, J., G. Hwang, and C. Suh (2020). A fair classifier using kernel density estimation. Advances in neural information processing systems 33, 15088–15099.
  • Creager et al. (2019) Creager, E., D. Madras, J.-H. Jacobsen, M. Weis, K. Swersky, T. Pitassi, and R. Zemel (2019, 09–15 Jun). Flexibly fair representation learning by disentanglement. In K. Chaudhuri and R. Salakhutdinov (Eds.), Proceedings of the 36th International Conference on Machine Learning, Volume 97 of Proceedings of Machine Learning Research, pp.  1436–1445. PMLR.
  • Dwork et al. (2012) Dwork, C., M. Hardt, T. Pitassi, O. Reingold, and R. Zemel (2012). Fairness through awareness. In Proceedings of the 3rd Innovations in Theoretical Computer Science Conference, pp.  214––226.
  • Feldman et al. (2015) Feldman, M., S. A. Friedler, J. Moeller, C. Scheidegger, and S. Venkatasubramanian (2015). Certifying and removing disparate impact. In Proceedings of the 21st ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp.  259–268.
  • Goodfellow et al. (2014) Goodfellow, I., J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y. Bengio (2014). Generative adversarial nets. In Advances in Neural Information Processing Systems 27, pp. 2672–2680.
  • Hardt et al. (2016) Hardt, M., E. Price, and N. Srebro (2016). Equality of opportunity in supervised learning. Advances in neural information processing systems 29.
  • Hebert-Johnson et al. (2018) Hebert-Johnson, U., M. Kim, O. Reingold, and G. Rothblum (2018, 10–15 Jul). Multicalibration: Calibration for the (Computationally-identifiable) masses. In J. Dy and A. Krause (Eds.), Proceedings of the 35th International Conference on Machine Learning, Volume 80 of Proceedings of Machine Learning Research, pp.  1939–1948. PMLR.
  • Huang et al. (2022) Huang, Z., N. Deb, and B. Sen (2022). Kernel partial correlation coefficient — a measure of conditional dependence. Journal of Machine Learning Research 23(216), 1–58.
  • Kamishima et al. (2011) Kamishima, T., S. Akaho, and J. Sakuma (2011). Fairness-aware learning through regularization approach. In 2011 IEEE 11th International Conference on Data Mining Workshops, pp.  643–650. IEEE.
  • Kearns et al. (2018) Kearns, M., S. Neel, A. Roth, and Z. S. Wu (2018, 10–15 Jul). Preventing fairness gerrymandering: Auditing and learning for subgroup fairness. In J. Dy and A. Krause (Eds.), Proceedings of the 35th International Conference on Machine Learning, Volume 80 of Proceedings of Machine Learning Research, pp.  2564–2572. PMLR.
  • Keener (2010) Keener, R. W. (2010). Theoretical statistics: Topics for a core course. Springer.
  • Kim et al. (2018) Kim, M. P., A. Ghorbani, and J. Zou (2018). Multiaccuracy: Black-box post-processing for fairness in classification.
  • Kusner et al. (2017) Kusner, M. J., J. Loftus, C. Russell, and R. Silva (2017). Counterfactual fairness. In Advances in Neural Information Processing Systems 30, pp. 4066–4076.
  • Mary et al. (2019) Mary, J., C. Calauzenes, and N. El Karoui (2019). Fairness-aware learning for continuous attributes and treatments. In International Conference on Machine Learning, pp. 4382–4391.
  • Mehrabi et al. (2021) Mehrabi, N., F. Morstatter, N. Saxena, K. Lerman, and A. Galstyan (2021). A survey on bias and fairness in machine learning. ACM computing surveys (CSUR) 54(6), 1–35.
  • Naaman (2021) Naaman, M. (2021). On the tight constant in the multivariate dvoretzky–kiefer–wolfowitz inequality. Statistics & Probability Letters 173, 109088.
  • Papamakarios et al. (2017) Papamakarios, G., T. Pavlakou, and I. Murray (2017). Masked autoregressive flow for density estimation. Advances in neural information processing systems 30.
  • Romano et al. (2020) Romano, Y., S. Bates, and E. Candes (2020). Achieving equalized odds by resampling sensitive attributes. Advances in neural information processing systems 33, 361–371.
  • Scott (1991) Scott, D. W. (1991). Feasibility of multivariate density estimates. Biometrika 78(1), 197–205.
  • Tang and Zhang (2022) Tang, Z. and K. Zhang (2022). Attainability and optimality: The equalized odds fairness revisited. In Conference on Causal Learning and Reasoning, pp.  754–786. PMLR.
  • Tansey et al. (2022) Tansey, W., V. Veitch, H. Zhang, R. Rabadan, and D. M. Blei (2022). The holdout randomization test for feature selection in black box models. Journal of Computational and Graphical Statistics 31(1), 151–162.
  • Yang et al. (2022) Yang, J., A. A. S. Soltan, Y. Yang, and D. A. Clifton (2022). Algorithmic fairness and bias mitigation for clinical machine learning: Insights from rapid covid-19 diagnosis by adversarial learning. medRxiv.
  • Zafar et al. (2017) Zafar, M. B., I. Valera, M. Gomez Rodriguez, and K. P. Gummadi (2017). Fairness beyond disparate treatment & disparate impact: Learning classification without disparate mistreatment. In Proceedings of the 26th International Conference on World Wide Web, pp.  1171––1180.
  • Zafar et al. (2019) Zafar, M. B., I. Valera, M. Gomez-Rodriguez, and K. P. Gummadi (2019). Fairness constraints: A flexible approach for fair classification. Journal of Machine Learning Research 20(75), 1–42.
  • Zemel et al. (2013) Zemel, R., Y. Wu, K. Swersky, T. Pitassi, and C. Dwork (2013). Learning fair representations. In International conference on machine learning, pp. 325–333. PMLR.
  • Zhang et al. (2018) Zhang, B. H., B. Lemoine, and M. Mitchell (2018). Mitigating unwanted biases with adversarial learning. In Proceedings of the 2018 AAAI/ACM Conference on AI, Ethics, and Society, pp.  335–340.

Appendix A Proofs

Proof of Theorem 2.1.

Let S(𝐀)={A1,,An}𝑆𝐀subscript𝐴1subscript𝐴𝑛S({\mathbf{A}})=\{A_{1},\ldots,A_{n}\}italic_S ( bold_A ) = { italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_A start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT } denote the row set of the observed n𝑛nitalic_n realizations of sensitive attributes (unordered and duplicates are allowed). Let 𝐗𝐗{\mathbf{X}}bold_X, 𝐘^f(𝐗)^𝐘𝑓𝐗\hat{\mathbf{Y}}\coloneqq f({\mathbf{X}})over^ start_ARG bold_Y end_ARG ≔ italic_f ( bold_X ) and 𝐘𝐘{\mathbf{Y}}bold_Y be the associated n𝑛nitalic_n feature, prediction, and response observations.

  1. 1.

    Taks 1: Show that (𝐘^,𝐀,𝐘)=𝑑(𝐘^,𝐀~,𝐘)^𝐘𝐀𝐘𝑑^𝐘~𝐀𝐘(\hat{\mathbf{Y}},{\mathbf{A}},{\mathbf{Y}})\overset{d}{=}(\hat{\mathbf{Y}},% \tilde{\mathbf{A}},{\mathbf{Y}})( over^ start_ARG bold_Y end_ARG , bold_A , bold_Y ) overitalic_d start_ARG = end_ARG ( over^ start_ARG bold_Y end_ARG , over~ start_ARG bold_A end_ARG , bold_Y ) given conditional independence Y^A|Yperpendicular-toabsentperpendicular-to^𝑌conditional𝐴𝑌\hat{Y}\mathchoice{\mathrel{\hbox to 0.0pt{$\displaystyle\perp$\hss}\mkern 2.0% mu{\displaystyle\perp}}}{\mathrel{\hbox to 0.0pt{$\textstyle\perp$\hss}\mkern 2% .0mu{\textstyle\perp}}}{\mathrel{\hbox to 0.0pt{$\scriptstyle\perp$\hss}\mkern 2% .0mu{\scriptstyle\perp}}}{\mathrel{\hbox to 0.0pt{$\scriptscriptstyle\perp$% \hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}A|Yover^ start_ARG italic_Y end_ARG start_RELOP ⟂ ⟂ end_RELOP italic_A | italic_Y.

    Proof of Task 1. Recall that conditional on S(𝐀)=S𝑆𝐀𝑆S({\mathbf{A}})=Sitalic_S ( bold_A ) = italic_S for some S={a1,,an}𝑆subscript𝑎1subscript𝑎𝑛S=\{a_{1},...,a_{n}\}italic_S = { italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT }, we have (Berrett et al., 2020):

    {𝐀=𝐚π|S(𝐀)=S,𝐘}=qA|Yn(𝐚π𝐘)π𝒮nqA|Yn(𝐚π𝐘),conditional-set𝐀subscript𝐚𝜋𝑆𝐀𝑆𝐘subscriptsuperscript𝑞𝑛conditional𝐴𝑌conditionalsubscript𝐚𝜋𝐘subscriptsuperscript𝜋subscript𝒮𝑛superscriptsubscript𝑞conditional𝐴𝑌𝑛conditionalsubscript𝐚superscript𝜋𝐘\mathbb{P}\left\{{\mathbf{A}}={\mathbf{a}}_{\pi}\right|S({\mathbf{A}})=S,{% \mathbf{Y}}\}=\frac{q^{n}_{A|Y}\left({\mathbf{a}}_{\pi}\mid{\mathbf{Y}}\right)% }{\sum_{\pi^{\prime}\in\mathcal{S}_{n}}q_{A|Y}^{n}\left({\mathbf{a}}_{\pi^{% \prime}}\mid{\mathbf{Y}}\right)},blackboard_P { bold_A = bold_a start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT | italic_S ( bold_A ) = italic_S , bold_Y } = divide start_ARG italic_q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_A | italic_Y end_POSTSUBSCRIPT ( bold_a start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT ∣ bold_Y ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_A | italic_Y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_a start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∣ bold_Y ) end_ARG , (9)

    where 𝐚=(a1,,an)𝐚subscript𝑎1subscript𝑎𝑛{\mathbf{a}}=(a_{1},...,a_{n})bold_a = ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) is the stacked a𝑎aitalic_a values in S𝑆Sitalic_S. On the other hand, conditional on S(𝐀~)=S𝑆~𝐀𝑆S(\tilde{\mathbf{A}})=Sitalic_S ( over~ start_ARG bold_A end_ARG ) = italic_S, by construction:

    {𝐀~=𝐚π|S(𝐀)=S,𝐘}=qY|An(𝐘π1𝐚)πqY|An(𝐘π1|𝐚)=qA|Yn(𝐚π𝐘)πqA|Yn(𝐚π𝐘)conditional-set~𝐀subscript𝐚𝜋𝑆𝐀𝑆𝐘superscriptsubscript𝑞conditional𝑌𝐴𝑛conditionalsubscript𝐘superscript𝜋1𝐚subscriptsuperscript𝜋subscriptsuperscript𝑞𝑛conditional𝑌𝐴conditionalsubscript𝐘superscriptsuperscript𝜋1𝐚superscriptsubscript𝑞conditional𝐴𝑌𝑛conditionalsubscript𝐚𝜋𝐘subscriptsuperscript𝜋superscriptsubscript𝑞conditional𝐴𝑌𝑛conditionalsubscript𝐚𝜋𝐘\displaystyle\mathbb{P}\left\{\tilde{\mathbf{A}}={\mathbf{a}}_{\pi}|S({\mathbf% {A}})=S,{\mathbf{Y}}\right\}=\frac{q_{Y|A}^{n}\left({\mathbf{Y}}_{\pi^{-1}}% \mid{\mathbf{a}}\right)}{\sum_{\pi^{\prime}}q^{n}_{Y|A}\left({\mathbf{Y}}_{{% \pi^{\prime}}^{-1}}|{\mathbf{a}}\right)}=\frac{q_{A|Y}^{n}\left({\mathbf{a}}_{% \pi}\mid{\mathbf{Y}}\right)}{\sum_{\pi^{\prime}}q_{A|Y}^{n}\left({\mathbf{a}}_% {\pi}\mid{\mathbf{Y}}\right)}blackboard_P { over~ start_ARG bold_A end_ARG = bold_a start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT | italic_S ( bold_A ) = italic_S , bold_Y } = divide start_ARG italic_q start_POSTSUBSCRIPT italic_Y | italic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_Y start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∣ bold_a ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_Y | italic_A end_POSTSUBSCRIPT ( bold_Y start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | bold_a ) end_ARG = divide start_ARG italic_q start_POSTSUBSCRIPT italic_A | italic_Y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_a start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT ∣ bold_Y ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_A | italic_Y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_a start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT ∣ bold_Y ) end_ARG (10)

    where the last equality utilizes the following fact,

    qY|An(𝐘π1𝐚)πqY|An(𝐘π1|𝐚)=qY,An(𝐘π1,𝐚)π𝒮nqY,An(𝐘π1,𝐚)=qY,An(𝐘,𝐚π)π𝒮nqY,An(𝐘,𝐚π)=qA|Yn(𝐚π𝐘)πqA|Yn(𝐚π𝐘).superscriptsubscript𝑞conditional𝑌𝐴𝑛conditionalsubscript𝐘superscript𝜋1𝐚subscriptsuperscript𝜋subscriptsuperscript𝑞𝑛conditional𝑌𝐴conditionalsubscript𝐘superscriptsuperscript𝜋1𝐚subscriptsuperscript𝑞𝑛𝑌𝐴subscript𝐘superscript𝜋1𝐚subscriptsuperscript𝜋subscript𝒮𝑛subscriptsuperscript𝑞𝑛𝑌𝐴subscript𝐘superscriptsuperscript𝜋1𝐚subscriptsuperscript𝑞𝑛𝑌𝐴𝐘subscript𝐚𝜋subscriptsuperscript𝜋subscript𝒮𝑛subscriptsuperscript𝑞𝑛𝑌𝐴𝐘subscript𝐚superscript𝜋superscriptsubscript𝑞conditional𝐴𝑌𝑛conditionalsubscript𝐚𝜋𝐘subscriptsuperscript𝜋superscriptsubscript𝑞conditional𝐴𝑌𝑛conditionalsubscript𝐚superscript𝜋𝐘\displaystyle\frac{q_{Y|A}^{n}\left({\mathbf{Y}}_{\pi^{-1}}\mid{\mathbf{a}}% \right)}{\sum_{\pi^{\prime}}q^{n}_{Y|A}\left({\mathbf{Y}}_{{\pi^{\prime}}^{-1}% }|{\mathbf{a}}\right)}=\frac{q^{n}_{Y,A}\left({\mathbf{Y}}_{\pi^{-1}},{\mathbf% {a}}\right)}{\sum_{\pi^{\prime}\in\mathcal{S}_{n}}q^{n}_{Y,A}\left({\mathbf{Y}% }_{{\pi^{\prime}}^{-1}},{\mathbf{a}}\right)}=\frac{q^{n}_{Y,A}\left({\mathbf{Y% }},{\mathbf{a}}_{\pi}\right)}{\sum_{\pi^{\prime}\in\mathcal{S}_{n}}q^{n}_{Y,A}% \left({\mathbf{Y}},{\mathbf{a}}_{\pi^{\prime}}\right)}=\frac{q_{A|Y}^{n}\left(% {\mathbf{a}}_{\pi}\mid{\mathbf{Y}}\right)}{\sum_{\pi^{\prime}}q_{A|Y}^{n}\left% ({\mathbf{a}}_{\pi^{\prime}}\mid{\mathbf{Y}}\right)}.divide start_ARG italic_q start_POSTSUBSCRIPT italic_Y | italic_A end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_Y start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∣ bold_a ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_Y | italic_A end_POSTSUBSCRIPT ( bold_Y start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | bold_a ) end_ARG = divide start_ARG italic_q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_Y , italic_A end_POSTSUBSCRIPT ( bold_Y start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , bold_a ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_Y , italic_A end_POSTSUBSCRIPT ( bold_Y start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , bold_a ) end_ARG = divide start_ARG italic_q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_Y , italic_A end_POSTSUBSCRIPT ( bold_Y , bold_a start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_Y , italic_A end_POSTSUBSCRIPT ( bold_Y , bold_a start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) end_ARG = divide start_ARG italic_q start_POSTSUBSCRIPT italic_A | italic_Y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_a start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT ∣ bold_Y ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUBSCRIPT italic_A | italic_Y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_a start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∣ bold_Y ) end_ARG .

    Consequently, under the conditional independence assumption, we can write the joint distribution of (𝐘^,𝐀,𝐘)^𝐘𝐀𝐘(\hat{\mathbf{Y}},{\mathbf{A}},{\mathbf{Y}})( over^ start_ARG bold_Y end_ARG , bold_A , bold_Y ) as the following (𝐲^^𝐲\hat{{\mathbf{y}}}over^ start_ARG bold_y end_ARG, 𝐲𝐲{\mathbf{y}}bold_y are some stacked observation values y^1,,y^nsubscript^𝑦1subscript^𝑦𝑛\hat{y}_{1},\ldots,\hat{y}_{n}over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , over^ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT and y1,,ynsubscript𝑦1subscript𝑦𝑛y_{1},\ldots,y_{n}italic_y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT for 𝐘^^𝐘\hat{\mathbf{Y}}over^ start_ARG bold_Y end_ARG and 𝐘𝐘{\mathbf{Y}}bold_Y respectively):

    (𝐘^=𝐲^,𝐀=𝐚π,𝐘=𝐲)formulae-sequence^𝐘^𝐲formulae-sequence𝐀subscript𝐚𝜋𝐘𝐲\displaystyle\mathbb{P}(\hat{\mathbf{Y}}=\hat{\mathbf{y}},{\mathbf{A}}={% \mathbf{a}}_{\pi},{\mathbf{Y}}={\mathbf{y}})blackboard_P ( over^ start_ARG bold_Y end_ARG = over^ start_ARG bold_y end_ARG , bold_A = bold_a start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT , bold_Y = bold_y ) =(𝐘^=𝐲^,𝐀=𝐚π𝐘=𝐲)(𝐘=𝐲)absentformulae-sequence^𝐘^𝐲𝐀conditionalsubscript𝐚𝜋𝐘𝐲𝐘𝐲\displaystyle=\mathbb{P}(\hat{\mathbf{Y}}=\hat{\mathbf{y}},{\mathbf{A}}={% \mathbf{a}}_{\pi}\mid{\mathbf{Y}}={\mathbf{y}})\cdot\mathbb{P}({\mathbf{Y}}={% \mathbf{y}})= blackboard_P ( over^ start_ARG bold_Y end_ARG = over^ start_ARG bold_y end_ARG , bold_A = bold_a start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT ∣ bold_Y = bold_y ) ⋅ blackboard_P ( bold_Y = bold_y )
    =(b1)(𝐘^=y^𝐘=𝐲)(𝐀=𝐚π𝐘=𝐲)(𝐘=𝐲)subscript𝑏1^𝐘conditional^𝑦𝐘𝐲𝐀conditionalsubscript𝐚𝜋𝐘𝐲𝐘𝐲\displaystyle\overset{(b_{1})}{=}\mathbb{P}(\hat{\mathbf{Y}}=\hat{y}\mid{% \mathbf{Y}}={\mathbf{y}})\cdot\mathbb{P}({\mathbf{A}}={\mathbf{a}}_{\pi}\mid{% \mathbf{Y}}={\mathbf{y}})\cdot\mathbb{P}({\mathbf{Y}}={\mathbf{y}})start_OVERACCENT ( italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_OVERACCENT start_ARG = end_ARG blackboard_P ( over^ start_ARG bold_Y end_ARG = over^ start_ARG italic_y end_ARG ∣ bold_Y = bold_y ) ⋅ blackboard_P ( bold_A = bold_a start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT ∣ bold_Y = bold_y ) ⋅ blackboard_P ( bold_Y = bold_y )
    =(𝐘^=y^𝐘=𝐲)𝔼S[(𝐀=𝐚π𝐘=𝐲,S(𝐀)=S)](𝐘=y)\displaystyle=\mathbb{P}(\hat{\mathbf{Y}}=\hat{y}\mid{\mathbf{Y}}={\mathbf{y}}% )\cdot\mathbb{E}_{S}\left[\mathbb{P}({\mathbf{A}}={\mathbf{a}}_{\pi}\mid{% \mathbf{Y}}={\mathbf{y}},S({\mathbf{A}})=S)\right]\cdot\mathbb{P}({\mathbf{Y}}% =y)= blackboard_P ( over^ start_ARG bold_Y end_ARG = over^ start_ARG italic_y end_ARG ∣ bold_Y = bold_y ) ⋅ blackboard_E start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT [ blackboard_P ( bold_A = bold_a start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT ∣ bold_Y = bold_y , italic_S ( bold_A ) = italic_S ) ] ⋅ blackboard_P ( bold_Y = italic_y )
    =(b2)(𝐘^=𝐲^𝐘=𝐲)𝔼S[(𝐀~=𝐚π𝐘=𝐲,S(𝐀~)=S)](𝐘=𝐲)\displaystyle\overset{(b_{2})}{=}\mathbb{P}(\hat{\mathbf{Y}}=\hat{\mathbf{y}}% \mid{\mathbf{Y}}={\mathbf{y}})\cdot\mathbb{E}_{S}\left[\mathbb{P}(\tilde{% \mathbf{A}}={\mathbf{a}}_{\pi}\mid{\mathbf{Y}}={\mathbf{y}},S(\tilde{\mathbf{A% }})=S)\right]\cdot\mathbb{P}({\mathbf{Y}}={\mathbf{y}})start_OVERACCENT ( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_OVERACCENT start_ARG = end_ARG blackboard_P ( over^ start_ARG bold_Y end_ARG = over^ start_ARG bold_y end_ARG ∣ bold_Y = bold_y ) ⋅ blackboard_E start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT [ blackboard_P ( over~ start_ARG bold_A end_ARG = bold_a start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT ∣ bold_Y = bold_y , italic_S ( over~ start_ARG bold_A end_ARG ) = italic_S ) ] ⋅ blackboard_P ( bold_Y = bold_y )
    =𝔼S[(𝐘^=𝐲^𝐘=𝐲)(𝐀~=𝐚π𝐘=𝐲,S(𝐀~)=S)](𝐘=𝐲)\displaystyle=\mathbb{E}_{S}\left[\mathbb{P}(\hat{\mathbf{Y}}=\hat{\mathbf{y}}% \mid{\mathbf{Y}}={\mathbf{y}})\cdot\mathbb{P}(\tilde{\mathbf{A}}={\mathbf{a}}_% {\pi}\mid{\mathbf{Y}}={\mathbf{y}},S(\tilde{\mathbf{A}})=S)\right]\cdot\mathbb% {P}({\mathbf{Y}}={\mathbf{y}})= blackboard_E start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT [ blackboard_P ( over^ start_ARG bold_Y end_ARG = over^ start_ARG bold_y end_ARG ∣ bold_Y = bold_y ) ⋅ blackboard_P ( over~ start_ARG bold_A end_ARG = bold_a start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT ∣ bold_Y = bold_y , italic_S ( over~ start_ARG bold_A end_ARG ) = italic_S ) ] ⋅ blackboard_P ( bold_Y = bold_y )
    =(b3)𝔼S[(𝐘^=𝐲^,𝐀~=𝐚π𝐘=𝐲,S(𝐀~)=S)](𝐘=𝐲)\displaystyle\overset{(b_{3})}{=}\mathbb{E}_{S}\left[\mathbb{P}(\hat{\mathbf{Y% }}=\hat{\mathbf{y}},\tilde{\mathbf{A}}={\mathbf{a}}_{\pi}\mid{\mathbf{Y}}={% \mathbf{y}},S(\tilde{\mathbf{A}})=S)\right]\cdot\mathbb{P}({\mathbf{Y}}={% \mathbf{y}})start_OVERACCENT ( italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) end_OVERACCENT start_ARG = end_ARG blackboard_E start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT [ blackboard_P ( over^ start_ARG bold_Y end_ARG = over^ start_ARG bold_y end_ARG , over~ start_ARG bold_A end_ARG = bold_a start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT ∣ bold_Y = bold_y , italic_S ( over~ start_ARG bold_A end_ARG ) = italic_S ) ] ⋅ blackboard_P ( bold_Y = bold_y )
    =(𝐘^=𝐲^,𝐀~=𝐚π𝐘=𝐲)(𝐘=𝐲)absentformulae-sequence^𝐘^𝐲~𝐀conditionalsubscript𝐚𝜋𝐘𝐲𝐘𝐲\displaystyle=\mathbb{P}(\hat{\mathbf{Y}}=\hat{{\mathbf{y}}},\tilde{\mathbf{A}% }={\mathbf{a}}_{\pi}\mid{\mathbf{Y}}={\mathbf{y}})\cdot\mathbb{P}({\mathbf{Y}}% ={\mathbf{y}})= blackboard_P ( over^ start_ARG bold_Y end_ARG = over^ start_ARG bold_y end_ARG , over~ start_ARG bold_A end_ARG = bold_a start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT ∣ bold_Y = bold_y ) ⋅ blackboard_P ( bold_Y = bold_y )
    =(𝐘^=𝐲^,𝐀~=𝐚π,𝐘=𝐲).absentformulae-sequence^𝐘^𝐲formulae-sequence~𝐀subscript𝐚𝜋𝐘𝐲\displaystyle=\mathbb{P}(\hat{\mathbf{Y}}=\hat{{\mathbf{y}}},\tilde{\mathbf{A}% }={\mathbf{a}}_{\pi},{\mathbf{Y}}={\mathbf{y}}).= blackboard_P ( over^ start_ARG bold_Y end_ARG = over^ start_ARG bold_y end_ARG , over~ start_ARG bold_A end_ARG = bold_a start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT , bold_Y = bold_y ) . (11)

    Here, step (b2)subscript𝑏2(b_{2})( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) has used eq. (9) and eq. (10), which establishes the equivalence between the condition law of A𝐴Aitalic_A and A~~𝐴\tilde{A}over~ start_ARG italic_A end_ARG; steps (b1)subscript𝑏1(b_{1})( italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) and (b3)subscript𝑏3(b_{3})( italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) relies on the conditional independence relationships AY^|Yperpendicular-toabsentperpendicular-to𝐴conditional^𝑌𝑌A\mathchoice{\mathrel{\hbox to 0.0pt{$\displaystyle\perp$\hss}\mkern 2.0mu{% \displaystyle\perp}}}{\mathrel{\hbox to 0.0pt{$\textstyle\perp$\hss}\mkern 2.0% mu{\textstyle\perp}}}{\mathrel{\hbox to 0.0pt{$\scriptstyle\perp$\hss}\mkern 2% .0mu{\scriptstyle\perp}}}{\mathrel{\hbox to 0.0pt{$\scriptscriptstyle\perp$% \hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}\hat{Y}|Yitalic_A start_RELOP ⟂ ⟂ end_RELOP over^ start_ARG italic_Y end_ARG | italic_Y and A~Y^|Yperpendicular-toabsentperpendicular-to~𝐴conditional^𝑌𝑌\tilde{A}\mathchoice{\mathrel{\hbox to 0.0pt{$\displaystyle\perp$\hss}\mkern 2% .0mu{\displaystyle\perp}}}{\mathrel{\hbox to 0.0pt{$\textstyle\perp$\hss}% \mkern 2.0mu{\textstyle\perp}}}{\mathrel{\hbox to 0.0pt{$\scriptstyle\perp$% \hss}\mkern 2.0mu{\scriptstyle\perp}}}{\mathrel{\hbox to 0.0pt{$% \scriptscriptstyle\perp$\hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}\hat{Y}|Yover~ start_ARG italic_A end_ARG start_RELOP ⟂ ⟂ end_RELOP over^ start_ARG italic_Y end_ARG | italic_Y. Hence, conditional independence indicate the distributional equivalence (𝐘^,𝐀,𝐘)=𝑑(𝐘^,𝐀~,𝐘)^𝐘𝐀𝐘𝑑^𝐘~𝐀𝐘(\hat{\mathbf{Y}},{\mathbf{A}},{\mathbf{Y}})\overset{d}{=}(\hat{\mathbf{Y}},% \tilde{\mathbf{A}},{\mathbf{Y}})( over^ start_ARG bold_Y end_ARG , bold_A , bold_Y ) overitalic_d start_ARG = end_ARG ( over^ start_ARG bold_Y end_ARG , over~ start_ARG bold_A end_ARG , bold_Y ).

  2. 2.

    Taks 2: Show the further conditioned conditional independence Y^A|Y,S(𝐀)perpendicular-toabsentperpendicular-to^𝑌conditional𝐴𝑌𝑆𝐀\hat{Y}\mathchoice{\mathrel{\hbox to 0.0pt{$\displaystyle\perp$\hss}\mkern 2.0% mu{\displaystyle\perp}}}{\mathrel{\hbox to 0.0pt{$\textstyle\perp$\hss}\mkern 2% .0mu{\textstyle\perp}}}{\mathrel{\hbox to 0.0pt{$\scriptstyle\perp$\hss}\mkern 2% .0mu{\scriptstyle\perp}}}{\mathrel{\hbox to 0.0pt{$\scriptscriptstyle\perp$% \hss}\mkern 2.0mu{\scriptscriptstyle\perp}}}A|Y,S({\mathbf{A}})over^ start_ARG italic_Y end_ARG start_RELOP ⟂ ⟂ end_RELOP italic_A | italic_Y , italic_S ( bold_A ) given (𝐘^,𝐀,𝐘)=𝑑(𝐘^,𝐀~,𝐘)^𝐘𝐀𝐘𝑑^𝐘~𝐀𝐘(\hat{\mathbf{Y}},{\mathbf{A}},{\mathbf{Y}})\overset{d}{=}(\hat{\mathbf{Y}},% \tilde{\mathbf{A}},{\mathbf{Y}})( over^ start_ARG bold_Y end_ARG , bold_A , bold_Y ) overitalic_d start_ARG = end_ARG ( over^ start_ARG bold_Y end_ARG , over~ start_ARG bold_A end_ARG , bold_Y ).

    Proof of Task 2. When (𝐘^=𝐲^,𝐀=𝐚,𝐘=𝐲)=(𝐘^=𝐲^,𝐀~=𝐚,𝐘=y)formulae-sequence^𝐘^𝐲formulae-sequence𝐀𝐚𝐘𝐲formulae-sequence^𝐘^𝐲formulae-sequence~𝐀𝐚𝐘𝑦\mathbb{P}(\hat{\mathbf{Y}}=\hat{\mathbf{y}},{\mathbf{A}}={\mathbf{a}},{% \mathbf{Y}}={\mathbf{y}})=\mathbb{P}(\hat{\mathbf{Y}}=\hat{\mathbf{y}},\tilde{% \mathbf{A}}={\mathbf{a}},{\mathbf{Y}}=y)blackboard_P ( over^ start_ARG bold_Y end_ARG = over^ start_ARG bold_y end_ARG , bold_A = bold_a , bold_Y = bold_y ) = blackboard_P ( over^ start_ARG bold_Y end_ARG = over^ start_ARG bold_y end_ARG , over~ start_ARG bold_A end_ARG = bold_a , bold_Y = italic_y ), we have

    (𝐘^=𝐲^,𝐀=𝐚𝐘=𝐲,S(𝐀)=S)\displaystyle\mathbb{P}(\hat{\mathbf{Y}}=\hat{\mathbf{y}},{\mathbf{A}}={% \mathbf{a}}\mid{\mathbf{Y}}={\mathbf{y}},S({\mathbf{A}})=S)blackboard_P ( over^ start_ARG bold_Y end_ARG = over^ start_ARG bold_y end_ARG , bold_A = bold_a ∣ bold_Y = bold_y , italic_S ( bold_A ) = italic_S ) =(𝐘^=𝐲^,𝐀=𝐚,𝐘=𝐲)(𝐘=𝐲,S(𝐀)=S)absentformulae-sequence^𝐘^𝐲formulae-sequence𝐀𝐚𝐘𝐲formulae-sequence𝐘𝐲𝑆𝐀𝑆\displaystyle=\frac{\mathbb{P}(\hat{\mathbf{Y}}=\hat{\mathbf{y}},{\mathbf{A}}=% {\mathbf{a}},{\mathbf{Y}}={\mathbf{y}})}{\mathbb{P}({\mathbf{Y}}={\mathbf{y}},% S({\mathbf{A}})=S)}= divide start_ARG blackboard_P ( over^ start_ARG bold_Y end_ARG = over^ start_ARG bold_y end_ARG , bold_A = bold_a , bold_Y = bold_y ) end_ARG start_ARG blackboard_P ( bold_Y = bold_y , italic_S ( bold_A ) = italic_S ) end_ARG
    =(𝐘^=𝐲^,𝐀~=𝐚,𝐘=𝐲)(𝐘=𝐲,S(𝐀)=S)absentformulae-sequence^𝐘^𝐲formulae-sequence~𝐀𝐚𝐘𝐲formulae-sequence𝐘𝐲𝑆𝐀𝑆\displaystyle=\frac{\mathbb{P}(\hat{\mathbf{Y}}=\hat{\mathbf{y}},\tilde{% \mathbf{A}}={\mathbf{a}},{\mathbf{Y}}={\mathbf{y}})}{\mathbb{P}({\mathbf{Y}}={% \mathbf{y}},S({\mathbf{A}})=S)}= divide start_ARG blackboard_P ( over^ start_ARG bold_Y end_ARG = over^ start_ARG bold_y end_ARG , over~ start_ARG bold_A end_ARG = bold_a , bold_Y = bold_y ) end_ARG start_ARG blackboard_P ( bold_Y = bold_y , italic_S ( bold_A ) = italic_S ) end_ARG
    =(𝐘^=𝐲^,𝐀~=𝐚𝐘=𝐲,S(𝐀)=S)\displaystyle=\mathbb{P}(\hat{\mathbf{Y}}=\hat{\mathbf{y}},\tilde{\mathbf{A}}=% {\mathbf{a}}\mid{\mathbf{Y}}={\mathbf{y}},S({\mathbf{A}})=S)= blackboard_P ( over^ start_ARG bold_Y end_ARG = over^ start_ARG bold_y end_ARG , over~ start_ARG bold_A end_ARG = bold_a ∣ bold_Y = bold_y , italic_S ( bold_A ) = italic_S )
    =(b1)(𝐘^=𝐲^𝐘=𝐲,S(𝐀)=S)(𝐀~=𝐚𝐘=𝐲,S(𝐀)=S)\displaystyle\overset{(b_{1})}{=}\mathbb{P}(\hat{\mathbf{Y}}=\hat{\mathbf{y}}% \mid{\mathbf{Y}}={\mathbf{y}},S({\mathbf{A}})=S)\mathbb{P}(\tilde{\mathbf{A}}=% {\mathbf{a}}\mid{\mathbf{Y}}={\mathbf{y}},S({\mathbf{A}})=S)start_OVERACCENT ( italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_OVERACCENT start_ARG = end_ARG blackboard_P ( over^ start_ARG bold_Y end_ARG = over^ start_ARG bold_y end_ARG ∣ bold_Y = bold_y , italic_S ( bold_A ) = italic_S ) blackboard_P ( over~ start_ARG bold_A end_ARG = bold_a ∣ bold_Y = bold_y , italic_S ( bold_A ) = italic_S )
    =(b2)(𝐘^=y^𝐘=𝐲,S(𝐀)=S)(𝐀=𝐚𝐘=𝐲,S(𝐀)=S)\displaystyle\overset{(b_{2})}{=}\mathbb{P}(\hat{\mathbf{Y}}=\hat{y}\mid{% \mathbf{Y}}={\mathbf{y}},S({\mathbf{A}})=S)\mathbb{P}({\mathbf{A}}={\mathbf{a}% }\mid{\mathbf{Y}}={\mathbf{y}},S({\mathbf{A}})=S)start_OVERACCENT ( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_OVERACCENT start_ARG = end_ARG blackboard_P ( over^ start_ARG bold_Y end_ARG = over^ start_ARG italic_y end_ARG ∣ bold_Y = bold_y , italic_S ( bold_A ) = italic_S ) blackboard_P ( bold_A = bold_a ∣ bold_Y = bold_y , italic_S ( bold_A ) = italic_S ) (12)

    Here, step (b1)subscript𝑏1(b_{1})( italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) holds by the construction of 𝐀~~𝐀\tilde{\mathbf{A}}over~ start_ARG bold_A end_ARG, while step (b2)subscript𝑏2(b_{2})( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) holds as a result of eq. (9) and eq. (10).

  3. 3.

    Taks 3: Show the asymptotic equalized odds given (𝐘^,𝐀,𝐘)=𝑑(𝐘^,𝐀~,𝐘)^𝐘𝐀𝐘𝑑^𝐘~𝐀𝐘(\hat{\mathbf{Y}},{\mathbf{A}},{\mathbf{Y}})\overset{d}{=}(\hat{\mathbf{Y}},% \tilde{\mathbf{A}},{\mathbf{Y}})( over^ start_ARG bold_Y end_ARG , bold_A , bold_Y ) overitalic_d start_ARG = end_ARG ( over^ start_ARG bold_Y end_ARG , over~ start_ARG bold_A end_ARG , bold_Y ).

    Proof of Task 3. We prove this statement utilizing the previous statement and known multi-dimensional c.d.f (cumulative distribution function) estimation bound (Naaman, 2021). Let t1superscript𝑡1t^{1}italic_t start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT and t2superscript𝑡2t^{2}italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT be constant vectors of the same dimensions as Y^^𝑌\hat{Y}over^ start_ARG italic_Y end_ARG and A𝐴Aitalic_A, and t3superscript𝑡3t^{3}italic_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT be a constant vector of the same dimension as Y𝑌Yitalic_Y. Construct augmented matrix 𝐭1superscript𝐭1{\mathbf{t}}^{1}bold_t start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT, 𝐭2superscript𝐭2{\mathbf{t}}^{2}bold_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT, 𝐭3superscript𝐭3{\mathbf{t}}^{3}bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT where 𝐭1.1=t1superscriptsubscript𝐭11superscript𝑡1{\mathbf{t}}_{1.}^{1}=t^{1}bold_t start_POSTSUBSCRIPT 1 . end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT = italic_t start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT, 𝐭1.2=t2superscriptsubscript𝐭12superscript𝑡2{\mathbf{t}}_{1.}^{2}=t^{2}bold_t start_POSTSUBSCRIPT 1 . end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and ti.1=superscriptsubscript𝑡𝑖1t_{i.}^{1}=\inftyitalic_t start_POSTSUBSCRIPT italic_i . end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT = ∞, ti.2=superscriptsubscript𝑡𝑖2t_{i.}^{2}=\inftyitalic_t start_POSTSUBSCRIPT italic_i . end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT = ∞ for i=2,,n𝑖2𝑛i=2,\ldots,nitalic_i = 2 , … , italic_n, and 𝐭i.3=t3subscriptsuperscript𝐭3𝑖subscript𝑡3{\mathbf{t}}^{3}_{i.}=t_{3}bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i . end_POSTSUBSCRIPT = italic_t start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT the same for all i=1,,n𝑖1𝑛i=1,\ldots,nitalic_i = 1 , … , italic_n. Let (Y^1,A1,Y1)subscript^𝑌1subscript𝐴1subscript𝑌1(\hat{Y}_{1},A_{1},Y_{1})( over^ start_ARG italic_Y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) be a from the same distribution as (Y^,A,Y)^𝑌𝐴𝑌(\hat{Y},A,Y)( over^ start_ARG italic_Y end_ARG , italic_A , italic_Y ). Then,

    (Y^1t1,A1t2|Y1=t3)formulae-sequencesubscript^𝑌1superscript𝑡1subscript𝐴1conditionalsuperscript𝑡2subscript𝑌1superscript𝑡3\displaystyle\mathbb{P}\left(\hat{Y}_{1}\leq t^{1},A_{1}\leq t^{2}|Y_{1}=t^{3}\right)blackboard_P ( over^ start_ARG italic_Y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) =(b1)(Y^1t1,A1t2|𝐘=𝐭3)subscript𝑏1formulae-sequencesubscript^𝑌1superscript𝑡1subscript𝐴1conditionalsuperscript𝑡2𝐘superscript𝐭3\displaystyle\overset{(b_{1})}{=}\mathbb{P}\left(\hat{Y}_{1}\leq t^{1},A_{1}% \leq t^{2}|{\mathbf{Y}}={\mathbf{t}}^{3}\right)start_OVERACCENT ( italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_OVERACCENT start_ARG = end_ARG blackboard_P ( over^ start_ARG italic_Y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | bold_Y = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT )
    =(Y^1t1,A1t2,𝐘=𝐭3)(𝐘=𝐭3)absentformulae-sequencesubscript^𝑌1superscript𝑡1formulae-sequencesubscript𝐴1superscript𝑡2𝐘superscript𝐭3𝐘superscript𝐭3\displaystyle=\frac{\mathbb{P}\left(\hat{Y}_{1}\leq t^{1},A_{1}\leq t^{2},{% \mathbf{Y}}={\mathbf{t}}^{3}\right)}{\mathbb{P}({\mathbf{Y}}={\mathbf{t}}^{3})}= divide start_ARG blackboard_P ( over^ start_ARG italic_Y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , bold_Y = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) end_ARG start_ARG blackboard_P ( bold_Y = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) end_ARG
    =(b2)(𝐘^𝐭1,𝐀𝐭2,𝐘=𝐭3)(𝐘=𝐭3),subscript𝑏2formulae-sequence^𝐘superscript𝐭1formulae-sequence𝐀superscript𝐭2𝐘superscript𝐭3𝐘superscript𝐭3\displaystyle\overset{(b_{2})}{=}\frac{\mathbb{P}\left(\hat{\mathbf{Y}}\leq{% \mathbf{t}}^{1},{\mathbf{A}}\leq{\mathbf{t}}^{2},{\mathbf{Y}}={\mathbf{t}}^{3}% \right)}{\mathbb{P}({\mathbf{Y}}={\mathbf{t}}^{3})},start_OVERACCENT ( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_OVERACCENT start_ARG = end_ARG divide start_ARG blackboard_P ( over^ start_ARG bold_Y end_ARG ≤ bold_t start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , bold_A ≤ bold_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , bold_Y = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) end_ARG start_ARG blackboard_P ( bold_Y = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) end_ARG ,

    where step (b1)subscript𝑏1(b_{1})( italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) has used the fact that (Xi,Ai,Yi)subscript𝑋𝑖subscript𝐴𝑖subscript𝑌𝑖(X_{i},A_{i},Y_{i})( italic_X start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_Y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ), for i=1,,n𝑖1𝑛i=1,\ldots,nitalic_i = 1 , … , italic_n are independently generated, thus, conditioning on additional independent Y2,,Ynsubscript𝑌2subscript𝑌𝑛Y_{2},\ldots,Y_{n}italic_Y start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_Y start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT does not change the probability; step (b2)subscript𝑏2(b_{2})( italic_b start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) holds because 𝐭i.1subscriptsuperscript𝐭1𝑖{\mathbf{t}}^{1}_{i.}bold_t start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i . end_POSTSUBSCRIPT and 𝐭i.2subscriptsuperscript𝐭2𝑖{\mathbf{t}}^{2}_{i.}bold_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i . end_POSTSUBSCRIPT, for i=2,,n𝑖2𝑛i=2,\ldots,nitalic_i = 2 , … , italic_n, take infinite values and do not modify the event considered. Utilizing eq. (12), have further have

    (𝐘^𝐭1,𝐀𝐭2,𝐘=𝐭3)(𝐘=𝐭3)formulae-sequence^𝐘superscript𝐭1formulae-sequence𝐀superscript𝐭2𝐘superscript𝐭3𝐘superscript𝐭3\displaystyle\frac{\mathbb{P}\left(\hat{\mathbf{Y}}\leq{\mathbf{t}}^{1},{% \mathbf{A}}\leq{\mathbf{t}}^{2},{\mathbf{Y}}={\mathbf{t}}^{3}\right)}{\mathbb{% P}({\mathbf{Y}}={\mathbf{t}}^{3})}divide start_ARG blackboard_P ( over^ start_ARG bold_Y end_ARG ≤ bold_t start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , bold_A ≤ bold_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , bold_Y = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) end_ARG start_ARG blackboard_P ( bold_Y = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) end_ARG
    =\displaystyle== 𝔼S|𝐘[(𝐘^𝐭1,𝐀𝐭2|𝐘=𝐭3,S(𝐀)=S)]\displaystyle\mathbb{E}_{S|{\mathbf{Y}}}\left[\mathbb{P}\left(\hat{\mathbf{Y}}% \leq{\mathbf{t}}^{1},{\mathbf{A}}\leq{\mathbf{t}}^{2}|{\mathbf{Y}}={\mathbf{t}% }^{3},S({\mathbf{A}})=S\right)\right]blackboard_E start_POSTSUBSCRIPT italic_S | bold_Y end_POSTSUBSCRIPT [ blackboard_P ( over^ start_ARG bold_Y end_ARG ≤ bold_t start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , bold_A ≤ bold_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | bold_Y = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT , italic_S ( bold_A ) = italic_S ) ]
    =\displaystyle== 𝔼S|𝐘[(𝐘^𝐭1|𝐘=𝐭3,S(𝐀~)=S)(𝐀~𝐭2|𝐘=𝐭3,S(𝐀~)=S)]\displaystyle\mathbb{E}_{S|{\mathbf{Y}}}\left[\mathbb{P}\left(\hat{\mathbf{Y}}% \leq{\mathbf{t}}^{1}|{\mathbf{Y}}={\mathbf{t}}^{3},S(\tilde{\mathbf{A}})=S% \right)\mathbb{P}\left(\tilde{\mathbf{A}}\leq{\mathbf{t}}^{2}|{\mathbf{Y}}={% \mathbf{t}}^{3},S(\tilde{\mathbf{A}})=S\right)\right]blackboard_E start_POSTSUBSCRIPT italic_S | bold_Y end_POSTSUBSCRIPT [ blackboard_P ( over^ start_ARG bold_Y end_ARG ≤ bold_t start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT | bold_Y = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT , italic_S ( over~ start_ARG bold_A end_ARG ) = italic_S ) blackboard_P ( over~ start_ARG bold_A end_ARG ≤ bold_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | bold_Y = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT , italic_S ( over~ start_ARG bold_A end_ARG ) = italic_S ) ]
    =(b3)subscript𝑏3\displaystyle\overset{(b_{3})}{=}start_OVERACCENT ( italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) end_OVERACCENT start_ARG = end_ARG 𝔼S|𝐘[(Y^1t1|𝐘=𝐭3,S(𝐀~)=S)(A~1t2|𝐘=𝐭3,S(𝐀~)=S)]\displaystyle\mathbb{E}_{S|{\mathbf{Y}}}\left[\mathbb{P}\left(\hat{Y}_{1}\leq t% ^{1}|{\mathbf{Y}}={\mathbf{t}}^{3},S(\tilde{\mathbf{A}})=S\right)\mathbb{P}% \left(\tilde{A}_{1}\leq t^{2}|{\mathbf{Y}}={\mathbf{t}}^{3},S(\tilde{\mathbf{A% }})=S\right)\right]blackboard_E start_POSTSUBSCRIPT italic_S | bold_Y end_POSTSUBSCRIPT [ blackboard_P ( over^ start_ARG italic_Y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT | bold_Y = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT , italic_S ( over~ start_ARG bold_A end_ARG ) = italic_S ) blackboard_P ( over~ start_ARG italic_A end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | bold_Y = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT , italic_S ( over~ start_ARG bold_A end_ARG ) = italic_S ) ]
    =\displaystyle== 𝔼S|𝐘[(Y^1t1|Y1=𝐭3,S(𝐀)=S)(A1t2|𝐘=𝐭3,S(𝐀)=S)]\displaystyle\mathbb{E}_{S|{\mathbf{Y}}}\left[\mathbb{P}\left(\hat{Y}_{1}\leq t% ^{1}|Y_{1}={\mathbf{t}}^{3},S({\mathbf{A}})=S\right)\mathbb{P}\left(A_{1}\leq t% ^{2}|{\mathbf{Y}}={\mathbf{t}}^{3},S({\mathbf{A}})=S\right)\right]blackboard_E start_POSTSUBSCRIPT italic_S | bold_Y end_POSTSUBSCRIPT [ blackboard_P ( over^ start_ARG italic_Y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT | italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT , italic_S ( bold_A ) = italic_S ) blackboard_P ( italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | bold_Y = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT , italic_S ( bold_A ) = italic_S ) ]
    =\displaystyle== (Y^1t1|Y1=t3)(A1t2|Y1=t3)+Δsubscript^𝑌1conditionalsuperscript𝑡1subscript𝑌1superscript𝑡3subscript𝐴1conditionalsuperscript𝑡2subscript𝑌1superscript𝑡3Δ\displaystyle\mathbb{P}\left(\hat{Y}_{1}\leq t^{1}|Y_{1}=t^{3}\right)\mathbb{P% }\left(A_{1}\leq t^{2}|Y_{1}=t^{3}\right)+\Deltablackboard_P ( over^ start_ARG italic_Y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT | italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) blackboard_P ( italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) + roman_Δ

    where step (b3)subscript𝑏3(b_{3})( italic_b start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) has used again the fact that 𝐭i.1=subscriptsuperscript𝐭1𝑖{\mathbf{t}}^{1}_{i.}=\inftybold_t start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i . end_POSTSUBSCRIPT = ∞ and 𝐭i.2=subscriptsuperscript𝐭2𝑖{\mathbf{t}}^{2}_{i.}=\inftybold_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_i . end_POSTSUBSCRIPT = ∞, for i=2,,n𝑖2𝑛i=2,\ldots,nitalic_i = 2 , … , italic_n, and ΔΔ\Deltaroman_Δ is defined as

    ΔΔ\displaystyle\Deltaroman_Δ =𝔼S|𝐘[(Y^1t1|𝐘=𝐭3,S(𝐀)=S)((A1t2|𝐘=𝐭3,S(𝐀)=S)(A1t2|Y1=t3))],\displaystyle=\mathbb{E}_{S|{\mathbf{Y}}}\left[\mathbb{P}\left(\hat{Y}_{1}\leq t% ^{1}|{\mathbf{Y}}={\mathbf{t}}^{3},S({\mathbf{A}})=S\right)\left(\mathbb{P}% \left(A_{1}\leq t^{2}|{\mathbf{Y}}={\mathbf{t}}^{3},S({\mathbf{A}})=S\right)-% \mathbb{P}\left(A_{1}\leq t^{2}|Y_{1}=t^{3}\right)\right)\right],= blackboard_E start_POSTSUBSCRIPT italic_S | bold_Y end_POSTSUBSCRIPT [ blackboard_P ( over^ start_ARG italic_Y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT | bold_Y = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT , italic_S ( bold_A ) = italic_S ) ( blackboard_P ( italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | bold_Y = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT , italic_S ( bold_A ) = italic_S ) - blackboard_P ( italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) ) ] ,
    =𝔼S|𝐘[(Y^1t1|𝐘=𝐭3,S(𝐀)=S)((A1t2|𝐘=𝐭3,S(𝐀)=S)(A1t2|Y1=t3))]\displaystyle=\mathbb{E}_{S|{\mathbf{Y}}}\left[\mathbb{P}\left(\hat{Y}_{1}\leq t% ^{1}|{\mathbf{Y}}={\mathbf{t}}^{3},S({\mathbf{A}})=S\right)\left(\mathbb{P}% \left(A_{1}\leq t^{2}|{\mathbf{Y}}={\mathbf{t}}^{3},S({\mathbf{A}})=S\right)-% \mathbb{P}\left(A_{1}\leq t^{2}|Y_{1}=t^{3}\right)\right)\right]= blackboard_E start_POSTSUBSCRIPT italic_S | bold_Y end_POSTSUBSCRIPT [ blackboard_P ( over^ start_ARG italic_Y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT | bold_Y = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT , italic_S ( bold_A ) = italic_S ) ( blackboard_P ( italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | bold_Y = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT , italic_S ( bold_A ) = italic_S ) - blackboard_P ( italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) ) ]

    Our goal is equivalent to bound |Δ|Δ|\Delta|| roman_Δ |. Notice that since 𝐭1.3==𝐭n.3=t3superscriptsubscript𝐭13superscriptsubscript𝐭𝑛3superscript𝑡3{\mathbf{t}}_{1.}^{3}=\ldots={\mathbf{t}}_{n.}^{3}=t^{3}bold_t start_POSTSUBSCRIPT 1 . end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT = … = bold_t start_POSTSUBSCRIPT italic_n . end_POSTSUBSCRIPT start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT = italic_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT are the same for all n𝑛nitalic_n samples, A1subscript𝐴1A_{1}italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, \ldots, Ansubscript𝐴𝑛A_{n}italic_A start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT are exchangeable given S(𝐀)=S𝑆𝐀𝑆S({\mathbf{A}})=Sitalic_S ( bold_A ) = italic_S. Consequently, we obtain that

    |Δ|Δ\displaystyle|\Delta|| roman_Δ | 𝔼S|𝐘[|(A1t2|𝐘=𝐭3,S(𝐀)=S)(A1t2|Y1=t3)|]\displaystyle\leq\mathbb{E}_{S|{\mathbf{Y}}}\left[|\mathbb{P}\left(A_{1}\leq t% ^{2}|{\mathbf{Y}}={\mathbf{t}}^{3},S({\mathbf{A}})=S\right)-\mathbb{P}\left(A_% {1}\leq t^{2}|Y_{1}=t^{3}\right)|\right]≤ blackboard_E start_POSTSUBSCRIPT italic_S | bold_Y end_POSTSUBSCRIPT [ | blackboard_P ( italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | bold_Y = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT , italic_S ( bold_A ) = italic_S ) - blackboard_P ( italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) | ]
    =S|(A1t2𝐘=𝐭3,S(𝐀)=S)(S(𝐀)=S|𝐘=𝐭3)(A1t2|Y1=t3)(S(𝐀)=S|𝐘=𝐭3)|\displaystyle=\sum_{S}|\mathbb{P}\left(A_{1}\leq t^{2}\mid{\mathbf{Y}}={% \mathbf{t}}^{3},S({\mathbf{A}})=S\right)\mathbb{P}\left(S({\mathbf{A}})=S|{% \mathbf{Y}}={\mathbf{t}}^{3}\right)-\mathbb{P}\left(A_{1}\leq t^{2}|Y_{1}=t^{3% }\right)\mathbb{P}\left(S({\mathbf{A}})=S|{\mathbf{Y}}={\mathbf{t}}^{3}\right)|= ∑ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT | blackboard_P ( italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∣ bold_Y = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT , italic_S ( bold_A ) = italic_S ) blackboard_P ( italic_S ( bold_A ) = italic_S | bold_Y = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) - blackboard_P ( italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | italic_Y start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) blackboard_P ( italic_S ( bold_A ) = italic_S | bold_Y = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) |
    =(b4)S|F^yS(t2)Fy(t2)|(S(𝐀)=S|𝐘=𝐭3),subscript𝑏4subscript𝑆superscriptsubscript^𝐹𝑦𝑆superscript𝑡2subscript𝐹𝑦superscript𝑡2𝑆𝐀conditional𝑆𝐘superscript𝐭3\displaystyle\overset{(b_{4})}{=}\sum_{S}|\hat{F}_{y}^{S}(t^{2})-F_{y}(t^{2})|% \mathbb{P}\left(S({\mathbf{A}})=S|{\mathbf{Y}}={\mathbf{t}}^{3}\right),start_OVERACCENT ( italic_b start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT ) end_OVERACCENT start_ARG = end_ARG ∑ start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT | over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT ( italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) - italic_F start_POSTSUBSCRIPT italic_y end_POSTSUBSCRIPT ( italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) | blackboard_P ( italic_S ( bold_A ) = italic_S | bold_Y = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) ,

    where step (b4)subscript𝑏4(b_{4})( italic_b start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT ) has used the equivalence of A1,..,AnA_{1},..,A_{n}italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , . . , italic_A start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, which leads to (A1t2|𝐘=𝐭3,S(𝐀)=S)\mathbb{P}\left(A_{1}\leq t^{2}|{\mathbf{Y}}={\mathbf{t}}^{3},S({\mathbf{A}})=% S\right)blackboard_P ( italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | bold_Y = bold_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT , italic_S ( bold_A ) = italic_S ) the S𝑆Sitalic_S-induced empirical c.d.f evaluated at t2superscript𝑡2t^{2}italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT. Also, S𝑆Sitalic_S is a set n𝑛nitalic_n samples A𝐴Aitalic_A generated conditional on Y=t3𝑌superscript𝑡3Y=t^{3}italic_Y = italic_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT, and F^t3S(.)\hat{F}_{t^{3}}^{S}(.)over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT ( . ) denotes the empirical c.d.f induced by S𝑆Sitalic_S and Ft3(.)F_{t^{3}}(.)italic_F start_POSTSUBSCRIPT italic_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( . ) denote the theoretical c.d.f of A|Y=t3conditional𝐴𝑌superscript𝑡3A|Y=t^{3}italic_A | italic_Y = italic_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT. From Lemma 4.1 in (Naaman, 2021), which generalizes Dvoretzky–Kiefer–Wolfowitz inequality to multi-dimensional empirical c.d.f to we know

    (supt2|F^t3S(t2)Ft3(t2)|>δ)p(n+1)exp(2nt2).subscriptsupremumsuperscript𝑡2superscriptsubscript^𝐹superscript𝑡3𝑆superscript𝑡2subscript𝐹superscript𝑡3superscript𝑡2𝛿𝑝𝑛12𝑛superscript𝑡2\mathbb{P}(\sup_{t^{2}}|\hat{F}_{t^{3}}^{S}(t^{2})-F_{t^{3}}(t^{2})|>\delta)% \leq p(n+1)\exp(-2nt^{2}).blackboard_P ( roman_sup start_POSTSUBSCRIPT italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT | over^ start_ARG italic_F end_ARG start_POSTSUBSCRIPT italic_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_S end_POSTSUPERSCRIPT ( italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) - italic_F start_POSTSUBSCRIPT italic_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) | > italic_δ ) ≤ italic_p ( italic_n + 1 ) roman_exp ( - 2 italic_n italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) .

    Combine this equality with the bound for |Δ|Δ|\Delta|| roman_Δ |, we have

    (|Δ|>Clogp+lognn)0,Δ𝐶𝑝𝑛𝑛0\mathbb{P}(|\Delta|>C\frac{\log p+\log n}{n})\rightarrow 0,blackboard_P ( | roman_Δ | > italic_C divide start_ARG roman_log italic_p + roman_log italic_n end_ARG start_ARG italic_n end_ARG ) → 0 ,

    for a sufficiently large C𝐶Citalic_C as n𝑛n\rightarrow\inftyitalic_n → ∞. We thus reached our conclusion that

    limn[(Y^t1,A1t2|Y=t3)(Y^t1|Y=t3)(At2|Y=t3)]0,subscript𝑛delimited-[]formulae-sequence^𝑌superscript𝑡1subscript𝐴1conditionalsuperscript𝑡2𝑌superscript𝑡3^𝑌conditionalsuperscript𝑡1𝑌superscript𝑡3𝐴conditionalsuperscript𝑡2𝑌superscript𝑡30\lim_{n\rightarrow\infty}\left[\mathbb{P}\left(\hat{Y}\leq t^{1},A_{1}\leq t^{% 2}|Y=t^{3}\right)-\mathbb{P}\left(\hat{Y}\leq t^{1}|Y=t^{3}\right)\mathbb{P}% \left(A\leq t^{2}|Y=t^{3}\right)\right]\rightarrow 0,roman_lim start_POSTSUBSCRIPT italic_n → ∞ end_POSTSUBSCRIPT [ blackboard_P ( over^ start_ARG italic_Y end_ARG ≤ italic_t start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT , italic_A start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | italic_Y = italic_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) - blackboard_P ( over^ start_ARG italic_Y end_ARG ≤ italic_t start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT | italic_Y = italic_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) blackboard_P ( italic_A ≤ italic_t start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT | italic_Y = italic_t start_POSTSUPERSCRIPT 3 end_POSTSUPERSCRIPT ) ] → 0 ,

Proof of Theorem 2.3.

For fixed f𝑓fitalic_f, the optimal discriminator D*superscript𝐷D^{*}italic_D start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT is reached at

θ^d*=argminθdd(θf,θd),subscriptsuperscript^𝜃𝑑subscriptsubscript𝜃𝑑subscript𝑑subscript𝜃𝑓subscript𝜃𝑑\hat{\theta}^{*}_{d}=\arg\min_{\theta_{d}}\mathcal{L}_{d}\left(\theta_{f},% \theta_{d}\right),over^ start_ARG italic_θ end_ARG start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT = roman_arg roman_min start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT end_POSTSUBSCRIPT caligraphic_L start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) ,

in which case, the discriminating classifier is Dθd*()=pY^AY()pY^AY()+pY^A~Y()subscript𝐷subscriptsuperscript𝜃𝑑subscript𝑝^𝑌𝐴𝑌subscript𝑝^𝑌𝐴𝑌subscript𝑝^𝑌~𝐴𝑌D_{\theta^{*}_{d}}(\cdot)=\dfrac{p_{\hat{Y}AY}(\cdot)}{p_{\hat{Y}AY}(\cdot)+p_% {\hat{Y}\tilde{A}Y}(\cdot)}italic_D start_POSTSUBSCRIPT italic_θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( ⋅ ) = divide start_ARG italic_p start_POSTSUBSCRIPT over^ start_ARG italic_Y end_ARG italic_A italic_Y end_POSTSUBSCRIPT ( ⋅ ) end_ARG start_ARG italic_p start_POSTSUBSCRIPT over^ start_ARG italic_Y end_ARG italic_A italic_Y end_POSTSUBSCRIPT ( ⋅ ) + italic_p start_POSTSUBSCRIPT over^ start_ARG italic_Y end_ARG over~ start_ARG italic_A end_ARG italic_Y end_POSTSUBSCRIPT ( ⋅ ) end_ARG (See Proposition 1 in (Goodfellow et al., 2014)), and dsubscript𝑑\mathcal{L}_{d}caligraphic_L start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT reduces to

d(θf,θd)subscript𝑑subscript𝜃𝑓subscript𝜃𝑑\displaystyle\mathcal{L}_{d}\left(\theta_{f},\theta_{d}\right)caligraphic_L start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) =log(4)2JSD(pY^AYpY^A~Y)absent42𝐽𝑆𝐷conditionalsubscript𝑝^𝑌𝐴𝑌subscript𝑝^𝑌~𝐴𝑌\displaystyle=\log(4)-2\cdot JSD\left(p_{\hat{Y}AY}\|~{}p_{\hat{Y}\tilde{A}Y}\right)= roman_log ( 4 ) - 2 ⋅ italic_J italic_S italic_D ( italic_p start_POSTSUBSCRIPT over^ start_ARG italic_Y end_ARG italic_A italic_Y end_POSTSUBSCRIPT ∥ italic_p start_POSTSUBSCRIPT over^ start_ARG italic_Y end_ARG over~ start_ARG italic_A end_ARG italic_Y end_POSTSUBSCRIPT )

where JSD𝐽𝑆𝐷JSDitalic_J italic_S italic_D is the Jensen-Shannon divergence between the distributions of (Y^,A,Y)^𝑌𝐴𝑌(\hat{Y},A,Y)( over^ start_ARG italic_Y end_ARG , italic_A , italic_Y ) and (Y^,A~,Y)^𝑌~𝐴𝑌(\hat{Y},\tilde{A},Y)( over^ start_ARG italic_Y end_ARG , over~ start_ARG italic_A end_ARG , italic_Y ). Plug this this into Vμ(θf,θd)subscript𝑉𝜇subscript𝜃𝑓subscript𝜃𝑑V_{\mu}(\theta_{f},\theta_{d})italic_V start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ), we reach the single-parameter form of the original objective:

Vμ(θf)subscript𝑉𝜇subscript𝜃𝑓\displaystyle V_{\mu}\left(\theta_{f}\right)italic_V start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ) =minθdVμ(θf,θd)=(1μ)f(θf)+2μJSD(pY^AYpY^A~Y)μlog(4)(1μ)H(YX)μlog(4),absentsubscriptsubscript𝜃𝑑subscript𝑉𝜇subscript𝜃𝑓subscript𝜃𝑑1𝜇subscript𝑓subscript𝜃𝑓2𝜇𝐽𝑆𝐷conditionalsubscript𝑝^𝑌𝐴𝑌subscript𝑝^𝑌~𝐴𝑌𝜇41𝜇𝐻conditional𝑌𝑋𝜇4\displaystyle=\min_{\theta_{d}}V_{\mu}(\theta_{f},\theta_{d})=(1-\mu)\mathcal{% L}_{f}\left(\theta_{f}\right)+2\mu\cdot JSD\left(p_{\hat{Y}AY}\|~{}p_{\hat{Y}% \tilde{A}Y}\right)-\mu\log(4)\geq(1-\mu)H(Y\mid X)-\mu\log(4),= roman_min start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_V start_POSTSUBSCRIPT italic_μ end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT italic_d end_POSTSUBSCRIPT ) = ( 1 - italic_μ ) caligraphic_L start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ) + 2 italic_μ ⋅ italic_J italic_S italic_D ( italic_p start_POSTSUBSCRIPT over^ start_ARG italic_Y end_ARG italic_A italic_Y end_POSTSUBSCRIPT ∥ italic_p start_POSTSUBSCRIPT over^ start_ARG italic_Y end_ARG over~ start_ARG italic_A end_ARG italic_Y end_POSTSUBSCRIPT ) - italic_μ roman_log ( 4 ) ≥ ( 1 - italic_μ ) italic_H ( italic_Y ∣ italic_X ) - italic_μ roman_log ( 4 ) ,

where the equality holds at θ*=argminθfV(θf)superscript𝜃subscriptsubscript𝜃𝑓𝑉subscript𝜃𝑓\theta^{*}=\arg\min_{\theta_{f}}V\left(\theta_{f}\right)italic_θ start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT = roman_arg roman_min start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_V ( italic_θ start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ). In summary, the solution value (1μ)H(YX)μlog(4)1𝜇𝐻conditional𝑌𝑋𝜇4(1-\mu)H(Y\mid X)-\mu\log(4)( 1 - italic_μ ) italic_H ( italic_Y ∣ italic_X ) - italic_μ roman_log ( 4 ) is achieved when:

  • θ^fsubscript^𝜃𝑓\hat{\theta}_{f}over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT minimizes the negative log\logroman_log-likelihood of YXconditional𝑌𝑋Y\mid Xitalic_Y ∣ italic_X under f𝑓fitalic_f, which happens when θ^fsubscript^𝜃𝑓\hat{\theta}_{f}over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT are the solutions of an optimal predictor f𝑓fitalic_f. In this case, fsubscript𝑓\mathcal{L}_{f}caligraphic_L start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT reduces to its minimum value H(YX)𝐻conditional𝑌𝑋H(Y\mid X)italic_H ( italic_Y ∣ italic_X )

  • θ^fsubscript^𝜃𝑓\hat{\theta}_{f}over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT minimizes the Jensen-Shannon divergence JSD(pY^AYpY^A~Y)𝐽𝑆𝐷conditionalsubscript𝑝^𝑌𝐴𝑌subscript𝑝^𝑌~𝐴𝑌JSD\left(p_{\hat{Y}AY}\|~{}p_{\hat{Y}\tilde{A}Y}\right)italic_J italic_S italic_D ( italic_p start_POSTSUBSCRIPT over^ start_ARG italic_Y end_ARG italic_A italic_Y end_POSTSUBSCRIPT ∥ italic_p start_POSTSUBSCRIPT over^ start_ARG italic_Y end_ARG over~ start_ARG italic_A end_ARG italic_Y end_POSTSUBSCRIPT ), Since the Jensen–Shannon divergence between two distributions is always non-negative, and zero if and only if they are equal.

The second characterization is equivalent to the condition (Y^AY)=d(Y^A~Y)superscript𝑑^𝑌𝐴𝑌^𝑌~𝐴𝑌(\hat{Y}AY)\,{\buildrel d\over{=}}\,(\hat{Y}\tilde{A}Y)( over^ start_ARG italic_Y end_ARG italic_A italic_Y ) start_RELOP SUPERSCRIPTOP start_ARG = end_ARG start_ARG italic_d end_ARG end_RELOP ( over^ start_ARG italic_Y end_ARG over~ start_ARG italic_A end_ARG italic_Y ). Note that this is a population level characterization with 𝔼𝔼\mathbb{E}blackboard_E corresponding to the case where n𝑛n\rightarrow\inftyitalic_n → ∞. As a result, by the asymptotic equalized odds statement in Theorem 2.1, we have that f^θ^fsubscript^𝑓subscript^𝜃𝑓\hat{f}_{\hat{\theta}_{f}}over^ start_ARG italic_f end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_θ end_ARG start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT end_POSTSUBSCRIPT also satisfies equalized odds. ∎

Proof of Proposition 3.2.

The proposed test is a special case of the Conditional Permutation Test (Berrett et al., 2020), so the proof is a direct result from Theorem 2.1 in our paper and Theorem 1 in (Berrett et al., 2020) .

Appendix B Sampling Algorithm

To sample the permutation ΠΠ\Piroman_Π from the probabilities:

{Π=π𝐀,𝐘}=qn(𝐘π1𝐀)π𝒮nqn(𝐘π1𝐀),conditional-setΠ𝜋𝐀𝐘superscript𝑞𝑛conditionalsubscript𝐘superscript𝜋1𝐀subscriptsuperscript𝜋subscript𝒮𝑛superscript𝑞𝑛conditionalsubscript𝐘superscriptsuperscript𝜋1𝐀\mathbb{P}\left\{\Pi=\pi\mid{\mathbf{A}},{\mathbf{Y}}\right\}=\frac{q^{n}\left% ({\mathbf{Y}}_{{\pi}^{-1}}\mid{\mathbf{A}}\right)}{\sum_{\pi^{\prime}\in% \mathcal{S}_{n}}q^{n}\left({\mathbf{Y}}_{{\pi^{\prime}}^{-1}}\mid{\mathbf{A}}% \right)},blackboard_P { roman_Π = italic_π ∣ bold_A , bold_Y } = divide start_ARG italic_q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_Y start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∣ bold_A ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ caligraphic_S start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_q start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ( bold_Y start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ∣ bold_A ) end_ARG ,

we use the Parallelized pairwise sampler for the CPT proposed in Berrett et al. (2020), which is detailed as follows:

Input: Data (𝐀,𝐘)𝐀𝐘({\mathbf{A}},{\mathbf{Y}})( bold_A , bold_Y ), Initial permutation Π[0]superscriptΠdelimited-[]0\Pi^{[0]}roman_Π start_POSTSUPERSCRIPT [ 0 ] end_POSTSUPERSCRIPT, integer S1𝑆1S\geq 1italic_S ≥ 1.

1:for s=1,,S𝑠1𝑆s=1,\dots,Sitalic_s = 1 , … , italic_S do
2:     Sample uniformly without replacement from {1,,n}1𝑛\{1,\ldots,n\}{ 1 , … , italic_n } to obtain disjoint pairs
(is,1,js,1),,(is,n/2,js,n/2).subscript𝑖𝑠1subscript𝑗𝑠1subscript𝑖𝑠𝑛2subscript𝑗𝑠𝑛2\left(i_{s,1},j_{s,1}\right),\ldots,\left(i_{s,\lfloor n/2\rfloor},j_{s,% \lfloor n/2\rfloor}\right).( italic_i start_POSTSUBSCRIPT italic_s , 1 end_POSTSUBSCRIPT , italic_j start_POSTSUBSCRIPT italic_s , 1 end_POSTSUBSCRIPT ) , … , ( italic_i start_POSTSUBSCRIPT italic_s , ⌊ italic_n / 2 ⌋ end_POSTSUBSCRIPT , italic_j start_POSTSUBSCRIPT italic_s , ⌊ italic_n / 2 ⌋ end_POSTSUBSCRIPT ) .
3:     Draw independent Bernoulli variables Bs,1,,Bs,n/2subscript𝐵𝑠1subscript𝐵𝑠𝑛2B_{s,1},\ldots,B_{s,\lfloor n/2\rfloor}italic_B start_POSTSUBSCRIPT italic_s , 1 end_POSTSUBSCRIPT , … , italic_B start_POSTSUBSCRIPT italic_s , ⌊ italic_n / 2 ⌋ end_POSTSUBSCRIPT with odds ratios
{Bs,k=1}{Bs,k=0}=q(Y(Π[s1](js,k))Ais,k)q(Y(Π[s1](is,k)Ajs,k)q(Y(Π[s1](is,k))Ais,k)q(Y(Π[s1](js,k))Ajs,k).\frac{\mathbb{P}\left\{B_{s,k}=1\right\}}{\mathbb{P}\left\{B_{s,k}=0\right\}}=% \frac{q\left(Y_{\left(\Pi^{[s-1]}\left(j_{s,k}\right)\right)}\mid A_{i_{s,k}}% \right)\cdot q\left(Y_{\left(\Pi^{[s-1]}\left(i_{s,k}\right)\right.}\mid A_{j_% {s,k}}\right)}{q\left(Y_{\left(\Pi^{[s-1]}\left(i_{s,k}\right)\right)}\mid A_{% i_{s,k}}\right)\cdot q\left(Y_{\left(\Pi^{[s-1]}\left(j_{s,k}\right)\right)}% \mid A_{j_{s,k}}\right)}.divide start_ARG blackboard_P { italic_B start_POSTSUBSCRIPT italic_s , italic_k end_POSTSUBSCRIPT = 1 } end_ARG start_ARG blackboard_P { italic_B start_POSTSUBSCRIPT italic_s , italic_k end_POSTSUBSCRIPT = 0 } end_ARG = divide start_ARG italic_q ( italic_Y start_POSTSUBSCRIPT ( roman_Π start_POSTSUPERSCRIPT [ italic_s - 1 ] end_POSTSUPERSCRIPT ( italic_j start_POSTSUBSCRIPT italic_s , italic_k end_POSTSUBSCRIPT ) ) end_POSTSUBSCRIPT ∣ italic_A start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT italic_s , italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ⋅ italic_q ( italic_Y start_POSTSUBSCRIPT ( roman_Π start_POSTSUPERSCRIPT [ italic_s - 1 ] end_POSTSUPERSCRIPT ( italic_i start_POSTSUBSCRIPT italic_s , italic_k end_POSTSUBSCRIPT ) end_POSTSUBSCRIPT ∣ italic_A start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_s , italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_ARG start_ARG italic_q ( italic_Y start_POSTSUBSCRIPT ( roman_Π start_POSTSUPERSCRIPT [ italic_s - 1 ] end_POSTSUPERSCRIPT ( italic_i start_POSTSUBSCRIPT italic_s , italic_k end_POSTSUBSCRIPT ) ) end_POSTSUBSCRIPT ∣ italic_A start_POSTSUBSCRIPT italic_i start_POSTSUBSCRIPT italic_s , italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ⋅ italic_q ( italic_Y start_POSTSUBSCRIPT ( roman_Π start_POSTSUPERSCRIPT [ italic_s - 1 ] end_POSTSUPERSCRIPT ( italic_j start_POSTSUBSCRIPT italic_s , italic_k end_POSTSUBSCRIPT ) ) end_POSTSUBSCRIPT ∣ italic_A start_POSTSUBSCRIPT italic_j start_POSTSUBSCRIPT italic_s , italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) end_ARG .
Define Π[s]superscriptΠdelimited-[]𝑠\Pi^{[s]}roman_Π start_POSTSUPERSCRIPT [ italic_s ] end_POSTSUPERSCRIPT by swap** Π[s1](is,k)superscriptΠdelimited-[]𝑠1subscript𝑖𝑠𝑘\Pi^{[s-1]}\left(i_{s,k}\right)roman_Π start_POSTSUPERSCRIPT [ italic_s - 1 ] end_POSTSUPERSCRIPT ( italic_i start_POSTSUBSCRIPT italic_s , italic_k end_POSTSUBSCRIPT ) and Π[s1](js,k)superscriptΠdelimited-[]𝑠1subscript𝑗𝑠𝑘\Pi^{[s-1]}\left(j_{s,k}\right)roman_Π start_POSTSUPERSCRIPT [ italic_s - 1 ] end_POSTSUPERSCRIPT ( italic_j start_POSTSUBSCRIPT italic_s , italic_k end_POSTSUBSCRIPT ) for each k𝑘kitalic_k with Bs,k=1subscript𝐵𝑠𝑘1B_{s,k}=1italic_B start_POSTSUBSCRIPT italic_s , italic_k end_POSTSUBSCRIPT = 1.
4:end for

Output: Permuted copy 𝐀~=𝐀Π[S]1~𝐀subscript𝐀superscriptsuperscriptΠdelimited-[]𝑆1\tilde{\mathbf{A}}={\mathbf{A}}_{{\Pi^{[S]}}^{-1}}over~ start_ARG bold_A end_ARG = bold_A start_POSTSUBSCRIPT roman_Π start_POSTSUPERSCRIPT [ italic_S ] end_POSTSUPERSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT.

Algorithm 3 Parallelized pairwise sampler for the ICP

Appendix C Additional comparisons of CP/ICP

When we know the true conditional laws qY|A(.)q_{Y|A}(.)italic_q start_POSTSUBSCRIPT italic_Y | italic_A end_POSTSUBSCRIPT ( . ) (conditional density Y𝑌Yitalic_Y given A𝐴Aitalic_A) and qA|Y(.)q_{A|Y}(.)italic_q start_POSTSUBSCRIPT italic_A | italic_Y end_POSTSUBSCRIPT ( . ) (conditional density A𝐴Aitalic_A given Y𝑌Yitalic_Y), both CP and ICP show provide accurate conditional permutation copies. However, both densities are estimated in practice, and the estimated densities are denoted as qˇY|A(.)\check{q}_{Y|A}(.)overroman_ˇ start_ARG italic_q end_ARG start_POSTSUBSCRIPT italic_Y | italic_A end_POSTSUBSCRIPT ( . ) and qˇA|Y(.)\check{q}_{A|Y}(.)overroman_ˇ start_ARG italic_q end_ARG start_POSTSUBSCRIPT italic_A | italic_Y end_POSTSUBSCRIPT ( . ) respectively. The density estimation quality will depend on both the density estimation algorithm and the data distribution. While a deep dive into this aspect, especially from the theoretical aspects, is beyond the scope, we provide some additional heuristic insights to assist our understanding of the potential gain of ICP over CP.

When ICP might improve over CP?

According to proof argument of Theorem 4 in Berrett et al. (2020), let 𝐀πmsubscript𝐀subscript𝜋𝑚{\mathbf{A}}_{\pi_{m}}bold_A start_POSTSUBSCRIPT italic_π start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT end_POSTSUBSCRIPT be some permuted copies of A𝐴Aitalic_A according to the estimated conditional law qˇA|Y()subscriptˇ𝑞conditional𝐴𝑌\check{q}_{A|Y}()overroman_ˇ start_ARG italic_q end_ARG start_POSTSUBSCRIPT italic_A | italic_Y end_POSTSUBSCRIPT ( ), an upper bound of exchangeability violation for 𝐀𝐀{\mathbf{A}}bold_A and 𝐀πsubscript𝐀𝜋{\mathbf{A}}_{\pi}bold_A start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT is related to the total variation between the estimated density qˇA|Y(.)\check{q}_{A|Y}(.)overroman_ˇ start_ARG italic_q end_ARG start_POSTSUBSCRIPT italic_A | italic_Y end_POSTSUBSCRIPT ( . ) and qA|Y(.)q_{A|Y}(.)italic_q start_POSTSUBSCRIPT italic_A | italic_Y end_POSTSUBSCRIPT ( . ) (Theorem 4 in Berrett et al. (2020)):

dTV{((𝐘,𝐀),(𝐘,𝐀π))|𝐘),((𝐘,𝐀ˇ),(𝐘,𝐀π))|𝐘)}\displaystyle d_{TV}\{(({\mathbf{Y}},{\mathbf{A}}),({\mathbf{Y}},{\mathbf{A}}_% {\pi}))|{\mathbf{Y}}),(({\mathbf{Y}},\check{{\mathbf{A}}}),({\mathbf{Y}},{% \mathbf{A}}_{\pi}))|{\mathbf{Y}})\}italic_d start_POSTSUBSCRIPT italic_T italic_V end_POSTSUBSCRIPT { ( ( bold_Y , bold_A ) , ( bold_Y , bold_A start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT ) ) | bold_Y ) , ( ( bold_Y , overroman_ˇ start_ARG bold_A end_ARG ) , ( bold_Y , bold_A start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT ) ) | bold_Y ) }
\displaystyle\leq dTV(i=1nqˇA|Y(.|yi),i=1nqA|Y(.|yi))(b1)i=1ndTV(qˇA|Y(.|yi),qA|Y(.|yi)),\displaystyle d_{TV}(\prod_{i=1}^{n}\check{q}_{A|Y}(.|y_{i}),\prod_{i=1}^{n}q_% {A|Y}(.|y_{i}))\overset{(b_{1})}{\leq}\sum_{i=1}^{n}d_{TV}(\check{q}_{A|Y}(.|y% _{i}),q_{A|Y}(.|y_{i})),italic_d start_POSTSUBSCRIPT italic_T italic_V end_POSTSUBSCRIPT ( ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT overroman_ˇ start_ARG italic_q end_ARG start_POSTSUBSCRIPT italic_A | italic_Y end_POSTSUBSCRIPT ( . | italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , ∏ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_q start_POSTSUBSCRIPT italic_A | italic_Y end_POSTSUBSCRIPT ( . | italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) start_OVERACCENT ( italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_OVERACCENT start_ARG ≤ end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_T italic_V end_POSTSUBSCRIPT ( overroman_ˇ start_ARG italic_q end_ARG start_POSTSUBSCRIPT italic_A | italic_Y end_POSTSUBSCRIPT ( . | italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , italic_q start_POSTSUBSCRIPT italic_A | italic_Y end_POSTSUBSCRIPT ( . | italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) , (13)

where step (b1)subscript𝑏1(b_{1})( italic_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) is from Lemma (B.8) from ghosal2017fundamentals. We adapt the proof arguments of Theorem 4 in Berrett et al. (2020) to the ICP procedure.

Specifically, let 𝐘πsubscript𝐘𝜋{\mathbf{Y}}_{\pi}bold_Y start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT be the conditional permutation of 𝐘𝐘{\mathbf{Y}}bold_Y according to qˇY|A(.)\check{q}_{Y|A}(.)overroman_ˇ start_ARG italic_q end_ARG start_POSTSUBSCRIPT italic_Y | italic_A end_POSTSUBSCRIPT ( . ) and 𝐘ˇˇ𝐘\check{{\mathbf{Y}}}overroman_ˇ start_ARG bold_Y end_ARG be a new copy sampled according to qˇY|A(.)\check{q}_{Y|A}(.)overroman_ˇ start_ARG italic_q end_ARG start_POSTSUBSCRIPT italic_Y | italic_A end_POSTSUBSCRIPT ( . ). We will have

dTV{((𝐘,𝐀),(𝐘π,𝐀)|𝐀)}i=1ndTV(qˇY|A(.|Ai),qY|A(.|Ai)).\displaystyle d_{TV}\{(({\mathbf{Y}},{\mathbf{A}}),({\mathbf{Y}}_{\pi},{% \mathbf{A}})|{\mathbf{A}})\}\leq\sum_{i=1}^{n}d_{TV}(\check{q}_{Y|A}(.|A_{i}),% q_{Y|A}(.|A_{i})).italic_d start_POSTSUBSCRIPT italic_T italic_V end_POSTSUBSCRIPT { ( ( bold_Y , bold_A ) , ( bold_Y start_POSTSUBSCRIPT italic_π end_POSTSUBSCRIPT , bold_A ) | bold_A ) } ≤ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_d start_POSTSUBSCRIPT italic_T italic_V end_POSTSUBSCRIPT ( overroman_ˇ start_ARG italic_q end_ARG start_POSTSUBSCRIPT italic_Y | italic_A end_POSTSUBSCRIPT ( . | italic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) , italic_q start_POSTSUBSCRIPT italic_Y | italic_A end_POSTSUBSCRIPT ( . | italic_A start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) . (14)

There is one issue before we can compare the two CP and ICP upper bounds for exchangeability violations: the two bounds consider different variables and conditioning events. Notice that we care only about the distributional level comparisons, hence, we can apply permutation π1superscript𝜋1\pi^{-1}italic_π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT to (𝐘,𝐀)𝐘𝐀({\mathbf{Y}},{\mathbf{A}})( bold_Y , bold_A ) and (𝐘,𝐀π1)𝐘subscript𝐀superscript𝜋1({\mathbf{Y}},{\mathbf{A}}_{\pi^{-1}})( bold_Y , bold_A start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ). The resulting (𝐘π1,𝐀π1)subscript𝐘superscript𝜋1subscript𝐀superscript𝜋1({\mathbf{Y}}_{\pi^{-1}},{\mathbf{A}}_{\pi^{-1}})( bold_Y start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT , bold_A start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) is equivalent to (𝐘,𝐀)𝐘𝐀({\mathbf{Y}},{\mathbf{A}})( bold_Y , bold_A ) and the resulting (𝐘,𝐀π1)𝐘subscript𝐀superscript𝜋1({\mathbf{Y}},{\mathbf{A}}_{\pi^{-1}})( bold_Y , bold_A start_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ) is exactly the ICP conditionally permuted version. Next we can remove the conditioning event by marginalizing out 𝐘𝐘{\mathbf{Y}}bold_Y and 𝐀𝐀{\mathbf{A}}bold_A in (C) and (14) respectively. Hence, we obtain upper bounds for violation of exchangeability using CP and ICP permutation copies, which is smaller for ICP if qˇY|A(.)\check{q}_{Y|A}(.)overroman_ˇ start_ARG italic_q end_ARG start_POSTSUBSCRIPT italic_Y | italic_A end_POSTSUBSCRIPT ( . ) is more accurate on average:

𝔼A[dTV(qˇY|A(.|A),qY|A(.|A))]<𝔼Y[dTV(qˇA|Y(.|Y),qA|Y(.|Y))].\mathbb{E}_{A}\left[d_{TV}(\check{q}_{Y|A}(.|A),q_{Y|A}(.|A))\right]<\mathbb{E% }_{Y}\left[d_{TV}(\check{q}_{A|Y}(.|Y),q_{A|Y}(.|Y))\right].blackboard_E start_POSTSUBSCRIPT italic_A end_POSTSUBSCRIPT [ italic_d start_POSTSUBSCRIPT italic_T italic_V end_POSTSUBSCRIPT ( overroman_ˇ start_ARG italic_q end_ARG start_POSTSUBSCRIPT italic_Y | italic_A end_POSTSUBSCRIPT ( . | italic_A ) , italic_q start_POSTSUBSCRIPT italic_Y | italic_A end_POSTSUBSCRIPT ( . | italic_A ) ) ] < blackboard_E start_POSTSUBSCRIPT italic_Y end_POSTSUBSCRIPT [ italic_d start_POSTSUBSCRIPT italic_T italic_V end_POSTSUBSCRIPT ( overroman_ˇ start_ARG italic_q end_ARG start_POSTSUBSCRIPT italic_A | italic_Y end_POSTSUBSCRIPT ( . | italic_Y ) , italic_q start_POSTSUBSCRIPT italic_A | italic_Y end_POSTSUBSCRIPT ( . | italic_Y ) ) ] .

ICP achieved higher quality empirically

To illustrate that ICP can provide resampling distribution closer to that of the oracle conditional permutation compared to CP, both utilizing off-the-shelf tools for density estimation with varying dimensions, we consider the following examples:

(1)let A=(U1,,UK0,UK0+1,,UK0+K)×Θ12𝐴subscript𝑈1subscript𝑈subscript𝐾0subscript𝑈subscript𝐾01subscript𝑈subscript𝐾0𝐾superscriptΘ12A=(U_{1},\ldots,U_{K_{0}},U_{K_{0}+1},\ldots,U_{K_{0}+K})\times\Theta^{\frac{1% }{2}}italic_A = ( italic_U start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , … , italic_U start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_U start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + 1 end_POSTSUBSCRIPT , … , italic_U start_POSTSUBSCRIPT italic_K start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + italic_K end_POSTSUBSCRIPT ) × roman_Θ start_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG 2 end_ARG end_POSTSUPERSCRIPT. Here Ujsubscript𝑈𝑗U_{j}italic_U start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT be independently generated from either the standard normal 𝒩(0,1)𝒩01\mathcal{N}(0,1)caligraphic_N ( 0 , 1 ) or a mixed Gamma distribution 12Γ(1,1)+12Γ(1,10)12Γ1112Γ110\frac{1}{2}\Gamma(1,1)+\frac{1}{2}\Gamma(1,10)divide start_ARG 1 end_ARG start_ARG 2 end_ARG roman_Γ ( 1 , 1 ) + divide start_ARG 1 end_ARG start_ARG 2 end_ARG roman_Γ ( 1 , 10 ); ΘΘ\Thetaroman_Θ is a randomly generated covariance matrix with eigenvalues equally spaced on [1,5]15[1,5][ 1 , 5 ].

(2) let YN(.5k=1K0Aj,1.5)similar-to𝑌𝑁.5superscriptsubscript𝑘1subscript𝐾0subscript𝐴𝑗1.5Y\sim N(\sqrt{.5}\sum_{k=1}^{K_{0}}A_{j},1.5)italic_Y ∼ italic_N ( square-root start_ARG .5 end_ARG ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_K start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUPERSCRIPT italic_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT , 1.5 ). That is, Y𝑌Yitalic_Y only influenced by first K0subscript𝐾0K_{0}italic_K start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT columns of Ajsubscript𝐴𝑗A_{j}italic_A start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT, with the next K𝐾Kitalic_K columns of A𝐴Aitalic_A be noise.

(3) We estimate qY|Asubscript𝑞conditional𝑌𝐴q_{Y|A}italic_q start_POSTSUBSCRIPT italic_Y | italic_A end_POSTSUBSCRIPT/ qA|Ysubscript𝑞conditional𝐴𝑌q_{A|Y}italic_q start_POSTSUBSCRIPT italic_A | italic_Y end_POSTSUBSCRIPT using (1) lasso regression/graphical lasso, where we estimate the linear dependence of Y𝑌Yitalic_Y on A𝐴Aitalic_A and variance empirically for qY|Asubscript𝑞conditional𝑌𝐴q_{Y|A}italic_q start_POSTSUBSCRIPT italic_Y | italic_A end_POSTSUBSCRIPT and estimate qA|Ysubscript𝑞conditional𝐴𝑌q_{A|Y}italic_q start_POSTSUBSCRIPT italic_A | italic_Y end_POSTSUBSCRIPT assuming joint normality of (Y,A)𝑌𝐴(Y,A)( italic_Y , italic_A ). For K0=1subscript𝐾01K_{0}=1italic_K start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 1 and K=0𝐾0K=0italic_K = 0, OLS was used for both estimations, and (2) MAF, which was default in our paper.

We set K0{1,5}subscript𝐾015K_{0}\in\{1,5\}italic_K start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ { 1 , 5 }, K{0,5,10,20}𝐾051020K\in\{0,5,10,20\}italic_K ∈ { 0 , 5 , 10 , 20 }, and the sample size for density estimation and evaluating the conditional permutation distribution to both be 200. We are interested in the total variation difference between permutations using ICP and CP using the estimated densities to that using the true density, which is explicitly known in this example up to a normalization constant.

Due to the large permutation space, the calculation of the actual total variation distance is difficult. To circumvent this challenge, we restrict the permutation space to swap** actions: we consider the TV distance (log1010\log 10roman_log 10 transformed) restricted to permutations π𝜋\piitalic_π that swaps i𝑖iitalic_i and j𝑗jitalic_j for ij,i,j=1,,nformulae-sequence𝑖𝑗𝑖𝑗1𝑛i\neq j,i,j=1,\ldots,nitalic_i ≠ italic_j , italic_i , italic_j = 1 , … , italic_n and the original order, and compare ICP and CP to the oracle conditional permutations on such n(n+1)2𝑛𝑛12\frac{n(n+1)}{2}divide start_ARG italic_n ( italic_n + 1 ) end_ARG start_ARG 2 end_ARG permutations only.

Figure 6 and Figure 7 show results using (1) MAF and (2) cross-validated lasso regression or graphical lasso, respectively (repeated 20 times for each setting). We see that the TV distances between ICP and the oracle are smaller than the corresponding ones for CP using both density estimation approaches. MAF is a default density estimation approach for general purposes. By design, lasso regression/OLS is favored over MAF for estimating qY|Asubscript𝑞conditional𝑌𝐴q_{Y|A}italic_q start_POSTSUBSCRIPT italic_Y | italic_A end_POSTSUBSCRIPT in this particular example. There may be better density estimation choices in other applications, but overall, estimating Y|Aconditional𝑌𝐴Y|Aitalic_Y | italic_A can be simpler and allows us to utilize existing tools, e.g., those designed for supervised learning.

Refer to caption
Figure 6: Total variation distances between CP/ICP and the true conditional permutations on the restricted swap** operation space when empirical density estimation is obtained via MAF, and data generated according to the normal distribution (upper panel) and mixed Gamma distribution (lower panel)
Refer to caption
Figure 7: Total variation distances between CP/ICP and the true conditional permutations on the restricted swap** operation space when empirical density estimation is obtained via lasso (ICP) and graphical lasso (CP), and data generated according to the normal distribution (upper panel) and mixed Gamma distribution (lower panel)

Appendix D Experiments

In both simulation studies and real-data experiments, we implement the algorithms with the hyperparameters chosen by the tuning procedure as in Romano et al. (2020). In practice, we tune the hyperparameters only once using 10-fold cross-validation on the entire data set and then treat the chosen set as fixed for the rest of the experiments. Then we compare the performance metrics of the different algorithms on 100 data splits that are different than the ones used to tune the parameters. This same tuning scheme is used for all methods, ensuring that the comparisons are meaningful.

D.1 Experiments on synthetic datasets

For all the models evaluated (FairICP, Oracle, FDL), we set the hyperparameters as follows:

  • We set f𝑓fitalic_f as a linear model and use the Adam optimizer with a mini-batch size in {16, 32, 64}, learning rate in {1e-4, 1e-3, 1e-2}, and the number of epochs in {20, 40, 60, 80, 100, 120, 140, 160, 180, 200}. The discriminator is implemented as a four-layer neural network with a hidden layer of size 64 and ReLU non-linearities. We use the Adam optimizer, with a fixed learning rate of 1e-4.

D.1.1 Low sensitive attribute dependence for Sim1

We report the results with A-dependence w=0.6𝑤0.6w=0.6italic_w = 0.6 here:

Refer to caption
Figure 8: Prediction loss and metrics of fairness in simulation over 100 independent runs for Sim1 under low A𝐴Aitalic_A-dependence. Conditional dependence measure ρ2^^superscript𝜌2\hat{\rho^{2}}over^ start_ARG italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG and statistical power {p-value<0.05}𝑝-value0.05{\mathbb{P}}\{p\text{-value}<0.05\}blackboard_P { italic_p -value < 0.05 } are shown in column 1 and 2 resp. From top to bottom shows the results on different choices of dimension of A𝐴Aitalic_A. The X-axis represents the metrics of fairness and the Y-axis is the prediction loss. Each graph shows the proposed method, FDL and oracle model with different hyperparameters μ𝜇\muitalic_μ.

D.1.2 Low sensitive attribute dependence for Sim2

We report the results with A-dependence w=0.6𝑤0.6w=0.6italic_w = 0.6 here:

Refer to caption
Figure 9: Prediction loss and metrics of fairness in simulation over 100 independent runs for Sim2 under low A𝐴Aitalic_A-dependence. Conditional dependence measure ρ2^^superscript𝜌2\hat{\rho^{2}}over^ start_ARG italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG and statistical power {p-value<0.05}𝑝-value0.05{\mathbb{P}}\{p\text{-value}<0.05\}blackboard_P { italic_p -value < 0.05 } are shown in column 1 and 2 resp. From top to bottom shows the results on different choices of the dimension of noisy attributes Asuperscript𝐴A^{\prime}italic_A start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. The X-axis represents the metrics of fairness and the Y-axis is the prediction loss. Each graph shows the proposed method, FDL and oracle model with different hyperparameters μ𝜇\muitalic_μ.

D.2 Real-data experiments

D.2.1 Regression

For FairICP and FDL, the hyperparameters used for linear model and neural network are as follows:

  • Linear: we set f𝑓fitalic_f as a linear model and use the Adam optimizer with a mini-batch size in {16, 32, 64}, learning rate in {1e-4, 1e-3, 1e-2}, and the number of epochs in {20, 40, 60, 80, 100}. The discriminator is implemented as a four-layer neural network with a hidden layer of size 64 and ReLU non-linearities. We use the Adam optimizer, with a fixed learning rate of 1e-4. The penalty parameter μ𝜇\muitalic_μ is set as {0,0.3,0.5,0.7,0.8,0.9}00.30.50.70.80.9\{0,0.3,0.5,0.7,0.8,0.9\}{ 0 , 0.3 , 0.5 , 0.7 , 0.8 , 0.9 }.

  • Neural network: we set f𝑓fitalic_f as a two-layer neural network with a 64-dimensional hidden layer and ReLU activation function. The hyperparameters are the same as the linear ones.

For HGR, the hyperparameters used for the linear model and neural network are as follows:

  • Linear: we set f𝑓fitalic_f as a linear model and use the Adam optimizer with a mini-batch size in {16, 32, 64}, learning rate in {1e-4, 1e-3, 1e-2}, and the number of epochs in {20, 40, 60, 80, 100}. The penalty parameter λ𝜆\lambdaitalic_λ is set as {0,0.25,0.5,0.75,1}00.250.50.751\{0,0.25,0.5,0.75,1\}{ 0 , 0.25 , 0.5 , 0.75 , 1 }.

  • Neural network: we set f𝑓fitalic_f as a two-layer neural network with a 64-dimensional hidden layer and ReLU activation function. The hyperparameters are the same as the linear ones.

We report the results with f𝑓fitalic_f as a linear model here, which is similar to NN version:

Refer to caption
Figure 10: Prediction loss and violation of equalized odds (ρ2^^superscript𝜌2\hat{\rho^{2}}over^ start_ARG italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG and statistical power {p-value<0.05}𝑝-value0.05{\mathbb{P}}\{p\text{-value}<0.05\}blackboard_P { italic_p -value < 0.05 } by ρ2^^superscript𝜌2\hat{\rho^{2}}over^ start_ARG italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG) obtained by 3 different training methods in Crimes data over 100 random splits using linear models. The mean value and one standard deviation of loss are reported in the bar graph. Each graph shows the results of using different A𝐴Aitalic_A. The Pareto front for each algorithm is obtained by varying the hyper-parameter controlling the level of fairness.

D.2.2 Classification

For FairICP, the hyperparameters used for linear model and neural network are as follows:

  • Linear: we set f𝑓fitalic_f as a linear model and use the Adam optimizer with a mini-batch size in {64, 128, 256}, learning rate in {1e-4, 1e-3, 1e-2}, and the number of epochs in {50, 100, 150, 200, 250, 300}. The discriminator is implemented as a four-layer neural network with a hidden layer of size 64 and ReLU non-linearities. We use the Adam optimizer, with a fixed learning rate in {1e-4, 1e-3}. The penalty parameter μ𝜇\muitalic_μ is set as {0,0.3,0.5,0.7,0.8,0.9}00.30.50.70.80.9\{0,0.3,0.5,0.7,0.8,0.9\}{ 0 , 0.3 , 0.5 , 0.7 , 0.8 , 0.9 }.

  • Neural network: we set f𝑓fitalic_f as a two-layer neural network with a 64-dimensional hidden layer and ReLU activation function. The hyperparameters are the same as the linear ones.

For HGR, the hyperparameters used for the linear model and neural network are as follows:

  • Linear: we set f𝑓fitalic_f as a linear model and use the Adam optimizer with a mini-batch size in {64, 128, 256}, learning rate in {1e-4, 1e-3, 1e-2}, and the number of epochs in {20, 40, 60, 80, 100}. The penalty parameter λ𝜆\lambdaitalic_λ is set as {0,0.075,0.125,0.25,0.5,1}00.0750.1250.250.51\{0,0.075,0.125,0.25,0.5,1\}{ 0 , 0.075 , 0.125 , 0.25 , 0.5 , 1 }.

  • Neural network: we set f𝑓fitalic_f as a two-layer neural network with a 64-dimensional hidden layer and ReLU activation function. The hyperparameters are the same as the linear ones.

We report the results with f𝑓fitalic_f as a linear model here:

Refer to caption
Figure 11: Prediction loss and violation of equalized odds (DEO (8), ρ2^^superscript𝜌2\hat{\rho^{2}}over^ start_ARG italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG and statistical power {p-value<0.05}𝑝-value0.05{\mathbb{P}}\{p\text{-value}<0.05\}blackboard_P { italic_p -value < 0.05 } by ρ2^^superscript𝜌2\hat{\rho^{2}}over^ start_ARG italic_ρ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG) obtained by 3 different training methods in COMPAS data over 100 random splits using linear models. The Pareto front for each algorithm is obtained by varying the hyper-parameter controlling the level of fairness.