Trustworthy Actionable Perturbations

Jesse Friedbaum    Sudarshan Adiga    Ravi Tandon
Abstract

Counterfactuals, or modified inputs that lead to a different outcome, are an important tool for understanding the logic used by machine learning classifiers and how to change an undesirable classification. Even if a counterfactual changes a classifier’s decision, however, it may not affect the true underlying class probabilities, i.e. the counterfactual may act like an adversarial attack and “fool” the classifier. We propose a new framework for creating modified inputs that change the true underlying probabilities in a beneficial way which we call Trustworthy Actionable Perturbations (TAP). This includes a novel verification procedure to ensure that TAP change the true class probabilities instead of acting adversarially. Our framework also includes new cost, reward, and goal definitions that are better suited to effectuating change in the real world. We present PAC-learnability results for our verification procedure and theoretically analyze our new method for measuring reward. We also develop a methodology for creating TAP and compare our results to those achieved by previous counterfactual methods.

Machine Learning, ICML

1 Introduction

As machine learning (ML) classifiers have experienced widespread adoption in applications that have an out-sized impact on individuals’ lives (such as credit lending (Leo et al., 2019), college admissions (Martinez Neda et al., 2021) and healthcare (Sauer et al., 2022)), the need to understand classifiers’ decision making and how to avoid undesirable classifications has become increasingly important. One of the most important tools for filling this need is the counterfactual: a counterfactual for a given input and classifier is a similar input that results in a different classification. Suppose a classifier is designed to determine whether a loan application represents a good or bad credit risk. If the classifier determines a loan to be a bad credit risk, a counterfactual would be a modified loan application that is classified as a good credit risk, e.g. the individual in a loan application is a bad credit risk, but an otherwise identical applicant who is 5555 years younger with a $500500500500 higher monthly income would be a good credit risk. (Wachter et al., 2017) first suggested the use of Counterfactuals Explanations (CE) to help understand classifiers’ decisions making. Subsequent works explored the use of counterfactuals to help individuals change an undesirable classification (Ustun et al., 2019; Karimi et al., 2021; Poyiadzi et al., 2020). Returning to the example of an individual turned down for a loan, this type of counterfactual would not suggest an individual decrease their age (clearly impossible), but rather make practical changes such as pay off all credit card debt and request a 10%percent1010\%10 % smaller loan. These counterfactuals came to be known as Actionable Counterfactuals (AC) or Algorithmic Recourses (AR). Although, these counterfactuals change a classifier’s decision, it can not be assumed they will have the same affect on the real world (Freiesleben, 2022), e.g. a change that causes a classifier to determine someone is a good credit risk may not increase the person’s odds of paying off the loan in reality. (König et al., 2023) point out that a counterfactual could change a classifiers decision without changing the real world if the modifications are not causally linked to the output. For example, having a mailing address in an affluent neighborhood may correlate to higher odds of paying off a loan and changing the address could affect a classifiers decision, but there is no causal link. Accordingly, telling an applicant to change their mailing address to a P.O. box in a wealthy neighborhood would not improve their chances of paying off a loan. (König et al., 2023) proposed a framework to ensure modifications are causally linked to the output which they called Improvement-Focused Causal Recourse (ICR).

Refer to caption
Figure 1: a) Overview of the framework for creating Trustworthy Actionable Perturbations (TAP). b) Comparison of objectives and features of TAP and Counterfactual Explanations (CE) (Wachter et al., 2017), Actionable Counterfactuals/Algorithmic Recourse (AC/AR) (Ustun et al., 2019; Karimi et al., 2021; Poyiadzi et al., 2020), Improvement-Focused Causal Recourse (ICR) (König et al., 2023).

In this paper, we focus on tackling new challenges for this problem, which have not been addressed in prior work. Trustworthy Actionable Perturbations (TAP) focus on three novel improvements for affecting real world outcomes.

Trustworthiness Against Adversarial Examples: (Szegedy et al., 2013) showed that ML classifiers are brittle and small modifications to an input can cause misclassifications in otherwise accurate classifiers. Among the various definitions of adversarial behavior, we use the definition: modifications to a data point are adversarial if they cause a classifier to be far less accurate on modified data points than the original data (Diochnos et al., 2018). These modified inputs are called adversarial examples and the algorithms that create them are called adversarial attacks. The algorithms that create counterfactuals are very similar to adversarial attacks and (Pawelczyk et al., 2022) showed they produce similar outputs, which leads to the troubling conclusion that many counterfactuals may act as adversarial examples and change the classifier decision (individual is now offered a loan) without changing the true underlying class probabilities (individual is still likely to default on the loan). The adversarial vulnerability of classifiers is separate from causality concerns. For this reason, we introduce a novel two step procedure where (1) we generate a suggested change and (2) we use an independently trained verifier to certify that this change is not acting as an adversarial example. We present a methodology for training this verifier and provide analytical results showing that it is PAC-learnable (Theorem 2.3).

Flexible Goal Definition: AC/AR focus solely on the final classification of a data point, but this may not always be sufficient or feasible. For instance a valid AC/AR may lead to a 51%percent5151\%51 % likelihood of paying off a loan, but this may not satisfy the individual. Additionally, a change that improves a cancer patient’s odds of survival form 15%percent1515\%15 % to 40%percent4040\%40 % would not constitute a valid AC/AR even though it would be very useful. Accordingly, our framework defines goals through a target set of acceptable outcomes that can be tailored to an individual’s needs, and we demonstrate how these target sets can be designed. We note that ICR (König et al., 2023) and one of the AC/AR methods (Dandl et al., 2020) propose the use of goals other than final classification, but our formulation is more flexible and applies to multi-class scenarios. We develop a principled measure of reward by defining a distance to the target set using statistical divergence. We analyze this distance theoretically in Theorem 2.2.

Real World Efficiency: Previous works on CE and AC/AR reduce the amount of changes made by a counterfactual by minimizing a weighted \ellroman_ℓ-norm of the changes (with the exception of (Ramakrishnan et al., 2020)), however these norms often fail to represent the real world cost of making changes. Alternatively, we minimize a cost measure built specifically to reflect real world costs of a change. By using this measure of real world cost and principled measure of rewards (distance to target set), TAP can suggest more efficient advice. We present a few examples of the utility of producing efficient advice through TAP: (a) Suggest the course of treatment that would double a patient’s odds of survival while requiring the least staff hours. (b) List the skills an job applicant could acquire in the least amount of time that would lead to a high probability of receiving an interview. (c) Find the cheapest modifications to a product that would bring it into a more premium price range and enhance marketability. We illustrate through experiments on real world data how the use of application specific cost functions leads to more efficient advice.

Figure 1(a) illustrates our framework of Trustworthy Actionable Perturbations (TAP) for using feasible actions, true cost and an individualized goal to create an efficient change, which is then verified to ensure that the change affects the true class probabilities instead of acting adversarially. Our goal to change the true class probabilities (real world outcomes) differs from previous CE and AC/AR works that seek only to change the classifier’s decision. We share our goal with ICR which is focused on ensuring that only features causally related to the class are modified. Our framework, on the other hand, focuses on ensuring that the changes do not exploit the brittleness of ML classifiers and cause misclassifications. This can occur regardless of whether modified features are causally related to the output. Figure 1(b) provides a summary of the objectives and features of various existing approaches alongside TAP.

2 Trustworthy Actionable Perturbations

Problem Setting and Goals: Suppose there is an unknown distribution (𝐱,C)𝒟similar-to𝐱𝐶𝒟(\mathbf{x},C)\sim\mathcal{D}( bold_x , italic_C ) ∼ caligraphic_D. Here 𝐱𝐱\mathbf{x}bold_x is a member of the input space 𝒳d𝒳superscript𝑑\mathcal{X}\subset\mathbb{R}^{d}caligraphic_X ⊂ blackboard_R start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT and C{1,,k}𝐶1𝑘C\in\{1,...,k\}italic_C ∈ { 1 , … , italic_k } is the class of 𝐱𝐱\mathbf{x}bold_x. We define the true class probabilities 𝐲(𝐱):=((C=1|𝐱),,(C=k|𝐱))assign𝐲𝐱𝐶conditional1𝐱𝐶conditional𝑘𝐱\mathbf{y}(\mathbf{x}):=\left(\mathbb{P}(C=1|\mathbf{x}),...,\mathbb{P}(C=k|% \mathbf{x})\right)bold_y ( bold_x ) := ( blackboard_P ( italic_C = 1 | bold_x ) , … , blackboard_P ( italic_C = italic_k | bold_x ) ). We let 𝒴𝒴\mathcal{Y}caligraphic_Y denote the k𝑘kitalic_k-simplex and use a classifier M:𝒳𝒴:𝑀𝒳𝒴M:\mathcal{X}\rightarrow\mathcal{Y}italic_M : caligraphic_X → caligraphic_Y to estimate 𝐲(𝐱)𝐲𝐱\mathbf{y}(\mathbf{x})bold_y ( bold_x ). Our goal in designing TAP is as follows: Given an input 𝐱𝐱\mathbf{x}bold_x with an undesirable classification M(𝐱)𝑀𝐱M(\mathbf{x})italic_M ( bold_x ), find the most efficient real world actions to create a modified input 𝐱~~𝐱\tilde{\mathbf{x}}over~ start_ARG bold_x end_ARG such that the corresponding true probabilities 𝐲(𝐱~)𝐲~𝐱\mathbf{y}(\tilde{\mathbf{x}})bold_y ( over~ start_ARG bold_x end_ARG ) (and not just M(𝐱~)𝑀~𝐱M(\mathbf{\tilde{x}})italic_M ( over~ start_ARG bold_x end_ARG )) are more desirable.

Real World Actionability: TAP should only suggest modifications that are feasible in the real world (e.g., not decreasing an individual’s age). To this end, we introduce: the Actionable Set 𝒜(𝐱)𝒜𝐱\mathcal{A}(\mathbf{x})caligraphic_A ( bold_x ) of a data point 𝐱𝐱\mathbf{x}bold_x as the set of all perturbations of 𝐱𝐱\mathbf{x}bold_x that are feasible in the real world. For example, if 𝒳𝒳\mathcal{X}caligraphic_X represents loan applications with x1subscript𝑥1x_{1}italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT the age of the applicant, x2subscript𝑥2x_{2}italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT the applicant’s credit score, x3subscript𝑥3x_{3}italic_x start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT the amount of credit and x4subscript𝑥4x_{4}italic_x start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT the loan duration, the actionable set could be 𝒜(𝐱)={𝐱~𝒳|x~1=x1,x~2=x2}𝒜𝐱conditional-set~𝐱𝒳formulae-sequencesubscript~𝑥1subscript𝑥1subscript~𝑥2subscript𝑥2\mathcal{A}(\mathbf{x})=\{\tilde{\mathbf{x}}\in\mathcal{X}|\tilde{x}_{1}=x_{1}% ,\tilde{x}_{2}=x_{2}\}caligraphic_A ( bold_x ) = { over~ start_ARG bold_x end_ARG ∈ caligraphic_X | over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_x start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , over~ start_ARG italic_x end_ARG start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = italic_x start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT }, i.e. the applicant can change the size and duration of the loan they request, but not their age or credit score. Previous works have examined the complexities of actionability including causal relations between inputs, e.g. one can’t increase their education without an increase in age (Mahajan et al., 2019; Karimi et al., 2020b). All of these considerations, as well as a limiting changes to attributes which are believed to have a causal link to the output, can be incorporated into 𝒜(𝐱)𝒜𝐱\mathcal{A}(\mathbf{x})caligraphic_A ( bold_x ).

Efficiency: The definition of the most efficient change depends on the context of the problem and could involve a well defined value such as “cost in dollars” or more nebulous value such as “amount of effort required.” We characterize this value with a function d𝒳:𝒳×𝒳:subscript𝑑𝒳𝒳𝒳d_{\mathcal{X}}:\mathcal{X}\times\mathcal{X}\rightarrow\mathbb{R}italic_d start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT : caligraphic_X × caligraphic_X → blackboard_R, where d𝒳(𝐱,𝐱~)subscript𝑑𝒳𝐱~𝐱d_{\mathcal{X}}(\mathbf{x},\tilde{\mathbf{x}})italic_d start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ( bold_x , over~ start_ARG bold_x end_ARG ) is the cost of changing 𝐱𝐱\mathbf{x}bold_x to 𝐱~~𝐱\tilde{\mathbf{x}}over~ start_ARG bold_x end_ARG. For example, if 𝐱𝐱\mathbf{x}bold_x and 𝐱~~𝐱\tilde{\mathbf{x}}over~ start_ARG bold_x end_ARG represent resumes, then d𝒳(𝐱,𝐱~)subscript𝑑𝒳𝐱~𝐱d_{\mathcal{X}}(\mathbf{x},\tilde{\mathbf{x}})italic_d start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ( bold_x , over~ start_ARG bold_x end_ARG ) could represent the time it would take to acquire the attributes listed on resume 𝐱~~𝐱\tilde{\mathbf{x}}over~ start_ARG bold_x end_ARG, but not on 𝐱𝐱\mathbf{x}bold_x. We note this function may not be a true distance measure. For example, if d𝒳subscript𝑑𝒳d_{\mathcal{X}}italic_d start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT represents the difference in financial cost between two courses of medical treatment, then d𝒳(𝐱,𝐱~)subscript𝑑𝒳𝐱~𝐱d_{\mathcal{X}}(\mathbf{x},\tilde{\mathbf{x}})italic_d start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ( bold_x , over~ start_ARG bold_x end_ARG ) should be negative when 𝐱~~𝐱\tilde{\mathbf{x}}over~ start_ARG bold_x end_ARG is more affordable than 𝐱𝐱\mathbf{x}bold_x.

Desirability: We now define what we mean by a desirable outcome—the goal of a TAP. The Target Set T𝑇Titalic_T is the set of all elements of 𝒴𝒴\mathcal{Y}caligraphic_Y that would be an acceptable result of a TAP. If we wish to belong to a desirable class w𝑤witalic_w with probability no less than p𝑝pitalic_p, the target set would have the form T={𝐳𝒴|zwp}.𝑇conditional-set𝐳𝒴subscript𝑧𝑤𝑝T=\{{\mathbf{z}}\in\mathcal{Y}|z_{w}\geq p\}.italic_T = { bold_z ∈ caligraphic_Y | italic_z start_POSTSUBSCRIPT italic_w end_POSTSUBSCRIPT ≥ italic_p } . If our goal is rather to avoid some undesirable class u𝑢uitalic_u, T𝑇Titalic_T could be of the form T={𝐳𝒴|zuq}𝑇conditional-set𝐳𝒴subscript𝑧𝑢𝑞T=\{{\mathbf{z}}\in\mathcal{Y}|z_{u}\leq q\}italic_T = { bold_z ∈ caligraphic_Y | italic_z start_POSTSUBSCRIPT italic_u end_POSTSUBSCRIPT ≤ italic_q } for a fixed q𝑞qitalic_q. More generally, if we wish to belong to a set of desirable classes 𝒲𝒲\mathcal{W}caligraphic_W with probability at least p𝑝pitalic_p and we wish to belong to a set of undesirable classes 𝒰𝒰\mathcal{U}caligraphic_U with probability no greater than q𝑞qitalic_q, we would use

T={𝐳𝒴|i𝒲zip,i𝒰ziq}.𝑇conditional-set𝐳𝒴formulae-sequencesubscript𝑖𝒲subscript𝑧𝑖𝑝subscript𝑖𝒰subscript𝑧𝑖𝑞T=\left\{\mathbf{z}\in\mathcal{Y}\bigg{|}\sum_{i\in\mathcal{W}}z_{i}\geq p,% \sum_{i\in\mathcal{U}}z_{i}\leq q\right\}.italic_T = { bold_z ∈ caligraphic_Y | ∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_W end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ italic_p , ∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_U end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_q } . (1)

We must quantify how close an TAP comes to achieving its goal in a principled manner. To do this, we first choose a measure of statistical distance D(𝐲||𝐳)D({\mathbf{y}}||\mathbf{z})italic_D ( bold_y | | bold_z ) (we use Kullback-Leibler (KL) Divergence). We then denote d𝒴(𝐲,T)subscript𝑑𝒴𝐲𝑇d_{\mathcal{Y}}({\mathbf{y}},T)italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( bold_y , italic_T ) as the distance of 𝐲𝐲{\mathbf{y}}bold_y to the target set T𝑇Titalic_T, defined as follows:

d𝒴(𝐲,T):=inf𝐳TD(𝐲||𝐳).\displaystyle d_{\mathcal{Y}}({\mathbf{y}},T):=\inf_{\mathbf{z}\in T}D({% \mathbf{y}}||\mathbf{z}).italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( bold_y , italic_T ) := roman_inf start_POSTSUBSCRIPT bold_z ∈ italic_T end_POSTSUBSCRIPT italic_D ( bold_y | | bold_z ) . (2)

We may now formally define Trustworthy Actionable Perturbations. Let ϵitalic-ϵ\epsilonitalic_ϵ represent budget —the amount of work we are willing to perform, and δ𝛿\deltaitalic_δ represent tolerance —how close the final result is to our target set T𝑇Titalic_T.

Definition 2.1 ((ϵ,δ)italic-ϵ𝛿(\epsilon,\delta)( italic_ϵ , italic_δ )-Trustworthy Actionable Perturbation).

𝐱~~𝐱\tilde{\mathbf{x}}over~ start_ARG bold_x end_ARG is an (ϵ,δ)italic-ϵ𝛿(\epsilon,\delta)( italic_ϵ , italic_δ )-trustworthy actionable perturbation for 𝐱𝐱\mathbf{x}bold_x and a target set T𝑇Titalic_T if

  1. 1.

    d𝒳(𝐱,𝐱~)ϵsubscript𝑑𝒳𝐱~𝐱italic-ϵd_{\mathcal{X}}(\mathbf{x},\tilde{\mathbf{x}})\leq\epsilonitalic_d start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ( bold_x , over~ start_ARG bold_x end_ARG ) ≤ italic_ϵ

  2. 2.

    d𝒴(𝐲(𝐱~),T)δsubscript𝑑𝒴𝐲~𝐱𝑇𝛿d_{\mathcal{Y}}({\mathbf{y}}(\tilde{\mathbf{x}}),T)\leq\deltaitalic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( bold_y ( over~ start_ARG bold_x end_ARG ) , italic_T ) ≤ italic_δ

  3. 3.

    𝐱~𝒜(𝐱).~𝐱𝒜𝐱\tilde{\mathbf{x}}\in\mathcal{A}(\mathbf{x}).over~ start_ARG bold_x end_ARG ∈ caligraphic_A ( bold_x ) .

In order to verify the second condition we must be able to calculate d𝒴subscript𝑑𝒴d_{\mathcal{Y}}italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT. Fortunately, the optimization problem in (2) has a differentiable closed form solution when D(𝐲||𝐳)D({\mathbf{y}}||\mathbf{z})italic_D ( bold_y | | bold_z ) is an f𝑓fitalic_f-divergence: a broad class of measures including KL-divergence, total-variation (TV) and other commonly used statistical distances. An f𝑓fitalic_f-divergence is defined as D(𝐲||𝐳)=i=1k𝐳if(𝐲i𝐳i)D({\mathbf{y}}||\mathbf{z})=\sum_{i=1}^{k}\mathbf{z}_{i}f\left(\frac{{\mathbf{% y}}_{i}}{\mathbf{z}_{i}}\right)italic_D ( bold_y | | bold_z ) = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT bold_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_f ( divide start_ARG bold_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG bold_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ), where f𝑓fitalic_f is a convex function satisfying f(1)=0𝑓10f(1)=0italic_f ( 1 ) = 0 and f(0)=limx0+f(x)𝑓0subscript𝑥superscript0𝑓𝑥f(0)=\lim_{x\to 0^{+}}f(x)italic_f ( 0 ) = roman_lim start_POSTSUBSCRIPT italic_x → 0 start_POSTSUPERSCRIPT + end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_f ( italic_x ) (Polyanskiy & Wu, 2024). Theorem 2.2 describes the solution to (2).

Refer to caption
Figure 2: Illustration of the partition on 𝒴𝒴\mathcal{Y}caligraphic_Y used to calculate the distance from the target set T𝑇Titalic_T in Theorem 2.2. Although the cost function takes different functional form(s) in the four regions, it is continuously differentiable in the entire space.
Theorem 2.2.

If D(𝐲||𝐳)D({\mathbf{y}}||\mathbf{z})italic_D ( bold_y | | bold_z ) is an f𝑓fitalic_f-Divergence with twice differentiable f𝑓fitalic_f and T𝑇Titalic_T is of form (1), then

d𝒴(𝐲,T)={0if 𝐲Apf(𝒮𝒲p)+(1p)f(1𝒮𝒲1p)if 𝐲Bqf(𝒮𝒰q)+(1q)f(1𝒮𝒰1q)if 𝐲Cpf(𝒮𝒲p)+qf(𝒮𝒰q)if 𝐲D+(1pq)f(1𝒮𝒲𝒮𝒰1pq)subscript𝑑𝒴𝐲𝑇cases0if 𝐲𝐴𝑝𝑓subscript𝒮𝒲𝑝1𝑝𝑓1subscript𝒮𝒲1𝑝if 𝐲𝐵𝑞𝑓subscript𝒮𝒰𝑞1𝑞𝑓1subscript𝒮𝒰1𝑞if 𝐲𝐶𝑝𝑓subscript𝒮𝒲𝑝𝑞𝑓subscript𝒮𝒰𝑞if 𝐲𝐷1𝑝𝑞𝑓1subscript𝒮𝒲subscript𝒮𝒰1𝑝𝑞otherwised_{\mathcal{Y}}({\mathbf{y}},T)=\begin{cases}0&\text{if }{\mathbf{y}}\in A\\ pf\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)+(1-p)f\left(\frac{1-% \mathcal{S}_{\mathcal{W}}}{1-p}\right)&\text{if }{\mathbf{y}}\in B\\ qf\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)+(1-q)f\left(\frac{1-% \mathcal{S}_{\mathcal{U}}}{1-q}\right)&\text{if }{\mathbf{y}}\in C\\ pf\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)+qf\left(\frac{\mathcal{S}_{% \mathcal{U}}}{q}\right)&\text{if }{\mathbf{y}}\in D\\ +(1-p-q)f\left(\frac{1-\mathcal{S}_{\mathcal{W}}-\mathcal{S}_{\mathcal{U}}}{1-% p-q}\right)\end{cases}italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( bold_y , italic_T ) = { start_ROW start_CELL 0 end_CELL start_CELL if bold_y ∈ italic_A end_CELL end_ROW start_ROW start_CELL italic_p italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) + ( 1 - italic_p ) italic_f ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG ) end_CELL start_CELL if bold_y ∈ italic_B end_CELL end_ROW start_ROW start_CELL italic_q italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) + ( 1 - italic_q ) italic_f ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_q end_ARG ) end_CELL start_CELL if bold_y ∈ italic_C end_CELL end_ROW start_ROW start_CELL italic_p italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) + italic_q italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) end_CELL start_CELL if bold_y ∈ italic_D end_CELL end_ROW start_ROW start_CELL + ( 1 - italic_p - italic_q ) italic_f ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ) end_CELL start_CELL end_CELL end_ROW (3)

where 𝒮𝒲=i𝒲yisubscript𝒮𝒲subscript𝑖𝒲subscript𝑦𝑖\mathcal{S}_{\mathcal{W}}=\sum_{i\in\mathcal{W}}{y}_{i}caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_W end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, 𝒮𝒰=i𝒰yisubscript𝒮𝒰subscript𝑖𝒰subscript𝑦𝑖\mathcal{S}_{\mathcal{U}}=\sum_{i\in\mathcal{U}}{y}_{i}caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_U end_POSTSUBSCRIPT italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and the sets A,B,C𝐴𝐵𝐶A,B,Citalic_A , italic_B , italic_C and D𝐷Ditalic_D are a partition of 𝒴𝒴\mathcal{Y}caligraphic_Y defined and visualized in Figure 2. Furthermore, d𝒴(𝐲,T)subscript𝑑𝒴𝐲𝑇d_{\mathcal{Y}}({\mathbf{y}},T)italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( bold_y , italic_T ) is continuously differentiable in 𝐲𝐲\mathbf{y}bold_y over its entire domain.

Equation (3) in Theorem 2.2 is easily calculable and continuously differentiable despite its piece-wise form, which will be significant when creating TAP (see Section 3). The proof of Theorem 2.2 involves showing that optimization problem (2) is convex and finding a value 𝐳𝐳\mathbf{z}bold_z that satisfies the KKT conditions. This proof and additional results about d𝒴subscript𝑑𝒴d_{\mathcal{Y}}italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT are found in the Appendix A.1.

Real-world Verifiability of TAP: Note that TAP are defined with respect to the true class probabilities 𝐲(𝐱~)𝐲~𝐱{\mathbf{y}}(\tilde{\mathbf{x}})bold_y ( over~ start_ARG bold_x end_ARG ) because TAP should have an effect in the real world. Notwithstanding, 𝐲(𝐱~)𝐲~𝐱{\mathbf{y}}(\tilde{\mathbf{x}})bold_y ( over~ start_ARG bold_x end_ARG ) is unknown and we must use M(𝐱~)𝑀~𝐱M(\tilde{\mathbf{x}})italic_M ( over~ start_ARG bold_x end_ARG ) to create our TAP (more details in Section 3), which introduces the risk that we might produce an 𝐱~~𝐱\tilde{\mathbf{x}}over~ start_ARG bold_x end_ARG that has the desired effect on M(𝐱~)𝑀~𝐱M(\tilde{\mathbf{x}})italic_M ( over~ start_ARG bold_x end_ARG ) but not 𝐲(𝐱~)𝐲~𝐱{\mathbf{y}}(\tilde{\mathbf{x}})bold_y ( over~ start_ARG bold_x end_ARG ) (like an adversarial example). This is of particular concern because TAP and all other counterfactuals are created by solving an optimization problem of the form

𝐱~=argmin𝐱~loss(𝐱~,w)+λdist(𝐱~,𝐱),~𝐱subscript~𝐱𝑙𝑜𝑠𝑠~𝐱𝑤𝜆𝑑𝑖𝑠𝑡~𝐱𝐱\tilde{\mathbf{x}}=\arg\min_{\tilde{\mathbf{x}}}\hskip 2.84544ptloss(\tilde{% \mathbf{x}},w)+\lambda\cdot dist(\tilde{\mathbf{x}},\mathbf{x}),over~ start_ARG bold_x end_ARG = roman_arg roman_min start_POSTSUBSCRIPT over~ start_ARG bold_x end_ARG end_POSTSUBSCRIPT italic_l italic_o italic_s italic_s ( over~ start_ARG bold_x end_ARG , italic_w ) + italic_λ ⋅ italic_d italic_i italic_s italic_t ( over~ start_ARG bold_x end_ARG , bold_x ) , (4)

which is precisely how most adversarial examples are created (Pawelczyk et al., 2022). When counterfactuals were first introduced to ML (Wachter et al., 2017), the concern that counterfactuals would act as adversarial examples was dismissed because the adversarial attacks of the time 1) modified many more features than counterfactuals and 2) were targeted almost exclusively at image data whereas counterfactuals were proposed for use on tabular data. Since that time, Gourdeau et al. (2021); Su et al. (2019) demonstrated that adversarial attacks can be effective when changing a very small number of features, and several works (Ballet et al., 2019; Mathov et al., 2020; Cartella et al., 2021; Kumar et al., 2021) have shown that adversarial examples exist on tabular data sets. This implies that verification is necessary to achieve results that can be trusted to change the true class probabilities.

Verifying 𝐱~~𝐱\tilde{\mathbf{x}}over~ start_ARG bold_x end_ARG may appear similar to detecting adversarial examples, which has been the object of significant research (Yang et al., 2020; Roth et al., 2019; Fidel et al., 2020; Carlini & Wagner, 2017a) with no satisfactory solution. Fortunately, we have an important advantage over detecting adversarial examples: we know the original data point 𝐱𝐱\mathbf{x}bold_x and exactly how it was modified, i.e., 𝐱~~𝐱\tilde{\mathbf{x}}over~ start_ARG bold_x end_ARG. To capitalize on this knowledge, we propose a novel verification procedure using a classifier V:𝒳×𝒳[0,1]:𝑉𝒳𝒳01V:\mathcal{X}\times\mathcal{X}\rightarrow[0,1]italic_V : caligraphic_X × caligraphic_X → [ 0 , 1 ] which compares two inputs simultaneously and predicts the probability of the inputs belonging to the same class: the value of V(𝐱,𝐱~)𝑉𝐱~𝐱V(\mathbf{x},\tilde{\mathbf{x}})italic_V ( bold_x , over~ start_ARG bold_x end_ARG ) estimates (C=C~|𝐱,𝐱~)𝐶conditional~𝐶𝐱~𝐱\mathbb{P}(C=\tilde{C}|\mathbf{x},\tilde{\mathbf{x}})blackboard_P ( italic_C = over~ start_ARG italic_C end_ARG | bold_x , over~ start_ARG bold_x end_ARG ). Because V𝑉Vitalic_V has a different classification task from M𝑀Mitalic_M, attacks targeted against M𝑀Mitalic_M should not be effective against V𝑉Vitalic_V, and we can use the discrepancy between estimates of M𝑀Mitalic_M and V𝑉Vitalic_V to determine if an 𝐱~~𝐱\tilde{\mathbf{x}}over~ start_ARG bold_x end_ARG acts adversarially on M𝑀Mitalic_M. In order to make this comparison, we use the fact that (C=C~|𝐱,𝐱~)𝐶conditional~𝐶𝐱~𝐱\mathbb{P}(C=\tilde{C}|\mathbf{x},\tilde{\mathbf{x}})blackboard_P ( italic_C = over~ start_ARG italic_C end_ARG | bold_x , over~ start_ARG bold_x end_ARG ) can also be estimated using M𝑀Mitalic_M by calculating i=1kMi(𝐱)Mi(𝐱~)superscriptsubscript𝑖1𝑘subscript𝑀𝑖𝐱subscript𝑀𝑖~𝐱\sum_{i=1}^{k}M_{i}(\mathbf{x})M_{i}(\tilde{\mathbf{x}})∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_x ) italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( over~ start_ARG bold_x end_ARG ). If 𝐱~~𝐱\tilde{\mathbf{x}}over~ start_ARG bold_x end_ARG acts adversarially we would expect i=1kMi(𝐱)Mi(𝐱~)superscriptsubscript𝑖1𝑘subscript𝑀𝑖𝐱subscript𝑀𝑖~𝐱\sum_{i=1}^{k}M_{i}(\mathbf{x})M_{i}(\tilde{\mathbf{x}})∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_x ) italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( over~ start_ARG bold_x end_ARG ) to be very small while V(𝐱,𝐱~)𝑉𝐱~𝐱V(\mathbf{x},\tilde{\mathbf{x}})italic_V ( bold_x , over~ start_ARG bold_x end_ARG ) is large. If 𝐱~~𝐱\tilde{\mathbf{x}}over~ start_ARG bold_x end_ARG is not adversarial we would expect similar values from both i=1kMi(𝐱)Mi(𝐱~)superscriptsubscript𝑖1𝑘subscript𝑀𝑖𝐱subscript𝑀𝑖~𝐱\sum_{i=1}^{k}M_{i}(\mathbf{x})M_{i}(\tilde{\mathbf{x}})∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_x ) italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( over~ start_ARG bold_x end_ARG ) and V(𝐱,𝐱~)𝑉𝐱~𝐱V(\mathbf{x},\tilde{\mathbf{x}})italic_V ( bold_x , over~ start_ARG bold_x end_ARG ). Accordingly, we define

Δ(𝐱,𝐱~):=|V(𝐱,𝐱~)i=1kMi(𝐱)Mi(𝐱~)|,assignΔ𝐱~𝐱𝑉𝐱~𝐱superscriptsubscript𝑖1𝑘subscript𝑀𝑖𝐱subscript𝑀𝑖~𝐱\Delta(\mathbf{x},\tilde{\mathbf{x}}):=\left|V(\mathbf{x},\tilde{\mathbf{x}})-% \sum_{i=1}^{k}M_{i}(\mathbf{x})M_{i}(\tilde{\mathbf{x}})\right|,roman_Δ ( bold_x , over~ start_ARG bold_x end_ARG ) := | italic_V ( bold_x , over~ start_ARG bold_x end_ARG ) - ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_x ) italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( over~ start_ARG bold_x end_ARG ) | , (5)

and verify that an 𝐱~~𝐱\tilde{\mathbf{x}}over~ start_ARG bold_x end_ARG is trustworthy only if Δ(𝐱,𝐱~)<γΔ𝐱~𝐱𝛾\Delta(\mathbf{x},\tilde{\mathbf{x}})<\gammaroman_Δ ( bold_x , over~ start_ARG bold_x end_ARG ) < italic_γ. In Section 3, we describe how we selected the threshold γ𝛾\gammaitalic_γ.

Training a Verifier & PAC Learnability: In order to create V𝑉Vitalic_V, we must have data on which it can be trained. We build this difference training data by creating all possible pairs of elements from our original training data and labeling the pairs by whether they belong to the same class (1111 for the same class, 00 for different classes). If the original training data is {𝐱(i),C(i)}i=1nsuperscriptsubscriptsuperscript𝐱𝑖superscript𝐶𝑖𝑖1𝑛\{\mathbf{x}^{(i)},C^{(i)}\}_{i=1}^{n}{ bold_x start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT , italic_C start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT, the difference training data is {(𝐱(i),𝐱(j)),z(i,j)}1i,jnsubscriptsuperscript𝐱𝑖superscript𝐱𝑗superscript𝑧𝑖𝑗formulae-sequence1𝑖𝑗𝑛\{(\mathbf{x}^{(i)},\mathbf{x}^{(j)}),z^{(i,j)}\}_{1\leq i,j\leq n}{ ( bold_x start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT , bold_x start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ) , italic_z start_POSTSUPERSCRIPT ( italic_i , italic_j ) end_POSTSUPERSCRIPT } start_POSTSUBSCRIPT 1 ≤ italic_i , italic_j ≤ italic_n end_POSTSUBSCRIPT, where z(i,j)=𝟙[C(i)=C(j)]superscript𝑧𝑖𝑗1delimited-[]superscript𝐶𝑖superscript𝐶𝑗z^{(i,j)}=\mathbbm{1}[C^{(i)}=C^{(j)}]italic_z start_POSTSUPERSCRIPT ( italic_i , italic_j ) end_POSTSUPERSCRIPT = blackboard_1 [ italic_C start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT = italic_C start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ]. We use the same architecture for V𝑉Vitalic_V as M𝑀Mitalic_M (only changing the number of inputs and outputs), but differing architectures could also be used. Now that we have a method for training V𝑉Vitalic_V, we show that training in this way leads to a generalizable verifier. To this end, we next present a probably approximately correct (PAC) bound on V𝑉Vitalic_V’s generalization gap which depends on n𝑛nitalic_n (number of training samples), k𝑘kitalic_k (number of classes), and d𝑑ditalic_d (data dimensionality).

Theorem 2.3.

Let R(V)R𝑉\mathrm{R}(V)roman_R ( italic_V ) be the true risk of a verifier V𝑉Vitalic_V over data drawn from 𝒟𝒟\mathcal{D}caligraphic_D and R^𝒮(V)subscript^R𝒮𝑉\hat{\mathrm{R}}_{\mathcal{S}}(V)over^ start_ARG roman_R end_ARG start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT ( italic_V ) be the empirical risk over a sample 𝒮𝒮\mathcal{S}caligraphic_S of labelled point pairs drawn i.i.d. from 𝒟𝒟\mathcal{D}caligraphic_D. Both risks are defined using a bounded loss function (,)Bsubscript𝐵\ell(\cdot,\cdot)\leq B_{\ell}roman_ℓ ( ⋅ , ⋅ ) ≤ italic_B start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT. Also let V𝑉Vitalic_V be selected from a function class 𝒱𝒱\mathcal{V}caligraphic_V. Then for any δ(0,1)𝛿01\delta\in(0,1)italic_δ ∈ ( 0 , 1 ), with probability (1δ)1𝛿(1-\delta)( 1 - italic_δ ), the following bound on the generalization gap holds.

supV𝒱|R(V)R^S(V)|=𝒪((kn2k2n)1/d)subscriptsupremum𝑉𝒱R𝑉subscript^R𝑆𝑉𝒪superscript𝑘superscript𝑛2superscript𝑘2𝑛1𝑑\sup_{V\in\mathcal{V}}\left|\mathrm{R}(V)-\hat{\mathrm{R}}_{S}(V)\right|=% \mathcal{O}\left(\left(\frac{k}{\sqrt{n^{2}-k^{2}n}}\right)^{1/d}\right)roman_sup start_POSTSUBSCRIPT italic_V ∈ caligraphic_V end_POSTSUBSCRIPT | roman_R ( italic_V ) - over^ start_ARG roman_R end_ARG start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ( italic_V ) | = caligraphic_O ( ( divide start_ARG italic_k end_ARG start_ARG square-root start_ARG italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n end_ARG end_ARG ) start_POSTSUPERSCRIPT 1 / italic_d end_POSTSUPERSCRIPT ) (6)

Here the terms with explicit dependence on δ𝛿\deltaitalic_δ have been suppressed because they are dominated by the term in (6). The precise generalization bound is presented in (43) in the Appendix.

To prove Theorem 2.3, we construct a definition of risk that fits this new learning scenario (i.e., learning if two samples are from the same class or not, as opposed to conventional classification). This risk takes into account that we expect large imbalances between the number of point pairs from the same class and from different classes. In order to obtain the bounds on the generalization gap, we expand this risk into a sum of terms which can be bounded with existing Rademacher complexity PAC-methods. Finally, we bound the growth of these Rademacher complexity terms as a function of n,k𝑛𝑘n,kitalic_n , italic_k and d𝑑ditalic_d to arrive at (6). The complete proof, including detailed definitions of R(V)R𝑉\mathrm{R}(V)roman_R ( italic_V ) and R^S(V)subscript^R𝑆𝑉\hat{\mathrm{R}}_{S}(V)over^ start_ARG roman_R end_ARG start_POSTSUBSCRIPT italic_S end_POSTSUBSCRIPT ( italic_V ) as well as additional discussion, is presented in the Appendix A.2.

Remark 2.4.

The bound in Theorem 2.3 is small as long as nk2much-greater-than𝑛superscript𝑘2n\gg k^{2}italic_n ≫ italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT and n𝑛nitalic_n is exponentially larger than d𝑑ditalic_d. The relation between n𝑛nitalic_n and k𝑘kitalic_k is crucial because it implies that the denominator n2k2nn2ksuperscript𝑛2superscript𝑘2𝑛superscript𝑛2much-greater-than𝑘\sqrt{n^{2}-k^{2}n}\approx n^{2}\gg ksquare-root start_ARG italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n end_ARG ≈ italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ≫ italic_k. This differs from typical PAC bounds where the primary requirement is n𝑛nitalic_n be exponentially larger than d𝑑ditalic_d (Theorem 4.3 in (Gottlieb et al., 2016)) and have mild dependence on the number of classes k𝑘kitalic_k. The key implication of this result is: when using a verifier as described in this paper, as the data sets used increase in number of classes k𝑘kitalic_k, it is essential that the amount of training data increases at a rate of k𝑘\sqrt{k}square-root start_ARG italic_k end_ARG.

3 Generating TAP

Two Step Creation Method: We now present and discuss the general optimization framework for creating TAP. Ideally, we would like to solve the following optimization problem: argmin𝐱~𝒜(𝐱)d𝒴(𝐲~,T)+λd𝒳(𝐱~,𝐱)subscript~𝐱𝒜𝐱subscript𝑑𝒴~𝐲𝑇𝜆subscript𝑑𝒳~𝐱𝐱\arg\min_{\tilde{\mathbf{x}}\in\mathcal{A}(\mathbf{x})}d_{\mathcal{Y}}(\tilde{% \mathbf{y}},T)+\lambda d_{\mathcal{X}}(\tilde{\mathbf{x}},\mathbf{x})roman_arg roman_min start_POSTSUBSCRIPT over~ start_ARG bold_x end_ARG ∈ caligraphic_A ( bold_x ) end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG , italic_T ) + italic_λ italic_d start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ( over~ start_ARG bold_x end_ARG , bold_x ), where the scalar parameter λ𝜆\lambdaitalic_λ balances the effort(ϵitalic-ϵ\epsilonitalic_ϵ)-reward(δ𝛿\deltaitalic_δ) trade-off. Solving this optimization would be guaranteed to create an effective TAP; unfortunately 𝐲(𝐱~)𝐲~𝐱{\mathbf{y}}(\tilde{\mathbf{x}})bold_y ( over~ start_ARG bold_x end_ARG ) is unknown and we cannot solve this problem directly. Instead propose the following two-step procedure where: in Step 1111, we treat M(𝐱~)𝑀~𝐱M(\tilde{\mathbf{x}})italic_M ( over~ start_ARG bold_x end_ARG ) as a surrogate for 𝐲(𝐱~)𝐲~𝐱{\mathbf{y}}(\tilde{\mathbf{x}})bold_y ( over~ start_ARG bold_x end_ARG ), and in Step 2222, we use a verification algorithm to ensure that 𝐱~~𝐱\tilde{\mathbf{x}}over~ start_ARG bold_x end_ARG is not just fooling the classifier.

Step 1 :argmin𝐱~𝒜(𝐱)d𝒴(M(𝐱~),T)+λd𝒳(𝐱~,𝐱)Step 1 :subscript~𝐱𝒜𝐱subscript𝑑𝒴𝑀~𝐱𝑇𝜆subscript𝑑𝒳~𝐱𝐱\displaystyle\textbf{Step 1 :}\arg\min_{\tilde{\mathbf{x}}\in\mathcal{A}(% \mathbf{x})}d_{\mathcal{Y}}(M(\tilde{\mathbf{x}}),T)+\lambda d_{\mathcal{X}}(% \tilde{\mathbf{x}},\mathbf{x})Step 1 : roman_arg roman_min start_POSTSUBSCRIPT over~ start_ARG bold_x end_ARG ∈ caligraphic_A ( bold_x ) end_POSTSUBSCRIPT italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( italic_M ( over~ start_ARG bold_x end_ARG ) , italic_T ) + italic_λ italic_d start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ( over~ start_ARG bold_x end_ARG , bold_x ) (7)
\displaystyle\downarrow
Step 2: Verify M(𝐱~)𝐲(𝐱~) i.e. Δ(𝐱,𝐱~)γStep 2: Verify 𝑀~𝐱𝐲~𝐱 i.e. Δ𝐱~𝐱𝛾\displaystyle\textbf{Step 2:}\text{ Verify }M(\tilde{\mathbf{x}})\approx{% \mathbf{y}}(\tilde{\mathbf{x}})\text{ i.e. }\Delta(\mathbf{x},\tilde{\mathbf{x% }})\leq\gammabold_Step bold_2: Verify italic_M ( over~ start_ARG bold_x end_ARG ) ≈ bold_y ( over~ start_ARG bold_x end_ARG ) i.e. roman_Δ ( bold_x , over~ start_ARG bold_x end_ARG ) ≤ italic_γ (8)
\displaystyle\downarrow
TAP

Solving Step 1: We solve (7) using gradient descent which requires us to use differentiable models M𝑀Mitalic_M and formulate d𝒳subscript𝑑𝒳d_{\mathcal{X}}italic_d start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT in a differentiable manner (d𝒴subscript𝑑𝒴d_{\mathcal{Y}}italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT is differentiable according to Theorem 2.2). We modify our gradient descent to address two challenges. (1) We must insure that our solution is actionable: 𝐱~𝒜(𝐱)~𝐱𝒜𝐱\tilde{\mathbf{x}}\in\mathcal{A}(\mathbf{x})over~ start_ARG bold_x end_ARG ∈ caligraphic_A ( bold_x ). (2) Our solution 𝐱~~𝐱\tilde{\mathbf{x}}over~ start_ARG bold_x end_ARG must follow any formatting rules associated with the data set (for instance, Boolean variables must be either 0 or 1, categorical features must respect one-hot encoding, etc.). A perturbation that follows these formatting rules is called coherent. To solve these two difficulties, we first assume 𝒜(𝐱)={𝐱~|li𝐱~iui,1id}𝒜𝐱conditional-set~𝐱formulae-sequencesubscript𝑙𝑖subscript~𝐱𝑖subscript𝑢𝑖1𝑖𝑑\mathcal{A}(\mathbf{x})=\{\tilde{\mathbf{x}}|l_{i}\leq\tilde{\mathbf{x}}_{i}% \leq u_{i},1\leq i\leq d\}caligraphic_A ( bold_x ) = { over~ start_ARG bold_x end_ARG | italic_l start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ over~ start_ARG bold_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , 1 ≤ italic_i ≤ italic_d } for some set of lower bounds {li}i=1dsuperscriptsubscriptsubscript𝑙𝑖𝑖1𝑑\{l_{i}\}_{i=1}^{d}{ italic_l start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT and upper bounds {ui}i=1dsuperscriptsubscriptsubscript𝑢𝑖𝑖1𝑑\{u_{i}\}_{i=1}^{d}{ italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_d end_POSTSUPERSCRIPT. An attribute is immutable if li=uisubscript𝑙𝑖subscript𝑢𝑖l_{i}=u_{i}italic_l start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT. We ensure actionability by setting all elements of the gradient corresponding to immutable features to zero and adding a large penalty b(𝐱~)𝑏~𝐱b(\tilde{\mathbf{x}})italic_b ( over~ start_ARG bold_x end_ARG ) term to the objective function which punishes points for leaving the actionable set. To ensure coherence, we project the result of our gradient descent onto the coherent space by using a function cond:m𝒳:𝑐𝑜𝑛𝑑superscript𝑚𝒳cond:\mathbb{R}^{m}\rightarrow\mathcal{X}italic_c italic_o italic_n italic_d : blackboard_R start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT → caligraphic_X which performs the appropriate value rounding to make an input coherent. We found it useful to introduce a second penalty term p(𝐱~)𝑝~𝐱p(\tilde{\mathbf{x}})italic_p ( over~ start_ARG bold_x end_ARG ) which requires that any one-hot encoded features sum to 1111. This ensures our answers never stray too far from a coherent point and improves robustness. Details on b𝑏bitalic_b, p𝑝pitalic_p and cond𝑐𝑜𝑛𝑑conditalic_c italic_o italic_n italic_d are found in the Appendix A.3.3. In practice we also found it useful to replace regular gradient descent with the ADAM algorithm (Kingma & Ba, 2014).

Algorithm 1 Generating TAP
  Input: Classifiers M𝑀Mitalic_M & V𝑉Vitalic_V, point 𝐱𝐱\mathbf{x}bold_x, target family T𝑇Titalic_T, learning rate α𝛼\alphaitalic_α, verification-cut off γ𝛾\gammaitalic_γ
  𝐱~~𝐱\tilde{\mathbf{x}}over~ start_ARG bold_x end_ARG \leftarrow 𝐱𝐱\mathbf{x}bold_x
  while 𝐱~~𝐱\tilde{\mathbf{x}}over~ start_ARG bold_x end_ARG not converged do
     𝐠𝐱~(d𝒴(M(𝐱~),T)+λd𝒳(𝐱~,𝐱)+b(𝐱~)+p(𝐱~))𝐠subscript~𝐱subscript𝑑𝒴𝑀~𝐱𝑇𝜆subscript𝑑𝒳~𝐱𝐱𝑏~𝐱𝑝~𝐱\mathbf{g}\leftarrow\nabla_{\tilde{\mathbf{x}}}\left(d_{\mathcal{Y}}(M(\tilde{% \mathbf{x}}),T)+\lambda d_{\mathcal{X}}(\tilde{\mathbf{x}},\mathbf{x})+b(% \tilde{\mathbf{x}})+p(\tilde{\mathbf{x}})\right)bold_g ← ∇ start_POSTSUBSCRIPT over~ start_ARG bold_x end_ARG end_POSTSUBSCRIPT ( italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( italic_M ( over~ start_ARG bold_x end_ARG ) , italic_T ) + italic_λ italic_d start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ( over~ start_ARG bold_x end_ARG , bold_x ) + italic_b ( over~ start_ARG bold_x end_ARG ) + italic_p ( over~ start_ARG bold_x end_ARG ) )
     𝐠j0subscript𝐠𝑗0\mathbf{g}_{j}\leftarrow 0bold_g start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ← 0 for all immutable features j𝑗jitalic_j.
     𝐱~~𝐱\tilde{\mathbf{x}}over~ start_ARG bold_x end_ARG \leftarrow 𝐱~αg~𝐱𝛼𝑔\tilde{\mathbf{x}}-\alpha gover~ start_ARG bold_x end_ARG - italic_α italic_g
  end while
  𝐱~=cond(𝐱~)~𝐱𝑐𝑜𝑛𝑑~𝐱\tilde{\mathbf{x}}=cond(\tilde{\mathbf{x}})over~ start_ARG bold_x end_ARG = italic_c italic_o italic_n italic_d ( over~ start_ARG bold_x end_ARG ) (project onto the coherent space)
  ϵ,δ=d𝒳(𝐱~,𝐱),d𝒴(M(𝐱~),T)formulae-sequenceitalic-ϵ𝛿subscript𝑑𝒳~𝐱𝐱subscript𝑑𝒴𝑀~𝐱𝑇\epsilon,\delta=d_{\mathcal{X}}(\tilde{\mathbf{x}},\mathbf{x}),d_{\mathcal{Y}}% (M(\tilde{\mathbf{x}}),T)italic_ϵ , italic_δ = italic_d start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT ( over~ start_ARG bold_x end_ARG , bold_x ) , italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( italic_M ( over~ start_ARG bold_x end_ARG ) , italic_T )
  if ϵitalic-ϵ\epsilonitalic_ϵ and δ𝛿\deltaitalic_δ requirements NOT met then
     Adjust λ𝜆\lambdaitalic_λ (see text for explanation)
     Return to while loop
  end if
  if |V(𝐱,𝐱~)i=1kMi(𝐱)Mi(𝐱~)|γ𝑉𝐱~𝐱superscriptsubscript𝑖1𝑘subscript𝑀𝑖𝐱subscript𝑀𝑖~𝐱𝛾\left|V(\mathbf{x},\tilde{\mathbf{x}})-\sum_{i=1}^{k}M_{i}(\mathbf{x})M_{i}(% \tilde{\mathbf{x}})\right|\geq\gamma| italic_V ( bold_x , over~ start_ARG bold_x end_ARG ) - ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_x ) italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( over~ start_ARG bold_x end_ARG ) | ≥ italic_γ then
     Adjust problem parameters (see text for explanation)
     Restart algorithm
  end if
  return 𝐱~~𝐱\tilde{\mathbf{x}}over~ start_ARG bold_x end_ARG

Refer to caption
Figure 3: Table containing details on data sets used for testing.

Solving Step 2: In Section 2, we discussed the necessity of verification and suggested that an TAP can be trusted if Δ(𝐱,𝐱~)=|V(𝐱,𝐱~)i=1kMi(𝐱)Mi(𝐱~)|Δ𝐱~𝐱𝑉𝐱~𝐱superscriptsubscript𝑖1𝑘subscript𝑀𝑖𝐱subscript𝑀𝑖~𝐱\Delta(\mathbf{x},\tilde{\mathbf{x}})=\left|V(\mathbf{x},\tilde{\mathbf{x}})-% \sum_{i=1}^{k}M_{i}(\mathbf{x})M_{i}(\tilde{\mathbf{x}})\right|roman_Δ ( bold_x , over~ start_ARG bold_x end_ARG ) = | italic_V ( bold_x , over~ start_ARG bold_x end_ARG ) - ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( bold_x ) italic_M start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ( over~ start_ARG bold_x end_ARG ) | is smaller than a threshold γ𝛾\gammaitalic_γ. Our process for choosing γ𝛾\gammaitalic_γ starts with deciding on an acceptable risk of eliminating a truly effective TAP (we use 10%). To find the γ𝛾\gammaitalic_γ corresponding to this risk, we calculate Δ(𝐱(i),𝐱(j))Δsuperscript𝐱𝑖superscript𝐱𝑗\Delta(\mathbf{x}^{(i)},\mathbf{x}^{(j)})roman_Δ ( bold_x start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT , bold_x start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ) for a sufficiently large number of pairs (𝐱(i),𝐱(j))superscript𝐱𝑖superscript𝐱𝑗(\mathbf{x}^{(i)},\mathbf{x}^{(j)})( bold_x start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT , bold_x start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ) from the testing data such that C(i)C(j)superscript𝐶𝑖superscript𝐶𝑗C^{(i)}\neq C^{(j)}italic_C start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT ≠ italic_C start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT. Finally, we pick γ𝛾\gammaitalic_γ such that only the desired percentage of Δ(𝐱(i),𝐱(j))Δsuperscript𝐱𝑖superscript𝐱𝑗\Delta(\mathbf{x}^{(i)},\mathbf{x}^{(j)})roman_Δ ( bold_x start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT , bold_x start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ) values (e.g. 10%) are above γ𝛾\gammaitalic_γ. The verification procedure is now reduced to eliminating any 𝐱~~𝐱\tilde{\mathbf{x}}over~ start_ARG bold_x end_ARG that results in Δ(𝐱,𝐱~)>γΔ𝐱~𝐱𝛾\Delta(\mathbf{x},\tilde{\mathbf{x}})>\gammaroman_Δ ( bold_x , over~ start_ARG bold_x end_ARG ) > italic_γ.

Adjusting for Suitability and Verifiability: When creating TAP we will often have a particular budget (ϵitalic-ϵ\epsilonitalic_ϵ) or tolerance (δ𝛿\deltaitalic_δ) bound we need to satisfy. To find a suitable TAP we repeat Step 1 of our process adjusting λ𝜆\lambdaitalic_λ until the desired budget or tolerance is met: increasing λ𝜆\lambdaitalic_λ to decrease ϵitalic-ϵ\epsilonitalic_ϵ and decreasing λ𝜆\lambdaitalic_λ to decrease δ𝛿\deltaitalic_δ. It may also be appropriate to use a variety of λ𝜆\lambdaitalic_λ values and plot the ϵitalic-ϵ\epsilonitalic_ϵ and δ𝛿\deltaitalic_δ values of each resulting TAP (see Figure 4). The user may then select a TAP they see as offering particularly good value. When a TAP fails the verification step, there are a few recourses. (1) Sometimes it is sufficient to decrease λ𝜆\lambdaitalic_λ, putting greater emphasis on reaching the target set. (2) “Shrink” the target set (increase the value of p𝑝pitalic_p and decrease the value of q𝑞qitalic_q) in order to force the algorithm to find more effective changes. (3) Add a random perturbation to 𝐱𝐱\mathbf{x}bold_x in order to move the starting point away from the adversarial example. The entire procedure is described in Algorithm 1.

4 Experimental Results

Data Sets: We compare TAP, counterfactuals and adversarial attacks on four data sets from different fields; data set details are found in Figure 3 and the Appendix A.3.1.

Refer to caption
Figure 4: Cost-Benefit plots of TAP and counterfactuals for an individually from the Law School data set with grades measured in standard deviations from the mean (a) and an individual in the Adult Income data set (b).

Adult Income (Kohavi & Becker, 1996): This data set contains demographic information on Americans labelled by whether they had a high income. The actionable set 𝒜(𝐱)𝒜𝐱\mathcal{A}(\mathbf{x)}caligraphic_A ( bold_x ) allows individuals to increase their education, change jobs and adjust their weekly work hours. The cost function d𝒳subscript𝑑𝒳d_{\mathcal{X}}italic_d start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT sums the expected number of years to improve education, a one-year cost to change jobs and the square of the change in hours worked (weighted so an additional 3 hours of work per week is equal to a year spent on education).
Law School Success (Wightman, 1998): This data set contains information on law school students labelled by whether they passed the BAR exam. 𝒜(𝐱)𝒜𝐱\mathcal{A}(\mathbf{x)}caligraphic_A ( bold_x ) allows changes to law school grades (through more studying) and the region where the exam is taken. The cost function d𝒳subscript𝑑𝒳d_{\mathcal{X}}italic_d start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT sums the increase in grades and the physical distance travelled to take the BAR. Moving to an adjacent region (Far West to North West) is weighted equal to increasing grades one standard deviation.
Diabetes Prediction (for Disease Control &, CDC): The individuals in this data set are labelled by whether they have diabetes. We define 𝒜(𝐱)𝒜𝐱\mathcal{A}(\mathbf{x})caligraphic_A ( bold_x ) to allow changes in health habits, BMI, education and income. We use a weighted 2-norm for d𝒳subscript𝑑𝒳d_{\mathcal{X}}italic_d start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT to represent the relative difficulty of making changes. For example, starting to get regular physical activity is weighted the same as drop** one BMI.
German Credit (Hofmann, 1994): This data set contains loan applications. In 𝒜(𝐱)𝒜𝐱\mathcal{A}(\mathbf{x})caligraphic_A ( bold_x ), we allow for changes to the loan duration and size and funds in the checking and savings accounts. We use d𝒳subscript𝑑𝒳d_{\mathcal{X}}italic_d start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT to measure the total difference in Deutsche Marks (DM) over all elements of the application.

Other Methods: We compare our results against counterfactuals created using the original method proposed to create counterfactuals (Wachter et al., 2017) and the diverse counterfactuals (DICE) method in (Mothilal et al., 2020), the most cited methods in the literature. These methods use an psubscript𝑝\ell_{p}roman_ℓ start_POSTSUBSCRIPT italic_p end_POSTSUBSCRIPT norm based cost function that often fails to reflect real world costs (see examples on the next page). We also compare TAP against the Carlini & Wagner (2017b) 2subscript2\ell_{2}roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT adversarial attack, one of the most well known and effective adversarial attacks. The counterfactuals belong to the same actionable set as the TAP, but the adversarial examples are not limited to an actionable set and may not be coherent.

Models: Gradient boosted tree algorithms (Friedman, 2001) are considered state of the art architectures for tabular data classification (Shwartz-Ziv & Armon, 2022). Unfortunately, these models are not differentiable and cannot be used with our framework. Instead we use neural networks which we tuned until they provide accuracy on par with gradient boosted tree models on the same data set. Details on our models’ structure, training are given in Appendix A.3.2.

Refer to caption
Figure 5: a) & b) show average success rate for moving individuals within a variety of distances (δ𝛿\deltaitalic_δ) to the target set. The y-axis shows the percentage of individuals within the goal distance, and the x-axis, represents different costs (ϵitalic-ϵ\epsilonitalic_ϵ values). c) Summarizes success values for all data sets. The upper (red) value for each row is the success rate before the verification procedure and the lower (green) value is the success rate after verification with a 10%percent1010\%10 % chance of rejecting valid examples.

Representative Examples of TAP and Trade-off between cost/desirability: We first examine two representative examples of how TAP behave differently than counterfactuals for specific individuals. Figure 4 shows a plot of the ϵitalic-ϵ\epsilonitalic_ϵ/δ𝛿\deltaitalic_δ values of TAP and counterfactuals for one individual in the Law School data set and one individual in the Adult Income data set. We examine the results from the Law School data set: The TAP labelled TAP-1 suggests only a mild (0.20.20.20.2 standard deviation) increase in grades and the relatively short move from the Far West to the Great Lakes region resulting in a small 11%percent1111\%11 % increase in the chance of passing the BAR. On the other hand, TAP-2 suggest a larger increase in grades and a longer move which results in a much larger 34%percent3434\%34 % increase to the odds of success. Finally the counterfactual CF-1 suggest an enormous increase in grades and massive cross country move to achieve 51%percent5151\%51 % increase in the odds of success. Turning our attention to the Adult Income example: TAP-3 suggests a relatively simple increase in education to the masters level resulting in a 20%percent2020\%20 % increase to the odds of a high income. Alternatively, TAP-4 achieves an 71%percent7171\%71 % increase by suggesting far more changes including a professional degree and becoming self-employed. The counterfactual CF-2 does not suggest becoming self-employed and produces a smaller 67%percent6767\%67 % increase in the odds of high income despite also suggesting a professional degree and a drastic 16161616 hour increase in the hours worked per week.

These examples illustrates two trends: 1) TAP offer both low-cost/low-reward (large-δ𝛿\deltaitalic_δ/small-ϵitalic-ϵ\epsilonitalic_ϵ) and high-cost/high-reward options, whereas counterfactual methods (Wachter et al., 2017; Mothilal et al., 2020) only offer high-cost options. This is because TAP are defined by distance to the target set, but counterfactuals are defined as belonging to the desirable class. That rules out any advice that doesn’t result in the desirable class being the most likely class. 2) Counterfactuals are prone to suggesting very high-cost outliers. This has two main causes: (a) The 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT norm used to create the counterfactuals does not accurately represent real world effort. For example this norm considers any move in region to cost the same regardless of actual distance. (b) Because counterfactuals do not use a target set, they are prone to “overshooting” the desired goal. For example CF1𝐶subscript𝐹1CF_{1}italic_C italic_F start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT resulted in a 95%percent9595\%95 % chance of passing the BAR when our goal was only 85%percent8585\%85 %.

Comparison of TAP vs. Other Approaches: We now compare TAP, counterfactuals (Wachter et al., 2017; Mothilal et al., 2020) and CW attacks (Carlini & Wagner, 2017b) over the entire data sets. In Figure 5: Each bar chart refers to a particular data set and desired distance δ𝛿\deltaitalic_δ to the target set T𝑇Titalic_T. Each bar shows the percentage of individuals that a method was able to move inside the goal δ𝛿\deltaitalic_δ at a variety of costs ϵitalic-ϵ\epsilonitalic_ϵ. (Bar charts for all data sets are found in the Appendix A.3.4.) The table summarizes this information for all data sets with the upper (red) value in each cell representing the data before the verification procedure and the lower (green) value the success rate after the verification procedure. Consider the bar chart on the top middle which refers to the German Credit data and a goal of δ=0.5𝛿0.5\delta=0.5italic_δ = 0.5 from the target (the same information as the last three columns of the table). At a ϵ=0italic-ϵ0\epsilon=0italic_ϵ = 0 Deutsche Marks (DM) cost, TAP are able to move 73%percent7373\%73 % of individuals within the goal range by closing empty accounts. Counterfactuals do not match this success until the cost ϵ=7,000DMitalic-ϵ7000DM\epsilon=7,000\text{DM}italic_ϵ = 7 , 000 DM, and CW attacks never achieve more than a 31%percent3131\%31 % success rate. TAP outperform counterfactuals in all of the test scenarios.

Impact and Effectiveness of Verifier: The first important take away from the success rates after verification is that the verifier was 100% effective at eliminating Carlini Wagner adversarial examples (visible in the bottom row of the table in Figure 5 c), implying that the verification method does indeed eliminate inputs that fool the classifier. Importantly, the verification procedure also removes a significant number of TAP and counterfactuals. Consider the second column of Figure 5 c: Out of all TAP generated 14%percent1414\%14 % appeared effective but were eliminated by the verification procedure. Counterfactual methods fared even worse with 20%percent2020\%20 % to 27%percent2727\%27 % of counterfactuals eliminated. This reinforces the necessity of a verification procedure.

Concluding Remarks & Future Work: In this work, we proposed Trustworthy Actionable Perturbations (TAP) which leverage ML classifiers to find efficient actions to achieve real world results. Our proposed framework introduces a novel verification procedure, flexible definition of goals, and principled reward measure for use in generating counterfactuals. We demonstrated their effectiveness when compared to other methods on data sets from multiple fields. Finally we note that our framework is flexible enough to incorporate contributions from previous works on counterfactuals such as individualized cost measures (De Toni et al., 2023), causal relations between inputs (Mahajan et al., 2019; Karimi et al., 2020b), causal relationships to the output (König et al., 2023), and advanced optimization methods (Guidotti et al., 2018; Karimi et al., 2020a).

Impact Statement

As the use of AI and ML expands into critical applications such as healthcare, criminal justice, and hiring, the importance of explaining decisions deemed unfavorable and providing recourse to such users has grown significantly. In this context, our paper introduces a novel contribution aimed at making recourse mechanisms more trustworthy. We present a flexible framework, Trustworthy Actionable Perturbations (TAP), designed to generate cost-effective recourse which can ensure that the recourse being provided to users results in real-world changes. TAP can be useful to both end-users and institutions that suggest the recourse. The technical tools and the analytical results developed in the paper (including a flexible target set, and a novel pair-wise verification procedure) can also find use and lead to new insights for other problems such as cost-sensitive learning and adversarial defense.

Acknowledgements

We thank the anonymous ICML reviewers and the area chairs for their insightful suggestions. This work was supported by NSF grants CAREER 1651492, CCF-2100013, CNS-2209951, CNS-1822071, CNS-2317192, and by the U.S. Department of Energy, Office of Science, Office of Advanced Scientific Computing under Award Number DE-SC-ERKJ422, and NIH Award R01-CA261457-01A1.

References

  • Ballet et al. (2019) Ballet, V., Renard, X., Aigrain, J., Laugel, T., Frossard, P., and Detyniecki, M. Imperceptible adversarial attacks on tabular data. arXiv preprint arXiv:1911.03274, 2019.
  • Bartlett & Mendelson (2002) Bartlett, P. L. and Mendelson, S. Rademacher and Gaussian Complexities: Risk Bounds and Structural Results. Journal of Machine Learning Research, 3(Nov):463–482, 2002.
  • Carlini & Wagner (2017a) Carlini, N. and Wagner, D. Adversarial examples are not easily detected: Bypassing ten detection methods. In Proceedings of the 10th ACM workshop on artificial intelligence and security, pp.  3–14, 2017a.
  • Carlini & Wagner (2017b) Carlini, N. and Wagner, D. Towards evaluating the robustness of neural networks. In 2017 IEEE Symposium on Security and Privacy (SP), pp.  39–57, 2017b.
  • Cartella et al. (2021) Cartella, F., Anunciacao, O., Funabiki, Y., Yamaguchi, D., Akishita, T., and Elshocht, O. Adversarial attacks for tabular data: Application to fraud detection and imbalanced data. arXiv preprint arXiv:2101.08030, 2021.
  • Dandl et al. (2020) Dandl, S., Molnar, C., Binder, M., and Bischl, B. Multi-objective counterfactual explanations. In International Conference on Parallel Problem Solving from Nature, pp.  448–469. Springer, 2020.
  • De Toni et al. (2023) De Toni, G., Viappiani, P., Teso, S., Lepri, B., and Passerini, A. Personalized algorithmic recourse with preference elicitation. Transactions on Machine Learning Research, 2023.
  • Diochnos et al. (2018) Diochnos, D., Mahloujifar, S., and Mahmoody, M. Adversarial risk and robustness: General definitions and implications for the uniform distribution. Advances in Neural Information Processing Systems, 31, 2018.
  • Fidel et al. (2020) Fidel, G., Bitton, R., and Shabtai, A. When explainability meets adversarial learning: Detecting adversarial examples using shap signatures. In 2020 international joint conference on neural networks (IJCNN), pp.  1–8. IEEE, 2020.
  • for Disease Control & (CDC) for Disease Control, C. and (CDC), P. Behavioral risk factor surveillance system survey data (brfss), 2015. URL https://www.cdc.gov/brfss/index.html.
  • Freiesleben (2022) Freiesleben, T. The intriguing relation between counterfactual explanations and adversarial examples. Minds and Machines, 32(1):77–109, 2022.
  • Friedman (2001) Friedman, J. H. Greedy function approximation: a gradient boosting machine. Annals of statistics, pp.  1189–1232, 2001.
  • Gottlieb et al. (2016) Gottlieb, L.-A., Kontorovich, A., and Krauthgamer, R. Adaptive metric dimensionality reduction. Theoretical Computer Science, 620:105–118, 2016.
  • Gourdeau et al. (2021) Gourdeau, P., Kanade, V., Kwiatkowska, M., and Worrell, J. On the hardness of robust classification. Journal of Machine Learning Research, 22(273):1–29, 2021.
  • Guidotti et al. (2018) Guidotti, R., Monreale, A., Ruggieri, S., Pedreschi, D., Turini, F., and Giannotti, F. Local rule-based explanations of black box decision systems. arXiv preprint arXiv:1805.10820, 2018.
  • Hofmann (1994) Hofmann, H. Statlog (German Credit Data). UCI Machine Learning Repository, 1994. DOI: https://doi.org/10.24432/C5NC77.
  • Karimi et al. (2020a) Karimi, A.-H., Barthe, G., Balle, B., and Valera, I. Model-agnostic counterfactual explanations for consequential decisions. In International Conference on Artificial Intelligence and Statistics, pp.  895–905. PMLR, 2020a.
  • Karimi et al. (2020b) Karimi, A.-H., Von Kügelgen, J., Schölkopf, B., and Valera, I. Algorithmic recourse under imperfect causal knowledge: a probabilistic approach. Advances in neural information processing systems, 33:265–277, 2020b.
  • Karimi et al. (2021) Karimi, A.-H., Schölkopf, B., and Valera, I. Algorithmic recourse: from counterfactual explanations to interventions. In Proceedings of the 2021 ACM conference on fairness, accountability, and transparency, pp.  353–362, 2021.
  • Kingma & Ba (2014) Kingma, D. P. and Ba, J. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
  • Kohavi & Becker (1996) Kohavi, R. and Becker, B. Uci adult dataset. UCI machine learning repository, 1996.
  • König et al. (2023) König, G., Freiesleben, T., and Grosse-Wentrup, M. Improvement-focused causal recourse (icr). In Proceedings of the AAAI Conference on Artificial Intelligence, volume 37, pp.  11847–11855, 2023.
  • Kumar et al. (2021) Kumar, N., Vimal, S., Kayathwal, K., and Dhama, G. Evolutionary adversarial attacks on payment systems. In 2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA), pp.  813–818. IEEE, 2021.
  • Leo et al. (2019) Leo, M., Sharma, S., and Maddulety, K. Machine learning in banking risk management: A literature review. Risks, 7(1):29, 2019.
  • Mahajan et al. (2019) Mahajan, D., Tan, C., and Sharma, A. Preserving causal constraints in counterfactual explanations for machine learning classifiers. arXiv preprint arXiv:1912.03277, 2019.
  • Martinez Neda et al. (2021) Martinez Neda, B., Zeng, Y., and Gago-Masague, S. Using machine learning in admissions: Reducing human and algorithmic bias in the selection process. In Proceedings of the 52nd ACM Technical Symposium on Computer Science Education, pp.  1323–1323, 2021.
  • Mathov et al. (2020) Mathov, Y., Levy, E., Katzir, Z., Shabtai, A., and Elovici, Y. Not all datasets are born equal: On heterogeneous data and adversarial examples. arXiv preprint arXiv:2010.03180, 2020.
  • Mohri et al. (2018) Mohri, M., Rostamizadeh, A., and Talwalkar, A. Foundations of Machine Learning. MIT press, 2018.
  • Mothilal et al. (2020) Mothilal, R. K., Sharma, A., and Tan, C. Explaining machine learning classifiers through diverse counterfactual explanations. In Proceedings of the 2020 conference on fairness, accountability, and transparency, pp.  607–617, 2020.
  • Naeini et al. (2015) Naeini, M. P., Cooper, G., and Hauskrecht, M. Obtaining well calibrated probabilities using bayesian binning. In Proceedings of the AAAI conference on artificial intelligence, volume 29, 2015.
  • Pawelczyk et al. (2022) Pawelczyk, M., Agarwal, C., Joshi, S., Upadhyay, S., and Lakkaraju, H. Exploring counterfactual explanations through the lens of adversarial examples: A theoretical and empirical analysis. In International Conference on Artificial Intelligence and Statistics, pp.  4574–4594. PMLR, 2022.
  • Polyanskiy & Wu (2024) Polyanskiy, Y. and Wu, Y. Information theory: From coding to learning. 2024.
  • Poyiadzi et al. (2020) Poyiadzi, R., Sokol, K., Santos-Rodriguez, R., De Bie, T., and Flach, P. Face: feasible and actionable counterfactual explanations. In Proceedings of the AAAI/ACM Conference on AI, Ethics, and Society, pp.  344–350, 2020.
  • Ramakrishnan et al. (2020) Ramakrishnan, G., Lee, Y. C., and Albarghouthi, A. Synthesizing action sequences for modifying model decisions. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, pp.  5462–5469, 2020.
  • Roth et al. (2019) Roth, K., Kilcher, Y., and Hofmann, T. The odds are odd: A statistical test for detecting adversarial examples. In International Conference on Machine Learning, pp.  5498–5507. PMLR, 2019.
  • Sauer et al. (2022) Sauer, C. M., Dam, T. A., Celi, L. A., Faltys, M., de la Hoz, M. A., Adhikari, L., Ziesemer, K. A., Girbes, A., Thoral, P. J., and Elbers, P. Systematic review and comparison of publicly available icu data sets—a decision guide for clinicians and data scientists. Critical care medicine, 50(6):e581–e588, 2022.
  • Shwartz-Ziv & Armon (2022) Shwartz-Ziv, R. and Armon, A. Tabular data: Deep learning is not all you need. Information Fusion, 81:84–90, 2022.
  • Su et al. (2019) Su, J., Vargas, D. V., and Sakurai, K. One pixel attack for fooling deep neural networks. IEEE Transactions on Evolutionary Computation, 23(5):828–841, 2019.
  • Szegedy et al. (2013) Szegedy, C., Zaremba, W., Sutskever, I., Bruna, J., Erhan, D., Goodfellow, I., and Fergus, R. Intriguing properties of neural networks. arXiv preprint arXiv:1312.6199, 2013.
  • Ustun et al. (2019) Ustun, B., Spangher, A., and Liu, Y. Actionable recourse in linear classification. In Proceedings of the conference on fairness, accountability, and transparency, pp.  10–19, 2019.
  • Wachter et al. (2017) Wachter, S., Mittelstadt, B. D., and Russell, C. Counterfactual explanations without opening the black box: Automated decisions and the gdpr. Cybersecurity, 2017.
  • Wightman (1998) Wightman, L. F. Lsac national longitudinal bar passage study. lsac research report series. 1998.
  • Yang et al. (2020) Yang, P., Chen, J., Hsieh, C.-J., Wang, J.-L., and Jordan, M. Ml-loo: Detecting adversarial examples with feature attribution. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, pp.  6639–6647, 2020.

Appendix A Appendix

The Appendix is organized as follows:

A.1 Proof of Theorem 2.2 (Analysis of statistical distance d𝒴subscript𝑑𝒴d_{\mathcal{Y}}italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT to the target set T𝑇Titalic_T)
A.2 Proofs of Theorem 2.3 (PAC generalization bounds for Verifier)
A.3 Additional details about the implementation of experiments
A.3.1 Details about data sets and their corresponding cost functions
A.3.2 Details about the models used
A.3.3 Details about the objective function used for optimization
A.3.4 Additional experimental results showing the comparative performance of TAP vs. other methods.

A.1 Proof of Theorem 2.2 (Analysis of statistical distance d𝒴subscript𝑑𝒴d_{\mathcal{Y}}italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT to the target set T𝑇Titalic_T)

Recall that our target sets have the form

T={𝐳𝒴|i𝒲zip,i𝒰ziq},𝑇conditional-set𝐳𝒴formulae-sequencesubscript𝑖𝒲subscript𝑧𝑖𝑝subscript𝑖𝒰subscript𝑧𝑖𝑞T=\left\{\mathbf{z}\in\mathcal{Y}\left|\sum_{i\in\mathcal{W}}z_{i}\geq p\right% .,\sum_{i\in\mathcal{U}}z_{i}\leq q\right\},italic_T = { bold_z ∈ caligraphic_Y | ∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_W end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≥ italic_p , ∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_U end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ italic_q } ,

where either 𝒲𝒲\mathcal{W}caligraphic_W or 𝒰𝒰\mathcal{U}caligraphic_U could be empty. Also recall

d𝒴(𝐲~,T)subscript𝑑𝒴~𝐲𝑇\displaystyle d_{\mathcal{Y}}(\tilde{\mathbf{y}},T)italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG , italic_T ) =min𝐳TDf(𝐲~||𝐳)=min𝐳Ti=1kzif(y~izi).\displaystyle=\min_{\mathbf{z}\in T}D_{f}(\tilde{\mathbf{y}}||\mathbf{z})=\min% _{\mathbf{z}\in T}\sum_{i=1}^{k}{z}_{i}f\left(\frac{\tilde{y}_{i}}{z_{i}}% \right).= roman_min start_POSTSUBSCRIPT bold_z ∈ italic_T end_POSTSUBSCRIPT italic_D start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG | | bold_z ) = roman_min start_POSTSUBSCRIPT bold_z ∈ italic_T end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_f ( divide start_ARG over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ) . (9)

We must prove three facts: 1) d𝒴(𝐲~,T)subscript𝑑𝒴~𝐲𝑇d_{\mathcal{Y}}(\tilde{\mathbf{y}},T)italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG , italic_T ) has the closed form found in equation (3), 2) This function is continuous, 3) the derivative of the function is continuous. We begin by proving the closed form equation.

Our proof will be made easier by introducing notation 𝒩=(𝒲𝒰)C𝒩superscript𝒲𝒰𝐶\mathcal{N}=(\mathcal{W}\cup\mathcal{U})^{C}caligraphic_N = ( caligraphic_W ∪ caligraphic_U ) start_POSTSUPERSCRIPT italic_C end_POSTSUPERSCRIPT as the neutral classes that are neither desirable nor undesirable. We will use the fact that 1=𝒮𝒲+𝒮𝒰+𝒮𝒩1subscript𝒮𝒲subscript𝒮𝒰subscript𝒮𝒩1=\mathcal{S}_{\mathcal{W}}+\mathcal{S}_{\mathcal{U}}+\mathcal{S}_{\mathcal{N}}1 = caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT + caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT + caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT to rewrite (3) as

d𝒴(𝐲~,T)={0if 𝒮𝒲p and 𝒮𝒰qpf(𝒮𝒲p)+(1p)f(𝒮𝒰+𝒮𝒩1p)if 𝒮𝒲<p and 𝒮𝒰(1𝒮𝒲)(q1p)qf(𝒮𝒰q)+(1q)f(𝒮𝒲+𝒮𝒩1q)if 𝒮𝒰>q and 𝒮𝒲(1𝒮𝒰)(p1q)pf(𝒮𝒲p)+qf(𝒮𝒰q)+(1pq)f(𝒮𝒩1pq)if 𝒮𝒰>(1𝒮𝒲)(q1p) and 𝒮𝒲<(1𝒮𝒰)(p1q),subscript𝑑𝒴~𝐲𝑇cases0if subscript𝒮𝒲𝑝 and subscript𝒮𝒰𝑞𝑝𝑓subscript𝒮𝒲𝑝1𝑝𝑓subscript𝒮𝒰subscript𝒮𝒩1𝑝if subscript𝒮𝒲𝑝 and subscript𝒮𝒰1subscript𝒮𝒲𝑞1𝑝𝑞𝑓subscript𝒮𝒰𝑞1𝑞𝑓subscript𝒮𝒲subscript𝒮𝒩1𝑞if subscript𝒮𝒰𝑞 and subscript𝒮𝒲1subscript𝒮𝒰𝑝1𝑞𝑝𝑓subscript𝒮𝒲𝑝𝑞𝑓subscript𝒮𝒰𝑞1𝑝𝑞𝑓subscript𝒮𝒩1𝑝𝑞if subscript𝒮𝒰1subscript𝒮𝒲𝑞1𝑝otherwise and subscript𝒮𝒲1subscript𝒮𝒰𝑝1𝑞d_{\mathcal{Y}}(\tilde{\mathbf{y}},T)=\begin{cases}0&\text{if }\mathcal{S}_{% \mathcal{W}}\geq p\text{ and }\mathcal{S}_{\mathcal{U}}\leq q\\ pf\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)+(1-p)f\left(\frac{\mathcal{% S}_{\mathcal{U}}+\mathcal{S}_{\mathcal{N}}}{1-p}\right)&\text{if }\mathcal{S}_% {\mathcal{W}}<p\text{ and }\mathcal{S}_{\mathcal{U}}\leq(1-\mathcal{S}_{% \mathcal{W}})\left(\frac{q}{1-p}\right)\\ qf\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)+(1-q)f\left(\frac{\mathcal{% S}_{\mathcal{W}}+\mathcal{S}_{\mathcal{N}}}{1-q}\right)&\text{if }\mathcal{S}_% {\mathcal{U}}>q\text{ and }\mathcal{S}_{\mathcal{W}}\geq(1-\mathcal{S}_{% \mathcal{U}})\left(\frac{p}{1-q}\right)\\ pf\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)+qf\left(\frac{\mathcal{S}_{% \mathcal{U}}}{q}\right)+(1-p-q)f\left(\frac{\mathcal{S}_{\mathcal{N}}}{1-p-q}% \right)&\text{if }\mathcal{S}_{\mathcal{U}}>(1-\mathcal{S}_{\mathcal{W}})\left% (\frac{q}{1-p}\right)\\ &\text{ and }\mathcal{S}_{\mathcal{W}}<(1-\mathcal{S}_{\mathcal{U}})\left(% \frac{p}{1-q}\right)\end{cases},italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG , italic_T ) = { start_ROW start_CELL 0 end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ≥ italic_p and caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ≤ italic_q end_CELL end_ROW start_ROW start_CELL italic_p italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) + ( 1 - italic_p ) italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT + caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG ) end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT < italic_p and caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ≤ ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG ) end_CELL end_ROW start_ROW start_CELL italic_q italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) + ( 1 - italic_q ) italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT + caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_q end_ARG ) end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT > italic_q and caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ≥ ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ) end_CELL end_ROW start_ROW start_CELL italic_p italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) + italic_q italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) + ( 1 - italic_p - italic_q ) italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ) end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT > ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL and caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT < ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ) end_CELL end_ROW ,

where 𝒮𝒲=i𝒲y~isubscript𝒮𝒲subscript𝑖𝒲subscript~𝑦𝑖\mathcal{S}_{\mathcal{W}}=\sum_{i\in\mathcal{W}}\tilde{y}_{i}caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_W end_POSTSUBSCRIPT over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, 𝒮𝒰=i𝒰y~isubscript𝒮𝒰subscript𝑖𝒰subscript~𝑦𝑖\mathcal{S}_{\mathcal{U}}=\sum_{i\in\mathcal{U}}\tilde{y}_{i}caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_U end_POSTSUBSCRIPT over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and 𝒮𝒩=i𝒩y~isubscript𝒮𝒩subscript𝑖𝒩subscript~𝑦𝑖\mathcal{S}_{\mathcal{N}}=\sum_{i\in\mathcal{N}}\tilde{y}_{i}caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_N end_POSTSUBSCRIPT over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.

The case where 𝐲~T~𝐲𝑇\tilde{\mathbf{y}}\in Tover~ start_ARG bold_y end_ARG ∈ italic_T is obvious so we consider only the case where 𝐲~T~𝐲𝑇\tilde{\mathbf{y}}\notin Tover~ start_ARG bold_y end_ARG ∉ italic_T, First note that f𝑓fitalic_f-divergence Df(𝐲~||𝐳)D_{f}(\tilde{\mathbf{y}}||\mathbf{z})italic_D start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG | | bold_z ) is convex in 𝐳𝐳\mathbf{z}bold_z. Furthermore T𝑇Titalic_T is a convex set. Therefore any 𝐳𝐳\mathbf{z}bold_z satisfying the KKT conditions is a minimizer. The KKT conditions for this problem can be written as

(𝐳)=0𝐳0\displaystyle\nabla\mathcal{L}(\mathbf{z})=\vec{0}∇ caligraphic_L ( bold_z ) = over→ start_ARG 0 end_ARG (10)
i=1kzi=1superscriptsubscript𝑖1𝑘subscript𝑧𝑖1\displaystyle\sum_{i=1}^{k}z_{i}=1∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1 (11)
pi𝒲zi0𝑝subscript𝑖𝒲subscript𝑧𝑖0\displaystyle p-\sum_{i\in\mathcal{W}}{z}_{i}\leq 0italic_p - ∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_W end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ 0 (12)
i𝒰ziq0subscript𝑖𝒰subscript𝑧𝑖𝑞0\displaystyle\sum_{i\in\mathcal{U}}{z}_{i}-q\leq 0∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_U end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_q ≤ 0 (13)
μ1,μ20subscript𝜇1subscript𝜇20\displaystyle\mu_{1},\mu_{2}\geq 0italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≥ 0 (14)
μ1(pi𝒲zi)=0subscript𝜇1𝑝subscript𝑖𝒲subscript𝑧𝑖0\displaystyle\mu_{1}\left(p-\sum_{i\in\mathcal{W}}{z}_{i}\right)=0italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_p - ∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_W end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = 0 (15)
μ2(qi𝒰zi)=0,subscript𝜇2𝑞subscript𝑖𝒰subscript𝑧𝑖0\displaystyle\mu_{2}\left(q-\sum_{i\in\mathcal{U}}{z}_{i}\right)=0,italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( italic_q - ∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_U end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = 0 , (16)

where the Lagrangian is defined by

(𝐳)=i=1kzif(y~izi)+λi=1kzi+μ1(pi𝒲zi)+μ2(i𝒰ziq).𝐳superscriptsubscript𝑖1𝑘subscript𝑧𝑖𝑓subscript~𝑦𝑖subscript𝑧𝑖𝜆superscriptsubscript𝑖1𝑘subscript𝑧𝑖subscript𝜇1𝑝subscript𝑖𝒲subscript𝑧𝑖subscript𝜇2subscript𝑖𝒰subscript𝑧𝑖𝑞\mathcal{L}(\mathbf{z})=\sum_{i=1}^{k}{z}_{i}f\left(\frac{\tilde{y}_{i}}{z_{i}% }\right)+\lambda\sum_{i=1}^{k}z_{i}+\mu_{1}\left(p-\sum_{i\in\mathcal{W}}{z}_{% i}\right)+\mu_{2}\left(\sum_{i\in\mathcal{U}}{z}_{i}-q\right).caligraphic_L ( bold_z ) = ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_f ( divide start_ARG over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ) + italic_λ ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ( italic_p - ∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_W end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) + italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ( ∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_U end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_q ) .

Note that we have neglected to explicitly state the requirement that 0zi10subscript𝑧𝑖10\leq z_{i}\leq 10 ≤ italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ≤ 1 for all i𝑖iitalic_i. This is because our eventual solution will satisfy these bounds anyways, and omitting these bounds will drastically simplify our calculations. We now rewrite (10) as

f(y~izi)y~izif(y~izi)+λμ1𝑓subscript~𝑦𝑖subscript𝑧𝑖subscript~𝑦𝑖subscript𝑧𝑖superscript𝑓subscript~𝑦𝑖subscript𝑧𝑖𝜆subscript𝜇1\displaystyle f\left(\frac{\tilde{y}_{i}}{z_{i}}\right)-\frac{\tilde{y}_{i}}{z% _{i}}f^{\prime}\left(\frac{\tilde{y}_{i}}{z_{i}}\right)+\lambda-\mu_{1}italic_f ( divide start_ARG over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ) - divide start_ARG over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ) + italic_λ - italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT =0absent0\displaystyle=0= 0 i𝒲𝑖𝒲\displaystyle i\in\mathcal{W}italic_i ∈ caligraphic_W (17)
f(y~izi)y~izif(y~izi)+λ+μ2𝑓subscript~𝑦𝑖subscript𝑧𝑖subscript~𝑦𝑖subscript𝑧𝑖superscript𝑓subscript~𝑦𝑖subscript𝑧𝑖𝜆subscript𝜇2\displaystyle f\left(\frac{\tilde{y}_{i}}{z_{i}}\right)-\frac{\tilde{y}_{i}}{z% _{i}}f^{\prime}\left(\frac{\tilde{y}_{i}}{z_{i}}\right)+\lambda+\mu_{2}italic_f ( divide start_ARG over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ) - divide start_ARG over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ) + italic_λ + italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT =0absent0\displaystyle=0= 0 i𝒰𝑖𝒰\displaystyle i\in\mathcal{U}italic_i ∈ caligraphic_U (18)
f(y~izi)y~izif(y~izi)+λ𝑓subscript~𝑦𝑖subscript𝑧𝑖subscript~𝑦𝑖subscript𝑧𝑖superscript𝑓subscript~𝑦𝑖subscript𝑧𝑖𝜆\displaystyle f\left(\frac{\tilde{y}_{i}}{z_{i}}\right)-\frac{\tilde{y}_{i}}{z% _{i}}f^{\prime}\left(\frac{\tilde{y}_{i}}{z_{i}}\right)+\lambdaitalic_f ( divide start_ARG over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ) - divide start_ARG over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG ) + italic_λ =0absent0\displaystyle=0= 0 i𝒩𝑖𝒩\displaystyle i\in\mathcal{N}italic_i ∈ caligraphic_N (19)

We now propose a solution can be found where that the ratios y~izisubscript~𝑦𝑖subscript𝑧𝑖\frac{\tilde{y}_{i}}{z_{i}}divide start_ARG over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG are constant in each of the sets 𝒲𝒲\mathcal{W}caligraphic_W, 𝒰𝒰\mathcal{U}caligraphic_U, 𝒩𝒩\mathcal{N}caligraphic_N. That is

zisubscript𝑧𝑖\displaystyle z_{i}italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT =C𝒲y~iabsentsubscript𝐶𝒲subscript~𝑦𝑖\displaystyle=C_{\mathcal{W}}\tilde{y}_{i}= italic_C start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT i𝒲𝑖𝒲\displaystyle i\in\mathcal{W}italic_i ∈ caligraphic_W
zisubscript𝑧𝑖\displaystyle z_{i}italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT =C𝒰y~iabsentsubscript𝐶𝒰subscript~𝑦𝑖\displaystyle=C_{\mathcal{U}}\tilde{y}_{i}= italic_C start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT i𝒰𝑖𝒰\displaystyle i\in\mathcal{U}italic_i ∈ caligraphic_U
zisubscript𝑧𝑖\displaystyle z_{i}italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT =C𝒩y~iabsentsubscript𝐶𝒩subscript~𝑦𝑖\displaystyle=C_{\mathcal{N}}\tilde{y}_{i}= italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT i𝒩.𝑖𝒩\displaystyle i\in\mathcal{N}.italic_i ∈ caligraphic_N .

In that case we can satisfy conditions (17), (18) and (19) (originally (10)) by setting

λ𝜆\displaystyle\lambdaitalic_λ =C𝒩1f(C𝒩1)f(C𝒩1)absentsuperscriptsubscript𝐶𝒩1superscript𝑓superscriptsubscript𝐶𝒩1𝑓superscriptsubscript𝐶𝒩1\displaystyle=C_{\mathcal{N}}^{-1}f^{\prime}(C_{\mathcal{N}}^{-1})-f(C_{% \mathcal{N}}^{-1})= italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) - italic_f ( italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT )
μ1subscript𝜇1\displaystyle\mu_{1}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT =λ+f(C𝒲1)C𝒲1f(C𝒲1)absent𝜆𝑓superscriptsubscript𝐶𝒲1superscriptsubscript𝐶𝒲1superscript𝑓superscriptsubscript𝐶𝒲1\displaystyle=\lambda+f(C_{\mathcal{W}}^{-1})-C_{\mathcal{W}}^{-1}f^{\prime}(C% _{\mathcal{W}}^{-1})= italic_λ + italic_f ( italic_C start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) - italic_C start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_C start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT )
μ2subscript𝜇2\displaystyle\mu_{2}italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT =λf(C𝒰1)+C𝒰1f(C𝒰1).absent𝜆𝑓superscriptsubscript𝐶𝒰1superscriptsubscript𝐶𝒰1superscript𝑓superscriptsubscript𝐶𝒰1\displaystyle=-\lambda-f(C_{\mathcal{U}}^{-1})+C_{\mathcal{U}}^{-1}f^{\prime}(% C_{\mathcal{U}}^{-1}).= - italic_λ - italic_f ( italic_C start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) + italic_C start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_C start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) .

We can now reformulate (14) so that it is easier to analyze. We will first define h(x)=xf(x)f(x)𝑥𝑥superscript𝑓𝑥𝑓𝑥h(x)=xf^{\prime}(x)-f(x)italic_h ( italic_x ) = italic_x italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_x ) - italic_f ( italic_x ). Note that because f(x)𝑓𝑥f(x)italic_f ( italic_x ) is convex h(x)=xf′′(x)0superscript𝑥𝑥superscript𝑓′′𝑥0h^{\prime}(x)=xf^{\prime\prime}(x)\geq 0italic_h start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_x ) = italic_x italic_f start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT ( italic_x ) ≥ 0 for all x0𝑥0x\geq 0italic_x ≥ 0 and h(x)𝑥h(x)italic_h ( italic_x ) is increasing. We can then rewrite our formulas for λ𝜆\lambdaitalic_λ, μ1subscript𝜇1\mu_{1}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and μ2subscript𝜇2\mu_{2}italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT.

λ𝜆\displaystyle\lambdaitalic_λ =h(C𝒩1)absentsuperscriptsubscript𝐶𝒩1\displaystyle=h(C_{\mathcal{N}}^{-1})= italic_h ( italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT )
μ1subscript𝜇1\displaystyle\mu_{1}italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT =h(C𝒩1)h(C𝒲1)absentsuperscriptsubscript𝐶𝒩1superscriptsubscript𝐶𝒲1\displaystyle=h(C_{\mathcal{N}}^{-1})-h(C_{\mathcal{W}}^{-1})= italic_h ( italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) - italic_h ( italic_C start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT )
μ2subscript𝜇2\displaystyle\mu_{2}italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT =h(C𝒰1)h(C𝒩1)absentsuperscriptsubscript𝐶𝒰1superscriptsubscript𝐶𝒩1\displaystyle=h(C_{\mathcal{U}}^{-1})-h(C_{\mathcal{N}}^{-1})= italic_h ( italic_C start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) - italic_h ( italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT )

Then μ10subscript𝜇10\mu_{1}\geq 0italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≥ 0 becomes

h(C𝒩1)superscriptsubscript𝐶𝒩1\displaystyle h(C_{\mathcal{N}}^{-1})italic_h ( italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT ) h(C𝒲1)absentsuperscriptsubscript𝐶𝒲1\displaystyle\geq h(C_{\mathcal{W}}^{-1})≥ italic_h ( italic_C start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT )
C𝒩1superscriptsubscript𝐶𝒩1\displaystyle C_{\mathcal{N}}^{-1}italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT C𝒲1absentsuperscriptsubscript𝐶𝒲1\displaystyle\geq C_{\mathcal{W}}^{-1}≥ italic_C start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT start_POSTSUPERSCRIPT - 1 end_POSTSUPERSCRIPT
C𝒩subscript𝐶𝒩\displaystyle C_{\mathcal{N}}italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT C𝒲,absentsubscript𝐶𝒲\displaystyle\leq C_{\mathcal{W}},≤ italic_C start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ,

and μ20subscript𝜇20\mu_{2}\geq 0italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≥ 0 similarly becomes C𝒩C𝒰subscript𝐶𝒩subscript𝐶𝒰C_{\mathcal{N}}\geq C_{\mathcal{U}}italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT ≥ italic_C start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT. This means (14) is equivalent to

C𝒰C𝒩C𝒲subscript𝐶𝒰subscript𝐶𝒩subscript𝐶𝒲C_{\mathcal{U}}\leq C_{\mathcal{N}}\leq C_{\mathcal{W}}italic_C start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ≤ italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT ≤ italic_C start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT (20)

We must now find values of C𝒲subscript𝐶𝒲C_{\mathcal{W}}italic_C start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT, C𝒰subscript𝐶𝒰C_{\mathcal{U}}italic_C start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT and C𝒩subscript𝐶𝒩C_{\mathcal{N}}italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT that satisfy (11) through (16). We will consider 3 cases illustrated in Figure 6.

Refer to caption
Figure 6: The three cases visualized in probability space.

Case: 1 Suppose 𝒮𝒲<psubscript𝒮𝒲𝑝\mathcal{S}_{\mathcal{W}}<pcaligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT < italic_p and 𝒮𝒰(1𝒮𝒲)(q1p)subscript𝒮𝒰1subscript𝒮𝒲𝑞1𝑝\mathcal{S}_{\mathcal{U}}\leq(1-\mathcal{S}_{\mathcal{W}})\left(\frac{q}{1-p}\right)caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ≤ ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG ).

Let C𝒲=p𝒮𝒲subscript𝐶𝒲𝑝subscript𝒮𝒲C_{\mathcal{W}}=\frac{p}{\mathcal{S}_{\mathcal{W}}}italic_C start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT = divide start_ARG italic_p end_ARG start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG and C𝒰=C𝒩=1p𝒮𝒰+𝒮𝒩subscript𝐶𝒰subscript𝐶𝒩1𝑝subscript𝒮𝒰subscript𝒮𝒩C_{\mathcal{U}}=C_{\mathcal{N}}=\frac{1-p}{\mathcal{S}_{\mathcal{U}}+\mathcal{% S}_{\mathcal{N}}}italic_C start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT = italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT = divide start_ARG 1 - italic_p end_ARG start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT + caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_ARG. This implies μ2=0subscript𝜇20\mu_{2}=0italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT = 0 which satisfies (16) and half of (14). This also implies i=𝒲zi=psubscript𝑖absent𝒲subscript𝑧𝑖𝑝\sum_{i=\in\mathcal{W}}z_{i}=p∑ start_POSTSUBSCRIPT italic_i = ∈ caligraphic_W end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_p satisfying (12) and (15). We will use the fact 𝒮𝒰+𝒮𝒩=1𝒮𝒲subscript𝒮𝒰subscript𝒮𝒩1subscript𝒮𝒲\mathcal{S}_{\mathcal{U}}+\mathcal{S}_{\mathcal{N}}=1-\mathcal{S}_{\mathcal{W}}caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT + caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT = 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT in our proof of condition (13).

i𝒮𝒰zisubscript𝑖subscript𝒮𝒰subscript𝑧𝑖\displaystyle\sum_{i\in\mathcal{S}_{\mathcal{U}}}z_{i}∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT =i𝒮𝒰C𝒰y~i=1p𝒮𝒰+𝒮𝒩𝒮𝒰absentsubscript𝑖subscript𝒮𝒰subscript𝐶𝒰subscript~𝑦𝑖1𝑝subscript𝒮𝒰subscript𝒮𝒩subscript𝒮𝒰\displaystyle=\sum_{i\in\mathcal{S}_{\mathcal{U}}}C_{\mathcal{U}}\tilde{y}_{i}% =\frac{1-p}{\mathcal{S}_{\mathcal{U}}+\mathcal{S}_{\mathcal{N}}}\mathcal{S}_{% \mathcal{U}}= ∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = divide start_ARG 1 - italic_p end_ARG start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT + caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT
1p𝒮𝒰+𝒮𝒩(1𝒮𝒲)(q1p)=qabsent1𝑝subscript𝒮𝒰subscript𝒮𝒩1subscript𝒮𝒲𝑞1𝑝𝑞\displaystyle\leq\frac{1-p}{\mathcal{S}_{\mathcal{U}}+\mathcal{S}_{\mathcal{N}% }}(1-\mathcal{S}_{\mathcal{W}})\left(\frac{q}{1-p}\right)=q≤ divide start_ARG 1 - italic_p end_ARG start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT + caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_ARG ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG ) = italic_q

This proves (13) is satisfied.

Because 𝒮𝒲<psubscript𝒮𝒲𝑝\mathcal{S}_{\mathcal{W}}<pcaligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT < italic_p we have

C𝒲subscript𝐶𝒲\displaystyle C_{\mathcal{W}}italic_C start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT =p𝒮𝒲>1>1p1𝒮𝒲=1p𝒮𝒰+𝒮𝒩=C𝒩absent𝑝subscript𝒮𝒲11𝑝1subscript𝒮𝒲1𝑝subscript𝒮𝒰subscript𝒮𝒩subscript𝐶𝒩\displaystyle=\frac{p}{\mathcal{S}_{\mathcal{W}}}>1>\frac{1-p}{1-\mathcal{S}_{% \mathcal{W}}}=\frac{1-p}{\mathcal{S}_{\mathcal{U}}+\mathcal{S}_{\mathcal{N}}}=% C_{\mathcal{N}}= divide start_ARG italic_p end_ARG start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG > 1 > divide start_ARG 1 - italic_p end_ARG start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG = divide start_ARG 1 - italic_p end_ARG start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT + caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_ARG = italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT

This implies μ1>0subscript𝜇10\mu_{1}>0italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > 0 and satisfies the other half of (14).

We have now shown all the KKT conditions are satisfied and we have found a minimizer. We now plug these values into (9) to find a closed form for the distance.

d𝒴(𝐲~,T)subscript𝑑𝒴~𝐲𝑇\displaystyle d_{\mathcal{Y}}(\tilde{\mathbf{y}},T)italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG , italic_T ) =min𝐳Ti=1kzif(y~izi)absentsubscript𝐳𝑇superscriptsubscript𝑖1𝑘subscript𝑧𝑖𝑓subscript~𝑦𝑖subscript𝑧𝑖\displaystyle=\min_{\mathbf{z}\in T}\sum_{i=1}^{k}{z}_{i}f\left(\frac{\tilde{y% }_{i}}{z_{i}}\right)= roman_min start_POSTSUBSCRIPT bold_z ∈ italic_T end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_f ( divide start_ARG over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG )
=i𝒲py~i𝒮𝒲f(𝒮𝒲p)+i𝒲(1p)y~i𝒮𝒰+𝒮𝒩f(𝒮𝒰+𝒮𝒩1p)absentsubscript𝑖𝒲𝑝subscript~𝑦𝑖subscript𝒮𝒲𝑓subscript𝒮𝒲𝑝subscript𝑖𝒲1𝑝subscript~𝑦𝑖subscript𝒮𝒰subscript𝒮𝒩𝑓subscript𝒮𝒰subscript𝒮𝒩1𝑝\displaystyle=\sum_{i\in\mathcal{W}}\frac{p\tilde{y}_{i}}{\mathcal{S}_{% \mathcal{W}}}f\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)+\sum_{i\notin% \mathcal{W}}\frac{(1-p)\tilde{y}_{i}}{\mathcal{S}_{\mathcal{U}}+\mathcal{S}_{% \mathcal{N}}}f\left(\frac{\mathcal{S}_{\mathcal{U}}+\mathcal{S}_{\mathcal{N}}}% {1-p}\right)= ∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_W end_POSTSUBSCRIPT divide start_ARG italic_p over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) + ∑ start_POSTSUBSCRIPT italic_i ∉ caligraphic_W end_POSTSUBSCRIPT divide start_ARG ( 1 - italic_p ) over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT + caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_ARG italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT + caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG )
=pf(𝒮𝒲p)+(1p)f(𝒮𝒰+𝒮𝒩1p).absent𝑝𝑓subscript𝒮𝒲𝑝1𝑝𝑓subscript𝒮𝒰subscript𝒮𝒩1𝑝\displaystyle=pf\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)+(1-p)f\left(% \frac{\mathcal{S}_{\mathcal{U}}+\mathcal{S}_{\mathcal{N}}}{1-p}\right).= italic_p italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) + ( 1 - italic_p ) italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT + caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG ) .

Case: 2 Suppose 𝒮𝒰>qsubscript𝒮𝒰𝑞\mathcal{S}_{\mathcal{U}}>qcaligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT > italic_q and 𝒮𝒲(1𝒮𝒰)(p1q)subscript𝒮𝒲1subscript𝒮𝒰𝑝1𝑞\mathcal{S}_{\mathcal{W}}\geq(1-\mathcal{S}_{\mathcal{U}})\left(\frac{p}{1-q}\right)caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ≥ ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ).

Let C𝒰=q𝒮𝒰subscript𝐶𝒰𝑞subscript𝒮𝒰C_{\mathcal{U}}=\frac{q}{\mathcal{S}_{\mathcal{U}}}italic_C start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT = divide start_ARG italic_q end_ARG start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG and C𝒲=C𝒩=1q𝒮𝒲+𝒮𝒩subscript𝐶𝒲subscript𝐶𝒩1𝑞subscript𝒮𝒲subscript𝒮𝒩C_{\mathcal{W}}=C_{\mathcal{N}}=\frac{1-q}{\mathcal{S}_{\mathcal{W}}+\mathcal{% S}_{\mathcal{N}}}italic_C start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT = italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT = divide start_ARG 1 - italic_q end_ARG start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT + caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_ARG. This implies μ1=0subscript𝜇10\mu_{1}=0italic_μ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = 0 which satisfies (15) and half of (14). We also have i=𝒰zi=qsubscript𝑖absent𝒰subscript𝑧𝑖𝑞\sum_{i=\in\mathcal{U}}z_{i}=q∑ start_POSTSUBSCRIPT italic_i = ∈ caligraphic_U end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_q satisfying (13) and (16). We now prove condition (12) is satisfied.

i𝒮𝒲zisubscript𝑖subscript𝒮𝒲subscript𝑧𝑖\displaystyle\sum_{i\in\mathcal{S}_{\mathcal{W}}}z_{i}∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT =i𝒮𝒲C𝒲y~i=1q𝒮𝒲+𝒮𝒩𝒮𝒲absentsubscript𝑖subscript𝒮𝒲subscript𝐶𝒲subscript~𝑦𝑖1𝑞subscript𝒮𝒲subscript𝒮𝒩subscript𝒮𝒲\displaystyle=\sum_{i\in\mathcal{S}_{\mathcal{W}}}C_{\mathcal{W}}\tilde{y}_{i}% =\frac{1-q}{\mathcal{S}_{\mathcal{W}}+\mathcal{S}_{\mathcal{N}}}\mathcal{S}_{% \mathcal{W}}= ∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = divide start_ARG 1 - italic_q end_ARG start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT + caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT
1q𝒮𝒲+𝒮𝒩(1𝒮𝒰)(p1q)absent1𝑞subscript𝒮𝒲subscript𝒮𝒩1subscript𝒮𝒰𝑝1𝑞\displaystyle\geq\frac{1-q}{\mathcal{S}_{\mathcal{W}}+\mathcal{S}_{\mathcal{N}% }}(1-\mathcal{S}_{\mathcal{U}})\left(\frac{p}{1-q}\right)≥ divide start_ARG 1 - italic_q end_ARG start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT + caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_ARG ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ) =pabsent𝑝\displaystyle=p= italic_p

Finally we prove C𝒩C𝒰subscript𝐶𝒩subscript𝐶𝒰C_{\mathcal{N}}\geq C_{\mathcal{U}}italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT ≥ italic_C start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT implying μ20subscript𝜇20\mu_{2}\geq 0italic_μ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ≥ 0 which satisfies the other half of (14)

C𝒰subscript𝐶𝒰\displaystyle C_{\mathcal{U}}italic_C start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT =q𝒮𝒰<1<1q1𝒮𝒰=1q𝒮𝒲+𝒮𝒩=C𝒩absent𝑞subscript𝒮𝒰11𝑞1subscript𝒮𝒰1𝑞subscript𝒮𝒲subscript𝒮𝒩subscript𝐶𝒩\displaystyle=\frac{q}{\mathcal{S}_{\mathcal{U}}}<1<\frac{1-q}{1-\mathcal{S}_{% \mathcal{U}}}=\frac{1-q}{\mathcal{S}_{\mathcal{W}}+\mathcal{S}_{\mathcal{N}}}=% C_{\mathcal{N}}= divide start_ARG italic_q end_ARG start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG < 1 < divide start_ARG 1 - italic_q end_ARG start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG = divide start_ARG 1 - italic_q end_ARG start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT + caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_ARG = italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT

Now that we have proven that this is a minimizer we will again plug solution into (9) to find the distance value.

d𝒴(𝐲~,T)subscript𝑑𝒴~𝐲𝑇\displaystyle d_{\mathcal{Y}}(\tilde{\mathbf{y}},T)italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG , italic_T ) =min𝐳Ti=1kzif(y~izi)absentsubscript𝐳𝑇superscriptsubscript𝑖1𝑘subscript𝑧𝑖𝑓subscript~𝑦𝑖subscript𝑧𝑖\displaystyle=\min_{\mathbf{z}\in T}\sum_{i=1}^{k}{z}_{i}f\left(\frac{\tilde{y% }_{i}}{z_{i}}\right)= roman_min start_POSTSUBSCRIPT bold_z ∈ italic_T end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_f ( divide start_ARG over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG )
=i𝒰qy~i𝒮𝒰f(𝒮𝒰q)+i𝒰(1q)y~i𝒮𝒲+𝒮𝒩f(𝒮𝒲+𝒮𝒩1p)absentsubscript𝑖𝒰𝑞subscript~𝑦𝑖subscript𝒮𝒰𝑓subscript𝒮𝒰𝑞subscript𝑖𝒰1𝑞subscript~𝑦𝑖subscript𝒮𝒲subscript𝒮𝒩𝑓subscript𝒮𝒲subscript𝒮𝒩1𝑝\displaystyle=\sum_{i\in\mathcal{U}}\frac{q\tilde{y}_{i}}{\mathcal{S}_{% \mathcal{U}}}f\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)+\sum_{i\notin% \mathcal{U}}\frac{(1-q)\tilde{y}_{i}}{\mathcal{S}_{\mathcal{W}}+\mathcal{S}_{% \mathcal{N}}}f\left(\frac{\mathcal{S}_{\mathcal{W}}+\mathcal{S}_{\mathcal{N}}}% {1-p}\right)= ∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_U end_POSTSUBSCRIPT divide start_ARG italic_q over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) + ∑ start_POSTSUBSCRIPT italic_i ∉ caligraphic_U end_POSTSUBSCRIPT divide start_ARG ( 1 - italic_q ) over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT + caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_ARG italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT + caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG )
=qf(𝒮𝒰q)+(1q)f(𝒮𝒲+𝒮𝒩1q).absent𝑞𝑓subscript𝒮𝒰𝑞1𝑞𝑓subscript𝒮𝒲subscript𝒮𝒩1𝑞\displaystyle=qf\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)+(1-q)f\left(% \frac{\mathcal{S}_{\mathcal{W}}+\mathcal{S}_{\mathcal{N}}}{1-q}\right).= italic_q italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) + ( 1 - italic_q ) italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT + caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_q end_ARG ) .

Case: 3 Suppose 𝒮𝒰>(1𝒮𝒲)(q1p)subscript𝒮𝒰1subscript𝒮𝒲𝑞1𝑝\mathcal{S}_{\mathcal{U}}>(1-\mathcal{S}_{\mathcal{W}})\left(\frac{q}{1-p}\right)caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT > ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG ) and 𝒮𝒲<(1𝒮𝒰)(p1q)subscript𝒮𝒲1subscript𝒮𝒰𝑝1𝑞\mathcal{S}_{\mathcal{W}}<(1-\mathcal{S}_{\mathcal{U}})\left(\frac{p}{1-q}\right)caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT < ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ).

Let C𝒲=p𝒮𝒲subscript𝐶𝒲𝑝subscript𝒮𝒲C_{\mathcal{W}}=\frac{p}{\mathcal{S}_{\mathcal{W}}}italic_C start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT = divide start_ARG italic_p end_ARG start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG, C𝒰=q𝒮𝒰subscript𝐶𝒰𝑞subscript𝒮𝒰C_{\mathcal{U}}=\frac{q}{\mathcal{S}_{\mathcal{U}}}italic_C start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT = divide start_ARG italic_q end_ARG start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG and C𝒩=1pq𝒮𝒩subscript𝐶𝒩1𝑝𝑞subscript𝒮𝒩C_{\mathcal{N}}=\frac{1-p-q}{\mathcal{S}_{\mathcal{N}}}italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT = divide start_ARG 1 - italic_p - italic_q end_ARG start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_ARG in which case i=𝒲zi=psubscript𝑖absent𝒲subscript𝑧𝑖𝑝\sum_{i=\in\mathcal{W}}z_{i}=p∑ start_POSTSUBSCRIPT italic_i = ∈ caligraphic_W end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_p (satisfying (12) and (15)), i=𝒰zi=qsubscript𝑖absent𝒰subscript𝑧𝑖𝑞\sum_{i=\in\mathcal{U}}z_{i}=q∑ start_POSTSUBSCRIPT italic_i = ∈ caligraphic_U end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_q (satisfying (13) and (16)). The choice of C𝒩subscript𝐶𝒩C_{\mathcal{N}}italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT ensures that (11) is satisfied:

i=1Mzisuperscriptsubscript𝑖1𝑀subscript𝑧𝑖\displaystyle\sum_{i=1}^{M}z_{i}∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_M end_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT =iC𝒲zi+iC𝒰zi+iC𝒩ziabsentsubscript𝑖subscript𝐶𝒲subscript𝑧𝑖subscript𝑖subscript𝐶𝒰subscript𝑧𝑖subscript𝑖subscript𝐶𝒩subscript𝑧𝑖\displaystyle=\sum_{i\in C_{\mathcal{W}}}z_{i}+\sum_{i\in C_{\mathcal{U}}}z_{i% }+\sum_{i\in C_{\mathcal{N}}}z_{i}= ∑ start_POSTSUBSCRIPT italic_i ∈ italic_C start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_i ∈ italic_C start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_i ∈ italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT
=iC𝒲C𝒲y~i+iC𝒰C𝒰y~i+iC𝒩C𝒩y~iabsentsubscript𝑖subscript𝐶𝒲subscript𝐶𝒲subscript~𝑦𝑖subscript𝑖subscript𝐶𝒰subscript𝐶𝒰subscript~𝑦𝑖subscript𝑖subscript𝐶𝒩subscript𝐶𝒩subscript~𝑦𝑖\displaystyle=\sum_{i\in C_{\mathcal{W}}}C_{\mathcal{W}}\tilde{y}_{i}+\sum_{i% \in C_{\mathcal{U}}}C_{\mathcal{U}}\tilde{y}_{i}+\sum_{i\in C_{\mathcal{N}}}C_% {\mathcal{N}}\tilde{y}_{i}= ∑ start_POSTSUBSCRIPT italic_i ∈ italic_C start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_i ∈ italic_C start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_i ∈ italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT
=C𝒲𝒮𝒲+C𝒰𝒮𝒰+C𝒩𝒮𝒩absentsubscript𝐶𝒲subscript𝒮𝒲subscript𝐶𝒰subscript𝒮𝒰subscript𝐶𝒩subscript𝒮𝒩\displaystyle=C_{\mathcal{W}}\mathcal{S}_{\mathcal{W}}+C_{\mathcal{U}}\mathcal% {S}_{\mathcal{U}}+C_{\mathcal{N}}\mathcal{S}_{\mathcal{N}}= italic_C start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT + italic_C start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT + italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT
=1absent1\displaystyle=1= 1

To show that (14) is satisfied. We note 𝒮𝒰>(1𝒮𝒲)(q1p)subscript𝒮𝒰1subscript𝒮𝒲𝑞1𝑝\mathcal{S}_{\mathcal{U}}>(1-\mathcal{S}_{\mathcal{W}})\left(\frac{q}{1-p}\right)caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT > ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG ) implies C𝒩>C𝒰subscript𝐶𝒩subscript𝐶𝒰C_{\mathcal{N}}>C_{\mathcal{U}}italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT > italic_C start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT and 𝒮𝒲<(1𝒮𝒰)(p1q)psubscript𝒮𝒲1subscript𝒮𝒰𝑝1𝑞𝑝\mathcal{S}_{\mathcal{W}}<(1-\mathcal{S}_{\mathcal{U}})\left(\frac{p}{1-q}% \right)pcaligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT < ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ) italic_p implies C𝒩<C𝒲subscript𝐶𝒩subscript𝐶𝒲C_{\mathcal{N}}<C_{\mathcal{W}}italic_C start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT < italic_C start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT. this proves (20) which is equivalent to (14) Plugging these minimizing values of 𝐳𝐳\mathbf{z}bold_z into (9) yields

d𝒴(𝐲~,T)subscript𝑑𝒴~𝐲𝑇\displaystyle d_{\mathcal{Y}}(\tilde{\mathbf{y}},T)italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG , italic_T ) =min𝐳Ti=1kzif(y~izi)absentsubscript𝐳𝑇superscriptsubscript𝑖1𝑘subscript𝑧𝑖𝑓subscript~𝑦𝑖subscript𝑧𝑖\displaystyle=\min_{\mathbf{z}\in T}\sum_{i=1}^{k}{z}_{i}f\left(\frac{\tilde{y% }_{i}}{z_{i}}\right)= roman_min start_POSTSUBSCRIPT bold_z ∈ italic_T end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_f ( divide start_ARG over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG italic_z start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG )
=i𝒲py~i𝒮𝒲f(𝒮𝒲p)+i𝒰qy~i𝒮𝒰f(𝒮𝒰q)+i𝒲(1pq)y~i𝒮𝒩f(𝒮𝒩1pq)absentsubscript𝑖𝒲𝑝subscript~𝑦𝑖subscript𝒮𝒲𝑓subscript𝒮𝒲𝑝subscript𝑖𝒰𝑞subscript~𝑦𝑖subscript𝒮𝒰𝑓subscript𝒮𝒰𝑞subscript𝑖𝒲1𝑝𝑞subscript~𝑦𝑖subscript𝒮𝒩𝑓subscript𝒮𝒩1𝑝𝑞\displaystyle=\sum_{i\in\mathcal{W}}\frac{p\tilde{y}_{i}}{\mathcal{S}_{% \mathcal{W}}}f\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)+\sum_{i\in% \mathcal{U}}\frac{q\tilde{y}_{i}}{\mathcal{S}_{\mathcal{U}}}f\left(\frac{% \mathcal{S}_{\mathcal{U}}}{q}\right)+\sum_{i\in\mathcal{W}}\frac{(1-p-q)\tilde% {y}_{i}}{\mathcal{S}_{\mathcal{N}}}f\left(\frac{\mathcal{S}_{\mathcal{N}}}{1-p% -q}\right)= ∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_W end_POSTSUBSCRIPT divide start_ARG italic_p over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) + ∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_U end_POSTSUBSCRIPT divide start_ARG italic_q over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) + ∑ start_POSTSUBSCRIPT italic_i ∈ caligraphic_W end_POSTSUBSCRIPT divide start_ARG ( 1 - italic_p - italic_q ) over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_ARG start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_ARG italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG )
=pf(𝒮𝒲p)+qf(𝒮𝒰q)+(1pq)f(𝒮𝒩1pq).absent𝑝𝑓subscript𝒮𝒲𝑝𝑞𝑓subscript𝒮𝒰𝑞1𝑝𝑞𝑓subscript𝒮𝒩1𝑝𝑞\displaystyle=pf\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)+qf\left(\frac% {\mathcal{S}_{\mathcal{U}}}{q}\right)+(1-p-q)f\left(\frac{\mathcal{S}_{% \mathcal{N}}}{1-p-q}\right).= italic_p italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) + italic_q italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) + ( 1 - italic_p - italic_q ) italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ) .

This proves the closed form in equation (3) and we may now proceed to show that this function is continuous. To prove continuity we need only show continuity the piece-wise boundaries which we will evaluate one at a time.

Boundary 1: 𝒮𝒲=psubscript𝒮𝒲𝑝\mathcal{S}_{\mathcal{W}}=pcaligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT = italic_p. The two functions that share this boundary are 00 and pf(𝒮𝒲p)+(1p)f(1𝒮𝒲1p)𝑝𝑓subscript𝒮𝒲𝑝1𝑝𝑓1subscript𝒮𝒲1𝑝pf\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)+(1-p)f\left(\frac{1-% \mathcal{S}_{\mathcal{W}}}{1-p}\right)italic_p italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) + ( 1 - italic_p ) italic_f ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG ). Plugging the boundary into the latter function yields

pf(𝒮𝒲p)+(1p)f(1𝒮𝒲1p)𝑝𝑓subscript𝒮𝒲𝑝1𝑝𝑓1subscript𝒮𝒲1𝑝\displaystyle pf\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)+(1-p)f\left(% \frac{1-\mathcal{S}_{\mathcal{W}}}{1-p}\right)italic_p italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) + ( 1 - italic_p ) italic_f ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG ) =pf(pp)+(1p)f(1p1p)absent𝑝𝑓𝑝𝑝1𝑝𝑓1𝑝1𝑝\displaystyle=pf\left(\frac{p}{p}\right)+(1-p)f\left(\frac{1-p}{1-p}\right)= italic_p italic_f ( divide start_ARG italic_p end_ARG start_ARG italic_p end_ARG ) + ( 1 - italic_p ) italic_f ( divide start_ARG 1 - italic_p end_ARG start_ARG 1 - italic_p end_ARG )
=0.absent0\displaystyle=0.= 0 .

The two functions are equal on the boundary and the boundary is continuous.

Boundary 2: 𝒮𝒰=qsubscript𝒮𝒰𝑞\mathcal{S}_{\mathcal{U}}=qcaligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT = italic_q. The two functions that share this boundary are 00 and qf(𝒮𝒰q)+(1q)f(1𝒮𝒰1q)𝑞𝑓subscript𝒮𝒰𝑞1𝑞𝑓1subscript𝒮𝒰1𝑞qf\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)+(1-q)f\left(\frac{1-% \mathcal{S}_{\mathcal{U}}}{1-q}\right)italic_q italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) + ( 1 - italic_q ) italic_f ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_q end_ARG ). Plugging the boundary into the latter function yields

qf(𝒮𝒰q)+(1q)f(1𝒮𝒰1q)𝑞𝑓subscript𝒮𝒰𝑞1𝑞𝑓1subscript𝒮𝒰1𝑞\displaystyle qf\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)+(1-q)f\left(% \frac{1-\mathcal{S}_{\mathcal{U}}}{1-q}\right)italic_q italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) + ( 1 - italic_q ) italic_f ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_q end_ARG ) =qf(qq)+(1q)f(1q1q)absent𝑞𝑓𝑞𝑞1𝑞𝑓1𝑞1𝑞\displaystyle=qf\left(\frac{q}{q}\right)+(1-q)f\left(\frac{1-q}{1-q}\right)= italic_q italic_f ( divide start_ARG italic_q end_ARG start_ARG italic_q end_ARG ) + ( 1 - italic_q ) italic_f ( divide start_ARG 1 - italic_q end_ARG start_ARG 1 - italic_q end_ARG )
=0.absent0\displaystyle=0.= 0 .

The two functions are equal on the boundary and the boundary is continuous.

Boundary 3: 𝒮𝒰=(1𝒮𝒲)(q1p)subscript𝒮𝒰1subscript𝒮𝒲𝑞1𝑝\mathcal{S}_{\mathcal{U}}=(1-\mathcal{S}_{\mathcal{W}})\left(\frac{q}{1-p}\right)caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT = ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG ). The two functions that share this boundary are pf(𝒮𝒲p)+(1p)f(1𝒮𝒲1p)𝑝𝑓subscript𝒮𝒲𝑝1𝑝𝑓1subscript𝒮𝒲1𝑝pf\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)+(1-p)f\left(\frac{1-% \mathcal{S}_{\mathcal{W}}}{1-p}\right)italic_p italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) + ( 1 - italic_p ) italic_f ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG ) and pf(𝒮𝒲p)+qf(𝒮𝒰q)+(1pq)f(𝒮𝒩1pq)𝑝𝑓subscript𝒮𝒲𝑝𝑞𝑓subscript𝒮𝒰𝑞1𝑝𝑞𝑓subscript𝒮𝒩1𝑝𝑞pf\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)+qf\left(\frac{\mathcal{S}_{% \mathcal{U}}}{q}\right)+(1-p-q)f\left(\frac{\mathcal{S}_{\mathcal{N}}}{1-p-q}\right)italic_p italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) + italic_q italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) + ( 1 - italic_p - italic_q ) italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ). Plugging the boundary into the latter function yields

pf(𝒮𝒲p)+qf(𝒮𝒰q)+(1pq)f(1𝒮𝒲𝒮𝒰1pq)=𝑝𝑓subscript𝒮𝒲𝑝𝑞𝑓subscript𝒮𝒰𝑞1𝑝𝑞𝑓1subscript𝒮𝒲subscript𝒮𝒰1𝑝𝑞absent\displaystyle pf\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)+qf\left(\frac% {\mathcal{S}_{\mathcal{U}}}{q}\right)+(1-p-q)f\left(\frac{1-\mathcal{S}_{% \mathcal{W}}-\mathcal{S}_{\mathcal{U}}}{1-p-q}\right)=italic_p italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) + italic_q italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) + ( 1 - italic_p - italic_q ) italic_f ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ) = pf(𝒮𝒲p)+(1p)f(1𝒮𝒲1p).𝑝𝑓subscript𝒮𝒲𝑝1𝑝𝑓1subscript𝒮𝒲1𝑝\displaystyle pf\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)+(1-p)f\left(% \frac{1-\mathcal{S}_{\mathcal{W}}}{1-p}\right).italic_p italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) + ( 1 - italic_p ) italic_f ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG ) .

The two functions are equal on the boundary and the boundary is continuous.

Boundary 4: 𝒮𝒲=(1𝒮𝒰)(p1q)subscript𝒮𝒲1subscript𝒮𝒰𝑝1𝑞\mathcal{S}_{\mathcal{W}}=(1-\mathcal{S}_{\mathcal{U}})\left(\frac{p}{1-q}\right)caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT = ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ). The two functions that share this boundary are qf(𝒮𝒰q)+(1q)f(1𝒮𝒰1q)𝑞𝑓subscript𝒮𝒰𝑞1𝑞𝑓1subscript𝒮𝒰1𝑞qf\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)+(1-q)f\left(\frac{1-% \mathcal{S}_{\mathcal{U}}}{1-q}\right)italic_q italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) + ( 1 - italic_q ) italic_f ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_q end_ARG ) and pf(𝒮𝒲p)+qf(𝒮𝒰q)+(1pq)f(𝒮𝒩1pq)𝑝𝑓subscript𝒮𝒲𝑝𝑞𝑓subscript𝒮𝒰𝑞1𝑝𝑞𝑓subscript𝒮𝒩1𝑝𝑞pf\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)+qf\left(\frac{\mathcal{S}_{% \mathcal{U}}}{q}\right)+(1-p-q)f\left(\frac{\mathcal{S}_{\mathcal{N}}}{1-p-q}\right)italic_p italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) + italic_q italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) + ( 1 - italic_p - italic_q ) italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ). Plugging the boundary into the latter function yields

qf(𝒮𝒰q)+pf(𝒮𝒲p)+(1pq)f(1𝒮𝒰𝒮𝒲1pq)=𝑞𝑓subscript𝒮𝒰𝑞𝑝𝑓subscript𝒮𝒲𝑝1𝑝𝑞𝑓1subscript𝒮𝒰subscript𝒮𝒲1𝑝𝑞absent\displaystyle qf\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)+pf\left(\frac% {\mathcal{S}_{\mathcal{W}}}{p}\right)+(1-p-q)f\left(\frac{1-\mathcal{S}_{% \mathcal{U}}-\mathcal{S}_{\mathcal{W}}}{1-p-q}\right)=italic_q italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) + italic_p italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) + ( 1 - italic_p - italic_q ) italic_f ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ) = qf(𝒮𝒰q)+(1q)f(1𝒮𝒰1q).𝑞𝑓subscript𝒮𝒰𝑞1𝑞𝑓1subscript𝒮𝒰1𝑞\displaystyle qf\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)+(1-q)f\left(% \frac{1-\mathcal{S}_{\mathcal{U}}}{1-q}\right).italic_q italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) + ( 1 - italic_q ) italic_f ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_q end_ARG ) .

The two functions are equal on the boundary and the boundary is continuous. We have now shown continuity on all boundaries and the function is continuous. Now to show that the derivative of the function is continuous we need only show the all partial derivatives exist and agree on the boundaries. We use the closed form equation (3) found in the body of the paper (which is equivalent to the one found in the beginning of the proof) but suppresses 𝒮𝒩subscript𝒮𝒩\mathcal{S}_{\mathcal{N}}caligraphic_S start_POSTSUBSCRIPT caligraphic_N end_POSTSUBSCRIPT. This makes it easier to differentiate with respect to y~isubscript~𝑦𝑖\tilde{y}_{i}over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, i𝒲𝒰𝑖𝒲𝒰i\in\mathcal{W}\cup\mathcal{U}italic_i ∈ caligraphic_W ∪ caligraphic_U.

d𝒴(𝐲~,T)={0if 𝒮𝒲p and 𝒮𝒰qpf(𝒮𝒲p)+(1p)f(1𝒮𝒲1p)if 𝒮𝒲<p and 𝒮𝒰(1𝒮𝒲)(q1p)qf(𝒮𝒰q)+(1q)f(1𝒮𝒰1q)if 𝒮𝒰>q and 𝒮𝒲(1𝒮𝒰)(p1q)pf(𝒮𝒲p)+qf(𝒮𝒰q)+(1pq)f(1𝒮𝒲𝒮𝒰1pq)if 𝒮𝒰>(1𝒮𝒲)(q1p) and 𝒮𝒲<(1𝒮𝒰)(p1q)subscript𝑑𝒴~𝐲𝑇cases0if subscript𝒮𝒲𝑝 and subscript𝒮𝒰𝑞𝑝𝑓subscript𝒮𝒲𝑝1𝑝𝑓1subscript𝒮𝒲1𝑝if subscript𝒮𝒲𝑝 and subscript𝒮𝒰1subscript𝒮𝒲𝑞1𝑝𝑞𝑓subscript𝒮𝒰𝑞1𝑞𝑓1subscript𝒮𝒰1𝑞if subscript𝒮𝒰𝑞 and subscript𝒮𝒲1subscript𝒮𝒰𝑝1𝑞𝑝𝑓subscript𝒮𝒲𝑝𝑞𝑓subscript𝒮𝒰𝑞1𝑝𝑞𝑓1subscript𝒮𝒲subscript𝒮𝒰1𝑝𝑞if subscript𝒮𝒰1subscript𝒮𝒲𝑞1𝑝otherwise and subscript𝒮𝒲1subscript𝒮𝒰𝑝1𝑞d_{\mathcal{Y}}(\tilde{\mathbf{y}},T)=\begin{cases}0&\text{if }\mathcal{S}_{% \mathcal{W}}\geq p\text{ and }\mathcal{S}_{\mathcal{U}}\leq q\\ pf\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)+(1-p)f\left(\frac{1-% \mathcal{S}_{\mathcal{W}}}{1-p}\right)&\text{if }\mathcal{S}_{\mathcal{W}}<p% \text{ and }\mathcal{S}_{\mathcal{U}}\leq(1-\mathcal{S}_{\mathcal{W}})\left(% \frac{q}{1-p}\right)\\ qf\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)+(1-q)f\left(\frac{1-% \mathcal{S}_{\mathcal{U}}}{1-q}\right)&\text{if }\mathcal{S}_{\mathcal{U}}>q% \text{ and }\mathcal{S}_{\mathcal{W}}\geq(1-\mathcal{S}_{\mathcal{U}})\left(% \frac{p}{1-q}\right)\\ pf\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)+qf\left(\frac{\mathcal{S}_{% \mathcal{U}}}{q}\right)+(1-p-q)f\left(\frac{1-\mathcal{S}_{\mathcal{W}}-% \mathcal{S}_{\mathcal{U}}}{1-p-q}\right)&\text{if }\mathcal{S}_{\mathcal{U}}>(% 1-\mathcal{S}_{\mathcal{W}})\left(\frac{q}{1-p}\right)\\ &\text{ and }\mathcal{S}_{\mathcal{W}}<(1-\mathcal{S}_{\mathcal{U}})\left(% \frac{p}{1-q}\right)\end{cases}italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG , italic_T ) = { start_ROW start_CELL 0 end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ≥ italic_p and caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ≤ italic_q end_CELL end_ROW start_ROW start_CELL italic_p italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) + ( 1 - italic_p ) italic_f ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG ) end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT < italic_p and caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ≤ ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG ) end_CELL end_ROW start_ROW start_CELL italic_q italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) + ( 1 - italic_q ) italic_f ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_q end_ARG ) end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT > italic_q and caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ≥ ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ) end_CELL end_ROW start_ROW start_CELL italic_p italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) + italic_q italic_f ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) + ( 1 - italic_p - italic_q ) italic_f ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ) end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT > ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL and caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT < ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ) end_CELL end_ROW

We now take the derivative with respect to a desirable class (i𝒲𝑖𝒲i\in\mathcal{W}italic_i ∈ caligraphic_W).

y~i𝒲d𝒴(𝐲~,T)={0if 𝒮𝒲>p and 𝒮𝒰<qf(𝒮𝒲p)f(1𝒮𝒲1p)if 𝒮𝒲<p and 𝒮𝒰<(1𝒮𝒲)(q1p)0if 𝒮𝒰>q and 𝒮𝒲>(1𝒮𝒰)(p1q)f(𝒮𝒲p)f(1𝒮𝒲𝒮𝒰1pq)if 𝒮𝒰>(1𝒮𝒲)(q1p) and 𝒮𝒲<(1𝒮𝒰)(p1q)subscript~𝑦𝑖𝒲subscript𝑑𝒴~𝐲𝑇cases0if subscript𝒮𝒲𝑝 and subscript𝒮𝒰𝑞superscript𝑓subscript𝒮𝒲𝑝superscript𝑓1subscript𝒮𝒲1𝑝if subscript𝒮𝒲𝑝 and subscript𝒮𝒰1subscript𝒮𝒲𝑞1𝑝0if subscript𝒮𝒰𝑞 and subscript𝒮𝒲1subscript𝒮𝒰𝑝1𝑞superscript𝑓subscript𝒮𝒲𝑝superscript𝑓1subscript𝒮𝒲subscript𝒮𝒰1𝑝𝑞if subscript𝒮𝒰1subscript𝒮𝒲𝑞1𝑝otherwise and subscript𝒮𝒲1subscript𝒮𝒰𝑝1𝑞\frac{\partial}{\partial\tilde{y}_{i\in\mathcal{W}}}d_{\mathcal{Y}}(\tilde{% \mathbf{y}},T)=\begin{cases}0&\text{if }\mathcal{S}_{\mathcal{W}}>p\text{ and % }\mathcal{S}_{\mathcal{U}}<q\\ f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)-f^{\prime}\left(% \frac{1-\mathcal{S}_{\mathcal{W}}}{1-p}\right)&\text{if }\mathcal{S}_{\mathcal% {W}}<p\text{ and }\mathcal{S}_{\mathcal{U}}<(1-\mathcal{S}_{\mathcal{W}})\left% (\frac{q}{1-p}\right)\\ 0&\text{if }\mathcal{S}_{\mathcal{U}}>q\text{ and }\mathcal{S}_{\mathcal{W}}>(% 1-\mathcal{S}_{\mathcal{U}})\left(\frac{p}{1-q}\right)\\ f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)-f^{\prime}\left(% \frac{1-\mathcal{S}_{\mathcal{W}}-\mathcal{S}_{\mathcal{U}}}{1-p-q}\right)&% \text{if }\mathcal{S}_{\mathcal{U}}>(1-\mathcal{S}_{\mathcal{W}})\left(\frac{q% }{1-p}\right)\\ &\text{ and }\mathcal{S}_{\mathcal{W}}<(1-\mathcal{S}_{\mathcal{U}})\left(% \frac{p}{1-q}\right)\end{cases}divide start_ARG ∂ end_ARG start_ARG ∂ over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i ∈ caligraphic_W end_POSTSUBSCRIPT end_ARG italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG , italic_T ) = { start_ROW start_CELL 0 end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT > italic_p and caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT < italic_q end_CELL end_ROW start_ROW start_CELL italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG ) end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT < italic_p and caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT < ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG ) end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT > italic_q and caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT > ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ) end_CELL end_ROW start_ROW start_CELL italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ) end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT > ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL and caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT < ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ) end_CELL end_ROW

Now we need only ensure all pieces agree on the boundaries to show that the derivative exists and is continuous.

Boundary 1: 𝒮𝒲=psubscript𝒮𝒲𝑝\mathcal{S}_{\mathcal{W}}=pcaligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT = italic_p. The two functions that share this boundary are 00 and f(𝒮𝒲p)f(1𝒮𝒲1p)superscript𝑓subscript𝒮𝒲𝑝superscript𝑓1subscript𝒮𝒲1𝑝f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)-f^{\prime}\left(% \frac{1-\mathcal{S}_{\mathcal{W}}}{1-p}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG ). Plugging the boundary into the latter function yields

f(𝒮𝒲p)f(1𝒮𝒲1p)superscript𝑓subscript𝒮𝒲𝑝superscript𝑓1subscript𝒮𝒲1𝑝\displaystyle f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)-f^{% \prime}\left(\frac{1-\mathcal{S}_{\mathcal{W}}}{1-p}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG ) =f(pp)f(1p1p)absentsuperscript𝑓𝑝𝑝superscript𝑓1𝑝1𝑝\displaystyle=f^{\prime}\left(\frac{p}{p}\right)-f^{\prime}\left(\frac{1-p}{1-% p}\right)= italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG italic_p end_ARG start_ARG italic_p end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - italic_p end_ARG start_ARG 1 - italic_p end_ARG )
=f(1)+f(1)absentsuperscript𝑓1𝑓superscript1\displaystyle=f^{\prime}(1)+f(1^{\prime})= italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( 1 ) + italic_f ( 1 start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT )
=0.absent0\displaystyle=0.= 0 .

Then setting the derivative at the boundary to 00 makes the derivative on this boundary continuous.

Boundary 2: 𝒮𝒰=qsubscript𝒮𝒰𝑞\mathcal{S}_{\mathcal{U}}=qcaligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT = italic_q. The two functions that share this boundary are both 00, and setting the derivative at the boundary to 00 makes the derivative on this boundary continuous.

Boundary 3: 𝒮𝒰=(1𝒮𝒲)(q1p)subscript𝒮𝒰1subscript𝒮𝒲𝑞1𝑝\mathcal{S}_{\mathcal{U}}=(1-\mathcal{S}_{\mathcal{W}})\left(\frac{q}{1-p}\right)caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT = ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG ). The two functions that share this boundary are f(𝒮𝒲p)f(1𝒮𝒲1p)superscript𝑓subscript𝒮𝒲𝑝superscript𝑓1subscript𝒮𝒲1𝑝f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)-f^{\prime}\left(% \frac{1-\mathcal{S}_{\mathcal{W}}}{1-p}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG ) and f(𝒮𝒲p)f(1𝒮𝒲𝒮𝒰1pq)superscript𝑓subscript𝒮𝒲𝑝superscript𝑓1subscript𝒮𝒲subscript𝒮𝒰1𝑝𝑞f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)-f^{\prime}\left(% \frac{1-\mathcal{S}_{\mathcal{W}}-\mathcal{S}_{\mathcal{U}}}{1-p-q}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ). Plugging the boundary into the latter function yields

f(𝒮𝒲p)f(1𝒮𝒲𝒮𝒰1pq)=superscript𝑓subscript𝒮𝒲𝑝superscript𝑓1subscript𝒮𝒲subscript𝒮𝒰1𝑝𝑞absent\displaystyle f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)-f^{% \prime}\left(\frac{1-\mathcal{S}_{\mathcal{W}}-\mathcal{S}_{\mathcal{U}}}{1-p-% q}\right)=italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ) = f(𝒮𝒲p)f(1𝒮𝒲1p)superscript𝑓subscript𝒮𝒲𝑝superscript𝑓1subscript𝒮𝒲1𝑝\displaystyle f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)-f^{% \prime}\left(\frac{1-\mathcal{S}_{\mathcal{W}}}{1-p}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG )

Then setting the derivative at the boundary to f(𝒮𝒲p)f(1𝒮𝒲1p)superscript𝑓subscript𝒮𝒲𝑝superscript𝑓1subscript𝒮𝒲1𝑝f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)-f^{\prime}\left(% \frac{1-\mathcal{S}_{\mathcal{W}}}{1-p}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG ) makes the derivative on this boundary continuous.

Boundary 4: 𝒮𝒲=(1𝒮𝒰)(p1q)subscript𝒮𝒲1subscript𝒮𝒰𝑝1𝑞\mathcal{S}_{\mathcal{W}}=(1-\mathcal{S}_{\mathcal{U}})\left(\frac{p}{1-q}\right)caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT = ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ). The two functions that share this boundary are 00 and f(𝒮𝒲p)f(1𝒮𝒲𝒮𝒰1pq)superscript𝑓subscript𝒮𝒲𝑝superscript𝑓1subscript𝒮𝒲subscript𝒮𝒰1𝑝𝑞f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)-f^{\prime}\left(% \frac{1-\mathcal{S}_{\mathcal{W}}-\mathcal{S}_{\mathcal{U}}}{1-p-q}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ). We rewrite the boundary as 𝒮𝒰=1qp𝒮𝒲+1subscript𝒮𝒰1𝑞𝑝subscript𝒮𝒲1\mathcal{S}_{\mathcal{U}}=\frac{1-q}{p}\mathcal{S}_{\mathcal{W}}+1caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT = divide start_ARG 1 - italic_q end_ARG start_ARG italic_p end_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT + 1 and plug it into the latter function.

f(𝒮𝒲p)f(1𝒮𝒲𝒮𝒰1pq)=superscript𝑓subscript𝒮𝒲𝑝superscript𝑓1subscript𝒮𝒲subscript𝒮𝒰1𝑝𝑞absent\displaystyle f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)-f^{% \prime}\left(\frac{1-\mathcal{S}_{\mathcal{W}}-\mathcal{S}_{\mathcal{U}}}{1-p-% q}\right)=italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ) = f(𝒮𝒲p)f(1𝒮𝒲(1qp𝒮𝒲+1)1pq)superscript𝑓subscript𝒮𝒲𝑝superscript𝑓1subscript𝒮𝒲1𝑞𝑝subscript𝒮𝒲11𝑝𝑞\displaystyle f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)-f^{% \prime}\left(\frac{1-\mathcal{S}_{\mathcal{W}}-\left(\frac{1-q}{p}\mathcal{S}_% {\mathcal{W}}+1\right)}{1-p-q}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - ( divide start_ARG 1 - italic_q end_ARG start_ARG italic_p end_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT + 1 ) end_ARG start_ARG 1 - italic_p - italic_q end_ARG )
=\displaystyle== 00\displaystyle 0

Then setting the derivative at the boundary to 00 makes the derivative on this boundary continuous.

This yields the continuous partial derivative

y~i𝒲d𝒴(𝐲~,T)={0if 𝒮𝒲p and 𝒮𝒰qf(𝒮𝒲p)f(1𝒮𝒲1p)if 𝒮𝒲<p and 𝒮𝒰(1𝒮𝒲)(q1p)0if 𝒮𝒰>q and 𝒮𝒲(1𝒮𝒰)(p1q)f(𝒮𝒲p)f(1𝒮𝒲𝒮𝒰1pq)if 𝒮𝒰>(1𝒮𝒲)(q1p) and 𝒮𝒲<(1𝒮𝒰)(p1q).subscript~𝑦𝑖𝒲subscript𝑑𝒴~𝐲𝑇cases0if subscript𝒮𝒲𝑝 and subscript𝒮𝒰𝑞superscript𝑓subscript𝒮𝒲𝑝superscript𝑓1subscript𝒮𝒲1𝑝if subscript𝒮𝒲𝑝 and subscript𝒮𝒰1subscript𝒮𝒲𝑞1𝑝0if subscript𝒮𝒰𝑞 and subscript𝒮𝒲1subscript𝒮𝒰𝑝1𝑞superscript𝑓subscript𝒮𝒲𝑝superscript𝑓1subscript𝒮𝒲subscript𝒮𝒰1𝑝𝑞if subscript𝒮𝒰1subscript𝒮𝒲𝑞1𝑝otherwise and subscript𝒮𝒲1subscript𝒮𝒰𝑝1𝑞\frac{\partial}{\partial\tilde{y}_{i\in\mathcal{W}}}d_{\mathcal{Y}}(\tilde{% \mathbf{y}},T)=\begin{cases}0&\text{if }\mathcal{S}_{\mathcal{W}}\geq p\text{ % and }\mathcal{S}_{\mathcal{U}}\leq q\\ f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)-f^{\prime}\left(% \frac{1-\mathcal{S}_{\mathcal{W}}}{1-p}\right)&\text{if }\mathcal{S}_{\mathcal% {W}}<p\text{ and }\mathcal{S}_{\mathcal{U}}\leq(1-\mathcal{S}_{\mathcal{W}})% \left(\frac{q}{1-p}\right)\\ 0&\text{if }\mathcal{S}_{\mathcal{U}}>q\text{ and }\mathcal{S}_{\mathcal{W}}% \geq(1-\mathcal{S}_{\mathcal{U}})\left(\frac{p}{1-q}\right)\\ f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)-f^{\prime}\left(% \frac{1-\mathcal{S}_{\mathcal{W}}-\mathcal{S}_{\mathcal{U}}}{1-p-q}\right)&% \text{if }\mathcal{S}_{\mathcal{U}}>(1-\mathcal{S}_{\mathcal{W}})\left(\frac{q% }{1-p}\right)\\ &\text{ and }\mathcal{S}_{\mathcal{W}}<(1-\mathcal{S}_{\mathcal{U}})\left(% \frac{p}{1-q}\right)\end{cases}.divide start_ARG ∂ end_ARG start_ARG ∂ over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i ∈ caligraphic_W end_POSTSUBSCRIPT end_ARG italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG , italic_T ) = { start_ROW start_CELL 0 end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ≥ italic_p and caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ≤ italic_q end_CELL end_ROW start_ROW start_CELL italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG ) end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT < italic_p and caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ≤ ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG ) end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT > italic_q and caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ≥ ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ) end_CELL end_ROW start_ROW start_CELL italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ) end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT > ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL and caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT < ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ) end_CELL end_ROW . (21)

We now take the derivative with respect to a undesirable class (i𝒰𝑖𝒰i\in\mathcal{U}italic_i ∈ caligraphic_U).

y~i𝒰d𝒴(𝐲~,T)={0if 𝒮𝒲>p and 𝒮𝒰<q0if 𝒮𝒲<p and 𝒮𝒰<(1𝒮𝒲)(q1p)f(𝒮𝒰q)f(1𝒮𝒰1q)if 𝒮𝒰>q and 𝒮𝒲>(1𝒮𝒰)(p1q)f(𝒮𝒰q)f(1𝒮𝒲𝒮𝒰1pq)if 𝒮𝒰>(1𝒮𝒲)(q1p) and 𝒮𝒲<(1𝒮𝒰)(p1q)subscript~𝑦𝑖𝒰subscript𝑑𝒴~𝐲𝑇cases0if subscript𝒮𝒲𝑝 and subscript𝒮𝒰𝑞0if subscript𝒮𝒲𝑝 and subscript𝒮𝒰1subscript𝒮𝒲𝑞1𝑝superscript𝑓subscript𝒮𝒰𝑞superscript𝑓1subscript𝒮𝒰1𝑞if subscript𝒮𝒰𝑞 and subscript𝒮𝒲1subscript𝒮𝒰𝑝1𝑞superscript𝑓subscript𝒮𝒰𝑞superscript𝑓1subscript𝒮𝒲subscript𝒮𝒰1𝑝𝑞if subscript𝒮𝒰1subscript𝒮𝒲𝑞1𝑝otherwise and subscript𝒮𝒲1subscript𝒮𝒰𝑝1𝑞\frac{\partial}{\partial\tilde{y}_{i\in\mathcal{U}}}d_{\mathcal{Y}}(\tilde{% \mathbf{y}},T)=\begin{cases}0&\text{if }\mathcal{S}_{\mathcal{W}}>p\text{ and % }\mathcal{S}_{\mathcal{U}}<q\\ 0&\text{if }\mathcal{S}_{\mathcal{W}}<p\text{ and }\mathcal{S}_{\mathcal{U}}<(% 1-\mathcal{S}_{\mathcal{W}})\left(\frac{q}{1-p}\right)\\ f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)-f^{\prime}\left(% \frac{1-\mathcal{S}_{\mathcal{U}}}{1-q}\right)&\text{if }\mathcal{S}_{\mathcal% {U}}>q\text{ and }\mathcal{S}_{\mathcal{W}}>(1-\mathcal{S}_{\mathcal{U}})\left% (\frac{p}{1-q}\right)\\ f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)-f^{\prime}\left(% \frac{1-\mathcal{S}_{\mathcal{W}}-\mathcal{S}_{\mathcal{U}}}{1-p-q}\right)&% \text{if }\mathcal{S}_{\mathcal{U}}>(1-\mathcal{S}_{\mathcal{W}})\left(\frac{q% }{1-p}\right)\\ &\text{ and }\mathcal{S}_{\mathcal{W}}<(1-\mathcal{S}_{\mathcal{U}})\left(% \frac{p}{1-q}\right)\end{cases}divide start_ARG ∂ end_ARG start_ARG ∂ over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i ∈ caligraphic_U end_POSTSUBSCRIPT end_ARG italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG , italic_T ) = { start_ROW start_CELL 0 end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT > italic_p and caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT < italic_q end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT < italic_p and caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT < ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG ) end_CELL end_ROW start_ROW start_CELL italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_q end_ARG ) end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT > italic_q and caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT > ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ) end_CELL end_ROW start_ROW start_CELL italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ) end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT > ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL and caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT < ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ) end_CELL end_ROW

Now we need only ensure that there is agreement on the boundaries.

Boundary 1: 𝒮𝒲=psubscript𝒮𝒲𝑝\mathcal{S}_{\mathcal{W}}=pcaligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT = italic_p. The two functions that share this boundary are both 00, and setting the derivative at the boundary to 00 makes the derivative on this boundary continuous.

Boundary 2: 𝒮𝒰=qsubscript𝒮𝒰𝑞\mathcal{S}_{\mathcal{U}}=qcaligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT = italic_q. The two functions that share this boundary are both 00 and f(𝒮𝒰q)f(1𝒮𝒰1q)superscript𝑓subscript𝒮𝒰𝑞superscript𝑓1subscript𝒮𝒰1𝑞f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)-f^{\prime}\left(% \frac{1-\mathcal{S}_{\mathcal{U}}}{1-q}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_q end_ARG ). Plugging the boundary into the latter function yields

f(𝒮𝒰q)f(1𝒮𝒰1q)superscript𝑓subscript𝒮𝒰𝑞superscript𝑓1subscript𝒮𝒰1𝑞\displaystyle f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)-f^{% \prime}\left(\frac{1-\mathcal{S}_{\mathcal{U}}}{1-q}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_q end_ARG ) =f(qq)f(1q1q)absentsuperscript𝑓𝑞𝑞superscript𝑓1𝑞1𝑞\displaystyle=f^{\prime}\left(\frac{q}{q}\right)-f^{\prime}\left(\frac{1-q}{1-% q}\right)= italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG italic_q end_ARG start_ARG italic_q end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - italic_q end_ARG start_ARG 1 - italic_q end_ARG )
=0.absent0\displaystyle=0.= 0 .

Then setting the derivative at the boundary to 00 makes the derivative on this boundary continuous.

Boundary 3: 𝒮𝒰=(1𝒮𝒲)(q1p)subscript𝒮𝒰1subscript𝒮𝒲𝑞1𝑝\mathcal{S}_{\mathcal{U}}=(1-\mathcal{S}_{\mathcal{W}})\left(\frac{q}{1-p}\right)caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT = ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG ). The two functions that share this boundary are 00 and f(𝒮𝒰q)f(1𝒮𝒲𝒮𝒰1pq)superscript𝑓subscript𝒮𝒰𝑞superscript𝑓1subscript𝒮𝒲subscript𝒮𝒰1𝑝𝑞f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)-f^{\prime}\left(% \frac{1-\mathcal{S}_{\mathcal{W}}-\mathcal{S}_{\mathcal{U}}}{1-p-q}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ). We rewrite the boundary as 𝒮𝒲=11pq𝒮𝒰subscript𝒮𝒲11𝑝𝑞subscript𝒮𝒰\mathcal{S}_{\mathcal{W}}=1-\frac{1-p}{q}\mathcal{S}_{\mathcal{U}}caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT = 1 - divide start_ARG 1 - italic_p end_ARG start_ARG italic_q end_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT and plug it into the latter function.

f(𝒮𝒰q)f(1𝒮𝒲𝒮𝒰1pq)=superscript𝑓subscript𝒮𝒰𝑞superscript𝑓1subscript𝒮𝒲subscript𝒮𝒰1𝑝𝑞absent\displaystyle f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)-f^{% \prime}\left(\frac{1-\mathcal{S}_{\mathcal{W}}-\mathcal{S}_{\mathcal{U}}}{1-p-% q}\right)=italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ) = f(𝒮𝒰q)f(1𝒮𝒰(11pq𝒮𝒰)1pq)superscript𝑓subscript𝒮𝒰𝑞superscript𝑓1subscript𝒮𝒰11𝑝𝑞subscript𝒮𝒰1𝑝𝑞\displaystyle f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)-f^{% \prime}\left(\frac{1-\mathcal{S}_{\mathcal{U}}-\left(1-\frac{1-p}{q}\mathcal{S% }_{\mathcal{U}}\right)}{1-p-q}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT - ( 1 - divide start_ARG 1 - italic_p end_ARG start_ARG italic_q end_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) end_ARG start_ARG 1 - italic_p - italic_q end_ARG )
=\displaystyle== 00\displaystyle 0

Then setting the derivative at the boundary to 00 makes the derivative on this boundary continuous.

Boundary 4: 𝒮𝒲=(1𝒮𝒰)(p1q)subscript𝒮𝒲1subscript𝒮𝒰𝑝1𝑞\mathcal{S}_{\mathcal{W}}=(1-\mathcal{S}_{\mathcal{U}})\left(\frac{p}{1-q}\right)caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT = ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ). The two functions that share this boundary are f(𝒮𝒰q)f(1𝒮𝒰1q)superscript𝑓subscript𝒮𝒰𝑞superscript𝑓1subscript𝒮𝒰1𝑞f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)-f^{\prime}\left(% \frac{1-\mathcal{S}_{\mathcal{U}}}{1-q}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_q end_ARG ) and f(𝒮𝒰q)f(1𝒮𝒲𝒮𝒰1pq)superscript𝑓subscript𝒮𝒰𝑞superscript𝑓1subscript𝒮𝒲subscript𝒮𝒰1𝑝𝑞f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)-f^{\prime}\left(% \frac{1-\mathcal{S}_{\mathcal{W}}-\mathcal{S}_{\mathcal{U}}}{1-p-q}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ). Plugging the boundary into the latter function yields

f(𝒮𝒰q)f(1𝒮𝒲𝒮𝒰1pq)=superscript𝑓subscript𝒮𝒰𝑞superscript𝑓1subscript𝒮𝒲subscript𝒮𝒰1𝑝𝑞absent\displaystyle f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)-f^{% \prime}\left(\frac{1-\mathcal{S}_{\mathcal{W}}-\mathcal{S}_{\mathcal{U}}}{1-p-% q}\right)=italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ) = f(𝒮𝒰q)f(1𝒮𝒰(1𝒮𝒰)(p1q)1pq)superscript𝑓subscript𝒮𝒰𝑞superscript𝑓1subscript𝒮𝒰1subscript𝒮𝒰𝑝1𝑞1𝑝𝑞\displaystyle f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)-f^{% \prime}\left(\frac{1-\mathcal{S}_{\mathcal{U}}-(1-\mathcal{S}_{\mathcal{U}})% \left(\frac{p}{1-q}\right)}{1-p-q}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT - ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ) end_ARG start_ARG 1 - italic_p - italic_q end_ARG )
=\displaystyle== f(𝒮𝒰q)f(1𝒮𝒰1q)superscript𝑓subscript𝒮𝒰𝑞superscript𝑓1subscript𝒮𝒰1𝑞\displaystyle f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)-f^{% \prime}\left(\frac{1-\mathcal{S}_{\mathcal{U}}}{1-q}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_q end_ARG )

Then setting the derivative at the boundary to f(𝒮𝒰q)f(1𝒮𝒰1q)superscript𝑓subscript𝒮𝒰𝑞superscript𝑓1subscript𝒮𝒰1𝑞f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)-f^{\prime}\left(% \frac{1-\mathcal{S}_{\mathcal{U}}}{1-q}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_q end_ARG ) makes the derivative on this boundary continuous.

This yields the continuous partial derivative

y~i𝒰d𝒴(𝐲~,T)={0if 𝒮𝒲p and 𝒮𝒰q0if 𝒮𝒲<p and 𝒮𝒰(1𝒮𝒲)(q1p)f(𝒮𝒰q)f(1𝒮𝒰1q)if 𝒮𝒰>q and 𝒮𝒲(1𝒮𝒰)(p1q)f(𝒮𝒰q)f(1𝒮𝒲𝒮𝒰1pq)if 𝒮𝒰>(1𝒮𝒲)(q1p) and 𝒮𝒲<(1𝒮𝒰)(p1q).subscript~𝑦𝑖𝒰subscript𝑑𝒴~𝐲𝑇cases0if subscript𝒮𝒲𝑝 and subscript𝒮𝒰𝑞0if subscript𝒮𝒲𝑝 and subscript𝒮𝒰1subscript𝒮𝒲𝑞1𝑝superscript𝑓subscript𝒮𝒰𝑞superscript𝑓1subscript𝒮𝒰1𝑞if subscript𝒮𝒰𝑞 and subscript𝒮𝒲1subscript𝒮𝒰𝑝1𝑞superscript𝑓subscript𝒮𝒰𝑞superscript𝑓1subscript𝒮𝒲subscript𝒮𝒰1𝑝𝑞if subscript𝒮𝒰1subscript𝒮𝒲𝑞1𝑝otherwise and subscript𝒮𝒲1subscript𝒮𝒰𝑝1𝑞\frac{\partial}{\partial\tilde{y}_{i\in\mathcal{U}}}d_{\mathcal{Y}}(\tilde{% \mathbf{y}},T)=\begin{cases}0&\text{if }\mathcal{S}_{\mathcal{W}}\geq p\text{ % and }\mathcal{S}_{\mathcal{U}}\leq q\\ 0&\text{if }\mathcal{S}_{\mathcal{W}}<p\text{ and }\mathcal{S}_{\mathcal{U}}% \leq(1-\mathcal{S}_{\mathcal{W}})\left(\frac{q}{1-p}\right)\\ f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)-f^{\prime}\left(% \frac{1-\mathcal{S}_{\mathcal{U}}}{1-q}\right)&\text{if }\mathcal{S}_{\mathcal% {U}}>q\text{ and }\mathcal{S}_{\mathcal{W}}\geq(1-\mathcal{S}_{\mathcal{U}})% \left(\frac{p}{1-q}\right)\\ f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)-f^{\prime}\left(% \frac{1-\mathcal{S}_{\mathcal{W}}-\mathcal{S}_{\mathcal{U}}}{1-p-q}\right)&% \text{if }\mathcal{S}_{\mathcal{U}}>(1-\mathcal{S}_{\mathcal{W}})\left(\frac{q% }{1-p}\right)\\ &\text{ and }\mathcal{S}_{\mathcal{W}}<(1-\mathcal{S}_{\mathcal{U}})\left(% \frac{p}{1-q}\right)\end{cases}.divide start_ARG ∂ end_ARG start_ARG ∂ over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i ∈ caligraphic_U end_POSTSUBSCRIPT end_ARG italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG , italic_T ) = { start_ROW start_CELL 0 end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ≥ italic_p and caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ≤ italic_q end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT < italic_p and caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ≤ ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG ) end_CELL end_ROW start_ROW start_CELL italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_q end_ARG ) end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT > italic_q and caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ≥ ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ) end_CELL end_ROW start_ROW start_CELL italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ) end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT > ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL and caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT < ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ) end_CELL end_ROW . (22)

Additional Analysis on d𝒴subscript𝑑𝒴d_{\mathcal{Y}}italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT The following lemma shows that d𝒴subscript𝑑𝒴d_{\mathcal{Y}}italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT exhibits desirable behavior for any f𝑓fitalic_f-divergence if we restrict ourselves to the binary classification setting.

Lemma A.1.

In the binary classification setting, if T={𝐳𝒴|z1p},𝑇conditional-set𝐳𝒴subscript𝑧1𝑝T=\{{\mathbf{z}}\in\mathcal{Y}|z_{1}\geq p\},italic_T = { bold_z ∈ caligraphic_Y | italic_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≥ italic_p } , then d𝒴(𝐲~,T)subscript𝑑𝒴~𝐲𝑇d_{\mathcal{Y}}(\tilde{\mathbf{y}},T)italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG , italic_T ) is decreasing (not necessarily strictly) in 𝐲~1subscript~𝐲1\tilde{\mathbf{y}}_{1}over~ start_ARG bold_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT for D(𝐲~||𝐳)D(\tilde{\mathbf{y}}||\mathbf{z})italic_D ( over~ start_ARG bold_y end_ARG | | bold_z ) any f𝑓fitalic_f-divergence.

We now present the proof of Lemma A.1. Recall d𝒴(𝐲~,T)=min𝐳TDf(𝐲~||𝐳)d_{\mathcal{Y}}(\tilde{\mathbf{y}},T)=\min_{\mathbf{z}\in T}D_{f}(\tilde{% \mathbf{y}}||\mathbf{z})italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG , italic_T ) = roman_min start_POSTSUBSCRIPT bold_z ∈ italic_T end_POSTSUBSCRIPT italic_D start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG | | bold_z ). For binary probability distributions 𝐚𝐚\mathbf{a}bold_a and 𝐛𝐛\mathbf{b}bold_b, the f𝑓fitalic_f-divergence has the simple form

Df(𝐛||𝐚)=𝐚1f(𝐛1𝐚1)+(1𝐚1)f(1𝐛11𝐚1)D_{f}(\mathbf{b}||\mathbf{a})=\mathbf{a}_{1}f\left(\frac{\mathbf{b}_{1}}{% \mathbf{a}_{1}}\right)+(1-\mathbf{a}_{1})f\left(\frac{1-\mathbf{b}_{1}}{1-% \mathbf{a}_{1}}\right)italic_D start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( bold_b | | bold_a ) = bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_f ( divide start_ARG bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG ) + ( 1 - bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) italic_f ( divide start_ARG 1 - bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG ) (23)

for a convex function f𝑓fitalic_f with f(1)=0𝑓10f(1)=0italic_f ( 1 ) = 0. We show a relationship between this formula and a secant line. To refer to the secant line of a function g(x)𝑔𝑥g(x)italic_g ( italic_x ) from point x=α𝑥𝛼x=\alphaitalic_x = italic_α to x=β𝑥𝛽x=\betaitalic_x = italic_β evaluated at γ𝛾\gammaitalic_γ, we will use the notation Sg(α,β;γ)subscript𝑆𝑔𝛼𝛽𝛾S_{g}(\alpha,\beta;\gamma)italic_S start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ( italic_α , italic_β ; italic_γ ). When using this notation we will assume that αβ𝛼𝛽\alpha\leq\betaitalic_α ≤ italic_β.

We assume 𝐚1>𝐛1subscript𝐚1subscript𝐛1\mathbf{a}_{1}>\mathbf{b}_{1}bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT > bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT and show that Df(𝐛||𝐚)D_{f}(\mathbf{b}||\mathbf{a})italic_D start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( bold_b | | bold_a ) is equivalent to the secant line of f(x)𝑓𝑥f(x)italic_f ( italic_x ) from x=𝐛1𝐚1𝑥subscript𝐛1subscript𝐚1x=\frac{\mathbf{b}_{1}}{\mathbf{a}_{1}}italic_x = divide start_ARG bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG to 1𝐛11𝐚11subscript𝐛11subscript𝐚1\frac{1-\mathbf{b}_{1}}{1-\mathbf{a}_{1}}divide start_ARG 1 - bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG evaluated at 1111. (Note 𝐛1𝐚1<1<1𝐛11𝐚1subscript𝐛1subscript𝐚111subscript𝐛11subscript𝐚1\frac{\mathbf{b}_{1}}{\mathbf{a}_{1}}<1<\frac{1-\mathbf{b}_{1}}{1-\mathbf{a}_{% 1}}divide start_ARG bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG < 1 < divide start_ARG 1 - bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG.) We show this simply using the point slope form.

Sf(𝐛1𝐚1,1𝐛11𝐚1;x)subscript𝑆𝑓subscript𝐛1subscript𝐚11subscript𝐛11subscript𝐚1𝑥\displaystyle S_{f}\left(\frac{\mathbf{b}_{1}}{\mathbf{a}_{1}},\frac{1-\mathbf% {b}_{1}}{1-\mathbf{a}_{1}};x\right)italic_S start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( divide start_ARG bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG , divide start_ARG 1 - bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG ; italic_x ) =(x1𝐛11𝐚1)f(1𝐛11𝐚1)f(𝐛1𝐚1)1𝐛11𝐚1𝐛1𝐚1+f(1𝐛11𝐚1)absent𝑥1subscript𝐛11subscript𝐚1𝑓1subscript𝐛11subscript𝐚1𝑓subscript𝐛1subscript𝐚11subscript𝐛11subscript𝐚1subscript𝐛1subscript𝐚1𝑓1subscript𝐛11subscript𝐚1\displaystyle=\left(x-\frac{1-\mathbf{b}_{1}}{1-\mathbf{a}_{1}}\right)\frac{f% \left(\frac{1-\mathbf{b}_{1}}{1-\mathbf{a}_{1}}\right)-f\left(\frac{\mathbf{b}% _{1}}{\mathbf{a}_{1}}\right)}{\frac{1-\mathbf{b}_{1}}{1-\mathbf{a}_{1}}-\frac{% \mathbf{b}_{1}}{\mathbf{a}_{1}}}+f\left(\frac{1-\mathbf{b}_{1}}{1-\mathbf{a}_{% 1}}\right)= ( italic_x - divide start_ARG 1 - bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG ) divide start_ARG italic_f ( divide start_ARG 1 - bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG ) - italic_f ( divide start_ARG bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG ) end_ARG start_ARG divide start_ARG 1 - bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG - divide start_ARG bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG end_ARG + italic_f ( divide start_ARG 1 - bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG )
Sf(𝐛1𝐚1,1𝐛11𝐚1;1)subscript𝑆𝑓subscript𝐛1subscript𝐚11subscript𝐛11subscript𝐚11\displaystyle S_{f}\left(\frac{\mathbf{b}_{1}}{\mathbf{a}_{1}},\frac{1-\mathbf% {b}_{1}}{1-\mathbf{a}_{1}};1\right)italic_S start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( divide start_ARG bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG , divide start_ARG 1 - bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG ; 1 ) =(11𝐛11𝐚1)f(1𝐛11𝐚1)f(𝐛1𝐚1)1𝐛11𝐚1𝐛1𝐚1+f(1𝐛11𝐚1)absent11subscript𝐛11subscript𝐚1𝑓1subscript𝐛11subscript𝐚1𝑓subscript𝐛1subscript𝐚11subscript𝐛11subscript𝐚1subscript𝐛1subscript𝐚1𝑓1subscript𝐛11subscript𝐚1\displaystyle=\left(1-\frac{1-\mathbf{b}_{1}}{1-\mathbf{a}_{1}}\right)\frac{f% \left(\frac{1-\mathbf{b}_{1}}{1-\mathbf{a}_{1}}\right)-f\left(\frac{\mathbf{b}% _{1}}{\mathbf{a}_{1}}\right)}{\frac{1-\mathbf{b}_{1}}{1-\mathbf{a}_{1}}-\frac{% \mathbf{b}_{1}}{\mathbf{a}_{1}}}+f\left(\frac{1-\mathbf{b}_{1}}{1-\mathbf{a}_{% 1}}\right)= ( 1 - divide start_ARG 1 - bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG ) divide start_ARG italic_f ( divide start_ARG 1 - bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG ) - italic_f ( divide start_ARG bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG ) end_ARG start_ARG divide start_ARG 1 - bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG - divide start_ARG bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG end_ARG + italic_f ( divide start_ARG 1 - bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG )
=𝐚1f(𝐛1𝐚1)+(1𝐚1)f(1𝐛11𝐚1)absentsubscript𝐚1𝑓subscript𝐛1subscript𝐚11subscript𝐚1𝑓1subscript𝐛11subscript𝐚1\displaystyle=\mathbf{a}_{1}f\left(\frac{\mathbf{b}_{1}}{\mathbf{a}_{1}}\right% )+(1-\mathbf{a}_{1})f\left(\frac{1-\mathbf{b}_{1}}{1-\mathbf{a}_{1}}\right)= bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT italic_f ( divide start_ARG bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG ) + ( 1 - bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) italic_f ( divide start_ARG 1 - bold_b start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - bold_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG )
=Df(𝐛||𝐚)\displaystyle=D_{f}(\mathbf{b}||\mathbf{a})= italic_D start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( bold_b | | bold_a )

Now that Df(𝐛||𝐚)D_{f}(\mathbf{b}||\mathbf{a})italic_D start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( bold_b | | bold_a ) is related to a secant line we prove a few facts about secant lines of convex functions. If g𝑔gitalic_g is convex, then Sg(α,β;γ)subscript𝑆𝑔𝛼𝛽𝛾S_{g}(\alpha,\beta;\gamma)italic_S start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ( italic_α , italic_β ; italic_γ ) is decreasing in α𝛼\alphaitalic_α and increasing in β𝛽\betaitalic_β whenever α<γ<β𝛼𝛾𝛽\alpha<\gamma<\betaitalic_α < italic_γ < italic_β. Recall that if g𝑔gitalic_g is convex, then by definition for any v1<v2<v3subscript𝑣1subscript𝑣2subscript𝑣3v_{1}<v_{2}<v_{3}italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT < italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT, we have

g(v2)g(v1)v2v1g(v3)g(v1)v3v1g(v3)g(v2)v3v2.𝑔subscript𝑣2𝑔subscript𝑣1subscript𝑣2subscript𝑣1𝑔subscript𝑣3𝑔subscript𝑣1subscript𝑣3subscript𝑣1𝑔subscript𝑣3𝑔subscript𝑣2subscript𝑣3subscript𝑣2\frac{g(v_{2})-g(v_{1})}{v_{2}-v_{1}}\leq\frac{g(v_{3})-g(v_{1})}{v_{3}-v_{1}}% \leq\frac{g(v_{3})-g(v_{2})}{v_{3}-v_{2}}.divide start_ARG italic_g ( italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) - italic_g ( italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG ≤ divide start_ARG italic_g ( italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) - italic_g ( italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT - italic_v start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG ≤ divide start_ARG italic_g ( italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) - italic_g ( italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT ) end_ARG start_ARG italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT - italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT end_ARG . (24)

Then for any β<β~𝛽~𝛽\beta<\tilde{\beta}italic_β < over~ start_ARG italic_β end_ARG we have

Sg(α,β;γ)subscript𝑆𝑔𝛼𝛽𝛾\displaystyle S_{g}(\alpha,\beta;\gamma)italic_S start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ( italic_α , italic_β ; italic_γ ) =(γα)m+g(α)absent𝛾𝛼𝑚𝑔𝛼\displaystyle=(\gamma-\alpha)m+g(\alpha)= ( italic_γ - italic_α ) italic_m + italic_g ( italic_α ) (25)
Sg(α,β~;γ)subscript𝑆𝑔𝛼~𝛽𝛾\displaystyle S_{g}(\alpha,\tilde{\beta};\gamma)italic_S start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ( italic_α , over~ start_ARG italic_β end_ARG ; italic_γ ) =(γα)m~+g(α)absent𝛾𝛼~𝑚𝑔𝛼\displaystyle=(\gamma-\alpha)\tilde{m}+g(\alpha)= ( italic_γ - italic_α ) over~ start_ARG italic_m end_ARG + italic_g ( italic_α ) (26)

for m~m~𝑚𝑚\tilde{m}\geq mover~ start_ARG italic_m end_ARG ≥ italic_m. It follows that for any γα𝛾𝛼\gamma\geq\alphaitalic_γ ≥ italic_α

Sg(α,β;x)Sg(α,β~;x),subscript𝑆𝑔𝛼𝛽𝑥subscript𝑆𝑔𝛼~𝛽𝑥S_{g}(\alpha,\beta;x)\leq S_{g}(\alpha,\tilde{\beta};x),italic_S start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ( italic_α , italic_β ; italic_x ) ≤ italic_S start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ( italic_α , over~ start_ARG italic_β end_ARG ; italic_x ) , (27)

and Sg(α,β;x)subscript𝑆𝑔𝛼𝛽𝑥S_{g}(\alpha,\beta;x)italic_S start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ( italic_α , italic_β ; italic_x ) is increasing in β𝛽\betaitalic_β.

A similar argument shows that Sg(α,β;x)subscript𝑆𝑔𝛼𝛽𝑥S_{g}(\alpha,\beta;x)italic_S start_POSTSUBSCRIPT italic_g end_POSTSUBSCRIPT ( italic_α , italic_β ; italic_x ) is decreasing in α𝛼\alphaitalic_α when γβ𝛾𝛽\gamma\leq\betaitalic_γ ≤ italic_β.

We will use these facts to analyze d𝒴(𝐲~,T)=min𝐳TDf(𝐲~||𝐳)d_{\mathcal{Y}}(\tilde{\mathbf{y}},T)=\min_{\mathbf{z}\in T}D_{f}(\tilde{% \mathbf{y}}||\mathbf{z})italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG , italic_T ) = roman_min start_POSTSUBSCRIPT bold_z ∈ italic_T end_POSTSUBSCRIPT italic_D start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG | | bold_z ). The f𝑓fitalic_f-divergence between identical distributions is zero, so we have d𝒴(𝐲~,T)=0subscript𝑑𝒴~𝐲𝑇0d_{\mathcal{Y}}(\tilde{\mathbf{y}},T)=0italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG , italic_T ) = 0 whenever 𝐲~1psubscript~𝐲1𝑝\tilde{\mathbf{y}}_{1}\geq pover~ start_ARG bold_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≥ italic_p. When 𝐲~1<psubscript~𝐲1𝑝\tilde{\mathbf{y}}_{1}<pover~ start_ARG bold_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT < italic_p we have 𝐲~1𝐳1<1<1𝐲~11𝐳1subscript~𝐲1subscript𝐳111subscript~𝐲11subscript𝐳1\frac{\tilde{\mathbf{y}}_{1}}{\mathbf{z}_{1}}<1<\frac{1-\tilde{\mathbf{y}}_{1}% }{1-\mathbf{z}_{1}}divide start_ARG over~ start_ARG bold_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG bold_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG < 1 < divide start_ARG 1 - over~ start_ARG bold_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - bold_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG and

d𝒴(𝐲~,T)subscript𝑑𝒴~𝐲𝑇\displaystyle d_{\mathcal{Y}}(\tilde{\mathbf{y}},T)italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG , italic_T ) =min𝐳TDf(𝐲~||𝐳)\displaystyle=\min_{\mathbf{z}\in T}D_{f}(\tilde{\mathbf{y}}||\mathbf{z})= roman_min start_POSTSUBSCRIPT bold_z ∈ italic_T end_POSTSUBSCRIPT italic_D start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG | | bold_z )
=min𝐳TSf(𝐲~1𝐳1,1𝐲~11𝐳1;1),absentsubscript𝐳𝑇subscript𝑆𝑓subscript~𝐲1subscript𝐳11subscript~𝐲11subscript𝐳11\displaystyle=\min_{\mathbf{z}\in T}S_{f}\left(\frac{\tilde{\mathbf{y}}_{1}}{% \mathbf{z}_{1}},\frac{1-\tilde{\mathbf{y}}_{1}}{1-\mathbf{z}_{1}};1\right),= roman_min start_POSTSUBSCRIPT bold_z ∈ italic_T end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( divide start_ARG over~ start_ARG bold_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG bold_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG , divide start_ARG 1 - over~ start_ARG bold_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - bold_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG ; 1 ) ,

which is decreasing in 𝐲~1𝐳1subscript~𝐲1subscript𝐳1\frac{\tilde{\mathbf{y}}_{1}}{\mathbf{z}_{1}}divide start_ARG over~ start_ARG bold_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG bold_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG and increasing in 1𝐲~11𝐳11subscript~𝐲11subscript𝐳1\frac{1-\tilde{\mathbf{y}}_{1}}{1-\mathbf{z}_{1}}divide start_ARG 1 - over~ start_ARG bold_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - bold_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG, so to achieve the minimum we use the smallest possible 𝐳1subscript𝐳1\mathbf{z}_{1}bold_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT, i.e. 𝐳1=psubscript𝐳1𝑝\mathbf{z}_{1}=pbold_z start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = italic_p. We may now simplify

d𝒴(𝐲~,T)={Sf(𝐲~1p,1𝐲~11p;1)if 𝐲~<p0if 𝐲~p.subscript𝑑𝒴~𝐲𝑇casessubscript𝑆𝑓subscript~𝐲1𝑝1subscript~𝐲11𝑝1if ~𝐲𝑝0if ~𝐲𝑝d_{\mathcal{Y}}(\tilde{\mathbf{y}},T)=\begin{cases}S_{f}\left(\frac{\tilde{% \mathbf{y}}_{1}}{p},\frac{1-\tilde{\mathbf{y}}_{1}}{1-p};1\right)&\text{if }% \tilde{\mathbf{y}}<p\\ 0&\text{if }\tilde{\mathbf{y}}\geq p\end{cases}.italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG , italic_T ) = { start_ROW start_CELL italic_S start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( divide start_ARG over~ start_ARG bold_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG , divide start_ARG 1 - over~ start_ARG bold_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG ; 1 ) end_CELL start_CELL if over~ start_ARG bold_y end_ARG < italic_p end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL if over~ start_ARG bold_y end_ARG ≥ italic_p end_CELL end_ROW .

Note that this is continuous at 𝐲~=p~𝐲𝑝\tilde{\mathbf{y}}=pover~ start_ARG bold_y end_ARG = italic_p because Sf(1,1;1)=f(1)=0subscript𝑆𝑓111𝑓10S_{f}(1,1;1)=f(1)=0italic_S start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( 1 , 1 ; 1 ) = italic_f ( 1 ) = 0. With this closed form solution for d𝒴(𝐲~,T)subscript𝑑𝒴~𝐲𝑇d_{\mathcal{Y}}(\tilde{\mathbf{y}},T)italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG , italic_T ) we may finish the proof.

We have already shown that Sf(𝐲~1p,1𝐲~11p;1)subscript𝑆𝑓subscript~𝐲1𝑝1subscript~𝐲11𝑝1S_{f}\left(\frac{\tilde{\mathbf{y}}_{1}}{p},\frac{1-\tilde{\mathbf{y}}_{1}}{1-% p};1\right)italic_S start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( divide start_ARG over~ start_ARG bold_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG , divide start_ARG 1 - over~ start_ARG bold_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG ; 1 ) is decreasing in 𝐲~1psubscript~𝐲1𝑝\frac{\tilde{\mathbf{y}}_{1}}{p}divide start_ARG over~ start_ARG bold_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG and increasing in 1𝐲~11p1subscript~𝐲11𝑝\frac{1-\tilde{\mathbf{y}}_{1}}{1-p}divide start_ARG 1 - over~ start_ARG bold_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG, so increasing 𝐲~1subscript~𝐲1\tilde{\mathbf{y}}_{1}over~ start_ARG bold_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT will decrease Sf(𝐲~1p,1𝐲~11p;1)subscript𝑆𝑓subscript~𝐲1𝑝1subscript~𝐲11𝑝1S_{f}\left(\frac{\tilde{\mathbf{y}}_{1}}{p},\frac{1-\tilde{\mathbf{y}}_{1}}{1-% p};1\right)italic_S start_POSTSUBSCRIPT italic_f end_POSTSUBSCRIPT ( divide start_ARG over~ start_ARG bold_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG , divide start_ARG 1 - over~ start_ARG bold_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG ; 1 ) and d𝒴(𝐲~,T)subscript𝑑𝒴~𝐲𝑇d_{\mathcal{Y}}(\tilde{\mathbf{y}},T)italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG , italic_T ) is decreasing in 𝐲~1subscript~𝐲1\tilde{\mathbf{y}}_{1}over~ start_ARG bold_y end_ARG start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT.

We now present a corollary to Theorem 2.2 that shows explicitly that d𝒴subscript𝑑𝒴d_{\mathcal{Y}}italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT decreases with added probability to the desirable classes and increases with added probability to the undesirable classes.

Corollary A.2.

If T𝑇Titalic_T is of form (1) and f𝑓fitalic_f is twice differentiable, then d𝒴(𝐲~,T)subscript𝑑𝒴~𝐲𝑇d_{\mathcal{Y}}(\tilde{\mathbf{y}},T)italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG , italic_T ) is decreasing in 𝐲~isubscript~𝐲𝑖\tilde{\mathbf{y}}_{i}over~ start_ARG bold_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT if i𝒲𝑖𝒲i\in\mathcal{W}italic_i ∈ caligraphic_W and is increasing if i𝒰𝑖𝒰i\in\mathcal{U}italic_i ∈ caligraphic_U.

To prove Corollary A.2, we need only show equation (3) is decreasing in y~isubscript~𝑦𝑖\tilde{y}_{i}over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for i𝒲𝑖𝒲i\in\mathcal{W}italic_i ∈ caligraphic_W and increasing in y~isubscript~𝑦𝑖\tilde{y}_{i}over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for i𝒰𝑖𝒰i\in\mathcal{U}italic_i ∈ caligraphic_U, we need only prove that the partial derivative (21) is non-positive and the partial derivative (22) is non-negative. We will rely heavily on the fact thatfsuperscript𝑓f^{\prime}italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is increasing because f𝑓fitalic_f is convex.

We start with (21):

y~i𝒲d𝒴(𝐲~,T)={0if 𝒮𝒲p and 𝒮𝒰qf(𝒮𝒲p)f(1𝒮𝒲1p)if 𝒮𝒲<p and 𝒮𝒰(1𝒮𝒲)(q1p)0if 𝒮𝒰>q and 𝒮𝒲(1𝒮𝒰)(p1q)f(𝒮𝒲p)f(1𝒮𝒲𝒮𝒰1pq)if 𝒮𝒰>(1𝒮𝒲)(q1p) and 𝒮𝒲<(1𝒮𝒰)(p1q).subscript~𝑦𝑖𝒲subscript𝑑𝒴~𝐲𝑇cases0if subscript𝒮𝒲𝑝 and subscript𝒮𝒰𝑞superscript𝑓subscript𝒮𝒲𝑝superscript𝑓1subscript𝒮𝒲1𝑝if subscript𝒮𝒲𝑝 and subscript𝒮𝒰1subscript𝒮𝒲𝑞1𝑝0if subscript𝒮𝒰𝑞 and subscript𝒮𝒲1subscript𝒮𝒰𝑝1𝑞superscript𝑓subscript𝒮𝒲𝑝superscript𝑓1subscript𝒮𝒲subscript𝒮𝒰1𝑝𝑞if subscript𝒮𝒰1subscript𝒮𝒲𝑞1𝑝otherwise and subscript𝒮𝒲1subscript𝒮𝒰𝑝1𝑞\frac{\partial}{\partial\tilde{y}_{i\in\mathcal{W}}}d_{\mathcal{Y}}(\tilde{% \mathbf{y}},T)=\begin{cases}0&\text{if }\mathcal{S}_{\mathcal{W}}\geq p\text{ % and }\mathcal{S}_{\mathcal{U}}\leq q\\ f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)-f^{\prime}\left(% \frac{1-\mathcal{S}_{\mathcal{W}}}{1-p}\right)&\text{if }\mathcal{S}_{\mathcal% {W}}<p\text{ and }\mathcal{S}_{\mathcal{U}}\leq(1-\mathcal{S}_{\mathcal{W}})% \left(\frac{q}{1-p}\right)\\ 0&\text{if }\mathcal{S}_{\mathcal{U}}>q\text{ and }\mathcal{S}_{\mathcal{W}}% \geq(1-\mathcal{S}_{\mathcal{U}})\left(\frac{p}{1-q}\right)\\ f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)-f^{\prime}\left(% \frac{1-\mathcal{S}_{\mathcal{W}}-\mathcal{S}_{\mathcal{U}}}{1-p-q}\right)&% \text{if }\mathcal{S}_{\mathcal{U}}>(1-\mathcal{S}_{\mathcal{W}})\left(\frac{q% }{1-p}\right)\\ &\text{ and }\mathcal{S}_{\mathcal{W}}<(1-\mathcal{S}_{\mathcal{U}})\left(% \frac{p}{1-q}\right)\end{cases}.divide start_ARG ∂ end_ARG start_ARG ∂ over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i ∈ caligraphic_W end_POSTSUBSCRIPT end_ARG italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG , italic_T ) = { start_ROW start_CELL 0 end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ≥ italic_p and caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ≤ italic_q end_CELL end_ROW start_ROW start_CELL italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG ) end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT < italic_p and caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ≤ ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG ) end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT > italic_q and caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ≥ ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ) end_CELL end_ROW start_ROW start_CELL italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ) end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT > ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL and caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT < ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ) end_CELL end_ROW .

Clearly the first and third cases are non-positive, so we proceed to the second case.

Because 𝒮𝒲<psubscript𝒮𝒲𝑝\mathcal{S}_{\mathcal{W}}<pcaligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT < italic_p, we have 𝒮𝒲p<1<1𝒮𝒲1psubscript𝒮𝒲𝑝11subscript𝒮𝒲1𝑝\frac{\mathcal{S}_{\mathcal{W}}}{p}<1<\frac{1-\mathcal{S}_{\mathcal{W}}}{1-p}divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG < 1 < divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG and

f(𝒮𝒲p)superscript𝑓subscript𝒮𝒲𝑝\displaystyle f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) <f(1𝒮𝒲1p)absentsuperscript𝑓1subscript𝒮𝒲1𝑝\displaystyle<f^{\prime}\left(\frac{1-\mathcal{S}_{\mathcal{W}}}{1-p}\right)< italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG )
f(𝒮𝒲p)f(1𝒮𝒲1p)superscript𝑓subscript𝒮𝒲𝑝superscript𝑓1subscript𝒮𝒲1𝑝\displaystyle f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)-f^{% \prime}\left(\frac{1-\mathcal{S}_{\mathcal{W}}}{1-p}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p end_ARG ) <0.absent0\displaystyle<0.< 0 .

Next we prove the partial derivative is negative in the fourth case.

𝒮𝒲subscript𝒮𝒲\displaystyle\mathcal{S}_{\mathcal{W}}caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT <(1𝒮𝒰)(p1q)absent1subscript𝒮𝒰𝑝1𝑞\displaystyle<(1-\mathcal{S}_{\mathcal{U}})\left(\frac{p}{1-q}\right)< ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG )
𝒮𝒲q𝒮𝒲subscript𝒮𝒲𝑞subscript𝒮𝒲\displaystyle\mathcal{S}_{\mathcal{W}}-q\mathcal{S}_{\mathcal{W}}caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - italic_q caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT <pp𝒮𝒰absent𝑝𝑝subscript𝒮𝒰\displaystyle<p-p\mathcal{S}_{\mathcal{U}}< italic_p - italic_p caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT
𝒮𝒲q𝒮𝒲p𝒮𝒲subscript𝒮𝒲𝑞subscript𝒮𝒲𝑝subscript𝒮𝒲\displaystyle\mathcal{S}_{\mathcal{W}}-q\mathcal{S}_{\mathcal{W}}-p\mathcal{S}% _{\mathcal{W}}caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - italic_q caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - italic_p caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT <pp𝒮𝒰p𝒮𝒲absent𝑝𝑝subscript𝒮𝒰𝑝subscript𝒮𝒲\displaystyle<p-p\mathcal{S}_{\mathcal{U}}-p\mathcal{S}_{\mathcal{W}}< italic_p - italic_p caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT - italic_p caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT
𝒮𝒲psubscript𝒮𝒲𝑝\displaystyle\frac{\mathcal{S}_{\mathcal{W}}}{p}divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG <1𝒮𝒰𝒮𝒲1pqabsent1subscript𝒮𝒰subscript𝒮𝒲1𝑝𝑞\displaystyle<\frac{1-\mathcal{S}_{\mathcal{U}}-\mathcal{S}_{\mathcal{W}}}{1-p% -q}< divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG
f(𝒮𝒲p)superscript𝑓subscript𝒮𝒲𝑝\displaystyle f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) <f(1𝒮𝒰𝒮𝒲1pq)absentsuperscript𝑓1subscript𝒮𝒰subscript𝒮𝒲1𝑝𝑞\displaystyle<f^{\prime}\left(\frac{1-\mathcal{S}_{\mathcal{U}}-\mathcal{S}_{% \mathcal{W}}}{1-p-q}\right)< italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG )
f(𝒮𝒲p)f(1𝒮𝒰𝒮𝒲1pq)superscript𝑓subscript𝒮𝒲𝑝superscript𝑓1subscript𝒮𝒰subscript𝒮𝒲1𝑝𝑞\displaystyle f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{W}}}{p}\right)-f^{% \prime}\left(\frac{1-\mathcal{S}_{\mathcal{U}}-\mathcal{S}_{\mathcal{W}}}{1-p-% q}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG italic_p end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ) <0absent0\displaystyle<0< 0

This shows that (21) is non-positive and (3) is decreasing in y~isubscript~𝑦𝑖\tilde{y}_{i}over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for i𝒲𝑖𝒲i\in\mathcal{W}italic_i ∈ caligraphic_W.

We now consider (22):

y~i𝒰d𝒴(𝐲~,T)={0if 𝒮𝒲p and 𝒮𝒰q0if 𝒮𝒲<p and 𝒮𝒰(1𝒮𝒲)(q1p)f(𝒮𝒰q)f(1𝒮𝒰1q)if 𝒮𝒰>q and 𝒮𝒲(1𝒮𝒰)(p1q)f(𝒮𝒰q)f(1𝒮𝒲𝒮𝒰1pq)if 𝒮𝒰>(1𝒮𝒲)(q1p) and 𝒮𝒲<(1𝒮𝒰)(p1q).subscript~𝑦𝑖𝒰subscript𝑑𝒴~𝐲𝑇cases0if subscript𝒮𝒲𝑝 and subscript𝒮𝒰𝑞0if subscript𝒮𝒲𝑝 and subscript𝒮𝒰1subscript𝒮𝒲𝑞1𝑝superscript𝑓subscript𝒮𝒰𝑞superscript𝑓1subscript𝒮𝒰1𝑞if subscript𝒮𝒰𝑞 and subscript𝒮𝒲1subscript𝒮𝒰𝑝1𝑞superscript𝑓subscript𝒮𝒰𝑞superscript𝑓1subscript𝒮𝒲subscript𝒮𝒰1𝑝𝑞if subscript𝒮𝒰1subscript𝒮𝒲𝑞1𝑝otherwise and subscript𝒮𝒲1subscript𝒮𝒰𝑝1𝑞\frac{\partial}{\partial\tilde{y}_{i\in\mathcal{U}}}d_{\mathcal{Y}}(\tilde{% \mathbf{y}},T)=\begin{cases}0&\text{if }\mathcal{S}_{\mathcal{W}}\geq p\text{ % and }\mathcal{S}_{\mathcal{U}}\leq q\\ 0&\text{if }\mathcal{S}_{\mathcal{W}}<p\text{ and }\mathcal{S}_{\mathcal{U}}% \leq(1-\mathcal{S}_{\mathcal{W}})\left(\frac{q}{1-p}\right)\\ f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)-f^{\prime}\left(% \frac{1-\mathcal{S}_{\mathcal{U}}}{1-q}\right)&\text{if }\mathcal{S}_{\mathcal% {U}}>q\text{ and }\mathcal{S}_{\mathcal{W}}\geq(1-\mathcal{S}_{\mathcal{U}})% \left(\frac{p}{1-q}\right)\\ f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)-f^{\prime}\left(% \frac{1-\mathcal{S}_{\mathcal{W}}-\mathcal{S}_{\mathcal{U}}}{1-p-q}\right)&% \text{if }\mathcal{S}_{\mathcal{U}}>(1-\mathcal{S}_{\mathcal{W}})\left(\frac{q% }{1-p}\right)\\ &\text{ and }\mathcal{S}_{\mathcal{W}}<(1-\mathcal{S}_{\mathcal{U}})\left(% \frac{p}{1-q}\right)\end{cases}.divide start_ARG ∂ end_ARG start_ARG ∂ over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i ∈ caligraphic_U end_POSTSUBSCRIPT end_ARG italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT ( over~ start_ARG bold_y end_ARG , italic_T ) = { start_ROW start_CELL 0 end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ≥ italic_p and caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ≤ italic_q end_CELL end_ROW start_ROW start_CELL 0 end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT < italic_p and caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ≤ ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG ) end_CELL end_ROW start_ROW start_CELL italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_q end_ARG ) end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT > italic_q and caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ≥ ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ) end_CELL end_ROW start_ROW start_CELL italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ) end_CELL start_CELL if caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT > ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG ) end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL and caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT < ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT ) ( divide start_ARG italic_p end_ARG start_ARG 1 - italic_q end_ARG ) end_CELL end_ROW .

Clearly the first two cases are non-negative, so we consider the third case.

Because 𝒮𝒰>qsubscript𝒮𝒰𝑞\mathcal{S}_{\mathcal{U}}>qcaligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT > italic_q, we have 𝒮𝒰q>1>1𝒮𝒰1qsubscript𝒮𝒰𝑞11subscript𝒮𝒰1𝑞\frac{\mathcal{S}_{\mathcal{U}}}{q}>1>\frac{1-\mathcal{S}_{\mathcal{U}}}{1-q}divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG > 1 > divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_q end_ARG and

f(𝒮𝒰q)superscript𝑓subscript𝒮𝒰𝑞\displaystyle f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) >f(1𝒮𝒰1q)absentsuperscript𝑓1subscript𝒮𝒰1𝑞\displaystyle>f^{\prime}\left(\frac{1-\mathcal{S}_{\mathcal{U}}}{1-q}\right)> italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_q end_ARG )
f(𝒮𝒰q)f(1𝒮𝒰1q)superscript𝑓subscript𝒮𝒰𝑞superscript𝑓1subscript𝒮𝒰1𝑞\displaystyle f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)-f^{% \prime}\left(\frac{1-\mathcal{S}_{\mathcal{U}}}{1-q}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_q end_ARG ) >0.absent0\displaystyle>0.> 0 .

We can no prove the fourth case is positive.

𝒮𝒰subscript𝒮𝒰\displaystyle\mathcal{S}_{\mathcal{U}}caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT >(1𝒮𝒲)(q1p)absent1subscript𝒮𝒲𝑞1𝑝\displaystyle>(1-\mathcal{S}_{\mathcal{W}})\left(\frac{q}{1-p}\right)> ( 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT ) ( divide start_ARG italic_q end_ARG start_ARG 1 - italic_p end_ARG )
𝒮𝒰p𝒮𝒲subscript𝒮𝒰𝑝subscript𝒮𝒲\displaystyle\mathcal{S}_{\mathcal{U}}-p\mathcal{S}_{\mathcal{W}}caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT - italic_p caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT >qq𝒮𝒲absent𝑞𝑞subscript𝒮𝒲\displaystyle>q-q\mathcal{S}_{\mathcal{W}}> italic_q - italic_q caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT
𝒮𝒰p𝒮𝒲q𝒮𝒰subscript𝒮𝒰𝑝subscript𝒮𝒲𝑞subscript𝒮𝒰\displaystyle\mathcal{S}_{\mathcal{U}}-p\mathcal{S}_{\mathcal{W}}-q\mathcal{S}% _{\mathcal{U}}caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT - italic_p caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - italic_q caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT >qp𝒮𝒲q𝒮𝒰absent𝑞𝑝subscript𝒮𝒲𝑞subscript𝒮𝒰\displaystyle>q-p\mathcal{S}_{\mathcal{W}}-q\mathcal{S}_{\mathcal{U}}> italic_q - italic_p caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - italic_q caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT
𝒮𝒰qsubscript𝒮𝒰𝑞\displaystyle\frac{\mathcal{S}_{\mathcal{U}}}{q}divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG >1𝒮𝒲𝒮𝒰1pqabsent1subscript𝒮𝒲subscript𝒮𝒰1𝑝𝑞\displaystyle>\frac{1-\mathcal{S}_{\mathcal{W}}-\mathcal{S}_{\mathcal{U}}}{1-p% -q}> divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG
f(𝒮𝒰q)superscript𝑓subscript𝒮𝒰𝑞\displaystyle f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) >f(1𝒮𝒲𝒮𝒰1pq)absentsuperscript𝑓1subscript𝒮𝒲subscript𝒮𝒰1𝑝𝑞\displaystyle>f^{\prime}\left(\frac{1-\mathcal{S}_{\mathcal{W}}-\mathcal{S}_{% \mathcal{U}}}{1-p-q}\right)> italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG )
f(𝒮𝒰q)f(1𝒮𝒲𝒮𝒰1pq)superscript𝑓subscript𝒮𝒰𝑞superscript𝑓1subscript𝒮𝒲subscript𝒮𝒰1𝑝𝑞\displaystyle f^{\prime}\left(\frac{\mathcal{S}_{\mathcal{U}}}{q}\right)-f^{% \prime}\left(\frac{1-\mathcal{S}_{\mathcal{W}}-\mathcal{S}_{\mathcal{U}}}{1-p-% q}\right)italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG italic_q end_ARG ) - italic_f start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( divide start_ARG 1 - caligraphic_S start_POSTSUBSCRIPT caligraphic_W end_POSTSUBSCRIPT - caligraphic_S start_POSTSUBSCRIPT caligraphic_U end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_p - italic_q end_ARG ) >0absent0\displaystyle>0> 0

This shows that (22) is non-negative and (3) is increasing in y~isubscript~𝑦𝑖\tilde{y}_{i}over~ start_ARG italic_y end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for i𝒰𝑖𝒰i\in\mathcal{U}italic_i ∈ caligraphic_U.

A.2 Proofs of Theorem 2.3 (PAC generalization bounds for Verifier)

Let us define 𝒟isubscript𝒟𝑖\mathcal{D}_{i}caligraphic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT as the distribution of the data 𝐱𝐱\mathbf{x}bold_x conditioned on the event that it is drawn from class i𝑖iitalic_i. We define a loss function :{0,1}×[0,1]:0101\ell:\{0,1\}\times[0,1]\rightarrow\mathbb{R}roman_ℓ : { 0 , 1 } × [ 0 , 1 ] → blackboard_R as follows:

(z,v)=zl(v)+(1z)l(1v),𝑧𝑣𝑧𝑙𝑣1𝑧𝑙1𝑣\ell(z,v)=zl(v)+(1-z)l(1-v),roman_ℓ ( italic_z , italic_v ) = italic_z italic_l ( italic_v ) + ( 1 - italic_z ) italic_l ( 1 - italic_v ) , (28)

where l𝑙litalic_l is some differentiable function (e.g., log()\log()roman_log ( ) which would lead to the cross-entropy loss). Furthermore, we assume that the output of the loss \ellroman_ℓ is upper bounded by a constant Bsubscript𝐵B_{\ell}italic_B start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT and is Lipschitz. The verifier output of V(𝐱,𝐱˘)𝑉𝐱˘𝐱V(\mathbf{x},\breve{\mathbf{x}})italic_V ( bold_x , over˘ start_ARG bold_x end_ARG ) estimates probability that 𝐱𝐱\mathbf{x}bold_x and 𝐱˘˘𝐱\breve{\mathbf{x}}over˘ start_ARG bold_x end_ARG belong to the same class.

Using this loss, we now define the true risk R(V)𝑅𝑉R(V)italic_R ( italic_V ) of a verifier V𝑉Vitalic_V as

R(V)=1k(k1)ij𝔼𝐱(i)𝒟i,𝐱(j)𝒟j[(0,V(𝐱(i),𝐱(j)))]R(diff)(V)+1ki=1k𝔼(𝐱,𝐱˘)𝒟i[(1,V(𝐱,𝐱˘))]R(same)(V).𝑅𝑉subscript1𝑘𝑘1subscript𝑖𝑗subscript𝔼formulae-sequencesimilar-tosuperscript𝐱𝑖subscript𝒟𝑖similar-tosuperscript𝐱𝑗subscript𝒟𝑗delimited-[]0𝑉superscript𝐱𝑖superscript𝐱𝑗superscript𝑅diff𝑉subscript1𝑘superscriptsubscript𝑖1𝑘subscript𝔼similar-to𝐱˘𝐱subscript𝒟𝑖delimited-[]1𝑉𝐱˘𝐱superscript𝑅same𝑉R(V)=\underbrace{\frac{1}{k(k-1)}\sum_{i\neq j}\mathop{\mathbb{E}}_{\mathbf{x}% ^{(i)}\sim\mathcal{D}_{i},\mathbf{{x}}^{(j)}\sim\mathcal{D}_{j}}[\ell(0,V(% \mathbf{x}^{(i)},{\mathbf{x}}^{(j)}))]}_{R^{(\text{diff})}(V)}+\underbrace{% \frac{1}{k}\sum_{i=1}^{k}\mathop{\mathbb{E}}_{(\mathbf{x},\breve{\mathbf{x}})% \sim\mathcal{D}_{i}}[\ell(1,V(\mathbf{x},\breve{\mathbf{x}}))]}_{R^{(\text{% same})}(V)}.italic_R ( italic_V ) = under⏟ start_ARG divide start_ARG 1 end_ARG start_ARG italic_k ( italic_k - 1 ) end_ARG ∑ start_POSTSUBSCRIPT italic_i ≠ italic_j end_POSTSUBSCRIPT blackboard_E start_POSTSUBSCRIPT bold_x start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT ∼ caligraphic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_x start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ∼ caligraphic_D start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ roman_ℓ ( 0 , italic_V ( bold_x start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT , bold_x start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ) ) ] end_ARG start_POSTSUBSCRIPT italic_R start_POSTSUPERSCRIPT ( diff ) end_POSTSUPERSCRIPT ( italic_V ) end_POSTSUBSCRIPT + under⏟ start_ARG divide start_ARG 1 end_ARG start_ARG italic_k end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT blackboard_E start_POSTSUBSCRIPT ( bold_x , over˘ start_ARG bold_x end_ARG ) ∼ caligraphic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ roman_ℓ ( 1 , italic_V ( bold_x , over˘ start_ARG bold_x end_ARG ) ) ] end_ARG start_POSTSUBSCRIPT italic_R start_POSTSUPERSCRIPT ( same ) end_POSTSUPERSCRIPT ( italic_V ) end_POSTSUBSCRIPT . (29)

The verifier faces two types of inputs that it should be able to distinguish: (a) pairs of inputs that can come from the same class (i.e., 𝐱,𝐱˘𝒟isimilar-to𝐱˘𝐱subscript𝒟𝑖\mathbf{x},\breve{\mathbf{x}}\sim\mathcal{D}_{i}bold_x , over˘ start_ARG bold_x end_ARG ∼ caligraphic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT for some class i𝑖iitalic_i) and (b) pairs of inputs that can belong to different classes (i.e., 𝐱(i)𝒟isimilar-tosuperscript𝐱𝑖subscript𝒟𝑖\mathbf{x}^{(i)}\sim\mathcal{D}_{i}bold_x start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT ∼ caligraphic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and 𝐱(j)𝒟jsimilar-tosuperscript𝐱𝑗subscript𝒟𝑗{\mathbf{x}^{(j)}}\sim\mathcal{D}_{j}bold_x start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ∼ caligraphic_D start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT for some pair of classes ij𝑖𝑗i\neq jitalic_i ≠ italic_j). This formulation of risk assigns equal value to identifying pairs from the same class and pairs form different classes because both of R(diff)(V)superscript𝑅diff𝑉R^{(\text{diff})}(V)italic_R start_POSTSUPERSCRIPT ( diff ) end_POSTSUPERSCRIPT ( italic_V ) (accuracy on pairs form different classes) and R(same)(V)superscript𝑅same𝑉R^{(\text{same})}(V)italic_R start_POSTSUPERSCRIPT ( same ) end_POSTSUPERSCRIPT ( italic_V ) (accuracy on pairs form the same class) are normalized by dividing by the total number of terms in the sum. Specifically, we normalize the total risk for misclassifying pairs from different classes by k(k1)𝑘𝑘1k(k-1)italic_k ( italic_k - 1 ), which is the number of distinct ordered pairs of classes we can form out of k𝑘kitalic_k classes. Similarly, we normalize the total risk of misclassifying pairs from same classes by k𝑘kitalic_k. Furthermore, both R(diff)(V)superscript𝑅diff𝑉R^{(\text{diff})}(V)italic_R start_POSTSUPERSCRIPT ( diff ) end_POSTSUPERSCRIPT ( italic_V ) and R(same)(V)superscript𝑅same𝑉R^{(\text{same})}(V)italic_R start_POSTSUPERSCRIPT ( same ) end_POSTSUPERSCRIPT ( italic_V ) assign equal importance to each possible type of combination (which class the first element of the pair comes form and which class the second element of the pair comes from).

To calculate our empirical risk we will assume we are given k𝑘kitalic_k sets 𝒮(i)superscript𝒮𝑖\mathcal{S}^{(i)}caligraphic_S start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT 1ik1𝑖𝑘1\leq i\leq k1 ≤ italic_i ≤ italic_k, each containing n/k𝑛𝑘n/kitalic_n / italic_k samples drawn independently from the corresponding 𝒟isubscript𝒟𝑖\mathcal{D}_{i}caligraphic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT as defined above. We index these sets as follows:

𝒮(i)={𝐱(q)(i)}q=1n/k,i=1,2,,k.formulae-sequencesuperscript𝒮𝑖subscriptsuperscriptsubscriptsuperscript𝐱𝑖𝑞𝑛𝑘𝑞1𝑖12𝑘\displaystyle\mathcal{S}^{(i)}=\{\mathbf{x}^{(i)}_{(q)}\}^{n/k}_{q=1},\quad i=% 1,2,\ldots,k.caligraphic_S start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT = { bold_x start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( italic_q ) end_POSTSUBSCRIPT } start_POSTSUPERSCRIPT italic_n / italic_k end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_q = 1 end_POSTSUBSCRIPT , italic_i = 1 , 2 , … , italic_k . (30)

We define the entire dataset 𝒮𝒮\mathcal{S}caligraphic_S as

𝒮=i=1k𝒮(i)𝒮superscriptsubscript𝑖1𝑘superscript𝒮𝑖\displaystyle\mathcal{S}=\bigcup\limits_{i=1}^{k}\mathcal{S}^{(i)}caligraphic_S = ⋃ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT caligraphic_S start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT (31)

We define our empirical risk for training the verifier over the set 𝒮𝒮\mathcal{S}caligraphic_S as follows:

R^𝒮(V)=1k(k1)ij1(nk)2q=1nkr=1nk(0,V(𝐱q(i),𝐱r(j)))R^𝒮(diff)(V)+1ki=1k1(nk)2nq=1nkr=1,rqnk(1,V(𝐱q(i),𝐱r(i)))R^𝒮(same)(V).subscript^𝑅𝒮𝑉subscript1𝑘𝑘1subscript𝑖𝑗1superscript𝑛𝑘2superscriptsubscript𝑞1𝑛𝑘superscriptsubscript𝑟1𝑛𝑘0𝑉subscriptsuperscript𝐱𝑖𝑞subscriptsuperscript𝐱𝑗𝑟superscriptsubscript^𝑅𝒮diff𝑉subscript1𝑘superscriptsubscript𝑖1𝑘1superscript𝑛𝑘2𝑛superscriptsubscript𝑞1𝑛𝑘subscriptsuperscript𝑛𝑘formulae-sequence𝑟1𝑟𝑞1𝑉subscriptsuperscript𝐱𝑖𝑞subscriptsuperscript𝐱𝑖𝑟subscriptsuperscript^𝑅same𝒮𝑉\hat{R}_{\mathcal{S}}(V)=\underbrace{\frac{1}{k(k-1)}\sum_{i\neq j}\frac{1}{% \left(\frac{n}{k}\right)^{2}}\sum_{q=1}^{\frac{n}{k}}\sum_{r=1}^{\frac{n}{k}}% \ell(0,V(\mathbf{x}^{(i)}_{q},{\mathbf{x}}^{(j)}_{r}))}_{\hat{R}_{\mathcal{S}}% ^{(\text{diff})}(V)}+\underbrace{\frac{1}{k}\sum_{i=1}^{k}\frac{1}{\left(\frac% {n}{k}\right)^{2}-n}\sum_{q=1}^{\frac{n}{k}}\sum^{\frac{n}{k}}_{r=1,r\neq q}% \ell(1,V(\mathbf{x}^{(i)}_{q},{\mathbf{x}}^{(i)}_{r}))}_{\hat{R}^{(\text{same}% )}_{\mathcal{S}}(V)}.over^ start_ARG italic_R end_ARG start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT ( italic_V ) = under⏟ start_ARG divide start_ARG 1 end_ARG start_ARG italic_k ( italic_k - 1 ) end_ARG ∑ start_POSTSUBSCRIPT italic_i ≠ italic_j end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG ( divide start_ARG italic_n end_ARG start_ARG italic_k end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG ∑ start_POSTSUBSCRIPT italic_q = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT divide start_ARG italic_n end_ARG start_ARG italic_k end_ARG end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_r = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT divide start_ARG italic_n end_ARG start_ARG italic_k end_ARG end_POSTSUPERSCRIPT roman_ℓ ( 0 , italic_V ( bold_x start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT , bold_x start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ) ) end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( diff ) end_POSTSUPERSCRIPT ( italic_V ) end_POSTSUBSCRIPT + under⏟ start_ARG divide start_ARG 1 end_ARG start_ARG italic_k end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT divide start_ARG 1 end_ARG start_ARG ( divide start_ARG italic_n end_ARG start_ARG italic_k end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_n end_ARG ∑ start_POSTSUBSCRIPT italic_q = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT divide start_ARG italic_n end_ARG start_ARG italic_k end_ARG end_POSTSUPERSCRIPT ∑ start_POSTSUPERSCRIPT divide start_ARG italic_n end_ARG start_ARG italic_k end_ARG end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_r = 1 , italic_r ≠ italic_q end_POSTSUBSCRIPT roman_ℓ ( 1 , italic_V ( bold_x start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT , bold_x start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ) ) end_ARG start_POSTSUBSCRIPT over^ start_ARG italic_R end_ARG start_POSTSUPERSCRIPT ( same ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT ( italic_V ) end_POSTSUBSCRIPT . (32)

where R^𝒮(diff)(V)superscriptsubscript^𝑅𝒮diff𝑉\hat{R}_{\mathcal{S}}^{(\text{diff})}(V)over^ start_ARG italic_R end_ARG start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( diff ) end_POSTSUPERSCRIPT ( italic_V ) denotes the empirical risk of the verifier on inputs from different classes; and R^𝒮(same)(V)superscriptsubscript^𝑅𝒮same𝑉\hat{R}_{\mathcal{S}}^{(\text{same})}(V)over^ start_ARG italic_R end_ARG start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( same ) end_POSTSUPERSCRIPT ( italic_V ) denotes the empirical risk of the verifier on inputs from the same class. It is straightforward to verify that R^𝒮(V)subscript^𝑅𝒮𝑉\hat{R}_{\mathcal{S}}(V)over^ start_ARG italic_R end_ARG start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT ( italic_V ) is an unbiased estimator of the true risk R(V)𝑅𝑉R(V)italic_R ( italic_V ), i.e., 𝔼(R^𝒮(V))=R(V)𝔼subscript^𝑅𝒮𝑉𝑅𝑉\mathbb{E}(\hat{R}_{\mathcal{S}}(V))={R}(V)blackboard_E ( over^ start_ARG italic_R end_ARG start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT ( italic_V ) ) = italic_R ( italic_V ).

Let us define worst case generalization gap for a given dataset 𝒮𝒮\mathcal{S}caligraphic_S as

ϕ(𝒮)=supV𝒱|R(V)R^𝒮(V)|,italic-ϕ𝒮subscriptsupremum𝑉𝒱R𝑉subscript^R𝒮𝑉\phi(\mathcal{{S}})=\sup_{V\in\mathcal{V}}\left|\mathrm{R}(V)-\hat{\mathrm{R}}% _{\mathcal{S}}(V)\right|,italic_ϕ ( caligraphic_S ) = roman_sup start_POSTSUBSCRIPT italic_V ∈ caligraphic_V end_POSTSUBSCRIPT | roman_R ( italic_V ) - over^ start_ARG roman_R end_ARG start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT ( italic_V ) | , (33)

where 𝒱𝒱\mathcal{V}caligraphic_V denotes the hypothesis class from which the verifier V()𝑉V(\cdot)italic_V ( ⋅ ) is selected. To bound this generalization gap, we will use the notion of Rademacher complexity which measures the correlation between the function class and the random labels to upper bound the generalization gap (Mohri et al., 2018). The Rademacher complexity of a hypothesis class over a particular data set is formally defined as:

Definition A.3.

The empirical Rademacher complexity of a function class \mathcal{F}caligraphic_F with respect to the set 𝒮ˇ={𝐚i}i=1nˇ𝒮superscriptsubscriptsubscript𝐚𝑖𝑖1𝑛\mathcal{\check{S}}=\{\mathbf{a}_{i}\}_{i=1}^{n}overroman_ˇ start_ARG caligraphic_S end_ARG = { bold_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT is given by the following equation:

𝒮ˇ()=1n𝔼𝜎[supfi=1nσif(𝐚i)],subscriptˇ𝒮1𝑛𝜎𝔼delimited-[]subscriptsupremum𝑓superscriptsubscript𝑖1𝑛subscript𝜎𝑖𝑓subscript𝐚𝑖\mathfrak{R}_{\mathcal{\check{S}}}(\mathcal{F})=\frac{1}{n}\underset{\sigma}{% \mathbb{E}}\left[\sup_{f\in\mathcal{F}}\sum_{i=1}^{n}\sigma_{i}f(\mathbf{a}_{i% })\right],fraktur_R start_POSTSUBSCRIPT overroman_ˇ start_ARG caligraphic_S end_ARG end_POSTSUBSCRIPT ( caligraphic_F ) = divide start_ARG 1 end_ARG start_ARG italic_n end_ARG underitalic_σ start_ARG blackboard_E end_ARG [ roman_sup start_POSTSUBSCRIPT italic_f ∈ caligraphic_F end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT italic_f ( bold_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ] , (34)

where σisubscript𝜎𝑖\sigma_{i}italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT’s are i.i.d. Rademacher random variables, i.e., Pr(σi=1)=Pr(σi=1)=12Prsubscript𝜎𝑖1Prsubscript𝜎𝑖112\Pr(\sigma_{i}=1)=\Pr(\sigma_{i}=-1)=\frac{1}{2}roman_Pr ( italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = 1 ) = roman_Pr ( italic_σ start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = - 1 ) = divide start_ARG 1 end_ARG start_ARG 2 end_ARG.

In the following steps, we upper bound the generalization gap in (33) as using Rademacher complexity. We first bound the generalization gap using triangle inequality as follows:

ϕ(𝒮)=italic-ϕ𝒮absent\displaystyle\phi(\mathcal{S})=italic_ϕ ( caligraphic_S ) = supV𝒱|R(diff)(V)+R(same)(V)R^𝒮(diff)(V)R^𝒮(same)(V)|subscriptsupremum𝑉𝒱superscript𝑅diff𝑉superscript𝑅same𝑉subscriptsuperscript^𝑅diff𝒮𝑉subscriptsuperscript^𝑅same𝒮𝑉\displaystyle\sup_{V\in\mathcal{V}}\left|R^{(\text{diff})}(V)+R^{(\text{same})% }(V)-\hat{R}^{(\text{diff})}_{\mathcal{S}}(V)-\hat{R}^{(\text{same})}_{% \mathcal{S}}(V)\right|roman_sup start_POSTSUBSCRIPT italic_V ∈ caligraphic_V end_POSTSUBSCRIPT | italic_R start_POSTSUPERSCRIPT ( diff ) end_POSTSUPERSCRIPT ( italic_V ) + italic_R start_POSTSUPERSCRIPT ( same ) end_POSTSUPERSCRIPT ( italic_V ) - over^ start_ARG italic_R end_ARG start_POSTSUPERSCRIPT ( diff ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT ( italic_V ) - over^ start_ARG italic_R end_ARG start_POSTSUPERSCRIPT ( same ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT ( italic_V ) | (35)
\displaystyle\leq supV𝒱|R(diff)(V)R^𝒮(diff)(V)|+supV𝒱|R(same)(V)R^𝒮(same)(V)|subscriptsupremum𝑉𝒱superscript𝑅diff𝑉subscriptsuperscript^𝑅diff𝒮𝑉subscriptsupremum𝑉𝒱superscript𝑅same𝑉subscriptsuperscript^𝑅same𝒮𝑉\displaystyle\sup_{V\in\mathcal{V}}\left|R^{(\text{diff})}(V)-\hat{R}^{(\text{% diff})}_{\mathcal{S}}(V)\right|+\sup_{V\in\mathcal{V}}\left|R^{(\text{same})}(% V)-\hat{R}^{(\text{same})}_{\mathcal{S}}(V)\right|roman_sup start_POSTSUBSCRIPT italic_V ∈ caligraphic_V end_POSTSUBSCRIPT | italic_R start_POSTSUPERSCRIPT ( diff ) end_POSTSUPERSCRIPT ( italic_V ) - over^ start_ARG italic_R end_ARG start_POSTSUPERSCRIPT ( diff ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT ( italic_V ) | + roman_sup start_POSTSUBSCRIPT italic_V ∈ caligraphic_V end_POSTSUBSCRIPT | italic_R start_POSTSUPERSCRIPT ( same ) end_POSTSUPERSCRIPT ( italic_V ) - over^ start_ARG italic_R end_ARG start_POSTSUPERSCRIPT ( same ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT caligraphic_S end_POSTSUBSCRIPT ( italic_V ) | (36)

The above bound first decomposes the generalization gap into the sum of two generalization gaps, the first over the pair of samples coming from different classes; and the second over the samples drawn from the same class. To proceed we will need a few additional definitions: we define 𝒟i×𝒟jsubscript𝒟𝑖subscript𝒟𝑗\mathcal{D}_{i}\times\mathcal{D}_{j}caligraphic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT × caligraphic_D start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT to represent the distribution over pairs (𝐱(i),𝐱(j))superscript𝐱𝑖superscript𝐱𝑗(\mathbf{x}^{(i)},\mathbf{x}^{(j)})( bold_x start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT , bold_x start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ) where 𝐱(i)superscript𝐱𝑖\mathbf{x}^{(i)}bold_x start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT is drawn from 𝒟isubscript𝒟𝑖\mathcal{D}_{i}caligraphic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and 𝐱(j)superscript𝐱𝑗\mathbf{x}^{(j)}bold_x start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT is drawn from 𝒟jsubscript𝒟𝑗\mathcal{D}_{j}caligraphic_D start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT independently. We also define the sets

𝒮(i)×𝒮(j)={{(𝐱(q)(i),𝐱(r)(j))}1q,rkij{(𝐱(q)(i),𝐱(r)(j))}1q,rk,qri=j,superscript𝒮𝑖superscript𝒮𝑗casessubscriptsubscriptsuperscript𝐱𝑖𝑞subscriptsuperscript𝐱𝑗𝑟formulae-sequence1𝑞𝑟𝑘𝑖𝑗subscriptsubscriptsuperscript𝐱𝑖𝑞subscriptsuperscript𝐱𝑗𝑟formulae-sequence1𝑞formulae-sequence𝑟𝑘𝑞𝑟𝑖𝑗\mathcal{S}^{(i)}\times\mathcal{S}^{(j)}=\begin{cases}\{(\mathbf{x}^{(i)}_{(q)% },{\mathbf{x}}^{(j)}_{(r)})\}_{1\leq q,r\leq k}&i\neq j\\ \{(\mathbf{x}^{(i)}_{(q)},{\mathbf{x}}^{(j)}_{(r)})\}_{1\leq q,r\leq k,q\neq r% }&i=j\end{cases},caligraphic_S start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT × caligraphic_S start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT = { start_ROW start_CELL { ( bold_x start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( italic_q ) end_POSTSUBSCRIPT , bold_x start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( italic_r ) end_POSTSUBSCRIPT ) } start_POSTSUBSCRIPT 1 ≤ italic_q , italic_r ≤ italic_k end_POSTSUBSCRIPT end_CELL start_CELL italic_i ≠ italic_j end_CELL end_ROW start_ROW start_CELL { ( bold_x start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( italic_q ) end_POSTSUBSCRIPT , bold_x start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT ( italic_r ) end_POSTSUBSCRIPT ) } start_POSTSUBSCRIPT 1 ≤ italic_q , italic_r ≤ italic_k , italic_q ≠ italic_r end_POSTSUBSCRIPT end_CELL start_CELL italic_i = italic_j end_CELL end_ROW , (37)

and enumerate the elements of each set by 𝒮(i)×𝒮(j)={uqi,j}q=1(n/k)2superscript𝒮𝑖superscript𝒮𝑗superscriptsubscriptsubscriptsuperscript𝑢𝑖𝑗𝑞𝑞1superscript𝑛𝑘2\mathcal{S}^{(i)}\times\mathcal{S}^{(j)}=\{u^{i,j}_{q}\}_{q=1}^{(n/k)^{2}}caligraphic_S start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT × caligraphic_S start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT = { italic_u start_POSTSUPERSCRIPT italic_i , italic_j end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_q = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_n / italic_k ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT when ij𝑖𝑗i\neq jitalic_i ≠ italic_j. When i=j𝑖𝑗i=jitalic_i = italic_j the enumeration takes the form 𝒮(i)×𝒮(i)={uqi,i}q=1(n/k)2nsuperscript𝒮𝑖superscript𝒮𝑖superscriptsubscriptsubscriptsuperscript𝑢𝑖𝑖𝑞𝑞1superscript𝑛𝑘2𝑛\mathcal{S}^{(i)}\times\mathcal{S}^{(i)}=\{u^{i,i}_{q}\}_{q=1}^{(n/k)^{2}-n}caligraphic_S start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT × caligraphic_S start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT = { italic_u start_POSTSUPERSCRIPT italic_i , italic_i end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_q = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_n / italic_k ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_n end_POSTSUPERSCRIPT.

Using our definitions of true and empirical risk, we can now upper bound the above sum as follows,

ϕ(𝒮)italic-ϕ𝒮absent\displaystyle\phi(\mathcal{S})\leqitalic_ϕ ( caligraphic_S ) ≤ supV𝒱1k(k1)|ij[𝔼𝐱(i)𝒟i,𝐱(j)𝒟j[(V(𝐱(i),𝐱(j)),0)](kn)2q=1nkr=1nk(V(𝐱(q)(i),𝐱(r)(j)),0)]|subscriptsupremum𝑉𝒱1𝑘𝑘1subscript𝑖𝑗delimited-[]subscript𝔼formulae-sequencesuperscript𝐱𝑖subscript𝒟𝑖superscript𝐱𝑗subscript𝒟𝑗delimited-[]𝑉superscript𝐱𝑖superscript𝐱𝑗0superscript𝑘𝑛2superscriptsubscript𝑞1𝑛𝑘superscriptsubscript𝑟1𝑛𝑘𝑉superscriptsubscript𝐱𝑞𝑖superscriptsubscript𝐱𝑟𝑗0\displaystyle\sup_{V\in\mathcal{V}}\frac{1}{k(k-1)}\Biggl{|}\sum_{i\neq j}% \left[\mathbb{E}_{\mathbf{x}^{(i)}\in\mathcal{D}_{i},\mathbf{x}^{(j)}\in% \mathcal{D}_{j}}\left[\ell(V(\mathbf{x}^{(i)},\mathbf{x}^{(j)}),0)\right]-% \left(\frac{k}{n}\right)^{2}\sum_{q=1}^{\frac{n}{k}}\sum_{r=1}^{\frac{n}{k}}% \ell(V(\mathbf{x}_{(q)}^{(i)},\mathbf{x}_{(r)}^{(j)}),0)\right]\Biggr{|}roman_sup start_POSTSUBSCRIPT italic_V ∈ caligraphic_V end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_k ( italic_k - 1 ) end_ARG | ∑ start_POSTSUBSCRIPT italic_i ≠ italic_j end_POSTSUBSCRIPT [ blackboard_E start_POSTSUBSCRIPT bold_x start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT ∈ caligraphic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , bold_x start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ∈ caligraphic_D start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ roman_ℓ ( italic_V ( bold_x start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT , bold_x start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ) , 0 ) ] - ( divide start_ARG italic_k end_ARG start_ARG italic_n end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_q = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT divide start_ARG italic_n end_ARG start_ARG italic_k end_ARG end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_r = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT divide start_ARG italic_n end_ARG start_ARG italic_k end_ARG end_POSTSUPERSCRIPT roman_ℓ ( italic_V ( bold_x start_POSTSUBSCRIPT ( italic_q ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT , bold_x start_POSTSUBSCRIPT ( italic_r ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT ) , 0 ) ] |
+supV𝒱1k|k=1j[𝔼𝐱,𝐱˘𝒟i[(V(𝐱,𝐱˘),1)]q=1nkr=1,rqnk(V(𝐱(q)(i),𝐱(r)(i)),1)]|subscriptsupremum𝑉𝒱1𝑘superscriptsubscript𝑘1𝑗delimited-[]subscript𝔼𝐱˘𝐱subscript𝒟𝑖delimited-[]𝑉𝐱˘𝐱1superscriptsubscript𝑞1𝑛𝑘superscriptsubscriptformulae-sequence𝑟1𝑟𝑞𝑛𝑘𝑉superscriptsubscript𝐱𝑞𝑖superscriptsubscript𝐱𝑟𝑖1\displaystyle+\sup_{V\in\mathcal{V}}\frac{1}{k}\Biggl{|}\sum_{k=1}^{j}\left[% \mathbb{E}_{\mathbf{x},\breve{\mathbf{x}}\in\mathcal{D}_{i}}\left[\ell(V(% \mathbf{x},\breve{\mathbf{x}}),1)\right]-\sum_{q=1}^{\frac{n}{k}}\sum_{r=1,r% \neq q}^{\frac{n}{k}}\ell(V(\mathbf{x}_{(q)}^{(i)},\mathbf{x}_{(r)}^{(i)}),1)% \right]\Biggr{|}+ roman_sup start_POSTSUBSCRIPT italic_V ∈ caligraphic_V end_POSTSUBSCRIPT divide start_ARG 1 end_ARG start_ARG italic_k end_ARG | ∑ start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_j end_POSTSUPERSCRIPT [ blackboard_E start_POSTSUBSCRIPT bold_x , over˘ start_ARG bold_x end_ARG ∈ caligraphic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ roman_ℓ ( italic_V ( bold_x , over˘ start_ARG bold_x end_ARG ) , 1 ) ] - ∑ start_POSTSUBSCRIPT italic_q = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT divide start_ARG italic_n end_ARG start_ARG italic_k end_ARG end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_r = 1 , italic_r ≠ italic_q end_POSTSUBSCRIPT start_POSTSUPERSCRIPT divide start_ARG italic_n end_ARG start_ARG italic_k end_ARG end_POSTSUPERSCRIPT roman_ℓ ( italic_V ( bold_x start_POSTSUBSCRIPT ( italic_q ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT , bold_x start_POSTSUBSCRIPT ( italic_r ) end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT ) , 1 ) ] |
\displaystyle{\leq} 1k(k1)ijsupV𝒱|𝔼u𝒟i×𝒟j(V(u),1)(kn)2q=1(nk)2(V(uq(i,j)),1)|1𝑘𝑘1subscript𝑖𝑗subscriptsupremum𝑉𝒱subscript𝔼similar-to𝑢subscript𝒟𝑖subscript𝒟𝑗𝑉𝑢1superscript𝑘𝑛2superscriptsubscript𝑞1superscript𝑛𝑘2𝑉subscriptsuperscript𝑢𝑖𝑗𝑞1\displaystyle\frac{1}{k(k-1)}\sum_{i\neq j}\sup_{V\in\mathcal{V}}\Biggl{|}% \mathbb{E}_{u\sim\mathcal{D}_{i}\times\mathcal{D}_{j}}\ell(V(u),1)-\left(\frac% {k}{n}\right)^{2}\sum_{q=1}^{\left(\frac{n}{k}\right)^{2}}\ell(V(u^{(i,j)}_{q}% ),1)\Biggr{|}divide start_ARG 1 end_ARG start_ARG italic_k ( italic_k - 1 ) end_ARG ∑ start_POSTSUBSCRIPT italic_i ≠ italic_j end_POSTSUBSCRIPT roman_sup start_POSTSUBSCRIPT italic_V ∈ caligraphic_V end_POSTSUBSCRIPT | blackboard_E start_POSTSUBSCRIPT italic_u ∼ caligraphic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT × caligraphic_D start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_POSTSUBSCRIPT roman_ℓ ( italic_V ( italic_u ) , 1 ) - ( divide start_ARG italic_k end_ARG start_ARG italic_n end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_q = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( divide start_ARG italic_n end_ARG start_ARG italic_k end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT roman_ℓ ( italic_V ( italic_u start_POSTSUPERSCRIPT ( italic_i , italic_j ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT ) , 1 ) |
+1ki=1ksupV𝒱|𝔼u𝒟i×𝒟i[(V(u),1)](kn)2q=1(nk)2n(V(uq(i,i)),1)|1𝑘superscriptsubscript𝑖1𝑘subscriptsupremum𝑉𝒱subscript𝔼similar-to𝑢subscript𝒟𝑖subscript𝒟𝑖delimited-[]𝑉𝑢1superscript𝑘𝑛2superscriptsubscript𝑞1superscript𝑛𝑘2𝑛𝑉subscriptsuperscript𝑢𝑖𝑖𝑞1\displaystyle+\frac{1}{k}\sum_{i=1}^{k}\sup_{V\in\mathcal{V}}\Biggl{|}\mathbb{% E}_{u\sim\mathcal{D}_{i}\times\mathcal{D}_{i}}\left[\ell(V(u),1)\right]-\left(% \frac{k}{n}\right)^{2}\sum_{q=1}^{\left(\frac{n}{k}\right)^{2}-n}\ell(V(u^{(i,% i)}_{q}),1)\Biggr{|}+ divide start_ARG 1 end_ARG start_ARG italic_k end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT roman_sup start_POSTSUBSCRIPT italic_V ∈ caligraphic_V end_POSTSUBSCRIPT | blackboard_E start_POSTSUBSCRIPT italic_u ∼ caligraphic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT × caligraphic_D start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ roman_ℓ ( italic_V ( italic_u ) , 1 ) ] - ( divide start_ARG italic_k end_ARG start_ARG italic_n end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ∑ start_POSTSUBSCRIPT italic_q = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ( divide start_ARG italic_n end_ARG start_ARG italic_k end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_n end_POSTSUPERSCRIPT roman_ℓ ( italic_V ( italic_u start_POSTSUPERSCRIPT ( italic_i , italic_i ) end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_q end_POSTSUBSCRIPT ) , 1 ) | (38)

where the second inequality follows by bounding the absolute value of a sum by the sum of the absolute values (across both the “diff” and “same” terms).

We now use the standard Rademacher complexity PAC-bound (Mohri et al., 2018; Bartlett & Mendelson, 2002) on each of the supremums in (38). The result is that for any δ(0,1)𝛿01\delta\in(0,1)italic_δ ∈ ( 0 , 1 ),formulation the following holds with probability 1δ1𝛿1-\delta1 - italic_δ over the choice of 𝒮𝒮\mathcal{S}caligraphic_S:

ϕ(𝒮)italic-ϕ𝒮absent\displaystyle\phi(\mathcal{S}){\leq}italic_ϕ ( caligraphic_S ) ≤ 1k(k1)ij[2𝒮(i)×𝒮(j)(𝒱)+6kBnlog(2/δ)2]+1ki=1k[2𝒮(i)×𝒮(i)(𝒱)+6kBn2k2nlog(2/δ)2]1𝑘𝑘1subscript𝑖𝑗delimited-[]2subscriptsuperscript𝒮𝑖superscript𝒮𝑗𝒱6𝑘subscript𝐵𝑛2𝛿21𝑘superscriptsubscript𝑖1𝑘delimited-[]2subscriptsuperscript𝒮𝑖superscript𝒮𝑖𝒱6𝑘subscript𝐵superscript𝑛2superscript𝑘2𝑛2𝛿2\displaystyle\frac{1}{k(k-1)}\sum_{i\neq j}\left[2\mathfrak{R}_{\mathcal{S}^{(% i)}\times\mathcal{S}^{(j)}}(\mathcal{V})+\frac{6kB_{\ell}}{n}\sqrt{\frac{\log(% 2/\delta)}{2}}\right]+\frac{1}{k}\sum_{i=1}^{k}\left[2\mathfrak{R}_{\mathcal{S% }^{(i)}\times\mathcal{S}^{(i)}}(\mathcal{V})+\frac{6kB_{\ell}}{\sqrt{n^{2}-k^{% 2}n}}\sqrt{\frac{\log(2/\delta)}{2}}\right]divide start_ARG 1 end_ARG start_ARG italic_k ( italic_k - 1 ) end_ARG ∑ start_POSTSUBSCRIPT italic_i ≠ italic_j end_POSTSUBSCRIPT [ 2 fraktur_R start_POSTSUBSCRIPT caligraphic_S start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT × caligraphic_S start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( caligraphic_V ) + divide start_ARG 6 italic_k italic_B start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_ARG start_ARG italic_n end_ARG square-root start_ARG divide start_ARG roman_log ( 2 / italic_δ ) end_ARG start_ARG 2 end_ARG end_ARG ] + divide start_ARG 1 end_ARG start_ARG italic_k end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT [ 2 fraktur_R start_POSTSUBSCRIPT caligraphic_S start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT × caligraphic_S start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( caligraphic_V ) + divide start_ARG 6 italic_k italic_B start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_ARG start_ARG square-root start_ARG italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n end_ARG end_ARG square-root start_ARG divide start_ARG roman_log ( 2 / italic_δ ) end_ARG start_ARG 2 end_ARG end_ARG ]
=\displaystyle== 2k(k1)ij𝒮(i)×𝒮(j)(𝒱)+2ki=1k𝒮(i)×𝒮(i)(𝒱)+6kBnlog(2/δ)2+6kBn2k2nlog(2/δ)22𝑘𝑘1subscript𝑖𝑗subscriptsuperscript𝒮𝑖superscript𝒮𝑗𝒱2𝑘superscriptsubscript𝑖1𝑘subscriptsuperscript𝒮𝑖superscript𝒮𝑖𝒱6𝑘subscript𝐵𝑛2𝛿26𝑘subscript𝐵superscript𝑛2superscript𝑘2𝑛2𝛿2\displaystyle\frac{2}{k(k-1)}\sum_{i\neq j}\mathfrak{R}_{\mathcal{S}^{(i)}% \times\mathcal{S}^{(j)}}(\mathcal{V})+\frac{2}{k}\sum_{i=1}^{k}\mathfrak{R}_{% \mathcal{S}^{(i)}\times\mathcal{S}^{(i)}}(\mathcal{V})+\frac{6kB_{\ell}}{n}% \sqrt{\frac{\log(2/\delta)}{2}}+\frac{6kB_{\ell}}{\sqrt{n^{2}-k^{2}n}}\sqrt{% \frac{\log(2/\delta)}{2}}divide start_ARG 2 end_ARG start_ARG italic_k ( italic_k - 1 ) end_ARG ∑ start_POSTSUBSCRIPT italic_i ≠ italic_j end_POSTSUBSCRIPT fraktur_R start_POSTSUBSCRIPT caligraphic_S start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT × caligraphic_S start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( caligraphic_V ) + divide start_ARG 2 end_ARG start_ARG italic_k end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT fraktur_R start_POSTSUBSCRIPT caligraphic_S start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT × caligraphic_S start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( caligraphic_V ) + divide start_ARG 6 italic_k italic_B start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_ARG start_ARG italic_n end_ARG square-root start_ARG divide start_ARG roman_log ( 2 / italic_δ ) end_ARG start_ARG 2 end_ARG end_ARG + divide start_ARG 6 italic_k italic_B start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_ARG start_ARG square-root start_ARG italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n end_ARG end_ARG square-root start_ARG divide start_ARG roman_log ( 2 / italic_δ ) end_ARG start_ARG 2 end_ARG end_ARG
\displaystyle\leq 2k(k1)ij𝒮(i)×𝒮(j)(𝒱)+2ki=1k𝒮(i)×𝒮(i)(𝒱)+12kBn2k2nlog(2/δ)22𝑘𝑘1subscript𝑖𝑗subscriptsuperscript𝒮𝑖superscript𝒮𝑗𝒱2𝑘superscriptsubscript𝑖1𝑘subscriptsuperscript𝒮𝑖superscript𝒮𝑖𝒱12𝑘subscript𝐵superscript𝑛2superscript𝑘2𝑛2𝛿2\displaystyle\frac{2}{k(k-1)}\sum_{i\neq j}\mathfrak{R}_{\mathcal{S}^{(i)}% \times\mathcal{S}^{(j)}}(\mathcal{V})+\frac{2}{k}\sum_{i=1}^{k}\mathfrak{R}_{% \mathcal{S}^{(i)}\times\mathcal{S}^{(i)}}(\mathcal{V})+\frac{12kB_{\ell}}{% \sqrt{n^{2}-k^{2}n}}\sqrt{\frac{\log(2/\delta)}{2}}divide start_ARG 2 end_ARG start_ARG italic_k ( italic_k - 1 ) end_ARG ∑ start_POSTSUBSCRIPT italic_i ≠ italic_j end_POSTSUBSCRIPT fraktur_R start_POSTSUBSCRIPT caligraphic_S start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT × caligraphic_S start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( caligraphic_V ) + divide start_ARG 2 end_ARG start_ARG italic_k end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT fraktur_R start_POSTSUBSCRIPT caligraphic_S start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT × caligraphic_S start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ( caligraphic_V ) + divide start_ARG 12 italic_k italic_B start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_ARG start_ARG square-root start_ARG italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n end_ARG end_ARG square-root start_ARG divide start_ARG roman_log ( 2 / italic_δ ) end_ARG start_ARG 2 end_ARG end_ARG (39)

where the final inequality comes form replacing 6kBn6𝑘subscript𝐵𝑛\frac{6kB_{\ell}}{n}divide start_ARG 6 italic_k italic_B start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_ARG start_ARG italic_n end_ARG with the larger 6kBn2k2n6𝑘subscript𝐵superscript𝑛2superscript𝑘2𝑛\frac{6kB_{\ell}}{\sqrt{n^{2}-k^{2}n}}divide start_ARG 6 italic_k italic_B start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_ARG start_ARG square-root start_ARG italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n end_ARG end_ARG. Equation (39) can be interpreted as the sum of three terms: the first term is the average Rademacher complexity over the datasets corresponding to pairs which are drawn from different classes; the second term is the average Rademacher complexity over the datasets corresponding to pairs which are drawn from same classes; the third term is a standard term which shows the dependence on δ𝛿\deltaitalic_δ (as well as n,k𝑛𝑘n,kitalic_n , italic_k).

We now apply the bound on empirical Rademacher complexity

𝒬(V)=𝒪(|𝒬|1/d)subscript𝒬𝑉𝒪superscript𝒬1superscript𝑑\mathfrak{R}_{\mathcal{Q}}(V)=\mathcal{O}\left(|\mathcal{Q}|^{-1/d^{\prime}}\right)fraktur_R start_POSTSUBSCRIPT caligraphic_Q end_POSTSUBSCRIPT ( italic_V ) = caligraphic_O ( | caligraphic_Q | start_POSTSUPERSCRIPT - 1 / italic_d start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUPERSCRIPT ) (40)

with dsuperscript𝑑d^{\prime}italic_d start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT the dimension of the elements of 𝒬𝒬\mathcal{Q}caligraphic_Q (Gottlieb et al., 2016). To apply this we will recall the dimension of the elements of 𝒮(i)×𝒮(j)superscript𝒮𝑖superscript𝒮𝑗\mathcal{S}^{(i)}\times\mathcal{S}^{(j)}caligraphic_S start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT × caligraphic_S start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT is 2d2𝑑2d2 italic_d, and |𝒮(i)×𝒮(j)|=(nk)2superscript𝒮𝑖superscript𝒮𝑗superscript𝑛𝑘2|\mathcal{S}^{(i)}\times\mathcal{S}^{(j)}|=\left(\frac{n}{k}\right)^{2}| caligraphic_S start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT × caligraphic_S start_POSTSUPERSCRIPT ( italic_j ) end_POSTSUPERSCRIPT | = ( divide start_ARG italic_n end_ARG start_ARG italic_k end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT when ij𝑖𝑗i\neq jitalic_i ≠ italic_j, and |𝒮(i)×𝒮(i)|=(nk)2n=n2k2nk2superscript𝒮𝑖superscript𝒮𝑖superscript𝑛𝑘2𝑛superscript𝑛2superscript𝑘2𝑛superscript𝑘2|\mathcal{S}^{(i)}\times\mathcal{S}^{(i)}|=\left(\frac{n}{k}\right)^{2}-n=% \frac{n^{2}-k^{2}n}{k^{2}}| caligraphic_S start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT × caligraphic_S start_POSTSUPERSCRIPT ( italic_i ) end_POSTSUPERSCRIPT | = ( divide start_ARG italic_n end_ARG start_ARG italic_k end_ARG ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_n = divide start_ARG italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n end_ARG start_ARG italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG. Applying our Rademacher complexity bound yields

ϕ(𝒮)italic-ϕ𝒮absent\displaystyle\phi(\mathcal{S})\leqitalic_ϕ ( caligraphic_S ) ≤ 2k(k1)ij𝒪((kn)1/d)+2ki=1k𝒪((kn2k2n)1/d)+12kBn2k2nlog(2/δ)22𝑘𝑘1subscript𝑖𝑗𝒪superscript𝑘𝑛1𝑑2𝑘superscriptsubscript𝑖1𝑘𝒪superscript𝑘superscript𝑛2superscript𝑘2𝑛1𝑑12𝑘subscript𝐵superscript𝑛2superscript𝑘2𝑛2𝛿2\displaystyle\frac{2}{k(k-1)}\sum_{i\neq j}\mathcal{O}\left(\left(\frac{k}{n}% \right)^{1/d}\right)+\frac{2}{k}\sum_{i=1}^{k}\mathcal{O}\left(\left(\frac{k}{% \sqrt{n^{2}-k^{2}n}}\right)^{1/d}\right)+\frac{12kB_{\ell}}{\sqrt{n^{2}-k^{2}n% }}\sqrt{\frac{\log(2/\delta)}{2}}divide start_ARG 2 end_ARG start_ARG italic_k ( italic_k - 1 ) end_ARG ∑ start_POSTSUBSCRIPT italic_i ≠ italic_j end_POSTSUBSCRIPT caligraphic_O ( ( divide start_ARG italic_k end_ARG start_ARG italic_n end_ARG ) start_POSTSUPERSCRIPT 1 / italic_d end_POSTSUPERSCRIPT ) + divide start_ARG 2 end_ARG start_ARG italic_k end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_k end_POSTSUPERSCRIPT caligraphic_O ( ( divide start_ARG italic_k end_ARG start_ARG square-root start_ARG italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n end_ARG end_ARG ) start_POSTSUPERSCRIPT 1 / italic_d end_POSTSUPERSCRIPT ) + divide start_ARG 12 italic_k italic_B start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_ARG start_ARG square-root start_ARG italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n end_ARG end_ARG square-root start_ARG divide start_ARG roman_log ( 2 / italic_δ ) end_ARG start_ARG 2 end_ARG end_ARG (41)
=\displaystyle== 2𝒪((kn)1/d)+2𝒪((kn2k2n)1/d)+12kBn2k2nlog(2/δ)22𝒪superscript𝑘𝑛1𝑑2𝒪superscript𝑘superscript𝑛2superscript𝑘2𝑛1𝑑12𝑘subscript𝐵superscript𝑛2superscript𝑘2𝑛2𝛿2\displaystyle 2\mathcal{O}\left(\left(\frac{k}{n}\right)^{1/d}\right)+2% \mathcal{O}\left(\left(\frac{k}{\sqrt{n^{2}-k^{2}n}}\right)^{1/d}\right)+\frac% {12kB_{\ell}}{\sqrt{n^{2}-k^{2}n}}\sqrt{\frac{\log(2/\delta)}{2}}2 caligraphic_O ( ( divide start_ARG italic_k end_ARG start_ARG italic_n end_ARG ) start_POSTSUPERSCRIPT 1 / italic_d end_POSTSUPERSCRIPT ) + 2 caligraphic_O ( ( divide start_ARG italic_k end_ARG start_ARG square-root start_ARG italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n end_ARG end_ARG ) start_POSTSUPERSCRIPT 1 / italic_d end_POSTSUPERSCRIPT ) + divide start_ARG 12 italic_k italic_B start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_ARG start_ARG square-root start_ARG italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n end_ARG end_ARG square-root start_ARG divide start_ARG roman_log ( 2 / italic_δ ) end_ARG start_ARG 2 end_ARG end_ARG (42)
\displaystyle\leq 4𝒪((kn2k2n)1/d)+12kBn2k2nlog(2/δ)2.4𝒪superscript𝑘superscript𝑛2superscript𝑘2𝑛1𝑑12𝑘subscript𝐵superscript𝑛2superscript𝑘2𝑛2𝛿2\displaystyle 4\mathcal{O}\left(\left(\frac{k}{\sqrt{n^{2}-k^{2}n}}\right)^{1/% d}\right)+\frac{12kB_{\ell}}{\sqrt{n^{2}-k^{2}n}}\sqrt{\frac{\log(2/\delta)}{2% }}.4 caligraphic_O ( ( divide start_ARG italic_k end_ARG start_ARG square-root start_ARG italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n end_ARG end_ARG ) start_POSTSUPERSCRIPT 1 / italic_d end_POSTSUPERSCRIPT ) + divide start_ARG 12 italic_k italic_B start_POSTSUBSCRIPT roman_ℓ end_POSTSUBSCRIPT end_ARG start_ARG square-root start_ARG italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n end_ARG end_ARG square-root start_ARG divide start_ARG roman_log ( 2 / italic_δ ) end_ARG start_ARG 2 end_ARG end_ARG . (43)

The bound in (43) is our final PAC bound true with probability 1δ1𝛿1-\delta1 - italic_δ. However, we expect the δ𝛿\deltaitalic_δ containing term to be dominated by the other term because (kn2k2n)<1𝑘superscript𝑛2superscript𝑘2𝑛1(\frac{k}{\sqrt{n^{2}-k^{2}n}})<1( divide start_ARG italic_k end_ARG start_ARG square-root start_ARG italic_n start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT - italic_k start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT italic_n end_ARG end_ARG ) < 1 and d𝑑ditalic_d is expected to be much larger than 1111.

A.3 Additional Implementation Details

In this section we give additional details on how we implemented our methods to create the experimental results found in this paper.

A.3.1 Data Set and Cost Function Details

Here we give additional description of each data set and the corresponding the cost functions d𝒳subscript𝑑𝒳d_{\mathcal{X}}italic_d start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT used in our experiments. As noted in Section 3 we must ensure d𝒳subscript𝑑𝒳d_{\mathcal{X}}italic_d start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT is differentiable. When dealing with categorical features costs are by nature discrete (and not differentiable). We show how we were able to write these costs in a differentiable form. Suppose 𝐯𝐯superscript\mathbf{v}\in\mathbb{R}^{\ell}bold_v ∈ blackboard_R start_POSTSUPERSCRIPT roman_ℓ end_POSTSUPERSCRIPT is a one-hot encoding of a categorical feature and define the transition cost matrix A𝐴Aitalic_A such that Ai,jsubscript𝐴𝑖𝑗A_{i,j}italic_A start_POSTSUBSCRIPT italic_i , italic_j end_POSTSUBSCRIPT as the cost of changing from category i𝑖iitalic_i to category j𝑗jitalic_j. Then 𝐳TA𝐳~superscript𝐳𝑇𝐴~𝐳\mathbf{z}^{T}A\tilde{\mathbf{z}}bold_z start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT italic_A over~ start_ARG bold_z end_ARG represents the costs of changing this categorical feature and is differentiable in 𝐳𝐳\mathbf{z}bold_z.

Adult Income Prediction Dataset: (Kohavi & Becker, 1996) This widely used data set contains information from the 1994 U.S. census, with individuals labelled by whether their annual income was over $50,000 (similar-to\sim$100,000 in 2023 adjusted for inflation). We define our target set T𝑇Titalic_T as over 80% probability high income. Our actionable set allows changes in job type, education and number of hours worked with all other attributes immutable. The cost function d𝒴subscript𝑑𝒴d_{\mathcal{Y}}italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT includes the expected number of years to improve education (e.g. two years to go from associate’s degree to bachelors degree), a one-year cost to change employer type and the 2-norm of the change in hours worked per week (weighted so 3 hours per week is equivalent to a year spent on education). Here Trustworthy Actionable Perturbations suggest the best way to improve an individuals odds of making a large income with the least time and effort.

Specifically d𝒳subscript𝑑𝒳d_{\mathcal{X}}italic_d start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT is the sum cost from changes (1) hours worked per week (2) change in employment type (3) change in education and (4) change in field of work.

The cost from a change in hours is given by Δh210Δsuperscript210\frac{\Delta h^{2}}{10}divide start_ARG roman_Δ italic_h start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG 10 end_ARG where ΔhΔ\Delta hroman_Δ italic_h is the change in weekly hours worked. This will mean 3333 extra hours of work are approximately equivalent to one year of schooling.

The cost from a change in employer (the options are government, private, self-employed and other) is always 1111 (equivalent to a year spent on education).

The possible levels of education are (1) any schooling, (2) High School Degree, (3) Professional Degree, (4) some college, (5) Associate’s Degree, (6) Bachelors Degree, (7) Master’s Degree, (8) Doctorate Degree. The cost transition matrix associated with the level of education (as ordered above) is

AEducation=[0210346811L0812469LL0LLL25LL701358LL6L0247LL4LL025LL4LLL03LL4LLLL0],subscript𝐴Educationmatrix0210346811𝐿0812469𝐿𝐿0𝐿𝐿𝐿25𝐿𝐿701358𝐿𝐿6𝐿0247𝐿𝐿4𝐿𝐿025𝐿𝐿4𝐿𝐿𝐿03𝐿𝐿4𝐿𝐿𝐿𝐿0A_{\text{Education}}=\begin{bmatrix}0&2&10&3&4&6&8&11\\ L&0&8&1&2&4&6&9\\ L&L&0&L&L&L&2&5\\ L&L&7&0&1&3&5&8\\ L&L&6&L&0&2&4&7\\ L&L&4&L&L&0&2&5\\ L&L&4&L&L&L&0&3\\ L&L&4&L&L&L&L&0\end{bmatrix},italic_A start_POSTSUBSCRIPT Education end_POSTSUBSCRIPT = [ start_ARG start_ROW start_CELL 0 end_CELL start_CELL 2 end_CELL start_CELL 10 end_CELL start_CELL 3 end_CELL start_CELL 4 end_CELL start_CELL 6 end_CELL start_CELL 8 end_CELL start_CELL 11 end_CELL end_ROW start_ROW start_CELL italic_L end_CELL start_CELL 0 end_CELL start_CELL 8 end_CELL start_CELL 1 end_CELL start_CELL 2 end_CELL start_CELL 4 end_CELL start_CELL 6 end_CELL start_CELL 9 end_CELL end_ROW start_ROW start_CELL italic_L end_CELL start_CELL italic_L end_CELL start_CELL 0 end_CELL start_CELL italic_L end_CELL start_CELL italic_L end_CELL start_CELL italic_L end_CELL start_CELL 2 end_CELL start_CELL 5 end_CELL end_ROW start_ROW start_CELL italic_L end_CELL start_CELL italic_L end_CELL start_CELL 7 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 3 end_CELL start_CELL 5 end_CELL start_CELL 8 end_CELL end_ROW start_ROW start_CELL italic_L end_CELL start_CELL italic_L end_CELL start_CELL 6 end_CELL start_CELL italic_L end_CELL start_CELL 0 end_CELL start_CELL 2 end_CELL start_CELL 4 end_CELL start_CELL 7 end_CELL end_ROW start_ROW start_CELL italic_L end_CELL start_CELL italic_L end_CELL start_CELL 4 end_CELL start_CELL italic_L end_CELL start_CELL italic_L end_CELL start_CELL 0 end_CELL start_CELL 2 end_CELL start_CELL 5 end_CELL end_ROW start_ROW start_CELL italic_L end_CELL start_CELL italic_L end_CELL start_CELL 4 end_CELL start_CELL italic_L end_CELL start_CELL italic_L end_CELL start_CELL italic_L end_CELL start_CELL 0 end_CELL start_CELL 3 end_CELL end_ROW start_ROW start_CELL italic_L end_CELL start_CELL italic_L end_CELL start_CELL 4 end_CELL start_CELL italic_L end_CELL start_CELL italic_L end_CELL start_CELL italic_L end_CELL start_CELL italic_L end_CELL start_CELL 0 end_CELL end_ROW end_ARG ] , (44)

where L𝐿Litalic_L is a large number meant to prevent suggestions that lead to a decrease in education, which is impossible (we use L=1,000𝐿1000L=1,000italic_L = 1 , 000). These numbers represent the expected number of years required to gain the specified degree (i.e. the cost of going from a high school degree to a bachelors degree is A2,6=4subscript𝐴264A_{2,6}=4italic_A start_POSTSUBSCRIPT 2 , 6 end_POSTSUBSCRIPT = 4).

Finally the options for fields of work are (1) Service, (2) Sales, (3) Blue-Collar (4) White Collar, (5) Professional, (6) Other. The cost transition matrix associated with the level of education (as ordered above) is

AProfession=[012341101231110121111011111101111110].subscript𝐴Professionmatrix012341101231110121111011111101111110A_{\text{Profession}}=\begin{bmatrix}0&1&2&3&4&1\\ 1&0&1&2&3&1\\ 1&1&0&1&2&1\\ 1&1&1&0&1&1\\ 1&1&1&1&0&1\\ 1&1&1&1&1&0\end{bmatrix}.italic_A start_POSTSUBSCRIPT Profession end_POSTSUBSCRIPT = [ start_ARG start_ROW start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 2 end_CELL start_CELL 3 end_CELL start_CELL 4 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 2 end_CELL start_CELL 3 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 2 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL end_ROW end_ARG ] . (45)

This represents a cost of 1111 for any change

Law School Success Prediction Dataset: (Wightman, 1998) This data set contains demographic information and academic records for over 20,000 law school students labelled by whether or not a student passed the BAR exam. Our target set is an 85% chance of passing the BAR. To create 𝒜(𝐱)𝒜𝐱\mathcal{A}(\mathbf{x)}caligraphic_A ( bold_x ), we suppose the law school performance is merely a projection that can be changed through more studying, allowing us to change the law school grades and the location where the students take the BAR. The cost function d𝒳subscript𝑑𝒳d_{\mathcal{X}}italic_d start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT sums the increase in grades and the physical distance travelled to take the BAR where moving to an adjacent region (e.g. Far West to North West) is weighted the same as increasing grades by one standard deviation.

Specifically d𝒴subscript𝑑𝒴d_{\mathcal{Y}}italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT sums the increase in grades and the physical distance travelled to take the BAR where moving to an adjacent region (e.g. Far West to North West) is weighted the same as increasing grades one standard deviation. This set up returns the optimal combination of studying harder and moving location to take the BAR.In this data set d𝒳subscript𝑑𝒳d_{\mathcal{X}}italic_d start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT is sum of the change in grades (in standard deviations from the mean) and distance traveled. The country was divided into eight regions: (1) Far West, (2) Great Lakes, (3) Mid-South, (4) Mountain West, (5) Mid-West, (6) North East, (7) New England, (8) North West. We use the transition cost matrix

ARegion=[0341265130121213410212151220143221110323622430155113210513523550]subscript𝐴Regionmatrix0341265130121213410212151220143221110323622430155113210513523550A_{\text{Region}}=\begin{bmatrix}0&3&4&1&2&6&5&1\\ 3&0&1&2&1&2&1&3\\ 4&1&0&2&1&2&1&5\\ 1&2&2&0&1&4&3&2\\ 2&1&1&1&0&3&2&3\\ 6&2&2&4&3&0&1&5\\ 5&1&1&3&2&1&0&5\\ 1&3&5&2&3&5&5&0\end{bmatrix}italic_A start_POSTSUBSCRIPT Region end_POSTSUBSCRIPT = [ start_ARG start_ROW start_CELL 0 end_CELL start_CELL 3 end_CELL start_CELL 4 end_CELL start_CELL 1 end_CELL start_CELL 2 end_CELL start_CELL 6 end_CELL start_CELL 5 end_CELL start_CELL 1 end_CELL end_ROW start_ROW start_CELL 3 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 2 end_CELL start_CELL 1 end_CELL start_CELL 2 end_CELL start_CELL 1 end_CELL start_CELL 3 end_CELL end_ROW start_ROW start_CELL 4 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL 2 end_CELL start_CELL 1 end_CELL start_CELL 2 end_CELL start_CELL 1 end_CELL start_CELL 5 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 2 end_CELL start_CELL 2 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 4 end_CELL start_CELL 3 end_CELL start_CELL 2 end_CELL end_ROW start_ROW start_CELL 2 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL 3 end_CELL start_CELL 2 end_CELL start_CELL 3 end_CELL end_ROW start_ROW start_CELL 6 end_CELL start_CELL 2 end_CELL start_CELL 2 end_CELL start_CELL 4 end_CELL start_CELL 3 end_CELL start_CELL 0 end_CELL start_CELL 1 end_CELL start_CELL 5 end_CELL end_ROW start_ROW start_CELL 5 end_CELL start_CELL 1 end_CELL start_CELL 1 end_CELL start_CELL 3 end_CELL start_CELL 2 end_CELL start_CELL 1 end_CELL start_CELL 0 end_CELL start_CELL 5 end_CELL end_ROW start_ROW start_CELL 1 end_CELL start_CELL 3 end_CELL start_CELL 5 end_CELL start_CELL 2 end_CELL start_CELL 3 end_CELL start_CELL 5 end_CELL start_CELL 5 end_CELL start_CELL 0 end_CELL end_ROW end_ARG ] (46)

Moves to adjacent regions result in a cost of 1111, while the highest cost of 6666 is incurred by moving from Far West to New England or back.

Diabetes Prediction Dataset: (for Disease Control &, CDC) This data set contains information on the demographics, health conditions and health habits of 250,000 individuals labelled by whether an individual is diabetic extracted from the Behavioral Risk Factor Surveillance System (BRFSS), a health-related telephone survey that is collected annually by the CDC.. We define 𝒜(𝐱)𝒜𝐱\mathcal{A}(\mathbf{x})caligraphic_A ( bold_x ) to allow changes in health habits, BMI, education and income. We use a weighted 2-norm for d𝒳subscript𝑑𝒳d_{\mathcal{X}}italic_d start_POSTSUBSCRIPT caligraphic_X end_POSTSUBSCRIPT to represent the relative difficulty of making changes. For example, starting to get regular physical activity is weighted the same as drop** one BMI point. Increasing education, income and health insurance were weighted as more difficult that simply adjusting health habits.

German Credit Dataset: (Hofmann, 1994) This commonly used data set contains information on 1,000 loan applications in Germany labelled by their credit risk. The actionable set allows for changes in the loan request (time and size) as well as the funds in the applicants checking and savings account and whither the applicant has a telephone. The target set T𝑇Titalic_T is a greater than 80% of being a good credit risk. The cost function d𝒴subscript𝑑𝒴d_{\mathcal{Y}}italic_d start_POSTSUBSCRIPT caligraphic_Y end_POSTSUBSCRIPT is the direct measuring the total difference in Deutsche Marks (DM) between all elements of the application. No cost was assigned to closing empty accounts. The change in length of loan is converted to DM through the individual’s monthly disposable income. Finally we set a flat cost of 50DM to acquire a telephone.

A.3.2 Model Details

We used fully connected feed forward neural networks. Each network used 3 hidden layers with ReLu activation functions between each layer. We tuned the parameters of the neural networks until we achieved accuracy on par with common tree based classifiers (random forests and histogram boosted trees). Accuracy results are presented in table A.3.2. For all data sets except the German Credit data set each hidden layer had 60606060 nodes. The German Credit data set required 120120120120 nodes per layer. Additionally, for the German Credit data set only, we used dropout regularization of 20%percent2020\%20 % on each hidden layer. We trained these models using the ADAM optimizer to minimize cross entropy loss. We used an 80101080101080-10-1080 - 10 - 10 train-validate-test data split and implemented early stop** with the validation data. All Trustworthy Actionable Perturbations, counterfactuals and adversarial examples were created for the testing data. We used identical architecture for V𝑉Vitalic_V as M𝑀Mitalic_M, except for doubling the input size. Accuracy data may be found in table 3.

Adult Income Law School Success Diabetes Prediction German Credit
Random Forest 73% 64% 62% 74%
Histogram Gradient Boosted Trees 81% 77% 75% 69%
Neural Network 80% 77% 75% 75%

We also tested the calibration of our networks by calculating the expected calibration error (ECE) (Naeini et al., 2015). We used 15 bins and record the results in table A.3.2

Adult Income Law School Success Diabetes Prediction German Credit
ECE (15 bins) 16% 15% 7% 21%

A.3.3 Objective Function Details

In our implementation we formulated the actionablility penalty term b𝑏bitalic_b as

b(𝐱~)=G(i=1mmax{0,𝐱~iui}+max{0,li𝐱~i})𝑏~𝐱𝐺superscriptsubscript𝑖1𝑚0subscript~𝐱𝑖subscript𝑢𝑖0subscript𝑙𝑖subscript~𝐱𝑖b(\tilde{\mathbf{x}})=G\left(\sum_{i=1}^{m}\max\{0,\tilde{\mathbf{x}}_{i}-u_{i% }\}+\max\{0,l_{i}-\tilde{\mathbf{x}}_{i}\}\right)italic_b ( over~ start_ARG bold_x end_ARG ) = italic_G ( ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_m end_POSTSUPERSCRIPT roman_max { 0 , over~ start_ARG bold_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - italic_u start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } + roman_max { 0 , italic_l start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT - over~ start_ARG bold_x end_ARG start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT } ) (47)

with G𝐺Gitalic_G a sufficiently large constant. We formulated our coherence penalty term p𝑝pitalic_p as

p(𝐱~)=Pi=1C(1j𝒞i𝐱~j)2,𝑝~𝐱𝑃superscriptsubscript𝑖1𝐶superscript1subscript𝑗subscript𝒞𝑖subscript~𝐱𝑗2p(\tilde{\mathbf{x}})=P\sum_{i=1}^{C}\left(1-\sum_{j\in\mathcal{C}_{i}}\tilde{% \mathbf{x}}_{j}\right)^{2},italic_p ( over~ start_ARG bold_x end_ARG ) = italic_P ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_C end_POSTSUPERSCRIPT ( 1 - ∑ start_POSTSUBSCRIPT italic_j ∈ caligraphic_C start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT end_POSTSUBSCRIPT over~ start_ARG bold_x end_ARG start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (48)

with P𝑃Pitalic_P another appropriately large constant. The conditioner function cond𝑐𝑜𝑛𝑑conditalic_c italic_o italic_n italic_d simply rounded integer and Boolean values to the nearest integer value. For one-hot encoded features categorical features, the category with the largest value set to one and all other categories set to zero.

A.3.4 Additional Experimental Results

Here we show success bar charts similar to those found in figure 7 compare the efficacy of Trustworthy Actionable Perturbations, counterfactuals (Wachter et al., 2017; Mothilal et al., 2020) and adversarial examples from the Carlini Wagner 2subscript2\ell_{2}roman_ℓ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT attack (Carlini & Wagner, 2017b) for all data sets. These are similar to Figure 5, but include all data sets and an increased number of cost (ϵitalic-ϵ\epsilonitalic_ϵ) values.

Each bar chart refers to a particular data set and desired distance δ𝛿\deltaitalic_δ to the target set T𝑇Titalic_T. Inside of each chart, the bars show the percentage of individuals that a method was able to successfully move inside the goal δ𝛿\deltaitalic_δ at a variety of costs ϵitalic-ϵ\epsilonitalic_ϵ. Figure 7 shows data before the verification procedure has been performed and 7 shows the data after all . In these tests, the Trustworthy Actionable Perturbations (in blue) outperform the counterfactuals (in green and orange) in nearly all cases except for when both methods achieved 100%percent100100\%100 % success or the very high-cost (large ϵitalic-ϵ\epsilonitalic_ϵ) high reward (δ=0𝛿0\delta=0italic_δ = 0) scenarios. Carlini Wagner attacks (red) are only effective at larger δ𝛿\deltaitalic_δ values because they are designed to move a data point just barely inside the target class. The Carlini Wagner attacks are not required to be actionable (or even feasible), so they do not constitute useful advise. The verifier is able to recognize that these adversarial examples are untrustworthy in all cases.

Refer to caption
Figure 7: Performance comparison over entire datasets before verification: The graphs show average success rate for moving individuals within a variety of distances (δ𝛿\deltaitalic_δ) to the target set. The y-axis shows the percentage of individuals within the goal distance, and the x-axis, represents different costs (ϵitalic-ϵ\epsilonitalic_ϵ values) to achieve the goal. These values were obtained before applying the verification procedure.
Refer to caption
Figure 8: Performance comparison over entire datasets after verification: The graphs show average success rate for moving individuals within a variety of distances (δ𝛿\deltaitalic_δ) to the target set. The y-axis shows the percentage of individuals within the goal distance, and the x-axis, represents different costs (ϵitalic-ϵ\epsilonitalic_ϵ values) to achieve the goal. These values were obtained after applying the verification procedure with a 10%percent1010\%10 % chance of eliminating valid inputs.