Bias Detection via Signaling

Yiling Chen
Harvard University
[email protected]
&Tao Lin
Harvard University
[email protected]
&Ariel D. Procaccia
Harvard University
[email protected]
&Aaditya Ramdas
Carnegie Mellon University
[email protected]
&Itai Shapira
Harvard University
[email protected]
Abstract

We introduce and study the problem of detecting whether an agent is updating their prior beliefs given new evidence in an optimal way that is Bayesian, or whether they are biased towards their own prior. In our model, biased agents form posterior beliefs that are a convex combination of their prior and the Bayesian posterior, where the more biased an agent is, the closer their posterior is to the prior. Since we often cannot observe the agent’s beliefs directly, we take an approach inspired by information design. Specifically, we measure an agent’s bias by designing a signaling scheme and observing the actions they take in response to different signals, assuming that they are maximizing their own expected utility; our goal is to detect bias with a minimum number of signals. Our main results include a characterization of scenarios where a single signal suffices and a computationally efficient algorithm to compute optimal signaling schemes.

1 Introduction

A bag contains two coins that look and feel identical, but one is a fair coin that, on a flip, comes up heads with probability 0.5, and the other is an unfair coin with probability 0.9 of heads. You reach into the bag, grab one of the coins and flip it once; it lands on heads. Since you are (hopefully) familiar with Bayes’ rule, you conclude that the probability you are holding the fair coin is 0.36absent0.36\approx 0.36≈ 0.36. Now suppose you are offered the following deal: if you pay $1, you get to flip the same coin again, and if it comes up heads, you will receive $1.4. Since you now believe that the probability of heads is 0.76, you take the deal (assuming you are risk neutral) and earn 6 cents in expectation.

If, by contrast, another risk-neutral person in the same situation decides to decline the same deal, they must believe that the probability they are holding the fair coin is greater than 0.47. That is, their belief is still very close to the prior of 0.5. We think of such a person as being biased, in the sense that they are unwilling to significantly update their beliefs, despite evidence to the contrary.

Of course, failing to update one’s beliefs about coin flips is not the end of the world. But this example serves to illustrate a broader phenomenon that, in our view, is both important and ubiquitous. In particular, the “stickiness” of prior beliefs in the face of evidence plays a role in politics — think of the controversy over Russian collusion in the 2016 US presidential election or the existence of weapons of mass destruction in Iraq in 2003. It is also prevalent in science, as exemplified by the polarized debate over the origins of the Covid pandemic [3].

Our goal in this paper is to develop algorithms that are able to detect bias in the form of non-Bayesian updating of beliefs. To our knowledge, we are the first to formalize and analytically address this problem, and we aim to build an initial framework that future work would build on. In the long term, we believe such algorithms could have many applications, including understanding to what degree the foregoing type of bias contributes to disagreement and polarization, and discounting the opinions of biased agents to improve collective decision making.

Our approach. The first question we need to answer is how to quantify bias. In this first investigation, we adopt a linear model of bias that was proposed and used as a general belief updating model in economics [8, 10, 5]. If the prior is μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and the correct Bayesian posterior upon receiving a signal (or evidence) s𝑠sitalic_s is denoted μssubscript𝜇𝑠\mu_{s}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT, we posit that an agent with bias w[0,1]𝑤01w\in[0,1]italic_w ∈ [ 0 , 1 ] adopts the belief wμ0+(1w)μs𝑤subscript𝜇01𝑤subscript𝜇𝑠w\mu_{0}+(1-w)\mu_{s}italic_w italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_w ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT. At the extremes, an agent with bias w=0𝑤0w=0italic_w = 0 performs perfect Bayesian updating and an agent with bias w=1𝑤1w=1italic_w = 1 cannot be convinced to budge from the prior.

The bigger conceptual question is how we can infer an agent’s bias. To address it, we take an approach that is inspired by the literature on information design [13]. In our context, suppose that we (the principal) and the agent have asymmetric information: while both share a common (say public) prior about the state of the world, the principal knows the true (realized) state of the world, but the agent does not. The principal publicly commits to a (randomized) signaling scheme that specifies the probability of sending each possible signal given each possible realized state of the world. Given their knowledge of the latter, the principal draws a signal from the specified distribution and sends it to the agent. Upon receiving such a signal, the agent updates their beliefs about the state of the world (from the common prior) and then takes an action that maximizes their expected utility according to a given utility function. Similarly to the example we started with, it is the action taken by the agent that can (indirectly) reveal their degree of bias.

The challenge, then, is to design signaling schemes that test whether the agent’s bias is above or below some threshold. (Through binary search, this makes it possible to estimate the level of bias.) We wish to do so in the most efficient way, that is, using a minimum number of signals in expectation.

Our results. We design a polynomial-time algorithm that computes optimal signaling schemes, in Section 4. We first show that constant algorithms, which repeatedly use the same signaling scheme, are as powerful as adaptive algorithms, which can vary the scheme over time based on historical data (Lemma 4.1); we can therefore restrict our attention to constant algorithms. In Lemma 4.5, we establish a version of the revelation principle for the bias detection problem, which asserts that optimal signaling schemes need only use signals that can be interpreted as action recommendations. Finally, building on these insights, we show that the optimal solution to our problem is obtained by solving a “small” linear program (Algorithm 1 and Theorem 4.6).

In Section 5, we present a geometric characterization of optimal signaling schemes (Theorem 5.2), which sheds additional light on the performance of the algorithm. In particular, the characterization provides sufficient and necessary conditions for the testability of bias, and also identifies cases where only a single sample is needed for this task.

Related work. There is a significant body of experimental work in the social sciences aiming to explain the failure of partisans to reach similar beliefs on factual questions where there is a large amount of publicly available evidence. The fact that biased belief updating occurs is undisputed (to our knowledge), and the focus is on understanding the factors that play a role. In particular, a prominent line of work supports the (perhaps counterintuitive) hypothesis that the more cognitively sophisticated a partisan is, the more politically biased is their belief update process [16, 17, 12, 11]. These results are challenged by more recent work by Tappin et al. [18], who found that greater analytical thinking is associated with belief updates that are less biased, using an experimental design that explicitly measures the proximity of belief updates to a correct Bayesian posterior. While these studies provide empirical underpinnings for our theoretical model, their research questions are orthogonal to ours: we aim to measure the magnitude of bias regardless of its source.

Classical work in information design [4, 13] studies how a principal can strategically provide information to induce an agent to take actions that are beneficial for the principal, assuming a perfectly Bayesian agent. Various relaxations of the perfectly Bayesian assumption have been investigated [1, 10, 7, 5, 9, 20, 14]. The closest to our work is that of de Clippel and Zhang [5], who study biased belief update models including the linear model. However, their goal is to maximize the principal’s utility with the agent’s bias fully known. In our problem the agent’s bias level is unknown, and the principal’s goal is to infer the agent’s bias level instead of maximizing their own utility.

2 Model

Biased agent. Consider a standard Bayesian setting: the relevant state of the world is θΘ𝜃Θ\theta\in\Thetaitalic_θ ∈ roman_Θ, distributed according to some known prior distribution μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. If an agent were perfectly Bayesian, when receiving some new information (“signal”) s𝑠sitalic_s and with the knowledge of the conditional distributions P(s|θ)𝑃conditional𝑠𝜃P(s|\theta)italic_P ( italic_s | italic_θ ) for all θ𝜃\thetaitalic_θ, they would update their belief about the state of the world according to Bayes’ Rule: μs(θ)=P(θ|s)=μ0(θ)P(s|θ)P(s)subscript𝜇𝑠𝜃𝑃conditional𝜃𝑠subscript𝜇0𝜃𝑃conditional𝑠𝜃𝑃𝑠\mu_{s}(\theta)=P(\theta|s)=\frac{\mu_{0}(\theta)P(s|\theta)}{P(s)}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_θ ) = italic_P ( italic_θ | italic_s ) = divide start_ARG italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) italic_P ( italic_s | italic_θ ) end_ARG start_ARG italic_P ( italic_s ) end_ARG. We refer to μssubscript𝜇𝑠\mu_{s}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT as the true posterior belief induced by s𝑠sitalic_s. Being biased, the agent’s belief after seeing s𝑠sitalic_s, denoted νssubscript𝜈𝑠\nu_{s}italic_ν start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT, is a convex combination of μssubscript𝜇𝑠\mu_{s}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT and μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT:

νs=wμ0+(1w)μs,subscript𝜈𝑠𝑤subscript𝜇01𝑤subscript𝜇𝑠\nu_{s}=w\mu_{0}+(1-w)\mu_{s},italic_ν start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = italic_w italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_w ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ,

where w[0,1]𝑤01w\in[0,1]italic_w ∈ [ 0 , 1 ] is the unknown bias level, capturing the agent’s inclination to retain their prior belief in the presence of new information. This linear model was proposed and adopted in economics for non-Bayesian belief updating [8, 10], in order to capture people’s conservatism in processing new information and their tendency to protect their beliefs [19].

The agent can choose an action from a finite set A𝐴Aitalic_A and has a state-dependent utility function U:A×Θ:𝑈𝐴ΘU:A\times\Theta\to\mathbb{R}italic_U : italic_A × roman_Θ → blackboard_R. They receive utility U(a,θ)𝑈𝑎𝜃U(a,\theta)italic_U ( italic_a , italic_θ ) when taking action a𝑎aitalic_a in state θ𝜃\thetaitalic_θ. The agent will act according to their (biased) belief νssubscript𝜈𝑠\nu_{s}italic_ν start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT and choose an action a𝑎aitalic_a that maximizes their expected utility:

aargmaxaA𝔼θνs[U(a,θ)]=argmaxaAθΘνs(θ)U(a,θ).𝑎subscriptargmax𝑎𝐴subscript𝔼similar-to𝜃subscript𝜈𝑠delimited-[]𝑈𝑎𝜃subscriptargmax𝑎𝐴subscript𝜃Θsubscript𝜈𝑠𝜃𝑈𝑎𝜃a\in\operatorname*{arg\,max}_{a\in A}\,\mathbb{E}_{\theta\sim\nu_{s}}[U(a,% \theta)]=\operatorname*{arg\,max}_{a\in A}\,\sum_{\theta\in\Theta}\nu_{s}(% \theta)U(a,\theta).italic_a ∈ start_OPERATOR roman_arg roman_max end_OPERATOR start_POSTSUBSCRIPT italic_a ∈ italic_A end_POSTSUBSCRIPT blackboard_E start_POSTSUBSCRIPT italic_θ ∼ italic_ν start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_POSTSUBSCRIPT [ italic_U ( italic_a , italic_θ ) ] = start_OPERATOR roman_arg roman_max end_OPERATOR start_POSTSUBSCRIPT italic_a ∈ italic_A end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_θ ∈ roman_Θ end_POSTSUBSCRIPT italic_ν start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ( italic_θ ) italic_U ( italic_a , italic_θ ) .

In the absence of any additional information, the agent operates based on the prior belief μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and will select an action deemed optimal with respect to μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. We introduce the following mild assumption to ensure the uniqueness of this action:

Assumption 2.1.

There is a unique action that maximizes the expected utility based on the prior belief μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT: |argmaxaA{θΘμ0(θ)U(a,θ)}|=1subscriptargmax𝑎𝐴subscript𝜃Θsubscript𝜇0𝜃𝑈𝑎𝜃1|\operatorname*{arg\,max}_{a\in A}\{\sum_{\theta\in\Theta}\mu_{0}(\theta)U(a,% \theta)\}|=1| start_OPERATOR roman_arg roman_max end_OPERATOR start_POSTSUBSCRIPT italic_a ∈ italic_A end_POSTSUBSCRIPT { ∑ start_POSTSUBSCRIPT italic_θ ∈ roman_Θ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) italic_U ( italic_a , italic_θ ) } | = 1.

This assumption will be made throughout the paper. We denote the unique optimal action on the prior belief as a0=argmaxaA{θΘμ0(θ)U(a,θ)}subscript𝑎0subscriptargmax𝑎𝐴subscript𝜃Θsubscript𝜇0𝜃𝑈𝑎𝜃a_{0}=\operatorname*{arg\,max}_{a\in A}\{\sum_{\theta\in\Theta}\mu_{0}(\theta)% U(a,\theta)\}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = start_OPERATOR roman_arg roman_max end_OPERATOR start_POSTSUBSCRIPT italic_a ∈ italic_A end_POSTSUBSCRIPT { ∑ start_POSTSUBSCRIPT italic_θ ∈ roman_Θ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) italic_U ( italic_a , italic_θ ) }, and call it the default action.

Bias detection. The principal, who knows the prior μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and the agent’s utility function U𝑈Uitalic_U, seeks to infer the agent’s bias level from their action as efficiently as possible. The principal has an informational advantage — they observe the independent realizations of the state of the world at each time step. In other words, the principal knows θtsubscript𝜃𝑡\theta_{t}italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, an independent sample drawn according to μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT at time t𝑡titalic_t. The principal wants to design signaling schemes to strategically reveal information about θtsubscript𝜃𝑡\theta_{t}italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT to the agent, ho** to influence the agent’s biased belief in a way that the agent’s chosen actions reveal information about their bias level. Specifically, with a finite signal space S𝑆Sitalic_S, the principal can commit to a signaling scheme πt:ΘΔ(S):subscript𝜋𝑡ΘΔ𝑆\pi_{t}:\Theta\to\Delta(S)italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT : roman_Θ → roman_Δ ( italic_S ) at time t𝑡titalic_t, where πt(s|θ)subscript𝜋𝑡conditional𝑠𝜃\pi_{t}(s|\theta)italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_s | italic_θ ) specifies the probability of sending signal s𝑠sitalic_s in state θ𝜃\thetaitalic_θ at time t𝑡titalic_t. After seeing a signal stsubscript𝑠𝑡s_{t}italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT, drawn according to πt(s|θt)subscript𝜋𝑡conditional𝑠subscript𝜃𝑡\pi_{t}(s|\theta_{t})italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ( italic_s | italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) at time t𝑡titalic_t, the agent takes action atsubscript𝑎𝑡a_{t}italic_a start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT that is optimal for their biased belief νstsubscript𝜈subscript𝑠𝑡\nu_{s_{t}}italic_ν start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT end_POSTSUBSCRIPT. The principal infers information about bias w𝑤witalic_w from the history of signaling schemes, realized states, realized signals, and agent actions t={(π1,θ1,s1,a1),,(πt,θt,st,at)}subscript𝑡subscript𝜋1subscript𝜃1subscript𝑠1subscript𝑎1subscript𝜋𝑡subscript𝜃𝑡subscript𝑠𝑡subscript𝑎𝑡\mathcal{H}_{t}=\{(\pi_{1},\theta_{1},s_{1},a_{1}),\ldots,(\pi_{t},\theta_{t},% s_{t},a_{t})\}caligraphic_H start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = { ( italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , … , ( italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) }. We denote by ΠΠ\Piroman_Π an adaptive algorithm that the principal uses to decide on the signaling scheme at time t+1𝑡1t+1italic_t + 1 based on history tsubscript𝑡\mathcal{H}_{t}caligraphic_H start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT.

Given a threshold τ(0,1)𝜏01\tau\in(0,1)italic_τ ∈ ( 0 , 1 ), the principal wants to design ΠΠ\Piroman_Π to answer the following question:

  • Is the agent’s bias level w𝑤witalic_w greater than or equal to τ𝜏\tauitalic_τ or less than or equal to τ𝜏\tauitalic_τ?111One may want to test wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or w<τ𝑤𝜏w<\tauitalic_w < italic_τ instead. But this requires assumptions on tie-breaking when the agent has multiple optimal actions. Indifference at w=τ𝑤𝜏w=\tauitalic_w = italic_τ allows us to avoid such assumptions.

An algorithm ΠΠ\Piroman_Π terminates as soon as it can output a deterministic answer to this question. The number of time steps for ΠΠ\Piroman_Π to terminate, denoted by Tτ(Π,w)subscript𝑇𝜏Π𝑤T_{\tau}(\Pi,w)italic_T start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT ( roman_Π , italic_w ), is a random variable. The sample complexity of ΠΠ\Piroman_Π is defined to be the expected termination time in the worst case over w[0,1]𝑤01w\in[0,1]italic_w ∈ [ 0 , 1 ]:

Definition 2.1 (sample complexity).

The (worst-case) sample complexity of ΠΠ\Piroman_Π is defined as222Taking the worst case over w[0,1]𝑤01w\in[0,1]italic_w ∈ [ 0 , 1 ] is not overly pessimistic. As we will show later, the worst case in fact happens at w[τε,τ+ε]𝑤𝜏𝜀𝜏𝜀w\in[\tau-\varepsilon,\tau+\varepsilon]italic_w ∈ [ italic_τ - italic_ε , italic_τ + italic_ε ] for some ε>0𝜀0\varepsilon>0italic_ε > 0, which makes intuitive sense. Therefore, the sample complexity can be equivalently defined as Tτ(Π)=maxw[τε,τ+ε]𝔼[Tτ(Π,w)]subscript𝑇𝜏Πsubscript𝑤𝜏𝜀𝜏𝜀𝔼delimited-[]subscript𝑇𝜏Π𝑤T_{\tau}(\Pi)=\max_{w\in[\tau-\varepsilon,\tau+\varepsilon]}\mathbb{E}[T_{\tau% }(\Pi,w)]italic_T start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT ( roman_Π ) = roman_max start_POSTSUBSCRIPT italic_w ∈ [ italic_τ - italic_ε , italic_τ + italic_ε ] end_POSTSUBSCRIPT blackboard_E [ italic_T start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT ( roman_Π , italic_w ) ].

Tτ(Π)=maxw[0,1]𝔼[Tτ(Π,w)].subscript𝑇𝜏Πsubscript𝑤01𝔼delimited-[]subscript𝑇𝜏Π𝑤T_{\tau}(\Pi)=\max_{w\in[0,1]}\mathbb{E}[T_{\tau}(\Pi,w)].italic_T start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT ( roman_Π ) = roman_max start_POSTSUBSCRIPT italic_w ∈ [ 0 , 1 ] end_POSTSUBSCRIPT blackboard_E [ italic_T start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT ( roman_Π , italic_w ) ] .

Our objective is to develop an algorithm ΠΠ\Piroman_Π that can determine whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ with minimal sample complexity. Specifically, we want to solve the following minimax problem:

minΠmaxw[0,1]𝔼[Tτ(Π,w)].subscriptΠsubscript𝑤01𝔼delimited-[]subscript𝑇𝜏Π𝑤\min_{\Pi}\max_{w\in[0,1]}\mathbb{E}[T_{\tau}(\Pi,w)].roman_min start_POSTSUBSCRIPT roman_Π end_POSTSUBSCRIPT roman_max start_POSTSUBSCRIPT italic_w ∈ [ 0 , 1 ] end_POSTSUBSCRIPT blackboard_E [ italic_T start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT ( roman_Π , italic_w ) ] .

We say that an algorithm ΠΠ\Piroman_Π is constant if it keeps using the same signaling scheme repeatedly until termination. Constant algorithms are a special case of non-adaptive algorithms, which may vary the signaling schemes over time but remain independent of historical data.

Preliminaries. We now introduce the well-known splitting lemma from the information design literature [2, 13, 15]. It relates a signaling scheme with a set of induced true posteriors for a Bayesian agent and a distribution over the set of true posteriors.

Lemma 2.1 (Splitting Lemma, e.g., [13]).

Let π𝜋\piitalic_π be a signaling scheme where each signal sS𝑠𝑆s\in Sitalic_s ∈ italic_S is sent with unconditional probability π(s)=θΘμ0(θ)π(s|θ)𝜋𝑠subscript𝜃Θsubscript𝜇0𝜃𝜋conditional𝑠𝜃\pi(s)=\sum_{\theta\in\Theta}\mu_{0}(\theta)\pi(s|\theta)italic_π ( italic_s ) = ∑ start_POSTSUBSCRIPT italic_θ ∈ roman_Θ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) italic_π ( italic_s | italic_θ ) and induces true posterior μssubscript𝜇𝑠\mu_{s}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT. Then, the prior μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT equals the convex combination of {μs}sSsubscriptsubscript𝜇𝑠𝑠𝑆\{\mu_{s}\}_{s\in S}{ italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_s ∈ italic_S end_POSTSUBSCRIPT with weights {π(s)}sSsubscript𝜋𝑠𝑠𝑆\{\pi(s)\}_{s\in S}{ italic_π ( italic_s ) } start_POSTSUBSCRIPT italic_s ∈ italic_S end_POSTSUBSCRIPT: μ0=sSπ(s)μssubscript𝜇0subscript𝑠𝑆𝜋𝑠subscript𝜇𝑠\mu_{0}=\sum_{s\in S}\pi(s)\mu_{s}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_s ∈ italic_S end_POSTSUBSCRIPT italic_π ( italic_s ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT. Conversely, if the prior can be expressed as a convex combination of distributions μsΔ(Θ)subscriptsuperscript𝜇𝑠ΔΘ\mu^{\prime}_{s}\in\Delta(\Theta)italic_μ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ∈ roman_Δ ( roman_Θ ): μ0=sSpsμssubscript𝜇0subscript𝑠𝑆subscript𝑝𝑠subscriptsuperscript𝜇𝑠\mu_{0}=\sum_{s\in S}p_{s}\mu^{\prime}_{s}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_s ∈ italic_S end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT italic_μ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT, where ps0,sSps=1formulae-sequencesubscript𝑝𝑠0subscript𝑠𝑆subscript𝑝𝑠1p_{s}\geq 0,\sum_{s\in S}p_{s}=1italic_p start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ≥ 0 , ∑ start_POSTSUBSCRIPT italic_s ∈ italic_S end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = 1, then there exists a signaling scheme π𝜋\piitalic_π where each signal s𝑠sitalic_s is sent with unconditional probability π(s)=ps𝜋𝑠subscript𝑝𝑠\pi(s)=p_{s}italic_π ( italic_s ) = italic_p start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT and induces posterior μssubscriptsuperscript𝜇𝑠\mu^{\prime}_{s}italic_μ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT.

The splitting lemma is also referred as the Bayesian consistency condition. It allows one to think about choosing a signaling scheme as choosing a set of true posteriors, {μs}sSsubscriptsubscript𝜇𝑠𝑠𝑆\{\mu_{s}\}_{s\in S}{ italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_s ∈ italic_S end_POSTSUBSCRIPT, and a distribution over the set, {π(s)}sSsubscript𝜋𝑠𝑠𝑆\{\pi(s)\}_{s\in S}{ italic_π ( italic_s ) } start_POSTSUBSCRIPT italic_s ∈ italic_S end_POSTSUBSCRIPT, in a Bayesian consistent way.

3 Warm-Up: A Two-State, Two-Action Example

How can the principal design a signaling scheme to learn the agent’s bias level? We use a simple two-state, two-action example to demonstrate how inducing a specific true posterior belief will allow the principal to determine whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ.

The two states of the world are represented as {GoodGood\mathrm{Good}roman_Good, BadBad\mathrm{Bad}roman_Bad}. The agent has two possible actions: ActiveActive\mathrm{Active}roman_Active and PassivePassive\mathrm{Passive}roman_Passive. Taking the PassivePassive\mathrm{Passive}roman_Passive action always yields a utility of 00, independently of the state. For the ActiveActive\mathrm{Active}roman_Active action, the utility is a𝑎aitalic_a if the state is GoodGood\mathrm{Good}roman_Good and b𝑏-b- italic_b otherwise; a,b>0𝑎𝑏0a,b>0italic_a , italic_b > 0. We use the probability of the GoodGood\mathrm{Good}roman_Good state to represent a belief, so the prior is a number μ0[0,1]subscript𝜇001\mu_{0}\in[0,1]italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ [ 0 , 1 ], which is only a slight abuse of notation. With belief μ[0,1]𝜇01\mu\in[0,1]italic_μ ∈ [ 0 , 1 ] for the GoodGood\mathrm{Good}roman_Good state (and 1μ1𝜇1-\mu1 - italic_μ for the BadBad\mathrm{Bad}roman_Bad state), the agent’s expected utility for choosing the ActiveActive\mathrm{Active}roman_Active action is aμb(1μ)=(a+b)μb𝑎𝜇𝑏1𝜇𝑎𝑏𝜇𝑏a\mu-b(1-\mu)=(a+b)\mu-bitalic_a italic_μ - italic_b ( 1 - italic_μ ) = ( italic_a + italic_b ) italic_μ - italic_b. Thus, the ActiveActive\mathrm{Active}roman_Active action is better than the PassivePassive\mathrm{Passive}roman_Passive action (so the agent will take ActiveActive\mathrm{Active}roman_Active) if

(a+b)μb>0μ>ba+b=:μ.(a+b)\mu-b>0\quad\;\Longleftrightarrow\;\quad\mu>\tfrac{b}{a+b}=:\mu^{*}.( italic_a + italic_b ) italic_μ - italic_b > 0 ⟺ italic_μ > divide start_ARG italic_b end_ARG start_ARG italic_a + italic_b end_ARG = : italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT . (1)

Conversely, the PassivePassive\mathrm{Passive}roman_Passive action is better if μ<μ𝜇superscript𝜇\mu<\mu^{*}italic_μ < italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT. Here, μ=ba+bsuperscript𝜇𝑏𝑎𝑏\mu^{*}=\frac{b}{a+b}italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = divide start_ARG italic_b end_ARG start_ARG italic_a + italic_b end_ARG is an indifference belief where the agent is indifferent between the two actions. We assume that the prior μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT satisfies 0<μ0<μ0subscript𝜇0superscript𝜇0<\mu_{0}<\mu^{*}0 < italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, so the agent chooses the PassivePassive\mathrm{Passive}roman_Passive action by default.

Consider the following constant signaling scheme πτsubscript𝜋𝜏\pi_{\tau}italic_π start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT with two signals {G,B}𝐺𝐵\{G,B\}{ italic_G , italic_B }:

  • If the state is GoodGood\mathrm{Good}roman_Good, send signal G𝐺Gitalic_G with probability one.

  • If the state is BadBad\mathrm{Bad}roman_Bad, send signal B𝐵Bitalic_B with probability μμ0(μτμ0)(1μ0)superscript𝜇subscript𝜇0superscript𝜇𝜏subscript𝜇01subscript𝜇0\frac{\mu^{*}-\mu_{0}}{(\mu^{*}-\tau\mu_{0})(1-\mu_{0})}divide start_ARG italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG ( italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ( 1 - italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_ARG and signal G𝐺Gitalic_G with the complement probability.

We will show that, by repeatedly using πτsubscript𝜋𝜏\pi_{\tau}italic_π start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT, we can test whether the agent’s bias w𝑤witalic_w is τabsent𝜏\leq\tau≤ italic_τ or τabsent𝜏\geq\tau≥ italic_τ. By Bayes’ Rule, the true posterior beliefs (for the GoodGood\mathrm{Good}roman_Good state) associated with the two signals are μB=0subscript𝜇𝐵0\mu_{B}=0italic_μ start_POSTSUBSCRIPT italic_B end_POSTSUBSCRIPT = 0 (i.e., on receiving B𝐵Bitalic_B, the agent knows the state is BadBad\mathrm{Bad}roman_Bad) and

μG=P(Good|G)=μ0πτ(G|Good)μ0πτ(G|Good)+(1μ0)πτ(G|Bad)=μτμ01τ.subscript𝜇𝐺𝑃conditionalGood𝐺subscript𝜇0subscript𝜋𝜏conditional𝐺Goodsubscript𝜇0subscript𝜋𝜏conditional𝐺Good1subscript𝜇0subscript𝜋𝜏conditional𝐺Badsuperscript𝜇𝜏subscript𝜇01𝜏\mu_{G}=P(\mathrm{Good}|G)=\tfrac{\mu_{0}\cdot\pi_{\tau}(G|\mathrm{Good})}{\mu% _{0}\cdot\pi_{\tau}(G|\mathrm{Good})+(1-\mu_{0})\cdot\pi_{\tau}(G|\mathrm{Bad}% )}=\tfrac{\mu^{*}-\tau\mu_{0}}{1-\tau}.italic_μ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT = italic_P ( roman_Good | italic_G ) = divide start_ARG italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ⋅ italic_π start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT ( italic_G | roman_Good ) end_ARG start_ARG italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ⋅ italic_π start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT ( italic_G | roman_Good ) + ( 1 - italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ⋅ italic_π start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT ( italic_G | roman_Bad ) end_ARG = divide start_ARG italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_τ end_ARG .

Notably, the posterior μGsubscript𝜇𝐺\mu_{G}italic_μ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT satisfies the following property: if the agent’s bias level w𝑤witalic_w is exactly equal to τ𝜏\tauitalic_τ, then the agent’s biased belief is equal to the indifference belief:

when w=τ,νG=τμ0+(1τ)μG=μ.when w=τsubscript𝜈𝐺𝜏subscript𝜇01𝜏subscript𝜇𝐺superscript𝜇\text{when $w=\tau$},\quad\quad\nu_{G}=\tau\mu_{0}+(1-\tau)\mu_{G}=\mu^{*}.when italic_w = italic_τ , italic_ν start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT = italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT = italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT .

We also note the inequality μ0<μ<μGsubscript𝜇0superscript𝜇subscript𝜇𝐺\mu_{0}<\mu^{*}<\mu_{G}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT < italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT < italic_μ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT. As a result, if the agent’s bias level w𝑤witalic_w is greater than τ𝜏\tauitalic_τ, then the biased belief will be smaller than μsuperscript𝜇\mu^{*}italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT, and otherwise the opposite is true:

for w>τ,wμ0+(1w)μG<μ; for w<τ,wμ0+(1w)μG>μ.formulae-sequencefor w>τ,𝑤subscript𝜇01𝑤subscript𝜇𝐺superscript𝜇 for w<τ,𝑤subscript𝜇01𝑤subscript𝜇𝐺superscript𝜇\text{for $w>\tau$,}\quad w\mu_{0}+(1-w)\mu_{G}<\mu^{*};\quad\quad\text{ for $% w<\tau$,}\quad w\mu_{0}+(1-w)\mu_{G}>\mu^{*}.for italic_w > italic_τ , italic_w italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_w ) italic_μ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT < italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ; for italic_w < italic_τ , italic_w italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_w ) italic_μ start_POSTSUBSCRIPT italic_G end_POSTSUBSCRIPT > italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT .

By Equation (1), this means that the agent will take the PassivePassive\mathrm{Passive}roman_Passive action if w>τ𝑤𝜏w>\tauitalic_w > italic_τ, and the ActiveActive\mathrm{Active}roman_Active action if w<τ𝑤𝜏w<\tauitalic_w < italic_τ (on receiving G𝐺Gitalic_G). Therefore, by observing which action is taken by the agent when signal G𝐺Gitalic_G is sent, we can immediately tell whether wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ or wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ. This leads to the following:

Theorem 3.1.

In the two-state, two-action example, for any threshold τ[0,1μ1μ0]𝜏01superscript𝜇1subscript𝜇0\tau\in[0,\frac{1-\mu^{*}}{1-\mu_{0}}]italic_τ ∈ [ 0 , divide start_ARG 1 - italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG 1 - italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ], the above constant signaling scheme πτsubscript𝜋𝜏\pi_{\tau}italic_π start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT can test whether the agent’s bias w𝑤witalic_w satisfies wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ or wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ: specifically, whenever the signal G𝐺Gitalic_G is sent,

  • if the agent takes action ActiveActive\mathrm{Active}roman_Active, then wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ,

  • if the agent takes action PassivePassive\mathrm{Passive}roman_Passive, then wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ.

The sample complexity of this scheme is μμ0μ0(1τ)+1superscript𝜇subscript𝜇0subscript𝜇01𝜏1\frac{\mu^{*}-\mu_{0}}{\mu_{0}(1-\tau)}+1divide start_ARG italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( 1 - italic_τ ) end_ARG + 1, which increases with τ𝜏\tauitalic_τ.

Proof.

The range τ[0,1μ1μ0]𝜏01superscript𝜇1subscript𝜇0\tau\in[0,\frac{1-\mu^{*}}{1-\mu_{0}}]italic_τ ∈ [ 0 , divide start_ARG 1 - italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG 1 - italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG ] ensures that the probability πτ(B|Bad)=μμ0(μτμ0)(1μ0)subscript𝜋𝜏conditional𝐵Badsuperscript𝜇subscript𝜇0superscript𝜇𝜏subscript𝜇01subscript𝜇0\pi_{\tau}(B|\mathrm{Bad})=\frac{\mu^{*}-\mu_{0}}{(\mu^{*}-\tau\mu_{0})(1-\mu_% {0})}italic_π start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT ( italic_B | roman_Bad ) = divide start_ARG italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG ( italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) ( 1 - italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) end_ARG is in [0,1]01[0,1][ 0 , 1 ]. The two items in the theorem follow from the argument before the theorem statement. The sample complexity is equal to the expected number of time steps until a G𝐺Gitalic_G signal is sent, which is a geometric random variable with success probability P(G)=μ0πτ(G|Good)+(1μ0)πτ(G|Bad)=μ0(1τ)μτμ0𝑃𝐺subscript𝜇0subscript𝜋𝜏conditional𝐺Good1subscript𝜇0subscript𝜋𝜏conditional𝐺Badsubscript𝜇01𝜏superscript𝜇𝜏subscript𝜇0P(G)=\mu_{0}\pi_{\tau}(G|\mathrm{Good})+(1-\mu_{0})\pi_{\tau}(G|\mathrm{Bad})=% \frac{\mu_{0}(1-\tau)}{\mu^{*}-\tau\mu_{0}}italic_P ( italic_G ) = italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT italic_π start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT ( italic_G | roman_Good ) + ( 1 - italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) italic_π start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT ( italic_G | roman_Bad ) = divide start_ARG italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( 1 - italic_τ ) end_ARG start_ARG italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG. So the sample complexity is equal to the mean 1P(G)=μμ0μ0(1τ)+11𝑃𝐺superscript𝜇subscript𝜇0subscript𝜇01𝜏1\frac{1}{P(G)}=\frac{\mu^{*}-\mu_{0}}{\mu_{0}(1-\tau)}+1divide start_ARG 1 end_ARG start_ARG italic_P ( italic_G ) end_ARG = divide start_ARG italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( 1 - italic_τ ) end_ARG + 1. ∎

The main intuition behind this result is that in order to test whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ, we want to design a signaling scheme where some of the signals induce posteriors under which the agent is indifferent between two actions if the agent’s bias level is exactly τ𝜏\tauitalic_τ. Then, the action actually taken by the agent will directly reveal whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ. Such signals are useful signals, but not all signals are necessarily useful. The sample complexity is then determined by the total probability of useful signals. This intuition will carry over to computing the optimal signaling scheme for the general case in Section 4.

Finally, we remark that using the constant signaling scheme πτsubscript𝜋𝜏\pi_{\tau}italic_π start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT constructed above to test wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ is in fact the optimal adaptive algorithm, according to the results we will present in Section 4. So, the minimal sample complexity to test whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ in this two-state, two-action example is exactly μμ0μ0(1τ)+1superscript𝜇subscript𝜇0subscript𝜇01𝜏1\frac{\mu^{*}-\mu_{0}}{\mu_{0}(1-\tau)}+1divide start_ARG italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_ARG start_ARG italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( 1 - italic_τ ) end_ARG + 1 as shown in Theorem 3.1.

4 Computing the Optimal Signaling Scheme in the General Case

In this section, we generalize the initial observations from the previous section to the case with any number of actions and states and general utility function U𝑈Uitalic_U. We will show how to compute the optimal algorithm (signaling scheme) to test the agent’s bias level. There are three key ingredients. First, we prove that we can use a constant signaling scheme. Second, we develop a “revelation principle” to further simplify the space of signaling schemes. Building on these two steps, we show that the optimal signaling scheme can be computed by a linear program.

4.1 Optimality of Constant Signaling Schemes

In this subsection, we show that adaptive algorithms are no better than constant algorithms for the problem of testing whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ. Therefore, to find the algorithm with minimal sample complexity, we only need to consider constant algorithms/signaling schemes.

Lemma 4.1.

Fix τ(0,1)𝜏01\tau\in(0,1)italic_τ ∈ ( 0 , 1 ). For the problem of testing whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ, the sample complexity of any adaptive algorithm is at least that of the optimal constant algorithm (i.e., using a fixed signaling scheme repeatedly).

To prove this lemma, we introduce some notations. For any action aA{a0}𝑎𝐴subscript𝑎0a\in A\setminus\{a_{0}\}italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT }, define vector

ca=(ca,θ)θΘ=(U(a0,θ)U(a,θ))θΘ|Θ|,subscript𝑐𝑎subscriptsubscript𝑐𝑎𝜃𝜃Θsubscript𝑈subscript𝑎0𝜃𝑈𝑎𝜃𝜃ΘsuperscriptΘc_{a}=(c_{a,\theta})_{\theta\in\Theta}=\big{(}U(a_{0},\theta)-U(a,\theta)\big{% )}_{\theta\in\Theta}\in\mathbb{R}^{|\Theta|},italic_c start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT = ( italic_c start_POSTSUBSCRIPT italic_a , italic_θ end_POSTSUBSCRIPT ) start_POSTSUBSCRIPT italic_θ ∈ roman_Θ end_POSTSUBSCRIPT = ( italic_U ( italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ ) - italic_U ( italic_a , italic_θ ) ) start_POSTSUBSCRIPT italic_θ ∈ roman_Θ end_POSTSUBSCRIPT ∈ blackboard_R start_POSTSUPERSCRIPT | roman_Θ | end_POSTSUPERSCRIPT , (2)

whose components are the utility differences between the default action a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and any other action a𝑎aitalic_a at different states θΘ𝜃Θ\theta\in\Thetaitalic_θ ∈ roman_Θ. Let Ra0Δ(Θ)subscript𝑅subscript𝑎0ΔΘR_{a_{0}}\subseteq\Delta(\Theta)italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ⊆ roman_Δ ( roman_Θ ) be the region of beliefs under which the agent strictly prefers a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT over any other action:

Ra0={μΔ(Θ)aA{a0},caμ>0}.subscript𝑅subscript𝑎0conditional-set𝜇ΔΘformulae-sequencefor-all𝑎𝐴subscript𝑎0superscriptsubscript𝑐𝑎top𝜇0R_{a_{0}}=\big{\{}\mu\in\Delta(\Theta)\mid\forall a\in A\setminus\{a_{0}\},\,c% _{a}^{\top}\mu>0\big{\}}.italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = { italic_μ ∈ roman_Δ ( roman_Θ ) ∣ ∀ italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } , italic_c start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_μ > 0 } . (3)

It is the intersection of |A|1𝐴1|A|-1| italic_A | - 1 open halfspaces with the probability simplex Δ(Θ)ΔΘ\Delta(\Theta)roman_Δ ( roman_Θ ). As the agent strictly prefers a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT at the prior μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, we have μ0Ra0subscript𝜇0subscript𝑅subscript𝑎0\mu_{0}\in R_{a_{0}}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. The boundary of this region, Ra0subscript𝑅subscript𝑎0\partial R_{a_{0}}∂ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, is the set of beliefs where the agent is indifferent between a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and at least one other action aA{a0}𝑎𝐴subscript𝑎0a\in A\setminus\{a_{0}\}italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } and a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and a𝑎aitalic_a are both (weakly) better than any other action:

Ra0={μΔ(Θ)aA{a0},caμ=0andaA{a0},caμ0}.subscript𝑅subscript𝑎0conditional-set𝜇ΔΘformulae-sequenceformulae-sequence𝑎𝐴subscript𝑎0superscriptsubscript𝑐𝑎top𝜇0andfor-allsuperscript𝑎𝐴subscript𝑎0superscriptsubscript𝑐superscript𝑎top𝜇0\partial R_{a_{0}}=\big{\{}\mu\in\Delta(\Theta)\mid\exists a\in A\setminus\{a_% {0}\},c_{a}^{\top}\mu=0~{}\text{and}~{}\forall a^{\prime}\in A\setminus\{a_{0}% \},\,c_{a^{\prime}}^{\top}\mu\geq 0\big{\}}.∂ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = { italic_μ ∈ roman_Δ ( roman_Θ ) ∣ ∃ italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } , italic_c start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_μ = 0 and ∀ italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } , italic_c start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_μ ≥ 0 } . (4)

Lastly, the exterior of Ra0subscript𝑅subscript𝑎0R_{a_{0}}italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, denoted as extRa0extsubscript𝑅subscript𝑎0\mathrm{ext}R_{a_{0}}roman_ext italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, comprises the set of beliefs where the agent strictly prefers not to choose a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT:

extRa0=Δ(Θ)(Ra0Ra0)={μΔ(Θ)aA{a0},caμ<0}.extsubscript𝑅subscript𝑎0ΔΘsubscript𝑅subscript𝑎0subscript𝑅subscript𝑎0conditional-set𝜇ΔΘformulae-sequence𝑎𝐴subscript𝑎0superscriptsubscript𝑐𝑎top𝜇0\mathrm{ext}R_{a_{0}}=\Delta(\Theta)\setminus(R_{a_{0}}\cup\partial R_{a_{0}})% =\big{\{}\mu\in\Delta(\Theta)\mid\exists a\in A\setminus\{a_{0}\},\,c_{a}^{% \top}\mu<0\big{\}}.roman_ext italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = roman_Δ ( roman_Θ ) ∖ ( italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∪ ∂ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) = { italic_μ ∈ roman_Δ ( roman_Θ ) ∣ ∃ italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } , italic_c start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_μ < 0 } . (5)

Given a signaling scheme π𝜋\piitalic_π, we classify its signals into three types based on the location of the biased belief associated with the signal with respect to the region Ra0subscript𝑅subscript𝑎0R_{a_{0}}italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT.

Definition 4.1.

Let τ(0,1)𝜏01\tau\in(0,1)italic_τ ∈ ( 0 , 1 ) be a parameter. Let sS𝑠𝑆s\in Sitalic_s ∈ italic_S be a signal from a signaling scheme π𝜋\piitalic_π, with associated true posterior μssubscript𝜇𝑠\mu_{s}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT and τ𝜏\tauitalic_τ-biased posterior μsτ=τμ0+(1τ)μssuperscriptsubscript𝜇𝑠𝜏𝜏subscript𝜇01𝜏subscript𝜇𝑠\mu_{s}^{\tau}=\tau\mu_{0}+(1-\tau)\mu_{s}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_τ end_POSTSUPERSCRIPT = italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT. We say s𝑠sitalic_s is

  • an internal signal if μsτRa0superscriptsubscript𝜇𝑠𝜏subscript𝑅subscript𝑎0\mu_{s}^{\tau}\in R_{a_{0}}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_τ end_POSTSUPERSCRIPT ∈ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT;

  • a boundary signal if μsτRa0superscriptsubscript𝜇𝑠𝜏subscript𝑅subscript𝑎0\mu_{s}^{\tau}\in\partial R_{a_{0}}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_τ end_POSTSUPERSCRIPT ∈ ∂ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT;

  • an external signal if μsτextRa0superscriptsubscript𝜇𝑠𝜏extsubscript𝑅subscript𝑎0\mu_{s}^{\tau}\in\mathrm{ext}R_{a_{0}}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_τ end_POSTSUPERSCRIPT ∈ roman_ext italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT.

The above classification helps to formalize the idea of whether a signal is “useful” for bias detection. A boundary signal is useful because the action taken by the agent after receiving a boundary signal immediately tells whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ:

Lemma 4.2.

When a boundary signal is realized, the agent’s action immediately reveals whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ. Specifically, if the agent chooses action a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, then wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ; otherwise, wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ.

Proof.

If the agent’s bias level satisfies w<τ𝑤𝜏w<\tauitalic_w < italic_τ, then the biased belief νs=wμ0+(1w)μssubscript𝜈𝑠𝑤subscript𝜇01𝑤subscript𝜇𝑠\nu_{s}=w\mu_{0}+(1-w)\mu_{s}italic_ν start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = italic_w italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_w ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT must be inside Ra0subscript𝑅subscript𝑎0R_{a_{0}}italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT (because μsτ=τμ0+(1τ)μssuperscriptsubscript𝜇𝑠𝜏𝜏subscript𝜇01𝜏subscript𝜇𝑠\mu_{s}^{\tau}=\tau\mu_{0}+(1-\tau)\mu_{s}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_τ end_POSTSUPERSCRIPT = italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT is on the boundary of Ra0subscript𝑅subscript𝑎0R_{a_{0}}italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT and μ0Ra0subscript𝜇0subscript𝑅subscript𝑎0\mu_{0}\in R_{a_{0}}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT), so the agent strictly prefers the default action a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. If w>τ𝑤𝜏w>\tauitalic_w > italic_τ, then the biased belief νssubscript𝜈𝑠\nu_{s}italic_ν start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT is outside of Ra0subscript𝑅subscript𝑎0R_{a_{0}}italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, so the agent will not take action a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. ∎

An external signal might also be useful in revealing whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ if the agent is indifferent between some actions a1,a2subscript𝑎1subscript𝑎2a_{1},a_{2}italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT other than a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT at the τ𝜏\tauitalic_τ-biased belief μsτsuperscriptsubscript𝜇𝑠𝜏\mu_{s}^{\tau}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_τ end_POSTSUPERSCRIPT. However, the following lemma shows that, in such cases, we can always modify the signaling scheme to turn the external signal into a boundary signal. This modification will increase the total probability of useful signals and hence reduce the sample complexity. The proof of this lemma is in Appendix A.1.

Lemma 4.3.

Suppose ΠΠ\Piroman_Π is an adaptive algorithm that uses signaling schemes with internal, boundary, and external signals. Then, there exists another adaptive algorithm ΠsuperscriptΠ\Pi^{\prime}roman_Π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT with equal or lower sample complexity that employs only signaling schemes with internal and boundary signals.

An internal signal, on the other hand, is not useful for testing wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ, for the following reason. For an internal signal, the biased belief with bias level τ𝜏\tauitalic_τ, μsτsuperscriptsubscript𝜇𝑠𝜏\mu_{s}^{\tau}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_τ end_POSTSUPERSCRIPT, lies inside Ra0subscript𝑅subscript𝑎0R_{a_{0}}italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Since Ra0subscript𝑅subscript𝑎0R_{a_{0}}italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT is an open region, there must exist a small number ε>0𝜀0\varepsilon>0italic_ε > 0 such that when the agent has bias level w=τ+ε𝑤𝜏𝜀w=\tau+\varepsilonitalic_w = italic_τ + italic_ε or τε𝜏𝜀\tau-\varepsilonitalic_τ - italic_ε, the biased belief with bias level w𝑤witalic_w, wμ0+(1w)μs𝑤subscript𝜇01𝑤subscript𝜇𝑠w\mu_{0}+(1-w)\mu_{s}italic_w italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_w ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT, is also inside the region Ra0subscript𝑅subscript𝑎0R_{a_{0}}italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, so the agent will take action a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. As the agent takes a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT under both w=τ+ε𝑤𝜏𝜀w=\tau+\varepsilonitalic_w = italic_τ + italic_ε and τε𝜏𝜀\tau-\varepsilonitalic_τ - italic_ε, we cannot distinguish these two cases, so this signal is not helpful in determining wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ. The following lemma (with proof in Appendix A.2) formalizes the idea that internal signals are not useful:

Lemma 4.4.

To test whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ, any adaptive algorithm that uses signaling schemes with boundary and internal signals cannot terminate until a boundary signal is sent.

With the above lemmas, we can now prove Lemma 4.1.

Proof of Lemma 4.1.

By Lemma 4.3, the optimal adaptive algorithm only uses signaling schemes with boundary and internal signals. By Lemma 4.4, the algorithm cannot terminate until a boundary signal is sent. By Lemma 4.2, the algorithm terminates when a boundary signal is sent. We conclude that the termination time of any adaptive algorithm cannot be better than the constant algorithm that keeps using the signaling scheme that maximizes the total probability of boundary signals. ∎

4.2 Revelation Principle

To compute the optimal constant signaling scheme, we need another technique that is similar to the revelation principle in the information design literature [13, 6]. The revelation principle says that, in some information design problems, it is without loss of generality to consider only “direct” signaling schemes where signals are recommendations of actions for the agent, that is, the signal space is S=A𝑆𝐴S=Aitalic_S = italic_A, and when the principal sends signal a𝑎aitalic_a, it should be optimal for the agent to take action a𝑎aitalic_a given the posterior belief induced by signal a𝑎aitalic_a.

Unlike classical information design problems where the agent is unbiased, our problem involves a biased agent, so we need a different revelation principle: the signals are still action recommendations, but when the principal sends signal a𝑎aitalic_a, action a𝑎aitalic_a is optimal for an agent with bias level exactly τ𝜏\tauitalic_τ; moreover, if aa0𝑎subscript𝑎0a\neq a_{0}italic_a ≠ italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, then an agent with bias level τ𝜏\tauitalic_τ will be indifferent between a𝑎aitalic_a and a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. This insight is formalized in the following lemma:

Lemma 4.5 (revelation principle for bias detection).

Let π𝜋\piitalic_π be an arbitrary signaling scheme that can test wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ. Then, there exists another signaling scheme πsuperscript𝜋\pi^{\prime}italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT that can do so with signal space S=A𝑆𝐴S=Aitalic_S = italic_A such that:

  • (1)

    Given signal aA𝑎𝐴a\in Aitalic_a ∈ italic_A, action a𝑎aitalic_a is an optimal action for any agent with bias level w=τ𝑤𝜏w=\tauitalic_w = italic_τ.

  • (2)

    Given signal aA{a0}𝑎𝐴subscript𝑎0a\in A\setminus\{a_{0}\}italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT }, actions a𝑎aitalic_a and a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT are both optimal for any agent with bias level w=τ𝑤𝜏w=\tauitalic_w = italic_τ. As a corollary, if the agent’s bias level w<τ𝑤𝜏w<\tauitalic_w < italic_τ, then the agent strictly prefers a𝑎aitalic_a over a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT; and if w>τ𝑤𝜏w>\tauitalic_w > italic_τ, then the agent strictly prefers a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT over any other actions.

  • (3)

    The sample complexity satisfies Tτ(π)Tτ(π)subscript𝑇𝜏superscript𝜋subscript𝑇𝜏𝜋T_{\tau}(\pi^{\prime})\leq T_{\tau}(\pi)italic_T start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT ( italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ≤ italic_T start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT ( italic_π ).

In the above signaling scheme πsuperscript𝜋\pi^{\prime}italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT, every aA{a0}𝑎𝐴subscript𝑎0a\in A\setminus\{a_{0}\}italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } is a boundary signal (Definition 4.1), which is useful for testing bias: given signal aA{a0}𝑎𝐴subscript𝑎0a\in A\setminus\{a_{0}\}italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT }, if the agent takes action a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, then it must be wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ; otherwise wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ. The signal a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is internal and not useful for determining wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ. So, the sample complexity of πsuperscript𝜋\pi^{\prime}italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is equal to the expected time steps until a signal in A{a0}𝐴subscript𝑎0A\setminus\{a_{0}\}italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } is sent.

The high-level idea behind Lemma 4.5 is combination of signals. Suppose there is a signaling scheme that can determine whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ with a signal space larger than A𝐴Aitalic_A. There must exist two signals s𝑠sitalic_s and ssuperscript𝑠s^{\prime}italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT under which the agent is indifferent between a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and some action aa0𝑎subscript𝑎0a\neq a_{0}italic_a ≠ italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT if the agent’s bias level is exactly τ𝜏\tauitalic_τ. Then, we can combine the two signals into one signal s′′superscript𝑠′′s^{\prime\prime}italic_s start_POSTSUPERSCRIPT ′ ′ end_POSTSUPERSCRIPT under which the agent is still indifferent between a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and a𝑎aitalic_a, obtaining a new signaling scheme with a smaller signal space. We can do this until the signal space is reduced to size |A|𝐴|A|| italic_A |. The full proof is in Appendix A.3.

4.3 Algorithm for Computing the Optimal Signaling Scheme

Finally, we present an algorithm to compute the optimal (minimal sample complexity) signaling scheme to test whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ. The revelation principle in the previous subsection ensures that we only need a direct signaling scheme where signals are action recommendations. The optimal direct signaling scheme turns out to be solvable by a linear program, detailed in Algorithm 1. In the linear program, the optimality constraint (7) ensures that whenever the principal recommends action aA𝑎𝐴a\in Aitalic_a ∈ italic_A, it is optimal for an agent with bias level τ𝜏\tauitalic_τ to take action a𝑎aitalic_a; this satisfies condition (1) in the revelation principle (Lemma 4.5). The indifference constraint (8) ensures that when the recommended action a𝑎aitalic_a is not a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, an agent with bias level τ𝜏\tauitalic_τ is indifferent between a𝑎aitalic_a and a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT; this satisfies condition (2) in the revelation principle. The objective (6) is to maximize the probability of useful signals (those in A{a0}𝐴subscript𝑎0A\setminus\{a_{0}\}italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT }), hence minimize the sample complexity. The resulting signaling scheme has the property that whenever the principal recommends an action a𝑎aitalic_a other than a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, if the agent indeed follows the recommendation (or takes any other action than a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT), then the bias must be small (wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ); if the agent takes a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT instead, the bias must be large (wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ).

Input : prior μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, utility function U𝑈Uitalic_U, and the parameter τ(0,1)𝜏01\tau\in(0,1)italic_τ ∈ ( 0 , 1 )
Variable : signaling scheme π𝜋\piitalic_π, consisting of conditional probabilities π(a|θ)𝜋conditional𝑎𝜃\pi(a|\theta)italic_π ( italic_a | italic_θ ) for aA,θΘformulae-sequence𝑎𝐴𝜃Θa\in A,\theta\in\Thetaitalic_a ∈ italic_A , italic_θ ∈ roman_Θ
Denote ΔU(a,a,θ)=U(a,θ)U(a,θ)Δ𝑈𝑎superscript𝑎𝜃𝑈𝑎𝜃𝑈superscript𝑎𝜃\Delta U(a,a^{\prime},\theta)=U(a,\theta)-U(a^{\prime},\theta)roman_Δ italic_U ( italic_a , italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_θ ) = italic_U ( italic_a , italic_θ ) - italic_U ( italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_θ ). Solve the following linear program:
MaximizeaA{a0}θΘπ(a|θ)μ0(θ)Maximizesubscript𝑎𝐴subscript𝑎0subscript𝜃Θ𝜋conditional𝑎𝜃subscript𝜇0𝜃\text{Maximize}\quad\sum_{a\in A\setminus\{a_{0}\}}\sum_{\theta\in\Theta}\pi(a% |\theta)\mu_{0}(\theta)Maximize ∑ start_POSTSUBSCRIPT italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_θ ∈ roman_Θ end_POSTSUBSCRIPT italic_π ( italic_a | italic_θ ) italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) (6)
subject to:
Optimality of a𝑎aitalic_a over other actions: aA,aA{a}formulae-sequencefor-all𝑎𝐴for-allsuperscript𝑎𝐴𝑎\forall a\in A,\forall a^{\prime}\in A\setminus\{a\}∀ italic_a ∈ italic_A , ∀ italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_A ∖ { italic_a }
θΘπ(a|θ)μ0(θ)[(1τ)ΔU(a,a,θ)+τθΘμ0(θ)ΔU(a,a,θ)]0;subscript𝜃Θ𝜋conditional𝑎𝜃subscript𝜇0𝜃delimited-[]1𝜏Δ𝑈𝑎superscript𝑎𝜃𝜏subscriptsuperscript𝜃Θsubscript𝜇0superscript𝜃Δ𝑈𝑎superscript𝑎superscript𝜃0\displaystyle\hskip 30.00005pt\sum_{\theta\in\Theta}\pi(a|\theta)\cdot\mu_{0}(% \theta)\Big{[}(1-\tau)\Delta U(a,a^{\prime},\theta)+\tau\sum_{\theta^{\prime}% \in\Theta}\mu_{0}(\theta^{\prime})\Delta U(a,a^{\prime},\theta^{\prime})\Big{]% }\geq 0;∑ start_POSTSUBSCRIPT italic_θ ∈ roman_Θ end_POSTSUBSCRIPT italic_π ( italic_a | italic_θ ) ⋅ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) [ ( 1 - italic_τ ) roman_Δ italic_U ( italic_a , italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_θ ) + italic_τ ∑ start_POSTSUBSCRIPT italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ roman_Θ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) roman_Δ italic_U ( italic_a , italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT , italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ] ≥ 0 ; (7)
Indifference between a𝑎aitalic_a and a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT: aA{a0}for-all𝑎𝐴subscript𝑎0\forall a\in A\setminus\{a_{0}\}∀ italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT },
θΘπ(a|θ)μ0(θ)[(1τ)ΔU(a,a0,θ)+τθΘμ0(θ)ΔU(a,a0,θ)]=0;subscript𝜃Θ𝜋conditional𝑎𝜃subscript𝜇0𝜃delimited-[]1𝜏Δ𝑈𝑎subscript𝑎0𝜃𝜏subscriptsuperscript𝜃Θsubscript𝜇0superscript𝜃Δ𝑈𝑎subscript𝑎0superscript𝜃0\displaystyle\hskip 30.00005pt\sum_{\theta\in\Theta}\pi(a|\theta)\cdot\mu_{0}(% \theta)\Big{[}(1-\tau)\Delta U(a,a_{0},\theta)+\tau\sum_{\theta^{\prime}\in% \Theta}\mu_{0}(\theta^{\prime})\Delta U(a,a_{0},\theta^{\prime})\Big{]}=0;∑ start_POSTSUBSCRIPT italic_θ ∈ roman_Θ end_POSTSUBSCRIPT italic_π ( italic_a | italic_θ ) ⋅ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) [ ( 1 - italic_τ ) roman_Δ italic_U ( italic_a , italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ ) + italic_τ ∑ start_POSTSUBSCRIPT italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ roman_Θ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) roman_Δ italic_U ( italic_a , italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ] = 0 ; (8)
Probability distribution constraints: θΘfor-all𝜃Θ\forall\theta\in\Theta∀ italic_θ ∈ roman_Θ,
aAπ(a|θ)=1 and aA,π(a|θ)0.formulae-sequencesubscript𝑎𝐴𝜋conditional𝑎𝜃1 and formulae-sequencefor-all𝑎𝐴𝜋conditional𝑎𝜃0\displaystyle\hskip 30.00005pt\sum_{a\in A}\pi(a|\theta)=1\quad\text{ and }% \quad\forall a\in A,~{}\pi(a|\theta)\geq 0.∑ start_POSTSUBSCRIPT italic_a ∈ italic_A end_POSTSUBSCRIPT italic_π ( italic_a | italic_θ ) = 1 and ∀ italic_a ∈ italic_A , italic_π ( italic_a | italic_θ ) ≥ 0 .
Algorithm 1 Linear program to compute the optimal signaling scheme
Theorem 4.6.

Algorithm 1 finds a constant signaling scheme for testing wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or τabsent𝜏\leq\tau≤ italic_τ that is optimal among all adaptive signaling schemes.

The linear program in Algorithm 1 has a polynomial size in |A|𝐴|A|| italic_A | (number of actions) and |Θ|Θ|\Theta|| roman_Θ | (number of states), so it is a polynomial-time algorithm.

The rest of this section proves Theorem 4.6. The proof needs an additional lemma:

Lemma 4.7.

Given a signaling scheme π=(π(a|θ))aA,θΘ𝜋subscript𝜋conditional𝑎𝜃formulae-sequence𝑎𝐴𝜃Θ\pi=(\pi(a|\theta))_{a\in A,\theta\in\Theta}italic_π = ( italic_π ( italic_a | italic_θ ) ) start_POSTSUBSCRIPT italic_a ∈ italic_A , italic_θ ∈ roman_Θ end_POSTSUBSCRIPT and an agent’s bias level w𝑤witalic_w, after signal a𝑎aitalic_a is sent, the agent strictly prefers action a1subscript𝑎1a_{1}italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT over action a2subscript𝑎2a_{2}italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT under the biased belief if and only if:

θΘπ(a|θ)μ0(θ)[(1w)ΔU(a1,a2,θ)+wθΘμ0(θ)ΔU(a1,a2,θ)]>0.subscript𝜃Θ𝜋conditional𝑎𝜃subscript𝜇0𝜃delimited-[]1𝑤Δ𝑈subscript𝑎1subscript𝑎2𝜃𝑤subscriptsuperscript𝜃Θsubscript𝜇0superscript𝜃Δ𝑈subscript𝑎1subscript𝑎2superscript𝜃0\sum_{\theta\in\Theta}\pi(a|\theta)\cdot\mu_{0}(\theta)\Big{[}(1-w)\Delta U(a_% {1},a_{2},\theta)+w\sum_{\theta^{\prime}\in\Theta}\mu_{0}(\theta^{\prime})% \Delta U(a_{1},a_{2},\theta^{\prime})\Big{]}>0.∑ start_POSTSUBSCRIPT italic_θ ∈ roman_Θ end_POSTSUBSCRIPT italic_π ( italic_a | italic_θ ) ⋅ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) [ ( 1 - italic_w ) roman_Δ italic_U ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_θ ) + italic_w ∑ start_POSTSUBSCRIPT italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ roman_Θ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) roman_Δ italic_U ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ] > 0 .
Proof.

The agent’s biased belief under signal a𝑎aitalic_a and bias level w𝑤witalic_w is given by

(1w)μ0(θ)π(a|θ)θΘμ0(θ)π(a|θ)+wμ0(θ),θΘ.1𝑤subscript𝜇0𝜃𝜋conditional𝑎𝜃subscriptsuperscript𝜃Θsubscript𝜇0superscript𝜃𝜋conditional𝑎superscript𝜃𝑤subscript𝜇0𝜃for-all𝜃Θ(1-w)\frac{\mu_{0}(\theta)\pi(a|\theta)}{\sum_{\theta^{\prime}\in\Theta}\mu_{0% }(\theta^{\prime})\pi(a|\theta^{\prime})}+w\mu_{0}(\theta),\quad\forall\theta% \in\Theta.( 1 - italic_w ) divide start_ARG italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) italic_π ( italic_a | italic_θ ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ roman_Θ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) italic_π ( italic_a | italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_ARG + italic_w italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) , ∀ italic_θ ∈ roman_Θ .

The condition for the agent to strictly prefer a1subscript𝑎1a_{1}italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT over a2subscript𝑎2a_{2}italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT is that the expected utility under the biased belief when choosing a1subscript𝑎1a_{1}italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT is greater than when choosing a2subscript𝑎2a_{2}italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT:

θΘ((1w)μ0(θ)π(a|θ)θΘμ0(θ)π(a|θ)+wμ0(θ))ΔU(a1,a2,θ)>0,subscript𝜃Θ1𝑤subscript𝜇0𝜃𝜋conditional𝑎𝜃subscriptsuperscript𝜃Θsubscript𝜇0superscript𝜃𝜋conditional𝑎superscript𝜃𝑤subscript𝜇0𝜃Δ𝑈subscript𝑎1subscript𝑎2𝜃0\sum_{\theta\in\Theta}\left((1-w)\frac{\mu_{0}(\theta)\pi(a|\theta)}{\sum_{% \theta^{\prime}\in\Theta}\mu_{0}(\theta^{\prime})\pi(a|\theta^{\prime})}+w\mu_% {0}(\theta)\right)\Delta U(a_{1},a_{2},\theta)>0,∑ start_POSTSUBSCRIPT italic_θ ∈ roman_Θ end_POSTSUBSCRIPT ( ( 1 - italic_w ) divide start_ARG italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) italic_π ( italic_a | italic_θ ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ roman_Θ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) italic_π ( italic_a | italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_ARG + italic_w italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) ) roman_Δ italic_U ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_θ ) > 0 ,

where ΔU(a1,a2,θ)=U(a1,θ)U(a2,θ)Δ𝑈subscript𝑎1subscript𝑎2𝜃𝑈subscript𝑎1𝜃𝑈subscript𝑎2𝜃\Delta U(a_{1},a_{2},\theta)=U(a_{1},\theta)-U(a_{2},\theta)roman_Δ italic_U ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_θ ) = italic_U ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_θ ) - italic_U ( italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_θ ). Multiplying by θΘμ0(θ)π(a|θ)subscriptsuperscript𝜃Θsubscript𝜇0superscript𝜃𝜋conditional𝑎superscript𝜃\sum_{\theta^{\prime}\in\Theta}\mu_{0}(\theta^{\prime})\pi(a|\theta^{\prime})∑ start_POSTSUBSCRIPT italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ roman_Θ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) italic_π ( italic_a | italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ), we obtain:

(1w)θΘμ0(θ)π(a|θ)ΔU(a1,a2,θ)+wθΘμ0(θ)θΘμ0(θ)π(a|θ)ΔU(a1,a2,θ)>0.1𝑤subscript𝜃Θsubscript𝜇0𝜃𝜋conditional𝑎𝜃Δ𝑈subscript𝑎1subscript𝑎2𝜃𝑤subscript𝜃Θsubscript𝜇0𝜃subscriptsuperscript𝜃Θsubscript𝜇0superscript𝜃𝜋conditional𝑎superscript𝜃Δ𝑈subscript𝑎1subscript𝑎2𝜃0(1-w)\sum_{\theta\in\Theta}\mu_{0}(\theta)\pi(a|\theta)\Delta U(a_{1},a_{2},% \theta)+w\sum_{\theta\in\Theta}\mu_{0}(\theta)\sum_{\theta^{\prime}\in\Theta}% \mu_{0}(\theta^{\prime})\pi(a|\theta^{\prime})\Delta U(a_{1},a_{2},\theta)>0.( 1 - italic_w ) ∑ start_POSTSUBSCRIPT italic_θ ∈ roman_Θ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) italic_π ( italic_a | italic_θ ) roman_Δ italic_U ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_θ ) + italic_w ∑ start_POSTSUBSCRIPT italic_θ ∈ roman_Θ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) ∑ start_POSTSUBSCRIPT italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ roman_Θ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) italic_π ( italic_a | italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) roman_Δ italic_U ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_θ ) > 0 .

Factoring out the terms, this can be rewritten as:

θΘπ(a|θ)μ0(θ)((1w)ΔU(a1,a2,θ)+wθΘμ0(θ)ΔU(a1,a2,θ))>0.subscript𝜃Θ𝜋conditional𝑎𝜃subscript𝜇0𝜃1𝑤Δ𝑈subscript𝑎1subscript𝑎2𝜃𝑤subscriptsuperscript𝜃Θsubscript𝜇0superscript𝜃Δ𝑈subscript𝑎1subscript𝑎2superscript𝜃0\sum_{\theta\in\Theta}\pi(a|\theta)\mu_{0}(\theta)\bigg{(}(1-w)\Delta U(a_{1},% a_{2},\theta)+w\sum_{\theta^{\prime}\in\Theta}\mu_{0}(\theta^{\prime})\Delta U% (a_{1},a_{2},\theta^{\prime})\bigg{)}>0.∑ start_POSTSUBSCRIPT italic_θ ∈ roman_Θ end_POSTSUBSCRIPT italic_π ( italic_a | italic_θ ) italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) ( ( 1 - italic_w ) roman_Δ italic_U ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_θ ) + italic_w ∑ start_POSTSUBSCRIPT italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ roman_Θ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) roman_Δ italic_U ( italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_θ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) ) > 0 .

This final expression is positive if and only if the agent to strictly prefer a1subscript𝑎1a_{1}italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT over a2subscript𝑎2a_{2}italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT. ∎

Proof of Theorem 4.6.

According to Lemma 4.1 (constant algorithms are optimal) and Lemma 4.5 (revelation principle), to find an optimal adaptive algorithm we only need to find the optimal constant signaling scheme that satisfies the conditions in Lemma 4.5. We verify that the signaling scheme computed from the linear program in Algorithm 1 satisfies the conditions in Lemma 4.5:

  • The optimality constraint (7) in the linear program, together with Lemma 4.7, ensures that: whenever signal aA𝑎𝐴a\in Aitalic_a ∈ italic_A is sent, action a𝑎aitalic_a is weakly better than any other action for an agent with bias level w=τ𝑤𝜏w=\tauitalic_w = italic_τ. This satisfies the first condition in Lemma 4.5.

  • The indifference constraint (8), together with Lemma 4.7, ensures that: whenever aA{a0}𝑎𝐴subscript𝑎0a\in A\setminus\{a_{0}\}italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } is sent, the agent is indifferent between action a𝑎aitalic_a and a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT if the bias level w=τ𝑤𝜏w=\tauitalic_w = italic_τ. Then, by the optimality constraint (7), we have both a𝑎aitalic_a and a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT being optimal actions. This satisfies the second condition in Lemma 4.5.

We then argue that the solution of the linear program is the optimal signaling scheme that satisfies the conditions of Lemma 4.5. According to our argument after Lemma 4.5, only the signals in A{a0}𝐴subscript𝑎0A\setminus\{a_{0}\}italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } are useful signals, so the sample complexity is equal to the expected number of time steps until a signal in A{a0}𝐴subscript𝑎0A\setminus\{a_{0}\}italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } is sent. The probability that a signal in A{a0}𝐴subscript𝑎0A\setminus\{a_{0}\}italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } is sent at each time step is equal to

aA{a0}π(a)=aA{a0}θΘμ0(θ)π(a|θ).subscript𝑎𝐴subscript𝑎0𝜋𝑎subscript𝑎𝐴subscript𝑎0subscript𝜃Θsubscript𝜇0𝜃𝜋conditional𝑎𝜃\sum_{a\in A\setminus\{a_{0}\}}\pi(a)=\sum_{a\in A\setminus\{a_{0}\}}\sum_{% \theta\in\Theta}\mu_{0}(\theta)\pi(a|\theta).∑ start_POSTSUBSCRIPT italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT italic_π ( italic_a ) = ∑ start_POSTSUBSCRIPT italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_θ ∈ roman_Θ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) italic_π ( italic_a | italic_θ ) .

The expected number of time steps is the inverse 1aA{a0}θΘμ0(θ)π(a|θ)1subscript𝑎𝐴subscript𝑎0subscript𝜃Θsubscript𝜇0𝜃𝜋conditional𝑎𝜃\frac{1}{\sum_{a\in A\setminus\{a_{0}\}}\sum_{\theta\in\Theta}\mu_{0}(\theta)% \pi(a|\theta)}divide start_ARG 1 end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_θ ∈ roman_Θ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) italic_π ( italic_a | italic_θ ) end_ARG (because the number of time steps is a geometric random variable). The linear program maximizes the probability aA{a0}θΘμ0(θ)π(a|θ)subscript𝑎𝐴subscript𝑎0subscript𝜃Θsubscript𝜇0𝜃𝜋conditional𝑎𝜃\sum_{a\in A\setminus\{a_{0}\}}\sum_{\theta\in\Theta}\mu_{0}(\theta)\pi(a|\theta)∑ start_POSTSUBSCRIPT italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_θ ∈ roman_Θ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) italic_π ( italic_a | italic_θ ), so it minimizes the sample complexity. ∎

5 Geometric Characterization of the Testability of Bias

To complement the algorithmic solution presented in the previous section, this section provides a geometric characterization of the bias detection problem. We identify the conditions under which testing whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ can be done in only one sample, in finite number of samples, or cannot be done at all (the scenario where the linear program in Algorithm 1 is infeasible).

By Assumption 2.1 (a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is strictly better than other actions at prior μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT), we have:

caμ0=θΘμ0(θ)(U(a0,θ)U(a,θ))>0,aA{a0},formulae-sequencesuperscriptsubscript𝑐𝑎topsubscript𝜇0subscript𝜃Θsubscript𝜇0𝜃𝑈subscript𝑎0𝜃𝑈𝑎𝜃0for-all𝑎𝐴subscript𝑎0c_{a}^{\top}\mu_{0}=\sum_{\theta\in\Theta}\mu_{0}(\theta)\big{(}U(a_{0},\theta% )-U(a,\theta)\big{)}>0,\quad\forall a\in A\setminus\{a_{0}\},italic_c start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_θ ∈ roman_Θ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) ( italic_U ( italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_θ ) - italic_U ( italic_a , italic_θ ) ) > 0 , ∀ italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } , (9)

where casubscript𝑐𝑎c_{a}italic_c start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT is as defined in Equation (2). Define Iasubscript𝐼𝑎I_{a}italic_I start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT as the set of indifference beliefs between action a𝑎aitalic_a and a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, which is the intersection of the hyperplane {x|cax=0}conditional-set𝑥superscriptsubscript𝑐𝑎top𝑥0\{x|c_{a}^{\top}x=0\}{ italic_x | italic_c start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_x = 0 } and the probability simplex Δ(Θ)ΔΘ\Delta(\Theta)roman_Δ ( roman_Θ ):

Ia:={μΔ(Θ)caμ=0}.assignsubscript𝐼𝑎conditional-set𝜇ΔΘsuperscriptsubscript𝑐𝑎top𝜇0I_{a}:=\{\mu\in\Delta(\Theta)\mid c_{a}^{\top}\mu=0\}.italic_I start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT := { italic_μ ∈ roman_Δ ( roman_Θ ) ∣ italic_c start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_μ = 0 } .

Given a parameter τ(0,1)𝜏01\tau\in(0,1)italic_τ ∈ ( 0 , 1 ), for which we want to test whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ, let Ia,τsubscript𝐼𝑎𝜏I_{a,\tau}italic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT be the set of posterior beliefs for which, if the agent’s bias level is exactly τ𝜏\tauitalic_τ, then the agent’s biased belief will fall within the indifference set Iasubscript𝐼𝑎I_{a}italic_I start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT:

Ia,τ:={μΔ(Θ)(1τ)μ+τμ0Ia}.assignsubscript𝐼𝑎𝜏conditional-set𝜇ΔΘ1𝜏𝜇𝜏subscript𝜇0subscript𝐼𝑎I_{a,\tau}:=\{\mu\in\Delta(\Theta)\mid(1-\tau)\mu+\tau\mu_{0}\in I_{a}\}.italic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT := { italic_μ ∈ roman_Δ ( roman_Θ ) ∣ ( 1 - italic_τ ) italic_μ + italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ italic_I start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT } .
Lemma 5.1.

Ia,τsubscript𝐼𝑎𝜏I_{a,\tau}italic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT is equal to the intersection of the probability simplex Δ(Θ)ΔΘ\Delta(\Theta)roman_Δ ( roman_Θ ) and a translation of the hyperplane {xcax=0}conditional-set𝑥superscriptsubscript𝑐𝑎top𝑥0\{x\mid c_{a}^{\top}x=0\}{ italic_x ∣ italic_c start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_x = 0 }: Ia,τ={μΔ(Θ)caμ=τ1τcaμ0}subscript𝐼𝑎𝜏conditional-set𝜇ΔΘsuperscriptsubscript𝑐𝑎top𝜇𝜏1𝜏superscriptsubscript𝑐𝑎topsubscript𝜇0I_{a,\tau}=\big{\{}\mu\in\Delta(\Theta)\mid c_{a}^{\top}\mu=-\frac{\tau}{1-% \tau}c_{a}^{\top}\mu_{0}\big{\}}italic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT = { italic_μ ∈ roman_Δ ( roman_Θ ) ∣ italic_c start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_μ = - divide start_ARG italic_τ end_ARG start_ARG 1 - italic_τ end_ARG italic_c start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT }.

The proof of this lemma is in Appendix B.1. With this representation of Ia,τsubscript𝐼𝑎𝜏I_{a,\tau}italic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT in hand, we can now present a geometric characterization of the testability of bias.

(0,0,1)001(0,0,1)( 0 , 0 , 1 )(1,0,0)100(1,0,0)( 1 , 0 , 0 )(0,1,0)010(0,1,0)( 0 , 1 , 0 )x𝑥xitalic_xy𝑦yitalic_yz𝑧zitalic_zμ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTIasubscript𝐼𝑎I_{a}italic_I start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT
(a) A single sample
(0,0,1)001(0,0,1)( 0 , 0 , 1 )(1,0,0)100(1,0,0)( 1 , 0 , 0 )(0,1,0)010(0,1,0)( 0 , 1 , 0 )x𝑥xitalic_xy𝑦yitalic_yz𝑧zitalic_zμ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTIasubscript𝐼𝑎I_{a}italic_I start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPTIa,τsubscript𝐼𝑎𝜏I_{a,\tau}italic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT
(b) Finite sample complexity
(0,0,1)001(0,0,1)( 0 , 0 , 1 )(1,0,0)100(1,0,0)( 1 , 0 , 0 )(0,1,0)010(0,1,0)( 0 , 1 , 0 )x𝑥xitalic_xy𝑦yitalic_yz𝑧zitalic_zμ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPTIasubscript𝐼𝑎I_{a}italic_I start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPTIa,τ=subscript𝐼𝑎𝜏I_{a,\tau}=\emptysetitalic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT = ∅
(c) Cannot be solved
Figure 1: The three qualitatively different cases for detecting the level of bias, each illustrated within a simplex over three states where μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is the prior belief. Each point within the simplex maps to an optimal action for the agent. The green curves represent indifference curves between the default action a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and another action, indicating where the agent is indifferent between these two actions under the unbiased belief. The orange curves are translations of these indifference curves; a posterior lying on these curves means that the agent’s biased belief (at bias level τ𝜏\tauitalic_τ) aligns exactly with the green curves. τ𝜏\tauitalic_τ increases from (a) to (c), so the orange curves are translated further. In Figure 1(a), μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT can be expressed as a convex combination of points in the translated curves, and thus the bias level can be detected using a single sample. In Figure 1(b), only some signals are useful, and more than one sample is needed in the worst case. In Figure 1(c), the bias level cannot be tested against τ𝜏\tauitalic_τ.
Theorem 5.2 (geometric characterization).

Fix τ(0,1)𝜏01\tau\in(0,1)italic_τ ∈ ( 0 , 1 ). The problem of testing wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ

  • Can be solved with a single sample (the sample complexity is 1) if and only if the prior μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is in the convex hull formed by the translated sets Ia,τsubscript𝐼𝑎𝜏I_{a,\tau}italic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT for all non-default actions aA{a0}𝑎𝐴subscript𝑎0a\in A\setminus\{a_{0}\}italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT }:

    μ0ConvexHull(aA{a0}Ia,τ).subscript𝜇0ConvexHullsubscript𝑎𝐴subscript𝑎0subscript𝐼𝑎𝜏\mu_{0}\in\mathrm{ConvexHull}\bigg{(}~{}\bigcup_{a\in A\setminus\{a_{0}\}}I_{a% ,\tau}~{}\bigg{)}.italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ roman_ConvexHull ( ⋃ start_POSTSUBSCRIPT italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT ) . (10)
  • Can be solved (with finite sample complexity) if and only if Ia,τsubscript𝐼𝑎𝜏I_{a,\tau}\neq\emptysetitalic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT ≠ ∅ for at least one aA{a0}𝑎𝐴subscript𝑎0a\in A\setminus\{a_{0}\}italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT }.

  • Cannot be solved if Ia,τ=subscript𝐼𝑎𝜏I_{a,\tau}=\emptysetitalic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT = ∅ for all aA{a0}𝑎𝐴subscript𝑎0a\in A\setminus\{a_{0}\}italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT }.

Figure 1 illustrates the three cases of Theorem 5.2. In the first case, the solution of the linear program in Algorithm 1 satisfies aA{a0}θΘπ(a|θ)μ0(θ)=1subscript𝑎𝐴subscript𝑎0subscript𝜃Θ𝜋conditional𝑎𝜃subscript𝜇0𝜃1\sum_{a\in A\setminus\{a_{0}\}}\sum_{\theta\in\Theta}\pi(a|\theta)\mu_{0}(% \theta)=1∑ start_POSTSUBSCRIPT italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_θ ∈ roman_Θ end_POSTSUBSCRIPT italic_π ( italic_a | italic_θ ) italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) = 1, meaning that useful signals are sent with probability 1, which allows us to tell whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ immediately. In the second case, the total probability of useful signals satisfies aA{a0}θΘπ(a|θ)μ0(θ)<1subscript𝑎𝐴subscript𝑎0subscript𝜃Θ𝜋conditional𝑎𝜃subscript𝜇0𝜃1\sum_{a\in A\setminus\{a_{0}\}}\sum_{\theta\in\Theta}\pi(a|\theta)\mu_{0}(% \theta)<1∑ start_POSTSUBSCRIPT italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_θ ∈ roman_Θ end_POSTSUBSCRIPT italic_π ( italic_a | italic_θ ) italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) < 1, so the sample complexity is more than 1. In the third case, the linear program in Algorithm 1 is not feasible, so wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ cannot be determined; importantly, this is not a limitation of our particular algorithm, but a general impossibility in our model. The proof of Theorem 5.2 is in Appendix B.2.

6 Discussion

Our approach has some limitations; here we discuss the two that we view as most significant.

First, we have assumed a linear model of bias. While the linear model is common in the literature [8, 10, 5], we also consider a more general model of bias (in Appendix C): as the bias level w𝑤witalic_w increases from 00 to 1111, the agent’s belief changes from the true posterior μssubscript𝜇𝑠\mu_{s}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT to the prior μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT according to some general continuous function ϕ(μ0,μs,w)italic-ϕsubscript𝜇0subscript𝜇𝑠𝑤\phi(\mu_{0},\mu_{s},w)italic_ϕ ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT , italic_w ). We show that, as long as the bias function ϕitalic-ϕ\phiitalic_ϕ satisfies a certain single-crossing property (as w𝑤witalic_w increases, once the agent starts to prefer the default action a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, they will not change the preferred action anymore), our results regarding the optimality of constant signaling schemes, the revelation principle, and the geometric characterization still hold, while the linear program algorithm no longer works because ϕitalic-ϕ\phiitalic_ϕ is not linear. We consider it an interesting challenge to come up with more general models of bias that are still tractable, in the sense that one can efficiently design good signaling schemes with reasonable sample complexity bounds.

Second, we have assumed that the agent’s prior is the same as the real prior from which states of the world are drawn. But what if the agent’s prior is different? Our results directly extend to the case where the agent has a wrong, known prior. If the agent’s prior is unknown, then our problem becomes significantly more challenging. In fact, we believe it may be flat-out impossible to measure bias, as the case of a slightly wrong prior with large bias may be indistinguishable from the case of a very wrong prior with small bias.

Despite these limitations, we view our paper as making significant progress on a novel problem that seems fundamental. Our results suggest that practical algorithms for detecting bias in belief update are within reach and, in the long term, may lead to new insights on issues of societal importance. In particular, we anticipate future research in more complex situations such as combining decisions of many experts (human or AI) after measuring and accounting for their individual biases.

References

  • Alonso and Câmara [2016] R. Alonso and O. Câmara. Bayesian persuasion with heterogeneous priors. Journal of Economic Theory, 165:672–706, 2016.
  • Aumann et al. [1995] R. J. Aumann, M. B. Maschler, and R. E. Stearns. Repeated Games with Incomplete Information. MIT Press, 1995.
  • Bloom et al. [2021] J. D. Bloom, Y. A. Chan, R. S. Baric, P. J. Bjorkman, S. Cobery, B. E. Deverman, D. N. Fisman, R. Gupta, A. Iwasaki, M. Lipsitch, R. Medzhitov, R. A. Neher, R. Nielsen, N. Patterson, T. Stearns, E. van Nimwegen, M. Worobey, and D. A. Relman. Investigate the origins of COVID-19. Science, 372(6543):694, 2021.
  • Crawford and Sobel [1982] V. P. Crawford and J. Sobel. Strategic information transmission. Econometrica, 50(6):1431–1451, 1982.
  • de Clippel and Zhang [2022] G. de Clippel and X. Zhang. Non-Bayesian persuasion. Journal of Political Economy, 130(10):2594–2642, 2022.
  • Dughmi and Xu [2016] S. Dughmi and H. Xu. Algorithmic Bayesian persuasion. In Proceedings of the 48th Annual ACM Symposium on Theory of Computing (STOC), pages 412–425, 2016.
  • Dworczak and Pavan [2022] P. Dworczak and A. Pavan. Preparing for the worst but ho** for the best: Robust (Bayesian) persuasion. Econometrica, 90(5):2017–2051, 2022.
  • Epstein et al. [2010] L. G. Epstein, J. Noor, and A. Sandroni. Non-Bayesian learning. The B.E. Journal of Theoretical Economics, 10(1):732–735, 2010.
  • Feng et al. [2024] Yiding Feng, Chien-Ju Ho, and Wei Tang. Rationality-robust information design: Bayesian persuasion under quantal response. In Proceedings of the 2024 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 501–546. SIAM, 2024.
  • Hagmann and Loewenstein [2017] D. Hagmann and G. Loewenstein. Persuasion with motivated beliefs. Carnegie Mellon University technical report, 2017.
  • Kahan [2013] D. M. Kahan. Ideology, motivated reasoning, and cognitive reflection. Judgment and Decision Making, 8(4):18, 2013.
  • Kahan et al. [2012] D. M. Kahan, E. Peters, M. Wittlin, P. Slovic, L. L. Ouellette, D. Braman, and G. Mandel. The polarizing impact of science literacy and numeracy on perceived climate change risks. Nature Climate Change, 2(10):732–735, 2012.
  • Kamenica and Gentzkow [2011] E. Kamenica and M. Gentzkow. Bayesian persuasion. American Economic Review, 101(6):2590–2615, 2011.
  • Lin and Chen [2024] T. Lin and Y. Chen. Persuading a learning agent. arXiv preprint arXiv:2402.09721, 2024.
  • Renault [2012] J. Renault. Repeated games with incomplete information. In R. A. Meyers, editor, Encyclopedia of Complexity and Systems Science. Springer, 2012.
  • Taber and Lodge [2006] C. S. Taber and M. Lodge. Motivated skepticism in the evaluation of political beliefs. American Journal of Political Science, 50(3):755–769, 2006.
  • Taber et al. [2009] C. S. Taber, D. Cann, and S. Kucsova. The motivated processing of political arguments. Political Behavior, 31(2):137–155, 2009.
  • Tappin et al. [2020] B. M. Tappin, G. Pennycook, and D. G. Rand. Bayesian or biased? Analytic thinking and political belief updating. Cognition, 204:104375, 2020.
  • Ward [2007] E. Ward. Conservatism in human information processing. In D. Kahneman, P. Slovic, and A. Tversky, editors, Judgment Under Uncertainty: Heuristics and Biases. Cambridge University Press, 2007.
  • Yang and Zhang [2024] K. Yang and H. Zhang. Computational aspects of Bayesian persuasion under approximate best response. arXiv preprint arXiv:2402.07426, 2024.

Appendix A Missing Proofs from Section 4

A.1 Proof of Lemma 4.3

Proof.

Suppose that, during its operation, ΠΠ\Piroman_Π selects a signaling scheme π𝜋\piitalic_π that includes an external signal sS𝑠𝑆s\in Sitalic_s ∈ italic_S. By definition, for an external signal, the τ𝜏\tauitalic_τ-biased belief μsτ=τμ0+(1τ)μssuperscriptsubscript𝜇𝑠𝜏𝜏subscript𝜇01𝜏subscript𝜇𝑠\mu_{s}^{\tau}=\tau\mu_{0}+(1-\tau)\mu_{s}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_τ end_POSTSUPERSCRIPT = italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT is in extRa0extsubscript𝑅subscript𝑎0\mathrm{ext}R_{a_{0}}roman_ext italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. This implies that the true posterior μssubscript𝜇𝑠\mu_{s}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT, derived from the signaling scheme π𝜋\piitalic_π and the prior μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, also lies in extRa0extsubscript𝑅subscript𝑎0\mathrm{ext}R_{a_{0}}roman_ext italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Consequently, the line segment connecting μssubscript𝜇𝑠\mu_{s}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT and μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, represented as {(1t)μs+tμ0t[0,1]}conditional-set1𝑡subscript𝜇𝑠𝑡subscript𝜇0𝑡01\{(1-t)\mu_{s}+t\mu_{0}\mid t\in[0,1]\}{ ( 1 - italic_t ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT + italic_t italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∣ italic_t ∈ [ 0 , 1 ] }, must intersect the boundary Ra0subscript𝑅subscript𝑎0\partial R_{a_{0}}∂ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT at some point. Denote this intersection by μ=(1t)μs+tμ0Ra0superscript𝜇1superscript𝑡subscript𝜇𝑠superscript𝑡subscript𝜇0subscript𝑅subscript𝑎0\mu^{*}=(1-t^{*})\mu_{s}+t^{*}\mu_{0}\in\partial R_{a_{0}}italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT = ( 1 - italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT + italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ ∂ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT.

We will adjust the original signaling scheme π𝜋\piitalic_π. To do so, define μ~ssubscript~𝜇𝑠\tilde{\mu}_{s}over~ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT as the belief whose τ𝜏\tauitalic_τ-biased version equals μsuperscript𝜇\mu^{*}italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT:

τμ0+(1τ)μ~s=μμ~s=(tτ)μ0+(1t)μs1τ.formulae-sequence𝜏subscript𝜇01𝜏subscript~𝜇𝑠superscript𝜇subscript~𝜇𝑠superscript𝑡𝜏subscript𝜇01superscript𝑡subscript𝜇𝑠1𝜏\tau\mu_{0}+(1-\tau)\tilde{\mu}_{s}=\mu^{*}\quad\;\Longleftrightarrow\;\quad% \tilde{\mu}_{s}=\frac{(t^{*}-\tau)\mu_{0}+(1-t^{*})\mu_{s}}{1-\tau}.italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) over~ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ⟺ over~ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = divide start_ARG ( italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT - italic_τ ) italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_τ end_ARG .

Under the original signaling scheme π𝜋\piitalic_π, according to the splitting lemma (Lemma 2.1), the prior μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT can be represented as a convex combination of μssubscript𝜇𝑠\mu_{s}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT and the posteriors associated with other signals sS{s}superscript𝑠𝑆𝑠s^{\prime}\in S\setminus\{s\}italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_S ∖ { italic_s }:

μ0=psμs+sS{s}psμs.subscript𝜇0subscript𝑝𝑠subscript𝜇𝑠subscriptsuperscript𝑠𝑆𝑠subscript𝑝superscript𝑠subscript𝜇superscript𝑠\mu_{0}=p_{s}\mu_{s}+\sum_{s^{\prime}\in S\setminus\{s\}}p_{s^{\prime}}\mu_{s^% {\prime}}.italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_p start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_S ∖ { italic_s } end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT .

If we change μssubscript𝜇𝑠\mu_{s}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT to μ~ssubscript~𝜇𝑠\tilde{\mu}_{s}over~ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT, then we obtain a new convex combination (this is valid because μ~ssubscript~𝜇𝑠\tilde{\mu}_{s}over~ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT is on the line segment from μssubscript𝜇𝑠\mu_{s}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT to μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT):

μ0=p~sμ~s+sS{s}p~sμs,subscript𝜇0subscript~𝑝𝑠subscript~𝜇𝑠subscriptsuperscript𝑠𝑆𝑠subscript~𝑝superscript𝑠subscript𝜇superscript𝑠\mu_{0}=\tilde{p}_{s}\tilde{\mu}_{s}+\sum_{s^{\prime}\in S\setminus\{s\}}% \tilde{p}_{s^{\prime}}\mu_{s^{\prime}},italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT over~ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_S ∖ { italic_s } end_POSTSUBSCRIPT over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT ,

where

p~s=ps1t+tpsandsS{s},p~s=1t1t+tpsps.formulae-sequencesubscript~𝑝𝑠subscript𝑝𝑠1superscript𝑡superscript𝑡subscript𝑝𝑠andformulae-sequencefor-allsuperscript𝑠𝑆𝑠subscript~𝑝superscript𝑠1superscript𝑡1superscript𝑡superscript𝑡subscript𝑝𝑠subscript𝑝superscript𝑠\tilde{p}_{s}=\frac{p_{s}}{1-t^{*}+t^{*}p_{s}}\quad\text{and}\quad\forall s^{% \prime}\in S\setminus\{s\},~{}\tilde{p}_{s^{\prime}}=\frac{1-t^{*}}{1-t^{*}+t^% {*}p_{s}}p_{s^{\prime}}.over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = divide start_ARG italic_p start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT + italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_ARG and ∀ italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_S ∖ { italic_s } , over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT = divide start_ARG 1 - italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT end_ARG start_ARG 1 - italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT + italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_ARG italic_p start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT .

Then, by the splitting lemma (Lemma 2.1), there exists a signaling scheme πsuperscript𝜋\pi^{\prime}italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT with |S|𝑆|S|| italic_S | signals where signal s𝑠sitalic_s induces posterior μ~ssubscript~𝜇𝑠\tilde{\mu}_{s}over~ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT and other signals ssuperscript𝑠s^{\prime}italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT induces μssubscript𝜇superscript𝑠\mu_{s^{\prime}}italic_μ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT. Note that the τ𝜏\tauitalic_τ-biased version of μ~ssubscript~𝜇𝑠\tilde{\mu}_{s}over~ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT satisfies τμ0+(1τ)μ~s=μRa0𝜏subscript𝜇01𝜏subscript~𝜇𝑠superscript𝜇subscript𝑅subscript𝑎0\tau\mu_{0}+(1-\tau)\tilde{\mu}_{s}=\mu^{*}\in\partial R_{a_{0}}italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) over~ start_ARG italic_μ end_ARG start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = italic_μ start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT ∈ ∂ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, so s𝑠sitalic_s is a boundary signal under signaling scheme πsuperscript𝜋\pi^{\prime}italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT.

Since s𝑠sitalic_s is a boundary signal, we can immediately tell whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ according to Lemma 4.2 when s𝑠sitalic_s is sent and end the algorithm. If any signal ssuperscript𝑠s^{\prime}italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT other than s𝑠sitalic_s is sent, the induced posterior μssubscript𝜇𝑠\mu_{s}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT is the same as the posterior in the original signaling scheme π𝜋\piitalic_π, so the agent will take the same action, and we can just follow the rest of the original algorithm ΠΠ\Piroman_Π. But we note that the probability of signal s𝑠sitalic_s being sent under the new signaling scheme πsuperscript𝜋\pi^{\prime}italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is larger than or equal to the probability under the original signaling scheme π𝜋\piitalic_π:

p~s=ps1t+tpsps.subscript~𝑝𝑠subscript𝑝𝑠1superscript𝑡superscript𝑡subscript𝑝𝑠subscript𝑝𝑠\tilde{p}_{s}=\frac{p_{s}}{1-t^{*}+t^{*}p_{s}}\geq p_{s}.over~ start_ARG italic_p end_ARG start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = divide start_ARG italic_p start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_ARG start_ARG 1 - italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT + italic_t start_POSTSUPERSCRIPT ∗ end_POSTSUPERSCRIPT italic_p start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_ARG ≥ italic_p start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT . (11)

So, in expectation, we can end the algorithm faster by using π~~𝜋\tilde{\pi}over~ start_ARG italic_π end_ARG than using π𝜋\piitalic_π. Hence, by repeating the above procedure to replace all the signaling schemes in the original algorithm ΠΠ\Piroman_Π that use external signals, we obtain a new algorithm ΠsuperscriptΠ\Pi^{\prime}roman_Π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT that only uses boundary and internal signals with smaller or equal sample complexity. ∎

A.2 Proof of Lemma 4.4

Proof.

Let ΠΠ\Piroman_Π be any adaptive algorithm using signaling schemes with boundary and internal signals. Let t={(π1,θ1,s1,a1),,(πt,θt,st,at)}subscript𝑡subscript𝜋1subscript𝜃1subscript𝑠1subscript𝑎1subscript𝜋𝑡subscript𝜃𝑡subscript𝑠𝑡subscript𝑎𝑡\mathcal{H}_{t}=\{(\pi_{1},\theta_{1},s_{1},a_{1}),\ldots,(\pi_{t},\theta_{t},% s_{t},a_{t})\}caligraphic_H start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = { ( italic_π start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) , … , ( italic_π start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_θ start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_s start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) } be any history that can happen during the execution of ΠΠ\Piroman_Π. If no boundary signal has been sent, then every realized signal sksubscript𝑠𝑘s_{k}italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT is an internal signal in the respective signaling scheme πksubscript𝜋𝑘\pi_{k}italic_π start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, with the τ𝜏\tauitalic_τ-biased posterior satisfying μskτ=τμ0+(1τ)μskRa0superscriptsubscript𝜇subscript𝑠𝑘𝜏𝜏subscript𝜇01𝜏subscript𝜇subscript𝑠𝑘subscript𝑅subscript𝑎0\mu_{s_{k}}^{\tau}=\tau\mu_{0}+(1-\tau)\mu_{s_{k}}\in R_{a_{0}}italic_μ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_τ end_POSTSUPERSCRIPT = italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Because Ra0={μΔ(Θ)aA{a0},caμ>0}subscript𝑅subscript𝑎0conditional-set𝜇ΔΘformulae-sequencefor-all𝑎𝐴subscript𝑎0superscriptsubscript𝑐𝑎top𝜇0R_{a_{0}}=\big{\{}\mu\in\Delta(\Theta)\mid\forall a\in A\setminus\{a_{0}\},\,c% _{a}^{\top}\mu>0\big{\}}italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = { italic_μ ∈ roman_Δ ( roman_Θ ) ∣ ∀ italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } , italic_c start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_μ > 0 } is an open region, there must exist some εk>0subscript𝜀𝑘0\varepsilon_{k}>0italic_ε start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT > 0 such that the 1subscript1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-norm ball Bεk(μskτ)={μΔ(Θ):μμskτ1εk}subscript𝐵subscript𝜀𝑘superscriptsubscript𝜇subscript𝑠𝑘𝜏conditional-set𝜇ΔΘsubscriptnorm𝜇superscriptsubscript𝜇subscript𝑠𝑘𝜏1subscript𝜀𝑘B_{\varepsilon_{k}}(\mu_{s_{k}}^{\tau})=\{\mu\in\Delta(\Theta):\|\mu-\mu_{s_{k% }}^{\tau}\|_{1}\leq\varepsilon_{k}\}italic_B start_POSTSUBSCRIPT italic_ε start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_τ end_POSTSUPERSCRIPT ) = { italic_μ ∈ roman_Δ ( roman_Θ ) : ∥ italic_μ - italic_μ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_τ end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_ε start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT } is a subset of Ra0subscript𝑅subscript𝑎0R_{a_{0}}italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Let ε=mink=1tεk>0𝜀superscriptsubscript𝑘1𝑡subscript𝜀𝑘0\varepsilon=\min_{k=1}^{t}\varepsilon_{k}>0italic_ε = roman_min start_POSTSUBSCRIPT italic_k = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_t end_POSTSUPERSCRIPT italic_ε start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT > 0. Then Bε(μskτ)Ra0subscript𝐵𝜀superscriptsubscript𝜇subscript𝑠𝑘𝜏subscript𝑅subscript𝑎0B_{\varepsilon}(\mu_{s_{k}}^{\tau})\subseteq R_{a_{0}}italic_B start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_τ end_POSTSUPERSCRIPT ) ⊆ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT for every k=1,,t𝑘1𝑡k=1,\ldots,titalic_k = 1 , … , italic_t. Suppose the agent’s bias level w𝑤witalic_w is in the range [τε2,τ+ε2]𝜏𝜀2𝜏𝜀2[\tau-\frac{\varepsilon}{2},\tau+\frac{\varepsilon}{2}][ italic_τ - divide start_ARG italic_ε end_ARG start_ARG 2 end_ARG , italic_τ + divide start_ARG italic_ε end_ARG start_ARG 2 end_ARG ]. Then, for every signal sksubscript𝑠𝑘s_{k}italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT, the agent’s biased belief νsk=wμ0+(1w)μsksubscript𝜈subscript𝑠𝑘𝑤subscript𝜇01𝑤subscript𝜇subscript𝑠𝑘\nu_{s_{k}}=w\mu_{0}+(1-w)\mu_{s_{k}}italic_ν start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT = italic_w italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_w ) italic_μ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT satisfies:

νskμskτ1=(wτ)(μ0μsk)1|wτ|μ0μsk1ε.subscriptnormsubscript𝜈subscript𝑠𝑘superscriptsubscript𝜇subscript𝑠𝑘𝜏1subscriptnorm𝑤𝜏subscript𝜇0subscript𝜇subscript𝑠𝑘1𝑤𝜏subscriptnormsubscript𝜇0subscript𝜇subscript𝑠𝑘1𝜀\displaystyle\|\nu_{s_{k}}-\mu_{s_{k}}^{\tau}\|_{1}=\|(w-\tau)(\mu_{0}-\mu_{s_% {k}})\|_{1}\leq|w-\tau|\cdot\|\mu_{0}-\mu_{s_{k}}\|_{1}\leq\varepsilon.∥ italic_ν start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT - italic_μ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_τ end_POSTSUPERSCRIPT ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT = ∥ ( italic_w - italic_τ ) ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - italic_μ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ) ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ | italic_w - italic_τ | ⋅ ∥ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT - italic_μ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∥ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ≤ italic_ε .

This means

νskBε(μskτ)Ra0.subscript𝜈subscript𝑠𝑘subscript𝐵𝜀superscriptsubscript𝜇subscript𝑠𝑘𝜏subscript𝑅subscript𝑎0\nu_{s_{k}}\in B_{\varepsilon}(\mu_{s_{k}}^{\tau})\subseteq R_{a_{0}}.italic_ν start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ italic_B start_POSTSUBSCRIPT italic_ε end_POSTSUBSCRIPT ( italic_μ start_POSTSUBSCRIPT italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_τ end_POSTSUPERSCRIPT ) ⊆ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT .

So, the agent should take action a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT given signal sksubscript𝑠𝑘s_{k}italic_s start_POSTSUBSCRIPT italic_k end_POSTSUBSCRIPT. Note that this holds for every k=1,,t𝑘1𝑡k=1,\ldots,titalic_k = 1 , … , italic_t and any w[τε2,τ+ε2]𝑤𝜏𝜀2𝜏𝜀2w\in[\tau-\frac{\varepsilon}{2},\tau+\frac{\varepsilon}{2}]italic_w ∈ [ italic_τ - divide start_ARG italic_ε end_ARG start_ARG 2 end_ARG , italic_τ + divide start_ARG italic_ε end_ARG start_ARG 2 end_ARG ]. So we cannot determine whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ so far. We have to run the algorithm until a boundary signal is sent. ∎

A.3 Proof of Lemma 4.5

Proof.

Let π𝜋\piitalic_π be a signaling scheme that can test whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ. According to Lemma 4.3, π𝜋\piitalic_π can be assumed to only use boundary and internal signals. Recall that a signal s𝑠sitalic_s is boundary if the τ𝜏\tauitalic_τ-biased belief μsτ=τμ0+(1τ)μssuperscriptsubscript𝜇𝑠𝜏𝜏subscript𝜇01𝜏subscript𝜇𝑠\mu_{s}^{\tau}=\tau\mu_{0}+(1-\tau)\mu_{s}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_τ end_POSTSUPERSCRIPT = italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT lies on the boundary set Ra0subscript𝑅subscript𝑎0\partial R_{a_{0}}∂ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. For aA{a0}𝑎𝐴subscript𝑎0a\in A\setminus\{a_{0}\}italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT }, let Basubscript𝐵𝑎B_{a}italic_B start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT be the set of beliefs under which the agent is indifferent between a𝑎aitalic_a and a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and a𝑎aitalic_a and a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT are both better than other actions:

Ba={μΔ(Θ)caμ=0 and aA,caμ0}.subscript𝐵𝑎conditional-set𝜇ΔΘformulae-sequencesuperscriptsubscript𝑐𝑎top𝜇0 and for-allsuperscript𝑎𝐴superscriptsubscript𝑐superscript𝑎top𝜇0B_{a}=\{\mu\in\Delta(\Theta)\mid c_{a}^{\top}\mu=0~{}\text{ and }~{}\forall a^% {\prime}\in A,c_{a^{\prime}}^{\top}\mu\geq 0\}.italic_B start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT = { italic_μ ∈ roman_Δ ( roman_Θ ) ∣ italic_c start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_μ = 0 and ∀ italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_A , italic_c start_POSTSUBSCRIPT italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_μ ≥ 0 } . (12)

The boundary set Ra0subscript𝑅subscript𝑎0\partial R_{a_{0}}∂ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT can be written as the union of Basubscript𝐵𝑎B_{a}italic_B start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT for aA{a0}𝑎𝐴subscript𝑎0a\in A\setminus\{a_{0}\}italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT }:

Ra0=aA{a0}Ba.subscript𝑅subscript𝑎0subscript𝑎𝐴subscript𝑎0subscript𝐵𝑎\partial R_{a_{0}}=\bigcup_{a\in A\setminus\{a_{0}\}}B_{a}.∂ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = ⋃ start_POSTSUBSCRIPT italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT italic_B start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT . (13)

Then, we classify the boundary signals into |A|1𝐴1|A|-1| italic_A | - 1 sets {Sa}aA{a0}subscriptsubscript𝑆𝑎𝑎𝐴subscript𝑎0\{S_{a}\}_{a\in A\setminus\{a_{0}\}}{ italic_S start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT according to which Basubscript𝐵𝑎B_{a}italic_B start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT sets their τ𝜏\tauitalic_τ-biased beliefs belong to: namely, the set Sasubscript𝑆𝑎S_{a}italic_S start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT contains boundary signals s𝑠sitalic_s under which

τμ0+(1τ)μsBa.𝜏subscript𝜇01𝜏subscript𝜇𝑠subscript𝐵𝑎\tau\mu_{0}+(1-\tau)\mu_{s}\in B_{a}.italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ∈ italic_B start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT . (14)

We then combine the signals in Sasubscript𝑆𝑎S_{a}italic_S start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT. Specifically, consider the normalized weighted average of the true posterior beliefs associated with the signals in Sasubscript𝑆𝑎S_{a}italic_S start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT, denoted by μasubscript𝜇𝑎\mu_{a}italic_μ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT:

μa=sSaπ(s)sSaπ(s)μs.subscript𝜇𝑎subscript𝑠subscript𝑆𝑎𝜋𝑠subscriptsuperscript𝑠subscript𝑆𝑎𝜋superscript𝑠subscript𝜇𝑠\mu_{a}=\sum_{s\in S_{a}}\frac{\pi(s)}{\sum_{s^{\prime}\in S_{a}}\pi(s^{\prime% })}\mu_{s}.italic_μ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_s ∈ italic_S start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT divide start_ARG italic_π ( italic_s ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_S start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_π ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_ARG italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT . (15)

Note that the τ𝜏\tauitalic_τ-biased version of μasubscript𝜇𝑎\mu_{a}italic_μ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT is also in the set Basubscript𝐵𝑎B_{a}italic_B start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT because Basubscript𝐵𝑎B_{a}italic_B start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT is a convex set:

τμ0+(1τ)μa=sSaπ(s)sSaπ(s)(τμ0+(1τ)μs)Ba.𝜏subscript𝜇01𝜏subscript𝜇𝑎subscript𝑠subscript𝑆𝑎𝜋𝑠subscriptsuperscript𝑠subscript𝑆𝑎𝜋superscript𝑠𝜏subscript𝜇01𝜏subscript𝜇𝑠subscript𝐵𝑎\displaystyle\tau\mu_{0}+(1-\tau)\mu_{a}=\sum_{s\in S_{a}}\frac{\pi(s)}{\sum_{% s^{\prime}\in S_{a}}\pi(s^{\prime})}\Big{(}\tau\mu_{0}+(1-\tau)\mu_{s}\Big{)}% \in B_{a}.italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_s ∈ italic_S start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT divide start_ARG italic_π ( italic_s ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_S start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_π ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_ARG ( italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) ∈ italic_B start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT . (16)

This means that if a signal a𝑎aitalic_a induces true posterior μasubscript𝜇𝑎\mu_{a}italic_μ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT, then this signal is a boundary signal.

After defining μasubscript𝜇𝑎\mu_{a}italic_μ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT as above for every aA{a0}𝑎𝐴subscript𝑎0a\in A\setminus\{a_{0}\}italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT }, let’s consider the set of internal signals of the signaling scheme π𝜋\piitalic_π, which we denote by SIsubscript𝑆𝐼S_{I}italic_S start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT. For each internal signal sSI𝑠subscript𝑆𝐼s\in S_{I}italic_s ∈ italic_S start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT, the τ𝜏\tauitalic_τ-biased belief satisfies

τμ0+(1τ)μsRa0.𝜏subscript𝜇01𝜏subscript𝜇𝑠subscript𝑅subscript𝑎0\tau\mu_{0}+(1-\tau)\mu_{s}\in R_{a_{0}}.italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ∈ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (17)

Similar to above, we combine all the signals in SIsubscript𝑆𝐼S_{I}italic_S start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT: define μa0subscript𝜇subscript𝑎0\mu_{a_{0}}italic_μ start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT to be the normalized weighted average of the posteriors associated with all internal signals:

μa0=sSIπ(s)sSIπ(s)μs.subscript𝜇subscript𝑎0subscript𝑠subscript𝑆𝐼𝜋𝑠subscriptsuperscript𝑠subscript𝑆𝐼𝜋superscript𝑠subscript𝜇𝑠\mu_{a_{0}}=\sum_{s\in S_{I}}\frac{\pi(s)}{\sum_{s^{\prime}\in S_{I}}\pi(s^{% \prime})}\mu_{s}.italic_μ start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_s ∈ italic_S start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT end_POSTSUBSCRIPT divide start_ARG italic_π ( italic_s ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_S start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_π ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_ARG italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT . (18)

Then, the τ𝜏\tauitalic_τ-biased version of μa0subscript𝜇subscript𝑎0\mu_{a_{0}}italic_μ start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT must be in Ra0subscript𝑅subscript𝑎0R_{a_{0}}italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT because Ra0subscript𝑅subscript𝑎0R_{a_{0}}italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT is a convex set:

τμ0+(1τ)μa0=sSIπ(s)sSIπ(s)(τμ0+(1τ)μs)Ra0.𝜏subscript𝜇01𝜏subscript𝜇subscript𝑎0subscript𝑠subscript𝑆𝐼𝜋𝑠subscriptsuperscript𝑠subscript𝑆𝐼𝜋superscript𝑠𝜏subscript𝜇01𝜏subscript𝜇𝑠subscript𝑅subscript𝑎0\tau\mu_{0}+(1-\tau)\mu_{a_{0}}=\sum_{s\in S_{I}}\frac{\pi(s)}{\sum_{s^{\prime% }\in S_{I}}\pi(s^{\prime})}\Big{(}\tau\mu_{0}+(1-\tau)\mu_{s}\Big{)}\in R_{a_{% 0}}.italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_s ∈ italic_S start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT end_POSTSUBSCRIPT divide start_ARG italic_π ( italic_s ) end_ARG start_ARG ∑ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_S start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_π ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) end_ARG ( italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ) ∈ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT . (19)

This means that, if a signal induces posterior μa0subscript𝜇subscript𝑎0\mu_{a_{0}}italic_μ start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, then this signal is internal.

From the splitting lemma (2.1), we know that the convex combination of the original posteriors sSπ(s)μssubscript𝑠𝑆𝜋𝑠subscript𝜇𝑠\sum_{s\in S}\pi(s)\mu_{s}∑ start_POSTSUBSCRIPT italic_s ∈ italic_S end_POSTSUBSCRIPT italic_π ( italic_s ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT is equal to the prior μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. This means that the following convex combination of the new posteriors {μa}aAa0subscriptsubscript𝜇𝑎𝑎𝐴subscript𝑎0\{\mu_{a}\}_{a\in A\setminus a_{0}}{ italic_μ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_a ∈ italic_A ∖ italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT and μa0subscript𝜇subscript𝑎0\mu_{a_{0}}italic_μ start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT is also equal to the prior:

aAa0sSaπ(s)μa+sSIπ(s)μa0=aAa0sSaπ(s)μs+sSIπ(s)μs=sSπ(s)μs=μ0subscript𝑎𝐴subscript𝑎0subscriptsuperscript𝑠subscript𝑆𝑎𝜋superscript𝑠subscript𝜇𝑎subscriptsuperscript𝑠subscript𝑆𝐼𝜋superscript𝑠subscript𝜇subscript𝑎0subscript𝑎𝐴subscript𝑎0subscript𝑠subscript𝑆𝑎𝜋𝑠subscript𝜇𝑠subscript𝑠subscript𝑆𝐼𝜋𝑠subscript𝜇𝑠subscript𝑠𝑆𝜋𝑠subscript𝜇𝑠subscript𝜇0\sum_{a\in A\setminus a_{0}}\sum_{s^{\prime}\in S_{a}}\pi(s^{\prime})\mu_{a}+% \sum_{s^{\prime}\in S_{I}}\pi(s^{\prime})\mu_{a_{0}}=\sum_{a\in A\setminus a_{% 0}}\sum_{s\in S_{a}}\pi(s)\mu_{s}+\sum_{s\in S_{I}}\pi(s)\mu_{s}=\sum_{s\in S}% \pi(s)\mu_{s}=\mu_{0}∑ start_POSTSUBSCRIPT italic_a ∈ italic_A ∖ italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_S start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_π ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) italic_μ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_S start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_π ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) italic_μ start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_a ∈ italic_A ∖ italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_s ∈ italic_S start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_π ( italic_s ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_s ∈ italic_S start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_π ( italic_s ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_s ∈ italic_S end_POSTSUBSCRIPT italic_π ( italic_s ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT = italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT

where the convex combination weight of μasubscript𝜇𝑎\mu_{a}italic_μ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT is sSaπ(s)subscriptsuperscript𝑠subscript𝑆𝑎𝜋superscript𝑠\sum_{s^{\prime}\in S_{a}}\pi(s^{\prime})∑ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_S start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_π ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) for every aA{a}𝑎𝐴𝑎a\in A\setminus\{a\}italic_a ∈ italic_A ∖ { italic_a } and the convex combination weight of μa0subscript𝜇subscript𝑎0\mu_{a_{0}}italic_μ start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT is sSIπ(s)subscriptsuperscript𝑠subscript𝑆𝐼𝜋superscript𝑠\sum_{s^{\prime}\in S_{I}}\pi(s^{\prime})∑ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_S start_POSTSUBSCRIPT italic_I end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_π ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ). One can easily verify that the weights sum to 1. Then, by the splitting lemma (2.1), there exists a signaling scheme πsuperscript𝜋\pi^{\prime}italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT with signal space of size |A|𝐴|A|| italic_A | (so we simply denote the signal space by A𝐴Aitalic_A) where each signal aA𝑎𝐴a\in Aitalic_a ∈ italic_A induces posterior μasubscript𝜇𝑎\mu_{a}italic_μ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT. We show that this new signaling scheme πsuperscript𝜋\pi^{\prime}italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT satisfies the properties in Lemma 4.5:

  • Signal a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT induces posterior μa0subscript𝜇subscript𝑎0\mu_{a_{0}}italic_μ start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT whose τ𝜏\tauitalic_τ-biased version satisfies τμ0+(1τ)μa0Ra0𝜏subscript𝜇01𝜏subscript𝜇subscript𝑎0subscript𝑅subscript𝑎0\tau\mu_{0}+(1-\tau)\mu_{a_{0}}\in R_{a_{0}}italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. So, given signal a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, action a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is the optimal action for an agent with bias level τ𝜏\tauitalic_τ.

  • For each signal aA{a0}𝑎𝐴subscript𝑎0a\in A\setminus\{a_{0}\}italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT }, the induced posterior μasubscript𝜇𝑎\mu_{a}italic_μ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT satisfies τμ0+(1τ)μaBaRa0𝜏subscript𝜇01𝜏subscript𝜇𝑎subscript𝐵𝑎subscript𝑅subscript𝑎0\tau\mu_{0}+(1-\tau)\mu_{a}\in B_{a}\subseteq\partial R_{a_{0}}italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ∈ italic_B start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ⊆ ∂ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. So, by the definition of Basubscript𝐵𝑎B_{a}italic_B start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT, an agent with bias level τ𝜏\tauitalic_τ is indifferent between actions a𝑎aitalic_a and a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and these two actions are better than other actions. Also, this signal is a boundary signal by Definition 4.1, which satisfies the following according to Lemma 4.2: if the agent’s bias level w<τ𝑤𝜏w<\tauitalic_w < italic_τ, then the agent strictly prefers a𝑎aitalic_a over a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT; if w>τ𝑤𝜏w>\tauitalic_w > italic_τ, then the agent strictly prefers a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT over a𝑎aitalic_a.

  • The sample complexity of πsuperscript𝜋\pi^{\prime}italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is the same as π𝜋\piitalic_π because: (1) the sample complexity is equal to the inverse of the total probability of boundary signals (as a corollary of Lemma 4.4), and (2) the total probability of boundary signals of the two signaling schemes are the same:

    aA{a0}π(a)=aA{a0}sSaπ(s)=saA{a0}Saπ(s).subscript𝑎𝐴subscript𝑎0superscript𝜋𝑎subscript𝑎𝐴subscript𝑎0subscriptsuperscript𝑠subscript𝑆𝑎𝜋superscript𝑠subscript𝑠subscript𝑎𝐴subscript𝑎0subscript𝑆𝑎𝜋𝑠\sum_{a\in A\setminus\{a_{0}\}}\pi^{\prime}(a)=\sum_{a\in A\setminus\{a_{0}\}}% \sum_{s^{\prime}\in S_{a}}\pi(s^{\prime})=\sum_{s\in\cup_{a\in A\setminus\{a_{% 0}\}}S_{a}}\pi(s).∑ start_POSTSUBSCRIPT italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ( italic_a ) = ∑ start_POSTSUBSCRIPT italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ italic_S start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_π ( italic_s start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = ∑ start_POSTSUBSCRIPT italic_s ∈ ∪ start_POSTSUBSCRIPT italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT italic_S start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT end_POSTSUBSCRIPT italic_π ( italic_s ) . (20)

    So, Tτ(π)=Tτ(π)subscript𝑇𝜏superscript𝜋subscript𝑇𝜏𝜋T_{\tau}(\pi^{\prime})=T_{\tau}(\pi)italic_T start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT ( italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ) = italic_T start_POSTSUBSCRIPT italic_τ end_POSTSUBSCRIPT ( italic_π ).

Appendix B Missing Proofs from Section 5

B.1 Proof of Lemma 5.1

Proof.

For μΔ(Θ)𝜇ΔΘ\mu\in\Delta(\Theta)italic_μ ∈ roman_Δ ( roman_Θ ), by convexity of Δ(Θ)ΔΘ\Delta(\Theta)roman_Δ ( roman_Θ ), we have (1τ)μ+τμ0Δ(Θ)1𝜏𝜇𝜏subscript𝜇0ΔΘ(1-\tau)\mu+\tau\mu_{0}\in\Delta(\Theta)( 1 - italic_τ ) italic_μ + italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ roman_Δ ( roman_Θ ). Then,

μIa,τ(1τ)μ+τμ0Ia𝜇subscript𝐼𝑎𝜏1𝜏𝜇𝜏subscript𝜇0subscript𝐼𝑎\displaystyle\mu\in I_{a,\tau}\;\Longleftrightarrow\;(1-\tau)\mu+\tau\mu_{0}% \in I_{a}italic_μ ∈ italic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT ⟺ ( 1 - italic_τ ) italic_μ + italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ italic_I start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ca((1τ)μ+τμ0)=0absentsuperscriptsubscript𝑐𝑎top1𝜏𝜇𝜏subscript𝜇00\displaystyle\;\Longleftrightarrow\;c_{a}^{\top}((1-\tau)\mu+\tau\mu_{0})=0⟺ italic_c start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT ( ( 1 - italic_τ ) italic_μ + italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ) = 0
(1τ)caμ+τcaμ0=0absent1𝜏superscriptsubscript𝑐𝑎top𝜇𝜏superscriptsubscript𝑐𝑎topsubscript𝜇00\displaystyle\;\Longleftrightarrow\;(1-\tau)c_{a}^{\top}\mu+\tau c_{a}^{\top}% \mu_{0}=0⟺ ( 1 - italic_τ ) italic_c start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_μ + italic_τ italic_c start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = 0
caμ=τ1τcaμ0.absentsuperscriptsubscript𝑐𝑎top𝜇𝜏1𝜏superscriptsubscript𝑐𝑎topsubscript𝜇0\displaystyle\;\Longleftrightarrow\;c_{a}^{\top}\mu=-\frac{\tau}{1-\tau}c_{a}^% {\top}\mu_{0}.⟺ italic_c start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_μ = - divide start_ARG italic_τ end_ARG start_ARG 1 - italic_τ end_ARG italic_c start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT .

B.2 Proof of Theorem 5.2

We first prove the first part of Theorem 5.2, then prove the the second and third parts.

B.2.1 Proof of Part 1 of Theorem 5.2

We want to prove that wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ can be tested with a single sample if and only if the prior μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is in the convex hull formed by the translated sets Ia,τsubscript𝐼𝑎𝜏I_{a,\tau}italic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT for all non-default actions aA{a0}𝑎𝐴subscript𝑎0a\in A\setminus\{a_{0}\}italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT }: μ0ConvexHull(aA{a0}Ia,τ)subscript𝜇0ConvexHullsubscript𝑎𝐴subscript𝑎0subscript𝐼𝑎𝜏\mu_{0}\in\mathrm{ConvexHull}\big{(}~{}\cup_{a\in A\setminus\{a_{0}\}}I_{a,% \tau}~{}\big{)}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ roman_ConvexHull ( ∪ start_POSTSUBSCRIPT italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT ).

The “if” part.

Suppose μ0ConvexHull(aA{a0}Ia,τ)subscript𝜇0ConvexHullsubscript𝑎𝐴subscript𝑎0subscript𝐼𝑎𝜏\mu_{0}\in\mathrm{ConvexHull}\big{(}~{}\cup_{a\in A\setminus\{a_{0}\}}I_{a,% \tau}~{}\big{)}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ roman_ConvexHull ( ∪ start_POSTSUBSCRIPT italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT ), namely, there exist a set of positive weights {ps}sSsubscriptsubscript𝑝𝑠𝑠𝑆\{p_{s}\}_{s\in S}{ italic_p start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_s ∈ italic_S end_POSTSUBSCRIPT and a set of posterior beliefs {μs}sSsubscriptsubscript𝜇𝑠𝑠𝑆\{\mu_{s}\}_{s\in S}{ italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_s ∈ italic_S end_POSTSUBSCRIPT such that

μ0=sSpsμs,subscript𝜇0subscript𝑠𝑆subscript𝑝𝑠subscript𝜇𝑠\mu_{0}=\sum_{s\in S}p_{s}\mu_{s},italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_s ∈ italic_S end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ,

where each μsIa,τsubscript𝜇𝑠subscript𝐼𝑎𝜏\mu_{s}\in I_{a,\tau}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ∈ italic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT for some aA{a0}𝑎𝐴subscript𝑎0a\in A\setminus\{a_{0}\}italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT }. By definition, the τ𝜏\tauitalic_τ-biased belief τμ0+(1τ)μs𝜏subscript𝜇01𝜏subscript𝜇𝑠\tau\mu_{0}+(1-\tau)\mu_{s}italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT is in the indifference set Iasubscript𝐼𝑎I_{a}italic_I start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT. Recall the definition of the boundary set Ra0subscript𝑅subscript𝑎0\partial R_{a_{0}}∂ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT (Equation 4), which is the set of beliefs under which the agent is indifferent between a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and some other action and these two actions are better any other actions. The τ𝜏\tauitalic_τ-biased belief τμ0+(1τ)μsIa𝜏subscript𝜇01𝜏subscript𝜇𝑠subscript𝐼𝑎\tau\mu_{0}+(1-\tau)\mu_{s}\in I_{a}italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ∈ italic_I start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT may or may not belong to Ra0subscript𝑅subscript𝑎0\partial R_{a_{0}}∂ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, depending on whether a𝑎aitalic_a and a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT are better than any other actions:

  • If τμ0+(1τ)μsRa0𝜏subscript𝜇01𝜏subscript𝜇𝑠subscript𝑅subscript𝑎0\tau\mu_{0}+(1-\tau)\mu_{s}\in\partial R_{a_{0}}italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ∈ ∂ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, then s𝑠sitalic_s is a boundary signal (by Definition 4.1) and hence useful for testing whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ (Lemma 4.2). Denote μs=μssuperscriptsubscript𝜇𝑠subscript𝜇𝑠\mu_{s}^{\prime}=\mu_{s}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT in this case.

  • If τμ0+(1τ)μsRa0𝜏subscript𝜇01𝜏subscript𝜇𝑠subscript𝑅subscript𝑎0\tau\mu_{0}+(1-\tau)\mu_{s}\notin\partial R_{a_{0}}italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ∉ ∂ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, then there must exist some action asuperscript𝑎a^{\prime}italic_a start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT that is strictly better than a𝑎aitalic_a and a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT for the agent at the τ𝜏\tauitalic_τ-biased belief, hence τμ0+(1τ)μsextRa0𝜏subscript𝜇01𝜏subscript𝜇𝑠extsubscript𝑅subscript𝑎0\tau\mu_{0}+(1-\tau)\mu_{s}\in\mathrm{ext}R_{a_{0}}italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ∈ roman_ext italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT (so s𝑠sitalic_s is an external signal). Then, according to the argument in Lemma 4.3, we can find another belief μssuperscriptsubscript𝜇𝑠\mu_{s}^{\prime}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT on the line segment between μssubscript𝜇𝑠\mu_{s}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT and μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT such that the τ𝜏\tauitalic_τ-biased version of μssuperscriptsubscript𝜇𝑠\mu_{s}^{\prime}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT lies exactly on the boundary set Ra0subscript𝑅subscript𝑎0\partial R_{a_{0}}∂ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT:

    τμ0+(1τ)μsRa0,μs=tμs+(1t)μ0 for some t[0,1].formulae-sequence𝜏subscript𝜇01𝜏superscriptsubscript𝜇𝑠subscript𝑅subscript𝑎0superscriptsubscript𝜇𝑠𝑡subscript𝜇𝑠1𝑡subscript𝜇0 for some t[0,1]\tau\mu_{0}+(1-\tau)\mu_{s}^{\prime}\in\partial R_{a_{0}},~{}~{}~{}~{}\mu_{s}^% {\prime}=t\mu_{s}+(1-t)\mu_{0}\text{ for some $t\in[0,1]$}.italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ∂ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT , italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_t italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT + ( 1 - italic_t ) italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT for some italic_t ∈ [ 0 , 1 ] . (21)

After the above discussion, we have found a μssuperscriptsubscript𝜇𝑠\mu_{s}^{\prime}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT that is either equal to μssubscript𝜇𝑠\mu_{s}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT or on the line segment between μssubscript𝜇𝑠\mu_{s}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT and μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, for every sS𝑠𝑆s\in Sitalic_s ∈ italic_S. So, μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT can be written as a convex combination of {μs}sSsubscriptsuperscriptsubscript𝜇𝑠𝑠𝑆\{\mu_{s}^{\prime}\}_{s\in S}{ italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT } start_POSTSUBSCRIPT italic_s ∈ italic_S end_POSTSUBSCRIPT:

μ0=sSpsμs.subscript𝜇0subscript𝑠𝑆superscriptsubscript𝑝𝑠superscriptsubscript𝜇𝑠\mu_{0}=\sum_{s\in S}p_{s}^{\prime}\mu_{s}^{\prime}.italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_s ∈ italic_S end_POSTSUBSCRIPT italic_p start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT . (22)

Moreover, the μssuperscriptsubscript𝜇𝑠\mu_{s}^{\prime}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT defined above satisfies τμ0+(1τ)μsRa0𝜏subscript𝜇01𝜏superscriptsubscript𝜇𝑠subscript𝑅subscript𝑎0\tau\mu_{0}+(1-\tau)\mu_{s}^{\prime}\in\partial R_{a_{0}}italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ∂ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. So, a signal inducing true posterior μssuperscriptsubscript𝜇𝑠\mu_{s}^{\prime}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT will be a boundary signal and useful for testing wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ (Lemma 4.2). Finally, by the splitting lemma (Lemma 2.1), we know that there must exist a signaling scheme πsuperscript𝜋\pi^{\prime}italic_π start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT with signal space S𝑆Sitalic_S where each signal sS𝑠𝑆s\in Sitalic_s ∈ italic_S indeed induces posterior μssuperscriptsubscript𝜇𝑠\mu_{s}^{\prime}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT. Such a signaling scheme sends useful (boundary) signals with probability 1. Hence, the sample complexity of it is 1.

The “only if” part.

Suppose whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ can be tested with a single sample. This means that the optimal signaling scheme obtained from the linear program in Algorithm 1 must satisfy aA{a0}π(a)=aA{a0}θΘπ(a|θ)μ0(θ)=1subscript𝑎𝐴subscript𝑎0𝜋𝑎subscript𝑎𝐴subscript𝑎0subscript𝜃Θ𝜋conditional𝑎𝜃subscript𝜇0𝜃1\sum_{a\in A\setminus\{a_{0}\}}\pi(a)=\sum_{a\in A\setminus\{a_{0}\}}\sum_{% \theta\in\Theta}\pi(a|\theta)\mu_{0}(\theta)=1∑ start_POSTSUBSCRIPT italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT italic_π ( italic_a ) = ∑ start_POSTSUBSCRIPT italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT ∑ start_POSTSUBSCRIPT italic_θ ∈ roman_Θ end_POSTSUBSCRIPT italic_π ( italic_a | italic_θ ) italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) = 1, namely, the total probability of useful signals (signals in A{a0}𝐴subscript𝑎0A\setminus\{a_{0}\}italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT }) is 1. Then, by the splitting lemma, the prior μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT can be expressed as the convex combination

μ0=aA{a0}π(a)μasubscript𝜇0subscript𝑎𝐴subscript𝑎0𝜋𝑎subscript𝜇𝑎\mu_{0}=\sum_{a\in A\setminus\{a_{0}\}}\pi(a)\mu_{a}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = ∑ start_POSTSUBSCRIPT italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT italic_π ( italic_a ) italic_μ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT (23)

where π(a)=θΘμ0(θ)π(a|θ)𝜋𝑎subscript𝜃Θsubscript𝜇0𝜃𝜋conditional𝑎𝜃\pi(a)=\sum_{\theta\in\Theta}\mu_{0}(\theta)\pi(a|\theta)italic_π ( italic_a ) = ∑ start_POSTSUBSCRIPT italic_θ ∈ roman_Θ end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) italic_π ( italic_a | italic_θ ) is the unconditional probability of signal a𝑎aitalic_a and μasubscript𝜇𝑎\mu_{a}italic_μ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT is the true posterior induced by signal a𝑎aitalic_a. Moreover, the indifference constraint (8) in the linear program ensures that the agent is indifferent between a𝑎aitalic_a and a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT upon receiving signal a𝑎aitalic_a if the agent has bias level τ𝜏\tauitalic_τ: mathematically, τμ0+(1τ)μaIa𝜏subscript𝜇01𝜏subscript𝜇𝑎subscript𝐼𝑎\tau\mu_{0}+(1-\tau)\mu_{a}\in I_{a}italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ∈ italic_I start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT. This means μaIa,τsubscript𝜇𝑎subscript𝐼𝑎𝜏\mu_{a}\in I_{a,\tau}italic_μ start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT ∈ italic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT by definition. So, we obtain

μ0ConvexHull(aA{a0}Ia,τ).subscript𝜇0ConvexHullsubscript𝑎𝐴subscript𝑎0subscript𝐼𝑎𝜏\mu_{0}\in\mathrm{ConvexHull}\bigg{(}~{}\bigcup_{a\in A\setminus\{a_{0}\}}I_{a% ,\tau}~{}\bigg{)}.italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ roman_ConvexHull ( ⋃ start_POSTSUBSCRIPT italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT italic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT ) . (24)

B.2.2 Proof of Parts 2 and 3 of Theorem 5.2

We first prove that, if whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ can be tested with finite sample complexity, then Ia,τsubscript𝐼𝑎𝜏I_{a,\tau}\neq\emptysetitalic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT ≠ ∅ for at least one aA{a0}𝑎𝐴subscript𝑎0a\in A\setminus\{a_{0}\}italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT }.

According to Lemma 4.1, if we can test whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ with finite sample complexity using adaptive algorithms, then we can do this using a constant signaling scheme. Lemma 4.3 further ensures that we can do this using a constant signaling scheme π𝜋\piitalic_π with only boundary and internal signals. But according to Lemma 4.4, internal signals are not useful for testing wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ. So, the signaling scheme π𝜋\piitalic_π must send some boundary signal s𝑠sitalic_s with positive probability. Let μssubscript𝜇𝑠\mu_{s}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT be the true posterior induced by s𝑠sitalic_s. By the definition of boundary signal, τμ0+(1τ)μsRa0𝜏subscript𝜇01𝜏subscript𝜇𝑠subscript𝑅subscript𝑎0\tau\mu_{0}+(1-\tau)\mu_{s}\in\partial R_{a_{0}}italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ∈ ∂ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, implying that the agent is indifferent between a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and some action aA{a0}𝑎𝐴subscript𝑎0a\in A\setminus\{a_{0}\}italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } if their belief is τμ0+(1τ)μs𝜏subscript𝜇01𝜏subscript𝜇𝑠\tau\mu_{0}+(1-\tau)\mu_{s}italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT (and a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT and a𝑎aitalic_a are better than any other actions). This means τμ0+(1τ)μsIa𝜏subscript𝜇01𝜏subscript𝜇𝑠subscript𝐼𝑎\tau\mu_{0}+(1-\tau)\mu_{s}\in I_{a}italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ∈ italic_I start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT, so μsIa,τsubscript𝜇𝑠subscript𝐼𝑎𝜏\mu_{s}\in I_{a,\tau}italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ∈ italic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT by definition. Hence, Ia,τsubscript𝐼𝑎𝜏I_{a,\tau}\neq\emptysetitalic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT ≠ ∅.

We then prove the opposite direction: if Ia,τsubscript𝐼𝑎𝜏I_{a,\tau}\neq\emptysetitalic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT ≠ ∅ for at least one aA{a0}𝑎𝐴subscript𝑎0a\in A\setminus\{a_{0}\}italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT }, then whether wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ can be tested with finite sample complexity.

Let a1A{a0}subscript𝑎1𝐴subscript𝑎0a_{1}\in A\setminus\{a_{0}\}italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } be an action for which Ia1,τsubscript𝐼subscript𝑎1𝜏I_{a_{1},\tau}\neq\emptysetitalic_I start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_τ end_POSTSUBSCRIPT ≠ ∅. We claim that:

Claim B.1.

There exists a state θ1Θsubscript𝜃1Θ\theta_{1}\in\Thetaitalic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ∈ roman_Θ for which the agent weakly prefers action a1subscript𝑎1a_{1}italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT over action a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT if the true posterior is state θ1subscript𝜃1\theta_{1}italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT with probability 1 and the agent has bias level τ𝜏\tauitalic_τ. In notation, let eθ1Δ(Θ)subscript𝑒subscript𝜃1ΔΘe_{\theta_{1}}\in\Delta(\Theta)italic_e start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ∈ roman_Δ ( roman_Θ ) be the vector whose θ1subscript𝜃1\theta_{1}italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPTth component is 1111 and other components are 00. The agent weakly prefers action a1subscript𝑎1a_{1}italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT over action a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT under belief τμ0+(1τ)eθ1𝜏subscript𝜇01𝜏subscript𝑒subscript𝜃1\tau\mu_{0}+(1-\tau)e_{\theta_{1}}italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_e start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT.

Proof.

Suppose on the contrary that no such state θ1subscript𝜃1\theta_{1}italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT exists. Then the agent strictly prefers a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT over a1subscript𝑎1a_{1}italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT under belief τμ0+(1τ)eθ𝜏subscript𝜇01𝜏subscript𝑒𝜃\tau\mu_{0}+(1-\tau)e_{\theta}italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_e start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT for every state θΘ𝜃Θ\theta\in\Thetaitalic_θ ∈ roman_Θ. This implies that, for any belief μΔ(Θ)𝜇ΔΘ\mu\in\Delta(\Theta)italic_μ ∈ roman_Δ ( roman_Θ ), the agent should also strictly prefer a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT over a1subscript𝑎1a_{1}italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT under the belief τμ0+(1τ)μ𝜏subscript𝜇01𝜏𝜇\tau\mu_{0}+(1-\tau)\muitalic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ, due to linearity of the agent’s utility with respect to the belief. The agent strictly preferring a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT over a1subscript𝑎1a_{1}italic_a start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT implies τμ0+(1τ)μIa𝜏subscript𝜇01𝜏𝜇subscript𝐼𝑎\tau\mu_{0}+(1-\tau)\mu\notin I_{a}italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ ∉ italic_I start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT, so μ𝜇\muitalic_μ cannot be in Ia,τsubscript𝐼𝑎𝜏I_{a,\tau}italic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT by definition. This holds for any μΔ(Θ)𝜇ΔΘ\mu\in\Delta(\Theta)italic_μ ∈ roman_Δ ( roman_Θ ), so Ia,τ=subscript𝐼𝑎𝜏I_{a,\tau}=\emptysetitalic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT = ∅, a contradiction. ∎

Let θ1subscript𝜃1\theta_{1}italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT be the state in the above claim. The prior μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT can be trivially written as the convex combination of eθ1subscript𝑒subscript𝜃1e_{\theta_{1}}italic_e start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT and eθsubscript𝑒𝜃e_{\theta}italic_e start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT for other states θ𝜃\thetaitalic_θ:

μ0=μ0(θ1)eθ1+θΘ{θ1}μ0(θ)eθ.subscript𝜇0subscript𝜇0subscript𝜃1subscript𝑒subscript𝜃1subscript𝜃Θsubscript𝜃1subscript𝜇0𝜃subscript𝑒𝜃\mu_{0}=\mu_{0}(\theta_{1})e_{\theta_{1}}+\sum_{\theta\in\Theta\setminus\{% \theta_{1}\}}\mu_{0}(\theta)e_{\theta}.italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) italic_e start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + ∑ start_POSTSUBSCRIPT italic_θ ∈ roman_Θ ∖ { italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ( italic_θ ) italic_e start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT . (25)

Since the agent does not prefer a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT under belief τμ0+(1τ)eθ1𝜏subscript𝜇01𝜏subscript𝑒subscript𝜃1\tau\mu_{0}+(1-\tau)e_{\theta_{1}}italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_e start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, the belief τμ0+(1τ)eθ1𝜏subscript𝜇01𝜏subscript𝑒subscript𝜃1\tau\mu_{0}+(1-\tau)e_{\theta_{1}}italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_e start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT cannot be in the region Ra0subscript𝑅subscript𝑎0R_{a_{0}}italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. The prior μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT is in the region Ra0subscript𝑅subscript𝑎0R_{a_{0}}italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Consider the line segment connecting eθ1subscript𝑒subscript𝜃1e_{\theta_{1}}italic_e start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT and the prior μ0subscript𝜇0\mu_{0}italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. There must exist a point μ=teθ1+(1t)μ0superscript𝜇𝑡subscript𝑒subscript𝜃11𝑡subscript𝜇0\mu^{\prime}=te_{\theta_{1}}+(1-t)\mu_{0}italic_μ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT = italic_t italic_e start_POSTSUBSCRIPT italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT end_POSTSUBSCRIPT + ( 1 - italic_t ) italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT on the line segment such that the τ𝜏\tauitalic_τ-biased belief τμ0+(1τ)μ𝜏subscript𝜇01𝜏superscript𝜇\tau\mu_{0}+(1-\tau)\mu^{\prime}italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT lies exactly on the boundary of Ra0subscript𝑅subscript𝑎0R_{a_{0}}italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT. Clearly, the prior can also be written as a convex combination of μsuperscript𝜇\mu^{\prime}italic_μ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT and eθsubscript𝑒𝜃e_{\theta}italic_e start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT for θΘ{θ1}𝜃Θsubscript𝜃1\theta\in\Theta\setminus\{\theta_{1}\}italic_θ ∈ roman_Θ ∖ { italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT }:

μ0=pμ+θΘ{θ1}pθeθ.subscript𝜇0superscript𝑝superscript𝜇subscript𝜃Θsubscript𝜃1subscriptsuperscript𝑝𝜃subscript𝑒𝜃\mu_{0}=p^{\prime}\mu^{\prime}+\sum_{\theta\in\Theta\setminus\{\theta_{1}\}}p^% {\prime}_{\theta}e_{\theta}.italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT = italic_p start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT italic_μ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT + ∑ start_POSTSUBSCRIPT italic_θ ∈ roman_Θ ∖ { italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT italic_p start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT italic_e start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT . (26)

Then by the splitting lemma (Lemma 2.1), there exists a signaling scheme with |Θ|Θ|\Theta|| roman_Θ | signals where one signal induces posterior μsuperscript𝜇\mu^{\prime}italic_μ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT and the other signals induce posteriors {eθ}θΘ{θ1}subscriptsubscript𝑒𝜃𝜃Θsubscript𝜃1\{e_{\theta}\}_{\theta\in\Theta\setminus\{\theta_{1}\}}{ italic_e start_POSTSUBSCRIPT italic_θ end_POSTSUBSCRIPT } start_POSTSUBSCRIPT italic_θ ∈ roman_Θ ∖ { italic_θ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT } end_POSTSUBSCRIPT. In particular, the signal inducing μsuperscript𝜇\mu^{\prime}italic_μ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT is a boundary signal since τμ0+(1τ)μRa0𝜏subscript𝜇01𝜏superscript𝜇subscript𝑅subscript𝑎0\tau\mu_{0}+(1-\tau)\mu^{\prime}\in\partial R_{a_{0}}italic_τ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_τ ) italic_μ start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT ∈ ∂ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT by construction. By Lemma 4.2, that signal is useful for testing wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ. When that signal is sent (which happens with positive probability p>0superscript𝑝0p^{\prime}>0italic_p start_POSTSUPERSCRIPT ′ end_POSTSUPERSCRIPT > 0 at each time step), we can tell wτ𝑤𝜏w\geq\tauitalic_w ≥ italic_τ or wτ𝑤𝜏w\leq\tauitalic_w ≤ italic_τ. This finishes the proof.

The two directions proved above together prove the parts 2 and 3 of Theorem 5.2.

Appendix C A More General Bias Model

We define a more general model of biased belief than the linear model. The agent’s bias is captured by some function ϕ:Δ(Θ)×Δ(Θ)×[0,1]Δ(Θ):italic-ϕΔΘΔΘ01ΔΘ\phi:\Delta(\Theta)\times\Delta(\Theta)\times[0,1]\to\Delta(\Theta)italic_ϕ : roman_Δ ( roman_Θ ) × roman_Δ ( roman_Θ ) × [ 0 , 1 ] → roman_Δ ( roman_Θ ). Given prior μ0Δ(Θ)subscript𝜇0ΔΘ\mu_{0}\in\Delta(\Theta)italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ roman_Δ ( roman_Θ ), true posterior μsΔ(Θ)subscript𝜇𝑠ΔΘ\mu_{s}\in\Delta(\Theta)italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT ∈ roman_Δ ( roman_Θ ), and bias level w[0,1]𝑤01w\in[0,1]italic_w ∈ [ 0 , 1 ], the agent has biased belief ϕ(μ0,μs,w)italic-ϕsubscript𝜇0subscript𝜇𝑠𝑤\phi(\mu_{0},\mu_{s},w)italic_ϕ ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT , italic_w ). The linear model is the special case where ϕ(μ0,μs,w)=wμ0+(1w)μsitalic-ϕsubscript𝜇0subscript𝜇𝑠𝑤𝑤subscript𝜇01𝑤subscript𝜇𝑠\phi(\mu_{0},\mu_{s},w)=w\mu_{0}+(1-w)\mu_{s}italic_ϕ ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT , italic_w ) = italic_w italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT + ( 1 - italic_w ) italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT. We make the following natural assumptions on ϕitalic-ϕ\phiitalic_ϕ:

Assumption C.1.
  • ϕ(μ0,μs,0)=μsitalic-ϕsubscript𝜇0subscript𝜇𝑠0subscript𝜇𝑠\phi(\mu_{0},\mu_{s},0)=\mu_{s}italic_ϕ ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT , 0 ) = italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT (no bias), ϕ(μ0,μs,1)=μ0italic-ϕsubscript𝜇0subscript𝜇𝑠1subscript𝜇0\phi(\mu_{0},\mu_{s},1)=\mu_{0}italic_ϕ ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT , 1 ) = italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT (full bias).

  • ϕ(μ0,μs,w)italic-ϕsubscript𝜇0subscript𝜇𝑠𝑤\phi(\mu_{0},\mu_{s},w)italic_ϕ ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT , italic_w ) is continuous in μ0,μs,wsubscript𝜇0subscript𝜇𝑠𝑤\mu_{0},\mu_{s},witalic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_μ start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT , italic_w.

We then make some joint assumptions on the bias model ϕitalic-ϕ\phiitalic_ϕ and the agent’s utility function U𝑈Uitalic_U. Recall that the notation Ra0={μΔ(Θ)aA{a0},caμ>0}subscript𝑅subscript𝑎0conditional-set𝜇ΔΘformulae-sequencefor-all𝑎𝐴subscript𝑎0superscriptsubscript𝑐𝑎top𝜇0R_{a_{0}}=\big{\{}\mu\in\Delta(\Theta)\mid\forall a\in A\setminus\{a_{0}\},\,c% _{a}^{\top}\mu>0\big{\}}italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT = { italic_μ ∈ roman_Δ ( roman_Θ ) ∣ ∀ italic_a ∈ italic_A ∖ { italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT } , italic_c start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_μ > 0 } is the region of beliefs under which the agent strictly prefers action a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, Ra0subscript𝑅subscript𝑎0\partial R_{a_{0}}∂ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT is the boundary of Ra0subscript𝑅subscript𝑎0R_{a_{0}}italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, and extRa0extsubscript𝑅subscript𝑎0\mathrm{ext}R_{a_{0}}roman_ext italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT is the exterior of Ra0subscript𝑅subscript𝑎0R_{a_{0}}italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT where the agent strictly not prefers a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT.

Assumption C.2.

When receiving no information, the agent will take the default action: μ0Δ(Θ),w[0,1]formulae-sequencefor-allsubscript𝜇0ΔΘfor-all𝑤01\forall\mu_{0}\in\Delta(\Theta),\forall w\in[0,1]∀ italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT ∈ roman_Δ ( roman_Θ ) , ∀ italic_w ∈ [ 0 , 1 ], ϕ(μ0,μ0,w)Ra0italic-ϕsubscript𝜇0subscript𝜇0𝑤subscript𝑅subscript𝑎0\phi(\mu_{0},\mu_{0},w)\in R_{a_{0}}italic_ϕ ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_w ) ∈ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT.

Definition C.1.

We say that a posterior belief μΔ(Θ)𝜇ΔΘ\mu\in\Delta(\Theta)italic_μ ∈ roman_Δ ( roman_Θ ) satisfies single-crossing if the curve {ϕ(μ0,μ,w):w[0,1]}conditional-setitalic-ϕsubscript𝜇0𝜇𝑤𝑤01\{\phi(\mu_{0},\mu,w):w\in[0,1]\}{ italic_ϕ ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_μ , italic_w ) : italic_w ∈ [ 0 , 1 ] } passes the boundary Ra0subscript𝑅subscript𝑎0\partial R_{a_{0}}∂ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT only once: namely, there exists w¯[0,1]¯𝑤01\overline{w}\in[0,1]over¯ start_ARG italic_w end_ARG ∈ [ 0 , 1 ] such that

{w[0,w¯),ϕ(μ0,μ,w)extRa0;ϕ(μ0,μ,w¯)Ra0;w(w¯,1],ϕ(μ0,μ,w)Ra0.casesfor-all𝑤0¯𝑤italic-ϕsubscript𝜇0𝜇𝑤extsubscript𝑅subscript𝑎0otherwiseitalic-ϕsubscript𝜇0𝜇¯𝑤subscript𝑅subscript𝑎0for-all𝑤¯𝑤1italic-ϕsubscript𝜇0𝜇𝑤subscript𝑅subscript𝑎0\begin{cases}\forall w\in[0,\overline{w}),&\phi(\mu_{0},\mu,w)\in\mathrm{ext}R% _{a_{0}};\\ &\phi(\mu_{0},\mu,\overline{w})\in\partial R_{a_{0}};\\ \forall w\in(\overline{w},1],&\phi(\mu_{0},\mu,w)\in R_{a_{0}}.\end{cases}{ start_ROW start_CELL ∀ italic_w ∈ [ 0 , over¯ start_ARG italic_w end_ARG ) , end_CELL start_CELL italic_ϕ ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_μ , italic_w ) ∈ roman_ext italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ; end_CELL end_ROW start_ROW start_CELL end_CELL start_CELL italic_ϕ ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_μ , over¯ start_ARG italic_w end_ARG ) ∈ ∂ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT ; end_CELL end_ROW start_ROW start_CELL ∀ italic_w ∈ ( over¯ start_ARG italic_w end_ARG , 1 ] , end_CELL start_CELL italic_ϕ ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_μ , italic_w ) ∈ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT . end_CELL end_ROW (27)

We assume that all posteriors outside Ra0subscript𝑅subscript𝑎0R_{a_{0}}italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT satisfy single-crossing, and all posteriors inside Ra0subscript𝑅subscript𝑎0R_{a_{0}}italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT do not cross the boundary when the bias level varies in [0,1]01[0,1][ 0 , 1 ]:

Assumption C.3.
  • Any μRa0𝜇subscript𝑅subscript𝑎0\mu\notin R_{a_{0}}italic_μ ∉ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT satisfies single-crossing.

  • For any μRa0𝜇subscript𝑅subscript𝑎0\mu\in R_{a_{0}}italic_μ ∈ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT, any w[0,1]𝑤01w\in[0,1]italic_w ∈ [ 0 , 1 ], ϕ(μ0,μ,w)Ra0italic-ϕsubscript𝜇0𝜇𝑤subscript𝑅subscript𝑎0\phi(\mu_{0},\mu,w)\in R_{a_{0}}italic_ϕ ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_μ , italic_w ) ∈ italic_R start_POSTSUBSCRIPT italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT end_POSTSUBSCRIPT.

Under the above general bias model with the stated assumptions, our results regarding the optimality of constant signaling schemes (Lemma 4.1) and the revelation principle (Lemma 4.5) still hold. The geometric characterization of testability of bias (Theorem 5.2) holds after redefining some notations. Let Ia={μΔ(Θ)caμ=0}subscript𝐼𝑎conditional-set𝜇ΔΘsuperscriptsubscript𝑐𝑎top𝜇0I_{a}=\{\mu\in\Delta(\Theta)\mid c_{a}^{\top}\mu=0\}italic_I start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT = { italic_μ ∈ roman_Δ ( roman_Θ ) ∣ italic_c start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ⊤ end_POSTSUPERSCRIPT italic_μ = 0 } still be the set of beliefs where the agent is indifferent between actions a𝑎aitalic_a and a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT. Let Ia,τsubscript𝐼𝑎𝜏I_{a,\tau}italic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT still be the set of posterior beliefs for which an agent with bias level τ𝜏\tauitalic_τ will be indifferent between a𝑎aitalic_a and a0subscript𝑎0a_{0}italic_a start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT, but with a more general expression than the linear model:

Ia,τ:={μΔ(Θ)ϕ(μ0,μ,τ)Ia}.assignsubscript𝐼𝑎𝜏conditional-set𝜇ΔΘitalic-ϕsubscript𝜇0𝜇𝜏subscript𝐼𝑎I_{a,\tau}:=\{\mu\in\Delta(\Theta)\mid\phi(\mu_{0},\mu,\tau)\in I_{a}\}.italic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT := { italic_μ ∈ roman_Δ ( roman_Θ ) ∣ italic_ϕ ( italic_μ start_POSTSUBSCRIPT 0 end_POSTSUBSCRIPT , italic_μ , italic_τ ) ∈ italic_I start_POSTSUBSCRIPT italic_a end_POSTSUBSCRIPT } . (28)

With the above definition of Ia,τsubscript𝐼𝑎𝜏I_{a,\tau}italic_I start_POSTSUBSCRIPT italic_a , italic_τ end_POSTSUBSCRIPT, Theorem 5.2 still holds. We omit the proofs because they are almost identical to the proofs for the linear model.

The linear program algorithm for computing the optimal signaling scheme (Algorithm 1 and Theorem 4.6) does not apply to the general bias model because ϕitalic-ϕ\phiitalic_ϕ is not linear. We view designing an efficient algorithm for the general bias model an interesting future direction.