Bias Detection via Signaling
Abstract
We introduce and study the problem of detecting whether an agent is updating their prior beliefs given new evidence in an optimal way that is Bayesian, or whether they are biased towards their own prior. In our model, biased agents form posterior beliefs that are a convex combination of their prior and the Bayesian posterior, where the more biased an agent is, the closer their posterior is to the prior. Since we often cannot observe the agent’s beliefs directly, we take an approach inspired by information design. Specifically, we measure an agent’s bias by designing a signaling scheme and observing the actions they take in response to different signals, assuming that they are maximizing their own expected utility; our goal is to detect bias with a minimum number of signals. Our main results include a characterization of scenarios where a single signal suffices and a computationally efficient algorithm to compute optimal signaling schemes.
1 Introduction
A bag contains two coins that look and feel identical, but one is a fair coin that, on a flip, comes up heads with probability 0.5, and the other is an unfair coin with probability 0.9 of heads. You reach into the bag, grab one of the coins and flip it once; it lands on heads. Since you are (hopefully) familiar with Bayes’ rule, you conclude that the probability you are holding the fair coin is . Now suppose you are offered the following deal: if you pay $1, you get to flip the same coin again, and if it comes up heads, you will receive $1.4. Since you now believe that the probability of heads is 0.76, you take the deal (assuming you are risk neutral) and earn 6 cents in expectation.
If, by contrast, another risk-neutral person in the same situation decides to decline the same deal, they must believe that the probability they are holding the fair coin is greater than 0.47. That is, their belief is still very close to the prior of 0.5. We think of such a person as being biased, in the sense that they are unwilling to significantly update their beliefs, despite evidence to the contrary.
Of course, failing to update one’s beliefs about coin flips is not the end of the world. But this example serves to illustrate a broader phenomenon that, in our view, is both important and ubiquitous. In particular, the “stickiness” of prior beliefs in the face of evidence plays a role in politics — think of the controversy over Russian collusion in the 2016 US presidential election or the existence of weapons of mass destruction in Iraq in 2003. It is also prevalent in science, as exemplified by the polarized debate over the origins of the Covid pandemic [3].
Our goal in this paper is to develop algorithms that are able to detect bias in the form of non-Bayesian updating of beliefs. To our knowledge, we are the first to formalize and analytically address this problem, and we aim to build an initial framework that future work would build on. In the long term, we believe such algorithms could have many applications, including understanding to what degree the foregoing type of bias contributes to disagreement and polarization, and discounting the opinions of biased agents to improve collective decision making.
Our approach. The first question we need to answer is how to quantify bias. In this first investigation, we adopt a linear model of bias that was proposed and used as a general belief updating model in economics [8, 10, 5]. If the prior is and the correct Bayesian posterior upon receiving a signal (or evidence) is denoted , we posit that an agent with bias adopts the belief . At the extremes, an agent with bias performs perfect Bayesian updating and an agent with bias cannot be convinced to budge from the prior.
The bigger conceptual question is how we can infer an agent’s bias. To address it, we take an approach that is inspired by the literature on information design [13]. In our context, suppose that we (the principal) and the agent have asymmetric information: while both share a common (say public) prior about the state of the world, the principal knows the true (realized) state of the world, but the agent does not. The principal publicly commits to a (randomized) signaling scheme that specifies the probability of sending each possible signal given each possible realized state of the world. Given their knowledge of the latter, the principal draws a signal from the specified distribution and sends it to the agent. Upon receiving such a signal, the agent updates their beliefs about the state of the world (from the common prior) and then takes an action that maximizes their expected utility according to a given utility function. Similarly to the example we started with, it is the action taken by the agent that can (indirectly) reveal their degree of bias.
The challenge, then, is to design signaling schemes that test whether the agent’s bias is above or below some threshold. (Through binary search, this makes it possible to estimate the level of bias.) We wish to do so in the most efficient way, that is, using a minimum number of signals in expectation.
Our results. We design a polynomial-time algorithm that computes optimal signaling schemes, in Section 4. We first show that constant algorithms, which repeatedly use the same signaling scheme, are as powerful as adaptive algorithms, which can vary the scheme over time based on historical data (Lemma 4.1); we can therefore restrict our attention to constant algorithms. In Lemma 4.5, we establish a version of the revelation principle for the bias detection problem, which asserts that optimal signaling schemes need only use signals that can be interpreted as action recommendations. Finally, building on these insights, we show that the optimal solution to our problem is obtained by solving a “small” linear program (Algorithm 1 and Theorem 4.6).
In Section 5, we present a geometric characterization of optimal signaling schemes (Theorem 5.2), which sheds additional light on the performance of the algorithm. In particular, the characterization provides sufficient and necessary conditions for the testability of bias, and also identifies cases where only a single sample is needed for this task.
Related work. There is a significant body of experimental work in the social sciences aiming to explain the failure of partisans to reach similar beliefs on factual questions where there is a large amount of publicly available evidence. The fact that biased belief updating occurs is undisputed (to our knowledge), and the focus is on understanding the factors that play a role. In particular, a prominent line of work supports the (perhaps counterintuitive) hypothesis that the more cognitively sophisticated a partisan is, the more politically biased is their belief update process [16, 17, 12, 11]. These results are challenged by more recent work by Tappin et al. [18], who found that greater analytical thinking is associated with belief updates that are less biased, using an experimental design that explicitly measures the proximity of belief updates to a correct Bayesian posterior. While these studies provide empirical underpinnings for our theoretical model, their research questions are orthogonal to ours: we aim to measure the magnitude of bias regardless of its source.
Classical work in information design [4, 13] studies how a principal can strategically provide information to induce an agent to take actions that are beneficial for the principal, assuming a perfectly Bayesian agent. Various relaxations of the perfectly Bayesian assumption have been investigated [1, 10, 7, 5, 9, 20, 14]. The closest to our work is that of de Clippel and Zhang [5], who study biased belief update models including the linear model. However, their goal is to maximize the principal’s utility with the agent’s bias fully known. In our problem the agent’s bias level is unknown, and the principal’s goal is to infer the agent’s bias level instead of maximizing their own utility.
2 Model
Biased agent. Consider a standard Bayesian setting: the relevant state of the world is , distributed according to some known prior distribution . If an agent were perfectly Bayesian, when receiving some new information (“signal”) and with the knowledge of the conditional distributions for all , they would update their belief about the state of the world according to Bayes’ Rule: . We refer to as the true posterior belief induced by . Being biased, the agent’s belief after seeing , denoted , is a convex combination of and :
where is the unknown bias level, capturing the agent’s inclination to retain their prior belief in the presence of new information. This linear model was proposed and adopted in economics for non-Bayesian belief updating [8, 10], in order to capture people’s conservatism in processing new information and their tendency to protect their beliefs [19].
The agent can choose an action from a finite set and has a state-dependent utility function . They receive utility when taking action in state . The agent will act according to their (biased) belief and choose an action that maximizes their expected utility:
In the absence of any additional information, the agent operates based on the prior belief and will select an action deemed optimal with respect to . We introduce the following mild assumption to ensure the uniqueness of this action:
Assumption 2.1.
There is a unique action that maximizes the expected utility based on the prior belief : .
This assumption will be made throughout the paper. We denote the unique optimal action on the prior belief as , and call it the default action.
Bias detection. The principal, who knows the prior and the agent’s utility function , seeks to infer the agent’s bias level from their action as efficiently as possible. The principal has an informational advantage — they observe the independent realizations of the state of the world at each time step. In other words, the principal knows , an independent sample drawn according to at time . The principal wants to design signaling schemes to strategically reveal information about to the agent, ho** to influence the agent’s biased belief in a way that the agent’s chosen actions reveal information about their bias level. Specifically, with a finite signal space , the principal can commit to a signaling scheme at time , where specifies the probability of sending signal in state at time . After seeing a signal , drawn according to at time , the agent takes action that is optimal for their biased belief . The principal infers information about bias from the history of signaling schemes, realized states, realized signals, and agent actions . We denote by an adaptive algorithm that the principal uses to decide on the signaling scheme at time based on history .
Given a threshold , the principal wants to design to answer the following question:
-
Is the agent’s bias level greater than or equal to or less than or equal to ?111One may want to test or instead. But this requires assumptions on tie-breaking when the agent has multiple optimal actions. Indifference at allows us to avoid such assumptions.
An algorithm terminates as soon as it can output a deterministic answer to this question. The number of time steps for to terminate, denoted by , is a random variable. The sample complexity of is defined to be the expected termination time in the worst case over :
Definition 2.1 (sample complexity).
The (worst-case) sample complexity of is defined as222Taking the worst case over is not overly pessimistic. As we will show later, the worst case in fact happens at for some , which makes intuitive sense. Therefore, the sample complexity can be equivalently defined as .
Our objective is to develop an algorithm that can determine whether or with minimal sample complexity. Specifically, we want to solve the following minimax problem:
We say that an algorithm is constant if it keeps using the same signaling scheme repeatedly until termination. Constant algorithms are a special case of non-adaptive algorithms, which may vary the signaling schemes over time but remain independent of historical data.
Preliminaries. We now introduce the well-known splitting lemma from the information design literature [2, 13, 15]. It relates a signaling scheme with a set of induced true posteriors for a Bayesian agent and a distribution over the set of true posteriors.
Lemma 2.1 (Splitting Lemma, e.g., [13]).
Let be a signaling scheme where each signal is sent with unconditional probability and induces true posterior . Then, the prior equals the convex combination of with weights : . Conversely, if the prior can be expressed as a convex combination of distributions : , where , then there exists a signaling scheme where each signal is sent with unconditional probability and induces posterior .
The splitting lemma is also referred as the Bayesian consistency condition. It allows one to think about choosing a signaling scheme as choosing a set of true posteriors, , and a distribution over the set, , in a Bayesian consistent way.
3 Warm-Up: A Two-State, Two-Action Example
How can the principal design a signaling scheme to learn the agent’s bias level? We use a simple two-state, two-action example to demonstrate how inducing a specific true posterior belief will allow the principal to determine whether or .
The two states of the world are represented as {, }. The agent has two possible actions: and . Taking the action always yields a utility of , independently of the state. For the action, the utility is if the state is and otherwise; . We use the probability of the state to represent a belief, so the prior is a number , which is only a slight abuse of notation. With belief for the state (and for the state), the agent’s expected utility for choosing the action is . Thus, the action is better than the action (so the agent will take ) if
(1) |
Conversely, the action is better if . Here, is an indifference belief where the agent is indifferent between the two actions. We assume that the prior satisfies , so the agent chooses the action by default.
Consider the following constant signaling scheme with two signals :
-
•
If the state is , send signal with probability one.
-
•
If the state is , send signal with probability and signal with the complement probability.
We will show that, by repeatedly using , we can test whether the agent’s bias is or . By Bayes’ Rule, the true posterior beliefs (for the state) associated with the two signals are (i.e., on receiving , the agent knows the state is ) and
Notably, the posterior satisfies the following property: if the agent’s bias level is exactly equal to , then the agent’s biased belief is equal to the indifference belief:
We also note the inequality . As a result, if the agent’s bias level is greater than , then the biased belief will be smaller than , and otherwise the opposite is true:
By Equation (1), this means that the agent will take the action if , and the action if (on receiving ). Therefore, by observing which action is taken by the agent when signal is sent, we can immediately tell whether or . This leads to the following:
Theorem 3.1.
In the two-state, two-action example, for any threshold , the above constant signaling scheme can test whether the agent’s bias satisfies or : specifically, whenever the signal is sent,
-
•
if the agent takes action , then ,
-
•
if the agent takes action , then .
The sample complexity of this scheme is , which increases with .
Proof.
The range ensures that the probability is in . The two items in the theorem follow from the argument before the theorem statement. The sample complexity is equal to the expected number of time steps until a signal is sent, which is a geometric random variable with success probability . So the sample complexity is equal to the mean . ∎
The main intuition behind this result is that in order to test whether or , we want to design a signaling scheme where some of the signals induce posteriors under which the agent is indifferent between two actions if the agent’s bias level is exactly . Then, the action actually taken by the agent will directly reveal whether or . Such signals are useful signals, but not all signals are necessarily useful. The sample complexity is then determined by the total probability of useful signals. This intuition will carry over to computing the optimal signaling scheme for the general case in Section 4.
Finally, we remark that using the constant signaling scheme constructed above to test or is in fact the optimal adaptive algorithm, according to the results we will present in Section 4. So, the minimal sample complexity to test whether or in this two-state, two-action example is exactly as shown in Theorem 3.1.
4 Computing the Optimal Signaling Scheme in the General Case
In this section, we generalize the initial observations from the previous section to the case with any number of actions and states and general utility function . We will show how to compute the optimal algorithm (signaling scheme) to test the agent’s bias level. There are three key ingredients. First, we prove that we can use a constant signaling scheme. Second, we develop a “revelation principle” to further simplify the space of signaling schemes. Building on these two steps, we show that the optimal signaling scheme can be computed by a linear program.
4.1 Optimality of Constant Signaling Schemes
In this subsection, we show that adaptive algorithms are no better than constant algorithms for the problem of testing whether or . Therefore, to find the algorithm with minimal sample complexity, we only need to consider constant algorithms/signaling schemes.
Lemma 4.1.
Fix . For the problem of testing whether or , the sample complexity of any adaptive algorithm is at least that of the optimal constant algorithm (i.e., using a fixed signaling scheme repeatedly).
To prove this lemma, we introduce some notations. For any action , define vector
(2) |
whose components are the utility differences between the default action and any other action at different states . Let be the region of beliefs under which the agent strictly prefers over any other action:
(3) |
It is the intersection of open halfspaces with the probability simplex . As the agent strictly prefers at the prior , we have . The boundary of this region, , is the set of beliefs where the agent is indifferent between and at least one other action and and are both (weakly) better than any other action:
(4) |
Lastly, the exterior of , denoted as , comprises the set of beliefs where the agent strictly prefers not to choose :
(5) |
Given a signaling scheme , we classify its signals into three types based on the location of the biased belief associated with the signal with respect to the region .
Definition 4.1.
Let be a parameter. Let be a signal from a signaling scheme , with associated true posterior and -biased posterior . We say is
-
•
an internal signal if ;
-
•
a boundary signal if ;
-
•
an external signal if .
The above classification helps to formalize the idea of whether a signal is “useful” for bias detection. A boundary signal is useful because the action taken by the agent after receiving a boundary signal immediately tells whether or :
Lemma 4.2.
When a boundary signal is realized, the agent’s action immediately reveals whether or . Specifically, if the agent chooses action , then ; otherwise, .
Proof.
If the agent’s bias level satisfies , then the biased belief must be inside (because is on the boundary of and ), so the agent strictly prefers the default action . If , then the biased belief is outside of , so the agent will not take action . ∎
An external signal might also be useful in revealing whether or if the agent is indifferent between some actions other than at the -biased belief . However, the following lemma shows that, in such cases, we can always modify the signaling scheme to turn the external signal into a boundary signal. This modification will increase the total probability of useful signals and hence reduce the sample complexity. The proof of this lemma is in Appendix A.1.
Lemma 4.3.
Suppose is an adaptive algorithm that uses signaling schemes with internal, boundary, and external signals. Then, there exists another adaptive algorithm with equal or lower sample complexity that employs only signaling schemes with internal and boundary signals.
An internal signal, on the other hand, is not useful for testing or , for the following reason. For an internal signal, the biased belief with bias level , , lies inside . Since is an open region, there must exist a small number such that when the agent has bias level or , the biased belief with bias level , , is also inside the region , so the agent will take action . As the agent takes under both and , we cannot distinguish these two cases, so this signal is not helpful in determining or . The following lemma (with proof in Appendix A.2) formalizes the idea that internal signals are not useful:
Lemma 4.4.
To test whether or , any adaptive algorithm that uses signaling schemes with boundary and internal signals cannot terminate until a boundary signal is sent.
With the above lemmas, we can now prove Lemma 4.1.
Proof of Lemma 4.1.
By Lemma 4.3, the optimal adaptive algorithm only uses signaling schemes with boundary and internal signals. By Lemma 4.4, the algorithm cannot terminate until a boundary signal is sent. By Lemma 4.2, the algorithm terminates when a boundary signal is sent. We conclude that the termination time of any adaptive algorithm cannot be better than the constant algorithm that keeps using the signaling scheme that maximizes the total probability of boundary signals. ∎
4.2 Revelation Principle
To compute the optimal constant signaling scheme, we need another technique that is similar to the revelation principle in the information design literature [13, 6]. The revelation principle says that, in some information design problems, it is without loss of generality to consider only “direct” signaling schemes where signals are recommendations of actions for the agent, that is, the signal space is , and when the principal sends signal , it should be optimal for the agent to take action given the posterior belief induced by signal .
Unlike classical information design problems where the agent is unbiased, our problem involves a biased agent, so we need a different revelation principle: the signals are still action recommendations, but when the principal sends signal , action is optimal for an agent with bias level exactly ; moreover, if , then an agent with bias level will be indifferent between and . This insight is formalized in the following lemma:
Lemma 4.5 (revelation principle for bias detection).
Let be an arbitrary signaling scheme that can test or . Then, there exists another signaling scheme that can do so with signal space such that:
-
(1)
Given signal , action is an optimal action for any agent with bias level .
-
(2)
Given signal , actions and are both optimal for any agent with bias level . As a corollary, if the agent’s bias level , then the agent strictly prefers over ; and if , then the agent strictly prefers over any other actions.
-
(3)
The sample complexity satisfies .
In the above signaling scheme , every is a boundary signal (Definition 4.1), which is useful for testing bias: given signal , if the agent takes action , then it must be ; otherwise . The signal is internal and not useful for determining or . So, the sample complexity of is equal to the expected time steps until a signal in is sent.
The high-level idea behind Lemma 4.5 is combination of signals. Suppose there is a signaling scheme that can determine whether or with a signal space larger than . There must exist two signals and under which the agent is indifferent between and some action if the agent’s bias level is exactly . Then, we can combine the two signals into one signal under which the agent is still indifferent between and , obtaining a new signaling scheme with a smaller signal space. We can do this until the signal space is reduced to size . The full proof is in Appendix A.3.
4.3 Algorithm for Computing the Optimal Signaling Scheme
Finally, we present an algorithm to compute the optimal (minimal sample complexity) signaling scheme to test whether or . The revelation principle in the previous subsection ensures that we only need a direct signaling scheme where signals are action recommendations. The optimal direct signaling scheme turns out to be solvable by a linear program, detailed in Algorithm 1. In the linear program, the optimality constraint (7) ensures that whenever the principal recommends action , it is optimal for an agent with bias level to take action ; this satisfies condition (1) in the revelation principle (Lemma 4.5). The indifference constraint (8) ensures that when the recommended action is not , an agent with bias level is indifferent between and ; this satisfies condition (2) in the revelation principle. The objective (6) is to maximize the probability of useful signals (those in ), hence minimize the sample complexity. The resulting signaling scheme has the property that whenever the principal recommends an action other than , if the agent indeed follows the recommendation (or takes any other action than ), then the bias must be small (); if the agent takes instead, the bias must be large ().
(6) |
Optimality of over other actions: | |||||
(7) | |||||
Indifference between and : , | |||||
(8) | |||||
Probability distribution constraints: , | |||||
Theorem 4.6.
Algorithm 1 finds a constant signaling scheme for testing or that is optimal among all adaptive signaling schemes.
The linear program in Algorithm 1 has a polynomial size in (number of actions) and (number of states), so it is a polynomial-time algorithm.
The rest of this section proves Theorem 4.6. The proof needs an additional lemma:
Lemma 4.7.
Given a signaling scheme and an agent’s bias level , after signal is sent, the agent strictly prefers action over action under the biased belief if and only if:
Proof.
The agent’s biased belief under signal and bias level is given by
The condition for the agent to strictly prefer over is that the expected utility under the biased belief when choosing is greater than when choosing :
where . Multiplying by , we obtain:
Factoring out the terms, this can be rewritten as:
This final expression is positive if and only if the agent to strictly prefer over . ∎
Proof of Theorem 4.6.
According to Lemma 4.1 (constant algorithms are optimal) and Lemma 4.5 (revelation principle), to find an optimal adaptive algorithm we only need to find the optimal constant signaling scheme that satisfies the conditions in Lemma 4.5. We verify that the signaling scheme computed from the linear program in Algorithm 1 satisfies the conditions in Lemma 4.5:
- •
- •
We then argue that the solution of the linear program is the optimal signaling scheme that satisfies the conditions of Lemma 4.5. According to our argument after Lemma 4.5, only the signals in are useful signals, so the sample complexity is equal to the expected number of time steps until a signal in is sent. The probability that a signal in is sent at each time step is equal to
The expected number of time steps is the inverse (because the number of time steps is a geometric random variable). The linear program maximizes the probability , so it minimizes the sample complexity. ∎
5 Geometric Characterization of the Testability of Bias
To complement the algorithmic solution presented in the previous section, this section provides a geometric characterization of the bias detection problem. We identify the conditions under which testing whether or can be done in only one sample, in finite number of samples, or cannot be done at all (the scenario where the linear program in Algorithm 1 is infeasible).
By Assumption 2.1 ( is strictly better than other actions at prior ), we have:
(9) |
where is as defined in Equation (2). Define as the set of indifference beliefs between action and , which is the intersection of the hyperplane and the probability simplex :
Given a parameter , for which we want to test whether or , let be the set of posterior beliefs for which, if the agent’s bias level is exactly , then the agent’s biased belief will fall within the indifference set :
Lemma 5.1.
is equal to the intersection of the probability simplex and a translation of the hyperplane : .
The proof of this lemma is in Appendix B.1. With this representation of in hand, we can now present a geometric characterization of the testability of bias.
Theorem 5.2 (geometric characterization).
Fix . The problem of testing or
-
•
Can be solved with a single sample (the sample complexity is 1) if and only if the prior is in the convex hull formed by the translated sets for all non-default actions :
(10) -
•
Can be solved (with finite sample complexity) if and only if for at least one .
-
•
Cannot be solved if for all .
Figure 1 illustrates the three cases of Theorem 5.2. In the first case, the solution of the linear program in Algorithm 1 satisfies , meaning that useful signals are sent with probability 1, which allows us to tell whether or immediately. In the second case, the total probability of useful signals satisfies , so the sample complexity is more than 1. In the third case, the linear program in Algorithm 1 is not feasible, so or cannot be determined; importantly, this is not a limitation of our particular algorithm, but a general impossibility in our model. The proof of Theorem 5.2 is in Appendix B.2.
6 Discussion
Our approach has some limitations; here we discuss the two that we view as most significant.
First, we have assumed a linear model of bias. While the linear model is common in the literature [8, 10, 5], we also consider a more general model of bias (in Appendix C): as the bias level increases from to , the agent’s belief changes from the true posterior to the prior according to some general continuous function . We show that, as long as the bias function satisfies a certain single-crossing property (as increases, once the agent starts to prefer the default action , they will not change the preferred action anymore), our results regarding the optimality of constant signaling schemes, the revelation principle, and the geometric characterization still hold, while the linear program algorithm no longer works because is not linear. We consider it an interesting challenge to come up with more general models of bias that are still tractable, in the sense that one can efficiently design good signaling schemes with reasonable sample complexity bounds.
Second, we have assumed that the agent’s prior is the same as the real prior from which states of the world are drawn. But what if the agent’s prior is different? Our results directly extend to the case where the agent has a wrong, known prior. If the agent’s prior is unknown, then our problem becomes significantly more challenging. In fact, we believe it may be flat-out impossible to measure bias, as the case of a slightly wrong prior with large bias may be indistinguishable from the case of a very wrong prior with small bias.
Despite these limitations, we view our paper as making significant progress on a novel problem that seems fundamental. Our results suggest that practical algorithms for detecting bias in belief update are within reach and, in the long term, may lead to new insights on issues of societal importance. In particular, we anticipate future research in more complex situations such as combining decisions of many experts (human or AI) after measuring and accounting for their individual biases.
References
- Alonso and Câmara [2016] R. Alonso and O. Câmara. Bayesian persuasion with heterogeneous priors. Journal of Economic Theory, 165:672–706, 2016.
- Aumann et al. [1995] R. J. Aumann, M. B. Maschler, and R. E. Stearns. Repeated Games with Incomplete Information. MIT Press, 1995.
- Bloom et al. [2021] J. D. Bloom, Y. A. Chan, R. S. Baric, P. J. Bjorkman, S. Cobery, B. E. Deverman, D. N. Fisman, R. Gupta, A. Iwasaki, M. Lipsitch, R. Medzhitov, R. A. Neher, R. Nielsen, N. Patterson, T. Stearns, E. van Nimwegen, M. Worobey, and D. A. Relman. Investigate the origins of COVID-19. Science, 372(6543):694, 2021.
- Crawford and Sobel [1982] V. P. Crawford and J. Sobel. Strategic information transmission. Econometrica, 50(6):1431–1451, 1982.
- de Clippel and Zhang [2022] G. de Clippel and X. Zhang. Non-Bayesian persuasion. Journal of Political Economy, 130(10):2594–2642, 2022.
- Dughmi and Xu [2016] S. Dughmi and H. Xu. Algorithmic Bayesian persuasion. In Proceedings of the 48th Annual ACM Symposium on Theory of Computing (STOC), pages 412–425, 2016.
- Dworczak and Pavan [2022] P. Dworczak and A. Pavan. Preparing for the worst but ho** for the best: Robust (Bayesian) persuasion. Econometrica, 90(5):2017–2051, 2022.
- Epstein et al. [2010] L. G. Epstein, J. Noor, and A. Sandroni. Non-Bayesian learning. The B.E. Journal of Theoretical Economics, 10(1):732–735, 2010.
- Feng et al. [2024] Yiding Feng, Chien-Ju Ho, and Wei Tang. Rationality-robust information design: Bayesian persuasion under quantal response. In Proceedings of the 2024 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 501–546. SIAM, 2024.
- Hagmann and Loewenstein [2017] D. Hagmann and G. Loewenstein. Persuasion with motivated beliefs. Carnegie Mellon University technical report, 2017.
- Kahan [2013] D. M. Kahan. Ideology, motivated reasoning, and cognitive reflection. Judgment and Decision Making, 8(4):18, 2013.
- Kahan et al. [2012] D. M. Kahan, E. Peters, M. Wittlin, P. Slovic, L. L. Ouellette, D. Braman, and G. Mandel. The polarizing impact of science literacy and numeracy on perceived climate change risks. Nature Climate Change, 2(10):732–735, 2012.
- Kamenica and Gentzkow [2011] E. Kamenica and M. Gentzkow. Bayesian persuasion. American Economic Review, 101(6):2590–2615, 2011.
- Lin and Chen [2024] T. Lin and Y. Chen. Persuading a learning agent. arXiv preprint arXiv:2402.09721, 2024.
- Renault [2012] J. Renault. Repeated games with incomplete information. In R. A. Meyers, editor, Encyclopedia of Complexity and Systems Science. Springer, 2012.
- Taber and Lodge [2006] C. S. Taber and M. Lodge. Motivated skepticism in the evaluation of political beliefs. American Journal of Political Science, 50(3):755–769, 2006.
- Taber et al. [2009] C. S. Taber, D. Cann, and S. Kucsova. The motivated processing of political arguments. Political Behavior, 31(2):137–155, 2009.
- Tappin et al. [2020] B. M. Tappin, G. Pennycook, and D. G. Rand. Bayesian or biased? Analytic thinking and political belief updating. Cognition, 204:104375, 2020.
- Ward [2007] E. Ward. Conservatism in human information processing. In D. Kahneman, P. Slovic, and A. Tversky, editors, Judgment Under Uncertainty: Heuristics and Biases. Cambridge University Press, 2007.
- Yang and Zhang [2024] K. Yang and H. Zhang. Computational aspects of Bayesian persuasion under approximate best response. arXiv preprint arXiv:2402.07426, 2024.
Appendix A Missing Proofs from Section 4
A.1 Proof of Lemma 4.3
Proof.
Suppose that, during its operation, selects a signaling scheme that includes an external signal . By definition, for an external signal, the -biased belief is in . This implies that the true posterior , derived from the signaling scheme and the prior , also lies in . Consequently, the line segment connecting and , represented as , must intersect the boundary at some point. Denote this intersection by .
We will adjust the original signaling scheme . To do so, define as the belief whose -biased version equals :
Under the original signaling scheme , according to the splitting lemma (Lemma 2.1), the prior can be represented as a convex combination of and the posteriors associated with other signals :
If we change to , then we obtain a new convex combination (this is valid because is on the line segment from to ):
where
Then, by the splitting lemma (Lemma 2.1), there exists a signaling scheme with signals where signal induces posterior and other signals induces . Note that the -biased version of satisfies , so is a boundary signal under signaling scheme .
Since is a boundary signal, we can immediately tell whether or according to Lemma 4.2 when is sent and end the algorithm. If any signal other than is sent, the induced posterior is the same as the posterior in the original signaling scheme , so the agent will take the same action, and we can just follow the rest of the original algorithm . But we note that the probability of signal being sent under the new signaling scheme is larger than or equal to the probability under the original signaling scheme :
(11) |
So, in expectation, we can end the algorithm faster by using than using . Hence, by repeating the above procedure to replace all the signaling schemes in the original algorithm that use external signals, we obtain a new algorithm that only uses boundary and internal signals with smaller or equal sample complexity. ∎
A.2 Proof of Lemma 4.4
Proof.
Let be any adaptive algorithm using signaling schemes with boundary and internal signals. Let be any history that can happen during the execution of . If no boundary signal has been sent, then every realized signal is an internal signal in the respective signaling scheme , with the -biased posterior satisfying . Because is an open region, there must exist some such that the -norm ball is a subset of . Let . Then for every . Suppose the agent’s bias level is in the range . Then, for every signal , the agent’s biased belief satisfies:
This means
So, the agent should take action given signal . Note that this holds for every and any . So we cannot determine whether or so far. We have to run the algorithm until a boundary signal is sent. ∎
A.3 Proof of Lemma 4.5
Proof.
Let be a signaling scheme that can test whether or . According to Lemma 4.3, can be assumed to only use boundary and internal signals. Recall that a signal is boundary if the -biased belief lies on the boundary set . For , let be the set of beliefs under which the agent is indifferent between and and and are both better than other actions:
(12) |
The boundary set can be written as the union of for :
(13) |
Then, we classify the boundary signals into sets according to which sets their -biased beliefs belong to: namely, the set contains boundary signals under which
(14) |
We then combine the signals in . Specifically, consider the normalized weighted average of the true posterior beliefs associated with the signals in , denoted by :
(15) |
Note that the -biased version of is also in the set because is a convex set:
(16) |
This means that if a signal induces true posterior , then this signal is a boundary signal.
After defining as above for every , let’s consider the set of internal signals of the signaling scheme , which we denote by . For each internal signal , the -biased belief satisfies
(17) |
Similar to above, we combine all the signals in : define to be the normalized weighted average of the posteriors associated with all internal signals:
(18) |
Then, the -biased version of must be in because is a convex set:
(19) |
This means that, if a signal induces posterior , then this signal is internal.
From the splitting lemma (2.1), we know that the convex combination of the original posteriors is equal to the prior . This means that the following convex combination of the new posteriors and is also equal to the prior:
where the convex combination weight of is for every and the convex combination weight of is . One can easily verify that the weights sum to 1. Then, by the splitting lemma (2.1), there exists a signaling scheme with signal space of size (so we simply denote the signal space by ) where each signal induces posterior . We show that this new signaling scheme satisfies the properties in Lemma 4.5:
-
•
Signal induces posterior whose -biased version satisfies . So, given signal , action is the optimal action for an agent with bias level .
-
•
For each signal , the induced posterior satisfies . So, by the definition of , an agent with bias level is indifferent between actions and and these two actions are better than other actions. Also, this signal is a boundary signal by Definition 4.1, which satisfies the following according to Lemma 4.2: if the agent’s bias level , then the agent strictly prefers over ; if , then the agent strictly prefers over .
-
•
The sample complexity of is the same as because: (1) the sample complexity is equal to the inverse of the total probability of boundary signals (as a corollary of Lemma 4.4), and (2) the total probability of boundary signals of the two signaling schemes are the same:
(20) So, .
∎
Appendix B Missing Proofs from Section 5
B.1 Proof of Lemma 5.1
Proof.
For , by convexity of , we have . Then,
∎
B.2 Proof of Theorem 5.2
We first prove the first part of Theorem 5.2, then prove the the second and third parts.
B.2.1 Proof of Part 1 of Theorem 5.2
We want to prove that or can be tested with a single sample if and only if the prior is in the convex hull formed by the translated sets for all non-default actions : .
The “if” part.
Suppose , namely, there exist a set of positive weights and a set of posterior beliefs such that
where each for some . By definition, the -biased belief is in the indifference set . Recall the definition of the boundary set (Equation 4), which is the set of beliefs under which the agent is indifferent between and some other action and these two actions are better any other actions. The -biased belief may or may not belong to , depending on whether and are better than any other actions:
- •
-
•
If , then there must exist some action that is strictly better than and for the agent at the -biased belief, hence (so is an external signal). Then, according to the argument in Lemma 4.3, we can find another belief on the line segment between and such that the -biased version of lies exactly on the boundary set :
(21)
After the above discussion, we have found a that is either equal to or on the line segment between and , for every . So, can be written as a convex combination of :
(22) |
Moreover, the defined above satisfies . So, a signal inducing true posterior will be a boundary signal and useful for testing or (Lemma 4.2). Finally, by the splitting lemma (Lemma 2.1), we know that there must exist a signaling scheme with signal space where each signal indeed induces posterior . Such a signaling scheme sends useful (boundary) signals with probability 1. Hence, the sample complexity of it is 1.
The “only if” part.
Suppose whether or can be tested with a single sample. This means that the optimal signaling scheme obtained from the linear program in Algorithm 1 must satisfy , namely, the total probability of useful signals (signals in ) is 1. Then, by the splitting lemma, the prior can be expressed as the convex combination
(23) |
where is the unconditional probability of signal and is the true posterior induced by signal . Moreover, the indifference constraint (8) in the linear program ensures that the agent is indifferent between and upon receiving signal if the agent has bias level : mathematically, . This means by definition. So, we obtain
(24) |
B.2.2 Proof of Parts 2 and 3 of Theorem 5.2
We first prove that, if whether or can be tested with finite sample complexity, then for at least one .
According to Lemma 4.1, if we can test whether or with finite sample complexity using adaptive algorithms, then we can do this using a constant signaling scheme. Lemma 4.3 further ensures that we can do this using a constant signaling scheme with only boundary and internal signals. But according to Lemma 4.4, internal signals are not useful for testing or . So, the signaling scheme must send some boundary signal with positive probability. Let be the true posterior induced by . By the definition of boundary signal, , implying that the agent is indifferent between and some action if their belief is (and and are better than any other actions). This means , so by definition. Hence, .
We then prove the opposite direction: if for at least one , then whether or can be tested with finite sample complexity.
Let be an action for which . We claim that:
Claim B.1.
There exists a state for which the agent weakly prefers action over action if the true posterior is state with probability 1 and the agent has bias level . In notation, let be the vector whose th component is and other components are . The agent weakly prefers action over action under belief .
Proof.
Suppose on the contrary that no such state exists. Then the agent strictly prefers over under belief for every state . This implies that, for any belief , the agent should also strictly prefer over under the belief , due to linearity of the agent’s utility with respect to the belief. The agent strictly preferring over implies , so cannot be in by definition. This holds for any , so , a contradiction. ∎
Let be the state in the above claim. The prior can be trivially written as the convex combination of and for other states :
(25) |
Since the agent does not prefer under belief , the belief cannot be in the region . The prior is in the region . Consider the line segment connecting and the prior . There must exist a point on the line segment such that the -biased belief lies exactly on the boundary of . Clearly, the prior can also be written as a convex combination of and for :
(26) |
Then by the splitting lemma (Lemma 2.1), there exists a signaling scheme with signals where one signal induces posterior and the other signals induce posteriors . In particular, the signal inducing is a boundary signal since by construction. By Lemma 4.2, that signal is useful for testing or . When that signal is sent (which happens with positive probability at each time step), we can tell or . This finishes the proof.
The two directions proved above together prove the parts 2 and 3 of Theorem 5.2.
Appendix C A More General Bias Model
We define a more general model of biased belief than the linear model. The agent’s bias is captured by some function . Given prior , true posterior , and bias level , the agent has biased belief . The linear model is the special case where . We make the following natural assumptions on :
Assumption C.1.
-
•
(no bias), (full bias).
-
•
is continuous in .
We then make some joint assumptions on the bias model and the agent’s utility function . Recall that the notation is the region of beliefs under which the agent strictly prefers action , is the boundary of , and is the exterior of where the agent strictly not prefers .
Assumption C.2.
When receiving no information, the agent will take the default action: , .
Definition C.1.
We say that a posterior belief satisfies single-crossing if the curve passes the boundary only once: namely, there exists such that
(27) |
We assume that all posteriors outside satisfy single-crossing, and all posteriors inside do not cross the boundary when the bias level varies in :
Assumption C.3.
-
•
Any satisfies single-crossing.
-
•
For any , any , .
Under the above general bias model with the stated assumptions, our results regarding the optimality of constant signaling schemes (Lemma 4.1) and the revelation principle (Lemma 4.5) still hold. The geometric characterization of testability of bias (Theorem 5.2) holds after redefining some notations. Let still be the set of beliefs where the agent is indifferent between actions and . Let still be the set of posterior beliefs for which an agent with bias level will be indifferent between and , but with a more general expression than the linear model:
(28) |
With the above definition of , Theorem 5.2 still holds. We omit the proofs because they are almost identical to the proofs for the linear model.