-
A Non-Terminating Game of Beggar-My-Neighbor
Authors:
Brayden Casella,
Philip M. Anderson,
Michael Kleber,
Richard P. Mann,
Reed Nessler,
William Rucklidge,
Samuel G. Williams,
Nicolas Wu
Abstract:
We demonstrate the existence of a non-terminating game of Beggar-My-Neighbor, discovered by lead author Brayden Casella. We detail the method for constructing this game and identify a cyclical structure of 62 tricks that is reached by 30 distinct starting hands. We further present a short history of the search for this solution since the problem was posed, and a record of previously found longest…
▽ More
We demonstrate the existence of a non-terminating game of Beggar-My-Neighbor, discovered by lead author Brayden Casella. We detail the method for constructing this game and identify a cyclical structure of 62 tricks that is reached by 30 distinct starting hands. We further present a short history of the search for this solution since the problem was posed, and a record of previously found longest terminating games. The existence of this non-terminating game provides a solution to a long-standing question which John H. Conway called an `anti-Hilbert problem.'
△ Less
Submitted 19 March, 2024;
originally announced March 2024.
-
Optimising collective accuracy among rational individuals in sequential decision-making with competition
Authors:
Richard P Mann
Abstract:
Theoretical results underpinning the Wisdom of Crowds, such as the Condorcet Jury Theorem, point to substantial accuracy gains through aggregation of decisions or opinions, but the foundations of this theorem are routinely undermined in circumstances where individuals are able to adapt their own choices based after observing what other agents have chosen. In sequential decision-making, rational ag…
▽ More
Theoretical results underpinning the Wisdom of Crowds, such as the Condorcet Jury Theorem, point to substantial accuracy gains through aggregation of decisions or opinions, but the foundations of this theorem are routinely undermined in circumstances where individuals are able to adapt their own choices based after observing what other agents have chosen. In sequential decision-making, rational agents use the choices of others as a source of information about the correct decision, creating powerful correlations between different agents' choices that violate the assumptions of independence on which the Condorcet Jury Theorem depends. In this paper I show how such correlations emerge when agents are rewarded solely based on their individual accuracy, and the impact of this on collective accuracy. I then demonstrate how a simple competitive reward scheme, where agents' rewards are greater if they correctly choose options that few have already chosen, can induce rational agents to make independent choices, returning the group to optimal levels of collective accuracy. I further show that this reward scheme is robust, offering improvements to collective accuracy across of wide range of competition strengths, suggesting that such schemes could be effectively implemented in real-world contexts to improve collective wisdom.
△ Less
Submitted 10 September, 2022;
originally announced September 2022.
-
Collective decision-making under changing social environments among agents adapted to sparse connectivity
Authors:
Richard P. Mann
Abstract:
Humans and other animals often follow the decisions made by others because these are indicative of the quality of possible choices, resulting in `social response rules': observed relationships between the probability that an agent will make a specific choice and the decisions other individuals have made. The form of social responses can be understood by considering the behaviour of rational agents…
▽ More
Humans and other animals often follow the decisions made by others because these are indicative of the quality of possible choices, resulting in `social response rules': observed relationships between the probability that an agent will make a specific choice and the decisions other individuals have made. The form of social responses can be understood by considering the behaviour of rational agents that seek to maximise their expected utility using both social and private information. Previous derivations of social responses assume that agents observe all others within a group, but real interaction networks are often characterised by sparse connectivity. Here I analyse the observable behaviour of rational agents that attend to the decisions made by a subset of others in the group. This reveals an adaptive strategy in sparsely-connected networks based on highly-simplified social information: the difference in the observed number of agents choosing each option. Where agents employ this strategy, collective outcomes and decision-making efficacy are controlled by the social connectivity at the time of the decision, rather than that to which the agents are accustomed, providing an important caveat for sociality observed in the laboratory and suggesting a basis for the social dynamics of highly-connected online communities.
△ Less
Submitted 26 October, 2021;
originally announced October 2021.
-
Modeling the effects of environmental and perceptual uncertainty using deterministic reinforcement learning dynamics with partial observability
Authors:
Wolfram Barfuss,
Richard P. Mann
Abstract:
Assessing the systemic effects of uncertainty that arises from agents' partial observation of the true states of the world is critical for understanding a wide range of scenarios. Yet, previous modeling work on agent learning and decision-making either lacks a systematic way to describe this source of uncertainty or puts the focus on obtaining optimal policies using complex models of the world tha…
▽ More
Assessing the systemic effects of uncertainty that arises from agents' partial observation of the true states of the world is critical for understanding a wide range of scenarios. Yet, previous modeling work on agent learning and decision-making either lacks a systematic way to describe this source of uncertainty or puts the focus on obtaining optimal policies using complex models of the world that would impose an unrealistically high cognitive demand on real agents. In this work we aim to efficiently describe the emergent behavior of biologically plausible and parsimonious learning agents faced with partially observable worlds. Therefore we derive and present deterministic reinforcement learning dynamics where the agents observe the true state of the environment only partially. We showcase the broad applicability of our dynamics across different classes of partially observable agent-environment systems. We find that partial observability creates unintuitive benefits in a number of specific contexts, pointing the way to further research on a general understanding of such effects. For instance, partially observant agents can learn better outcomes faster, in a more stable way and even overcome social dilemmas. Furthermore, our method allows the application of dynamical systems theory to partially observable multiagent leaning. In this regard we find the emergence of catastrophic limit cycles, a critical slowing down of the learning processes between reward regimes and the separation of the learning dynamics into fast and slow directions, all caused by partial observability. Therefore, the presented dynamics have the potential to become a formal, yet practical, lightweight and robust tool for researchers in biology, social science and machine learning to systematically investigate the effects of interacting partially observant agents.
△ Less
Submitted 14 April, 2022; v1 submitted 15 September, 2021;
originally announced September 2021.
-
Optimal incentives for collective intelligence
Authors:
Richard P. Mann,
Dirk Helbing
Abstract:
Collective intelligence is the ability of a group to perform more effectively than any individual alone. Diversity among group members is a key condition for the emergence of collective intelligence, but maintaining diversity is challenging in the face of social pressure to imitate one's peers. We investigate the role incentives play in maintaining useful diversity through an evolutionary game-the…
▽ More
Collective intelligence is the ability of a group to perform more effectively than any individual alone. Diversity among group members is a key condition for the emergence of collective intelligence, but maintaining diversity is challenging in the face of social pressure to imitate one's peers. We investigate the role incentives play in maintaining useful diversity through an evolutionary game-theoretic model of collective prediction. We show that market-based incentive systems produce herding effects, reduce information available to the group and suppress collective intelligence. In response, we propose a new incentive scheme that rewards accurate minority predictions, and show that this produces optimal diversity and collective predictive accuracy. We conclude that real-world systems should reward those who have demonstrated accuracy when majority opinion has been in error.
△ Less
Submitted 17 October, 2017; v1 submitted 11 November, 2016;
originally announced November 2016.
-
Towards a fully predictive model of flight paths in pigeons navigating in the familiar area: prediction across differing individuals
Authors:
Richard P. Mann
Abstract:
This paper will detail the basis of our previously developed predictive model for pigeon flight paths based on observations of the specific individual being predicted. We will then describe how this model can be adapted to predict the flight of a new, unobserved bird, based on observations of other individuals from the same release site. We will test the accuracy of these predictions relative to n…
▽ More
This paper will detail the basis of our previously developed predictive model for pigeon flight paths based on observations of the specific individual being predicted. We will then describe how this model can be adapted to predict the flight of a new, unobserved bird, based on observations of other individuals from the same release site. We will test the accuracy of these predictions relative to naive models with no previous flight information and those trained on the focal bird's own previous flights, and discuss the implications of these results for the nature of navigational cue use in the familiar area. Finally we will discuss how visual cues may be explicitly encoded in the model in future work.
△ Less
Submitted 20 April, 2016;
originally announced April 2016.
-
Escape path complexity and its context dependency in Pacific blue-eyes (Pseudomugil signifer)
Authors:
James E. Herbert-Read,
Ashley J. W. Ward,
David J. T. Sumpter,
Richard P. Mann
Abstract:
The escape trajectories animals take following a predatory attack appear to show high degrees of apparent 'randomness' - a property that has been described as 'protean behaviour'. Here we present a method of quantifying the escape trajectories of individual animals using a path complexity approach. When fish (Pseudomugil signifer) were attacked either on their own or in groups, we find that an ind…
▽ More
The escape trajectories animals take following a predatory attack appear to show high degrees of apparent 'randomness' - a property that has been described as 'protean behaviour'. Here we present a method of quantifying the escape trajectories of individual animals using a path complexity approach. When fish (Pseudomugil signifer) were attacked either on their own or in groups, we find that an individual's path rapidly increases in entropy (our measure of complexity) following the attack. For individuals on their own, this entropy remains elevated (indicating a more random path) for a sustained period (10 seconds) after the attack, whilst it falls more quickly for individuals in groups. The entropy of the path is context dependent. When attacks towards single fish come from greater distances, a fish's path shows less complexity compared to attacks that come from short range. This context dependency effect did not exist, however, when individuals were in groups. Nor did the path complexity of individuals in groups depend on a fish's local density of neighbours. We separate out the components of speed and direction changes to determine which of these components contributes to the overall increase in path complexity following an attack. We found that both speed and direction measures contribute similarly to an individual's path's complexity in absolute terms. Our work highlights the adaptive behavioural tactics that animals use to avoid predators and also provides a novel method for quantifying the escape trajectories of animals.
△ Less
Submitted 26 February, 2015;
originally announced February 2015.
-
The entropic basis of collective behaviour
Authors:
Richard P. Mann,
Roman Garnett
Abstract:
In this paper, we identify a radically new viewpoint on the collective behaviour of groups of intelligent agents. We first develop a highly general abstract model for the possible future lives that these agents may encounter as a result of their decisions. In the context of these possible futures, we show that the causal entropic principle, whereby agents follow behavioural rules that maximise the…
▽ More
In this paper, we identify a radically new viewpoint on the collective behaviour of groups of intelligent agents. We first develop a highly general abstract model for the possible future lives that these agents may encounter as a result of their decisions. In the context of these possible futures, we show that the causal entropic principle, whereby agents follow behavioural rules that maximise their entropy over all paths through the future, predicts many of the observed features of social interactions between individuals in both human and animal groups. Our results indicate that agents are often able to maximise their future path entropy by remaining cohesive as a group, and that this cohesion leads to collectively intelligent outcomes that depend strongly on the distribution of the number of future paths that are possible. We derive social interaction rules that are consistent with maximum-entropy group behaviour for both discrete and continuous decision spaces. Our analysis further predicts that social interactions are likely to be fundamentally based on Weber's law of response to proportional stimuli, supporting many studies that find a neurological basis for this stimulus-response mechanism, and providing a novel basis for the common assumption of linearly additive 'social forces' in simulation studies of collective behaviour.
△ Less
Submitted 24 September, 2014;
originally announced September 2014.
-
A note on the duality between interaction responses and mutual positions in flocking and schooling
Authors:
Andrea Perna,
Guillaume Gregoire,
Richard P. Mann
Abstract:
Background: Recent research in animal behaviour has contributed to determine how alignment, turning responses, and changes of speed mediate flocking and schooling interactions in different animal species. Here, we address specifically the problem of what interaction responses support different nearest neighbour configurations in terms of mutual position and distance. Results: We find that the diff…
▽ More
Background: Recent research in animal behaviour has contributed to determine how alignment, turning responses, and changes of speed mediate flocking and schooling interactions in different animal species. Here, we address specifically the problem of what interaction responses support different nearest neighbour configurations in terms of mutual position and distance. Results: We find that the different interaction rules observed in different animal species may be a simple consequence of the relative positions that individuals assume when they move together, and of the noise inherent with the movement of animals, or associated with tracking inaccuracy. Conclusions: The anisotropic positioning of individuals with respect to their neighbours, in combination with noise, can explain several aspects of the movement responses observed in real animal groups, and should be considered explicitly in future models of flocking and schooling. By making a distinction between interaction responses involved in maintaining a preferred flock configuration, and interaction responses directed at changing it, we provide a frame to discriminate movement interactions that signal directional conflict from those underlying consensual group motion.
△ Less
Submitted 4 July, 2014; v1 submitted 28 April, 2014;
originally announced April 2014.