Search | arXiv e-print repository

Mimicry and the Emergence of Cooperative Communication

Abstract: In many situations, communication between agents is a critical component of cooperative multi-agent systems, however, it can be difficult to learn or evolve. In this paper, we investigate a simple way in which the emergence of communication may be facilitated. Namely, we explore the effects of when agents can mimic preexisting, externally generated useful signals. The key idea here is that these s… ▽ More In many situations, communication between agents is a critical component of cooperative multi-agent systems, however, it can be difficult to learn or evolve. In this paper, we investigate a simple way in which the emergence of communication may be facilitated. Namely, we explore the effects of when agents can mimic preexisting, externally generated useful signals. The key idea here is that these signals incentivise listeners to develop positive responses, that can then also be invoked by speakers mimicking those signals. This investigation starts with formalising this problem, and demonstrating that this form of mimicry changes optimisation dynamics and may provide the opportunity to escape non-communicative local optima. We then explore the problem empirically with a simulation in which spatially situated agents must communicate to collect resources. Our results show that both evolutionary optimisation and reinforcement learning may benefit from this intervention. △ Less

Submitted 26 May, 2024; originally announced May 2024.

Comments: Accepted for publication in the proceedings of the 2024 International Conference on Artificial Life (ALIFE24)

arXiv:2403.18415

The Topos of Transformer Networks

Authors: Mattia Jacopo Villani, Peter McBurney

Abstract: The transformer neural network has significantly out-shined all other neural network architectures as the engine behind large language models. We provide a theoretical analysis of the expressivity of the transformer architecture through the lens of topos theory. From this viewpoint, we show that many common neural network architectures, such as the convolutional, recurrent and graph convolutional… ▽ More The transformer neural network has significantly out-shined all other neural network architectures as the engine behind large language models. We provide a theoretical analysis of the expressivity of the transformer architecture through the lens of topos theory. From this viewpoint, we show that many common neural network architectures, such as the convolutional, recurrent and graph convolutional networks, can be embedded in a pretopos of piecewise-linear functions, but that the transformer necessarily lives in its topos completion. In particular, this suggests that the two network families instantiate different fragments of logic: the former are first order, whereas transformers are higher-order reasoners. Furthermore, we draw parallels with architecture search and gradient descent, integrating our analysis in the framework of cybernetic agents. △ Less

Submitted 5 May, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

Comments: Requires major revision

arXiv:2402.16247 [pdf, other]

Learning Translations: Emergent Communication Pretraining for Cooperative Language Acquisition

Authors: Dylan Cope, Peter McBurney

Abstract: In Emergent Communication (EC) agents learn to communicate with one another, but the protocols that they develop are specialised to their training community. This observation led to research into Zero-Shot Coordination (ZSC) for learning communication strategies that are robust to agents not encountered during training. However, ZSC typically assumes that no prior data is available about the agent… ▽ More In Emergent Communication (EC) agents learn to communicate with one another, but the protocols that they develop are specialised to their training community. This observation led to research into Zero-Shot Coordination (ZSC) for learning communication strategies that are robust to agents not encountered during training. However, ZSC typically assumes that no prior data is available about the agents that will be encountered in the zero-shot setting. In many cases, this presents an unnecessarily hard problem and rules out communication via preestablished conventions. We propose a novel AI challenge called a Cooperative Language Acquisition Problem (CLAP) in which the ZSC assumptions are relaxed by allowing a 'joiner' agent to learn from a dataset of interactions between agents in a target community. We propose and compare two methods for solving CLAPs: Imitation Learning (IL), and Emergent Communication pretraining and Translation Learning (ECTL), in which an agent is trained in self-play with EC and then learns from the data to translate between the emergent protocol and the target community's protocol. △ Less

Submitted 25 February, 2024; originally announced February 2024.

arXiv:2305.12235 [pdf, ps, other]

Joining the Conversation: Towards Language Acquisition for Ad Hoc Team Play

Authors: Dylan Cope, Peter McBurney

Abstract: In this paper, we propose and consider the problem of cooperative language acquisition as a particular form of the ad hoc team play problem. We then present a probabilistic model for inferring a speaker's intentions and a listener's semantics from observing communications between a team of language-users. This model builds on the assumptions that speakers are engaged in positive signalling and lis… ▽ More In this paper, we propose and consider the problem of cooperative language acquisition as a particular form of the ad hoc team play problem. We then present a probabilistic model for inferring a speaker's intentions and a listener's semantics from observing communications between a team of language-users. This model builds on the assumptions that speakers are engaged in positive signalling and listeners are exhibiting positive listening, which is to say the messages convey hidden information from the listener, that then causes them to change their behaviour. Further, it accounts for potential sub-optimality in the speaker's ability to convey the right information (according to the given task). Finally, we discuss further work for testing and develo** this framework. △ Less

Submitted 20 May, 2023; originally announced May 2023.

Comments: Published as a workshop paper at EmeCom at ICLR 2022

arXiv:2305.12233 [pdf, ps, other]

A Measure of Explanatory Effectiveness

Authors: Dylan Cope, Peter McBurney

Abstract: In most conversations about explanation and AI, the recipient of the explanation (the explainee) is suspiciously absent, despite the problem being ultimately communicative in nature. We pose the problem `explaining AI systems' in terms of a two-player cooperative game in which each agent seeks to maximise our proposed measure of explanatory effectiveness. This measure serves as a foundation for th… ▽ More In most conversations about explanation and AI, the recipient of the explanation (the explainee) is suspiciously absent, despite the problem being ultimately communicative in nature. We pose the problem `explaining AI systems' in terms of a two-player cooperative game in which each agent seeks to maximise our proposed measure of explanatory effectiveness. This measure serves as a foundation for the automated assessment of explanations, in terms of the effects that any given action in the game has on the internal state of the explainee. △ Less

Submitted 20 May, 2023; originally announced May 2023.

Comments: Presented at the 1st International Workshop on Trusted Automated Decision-Making (TADM) co-located with ETAPS 2021

arXiv:2305.09424 [pdf, ps, other]

Unwrap** All ReLU Networks

Authors: Mattia Jacopo Villani, Peter McBurney

Abstract: Deep ReLU Networks can be decomposed into a collection of linear models, each defined in a region of a partition of the input space. This paper provides three results extending this theory. First, we extend this linear decompositions to Graph Neural networks and tensor convolutional networks, as well as networks with multiplicative interactions. Second, we provide proofs that neural networks can b… ▽ More Deep ReLU Networks can be decomposed into a collection of linear models, each defined in a region of a partition of the input space. This paper provides three results extending this theory. First, we extend this linear decompositions to Graph Neural networks and tensor convolutional networks, as well as networks with multiplicative interactions. Second, we provide proofs that neural networks can be understood as interpretable models such as Multivariate Decision trees and logical theories. Finally, we show how this model leads to computing cheap and exact SHAP values. We validate the theory through experiments with on Graph Neural Networks. △ Less

Submitted 16 May, 2023; originally announced May 2023.

arXiv:2105.04666 [pdf, ps, other]

The Influence of Memory in Multi-Agent Consensus

Authors: David Kohan Marzagão, Luciana Basualdo Bonatto, Tiago Madeira, Marcelo Matheus Gauy, Peter McBurney

Abstract: Multi-agent consensus problems can often be seen as a sequence of autonomous and independent local choices between a finite set of decision options, with each local choice undertaken simultaneously, and with a shared goal of achieving a global consensus state. Being able to estimate probabilities for the different outcomes and to predict how long it takes for a consensus to be formed, if ever, are… ▽ More Multi-agent consensus problems can often be seen as a sequence of autonomous and independent local choices between a finite set of decision options, with each local choice undertaken simultaneously, and with a shared goal of achieving a global consensus state. Being able to estimate probabilities for the different outcomes and to predict how long it takes for a consensus to be formed, if ever, are core issues for such protocols. Little attention has been given to protocols in which agents can remember past or outdated states. In this paper, we propose a framework to study what we call \emph{memory consensus protocol}. We show that the employment of memory allows such processes to always converge, as well as, in some scenarios, such as cycles, converge faster. We provide a theoretical analysis of the probability of each option eventually winning such processes based on the initial opinions expressed by agents. Further, we perform experiments to investigate network topologies in which agents benefit from memory on the expected time needed for consensus. △ Less

Submitted 10 May, 2021; originally announced May 2021.

Comments: Accepted at AAAI 2021

ACM Class: I.2.11; G.3

arXiv:1708.09327 [pdf, ps, other]

doi 10.1007/978-3-319-09578-3_7

Spontaneous Segregation of Agents Across Double Auction Markets

Authors: Aleksandra Alorić, Peter Sollich, Peter McBurney

Abstract: In this paper we investigate the possibility of spontaneous segregation into groups of traders that have to choose among several markets. Even in the simplest case of two markets and Zero Intelligence traders, we are able to observe segregation effects below a critical value Tc of the temperature T; the latter regulates how strongly traders bias their decisions towards choices with large accumulat… ▽ More In this paper we investigate the possibility of spontaneous segregation into groups of traders that have to choose among several markets. Even in the simplest case of two markets and Zero Intelligence traders, we are able to observe segregation effects below a critical value Tc of the temperature T; the latter regulates how strongly traders bias their decisions towards choices with large accumulated scores. It is notable that segregation occurs even though the traders are statistically homogeneous. Traders can in principle change their loyalty to a market, but the relevant persistence times become long below Tc. △ Less

Submitted 30 August, 2017; originally announced August 2017.

Comments: 12 pages, 7 figures; Artificial Economics 2014 conference; Published online: 17 October 2014

Journal ref: Advances in Artificial Economics (2015) pp 79-90. Lecture Notes in Economics and Mathematical Systems, vol 676. Springer, Cham

arXiv:1511.00740 [pdf, other]

Learning Unfair Trading: a Market Manipulation Analysis From the Reinforcement Learning Perspective

Authors: Enrique Martínez-Miranda, Peter McBurney, Matthew J. Howard

Abstract: Market manipulation is a strategy used by traders to alter the price of financial securities. One type of manipulation is based on the process of buying or selling assets by using several trading strategies, among them spoofing is a popular strategy and is considered illegal by market regulators. Some promising tools have been developed to detect manipulation, but cases can still be found in the m… ▽ More Market manipulation is a strategy used by traders to alter the price of financial securities. One type of manipulation is based on the process of buying or selling assets by using several trading strategies, among them spoofing is a popular strategy and is considered illegal by market regulators. Some promising tools have been developed to detect manipulation, but cases can still be found in the markets. In this paper we model spoofing and **ing trading, two strategies that differ in the legal background but share the same elemental concept of market manipulation. We use a reinforcement learning framework within the full and partial observability of Markov decision processes and analyse the underlying behaviour of the manipulators by finding the causes of what encourages the traders to perform fraudulent activities. This reveals procedures to counter the problem that may be helpful to market regulators as our model predicts the activity of spoofers. △ Less

Submitted 2 November, 2015; originally announced November 2015.

Comments: 7 pages, 4 figures, 3 tables

arXiv:1510.07927 [pdf, ps, other]

doi 10.1371/journal.pone.0154606

Emergence of Cooperative Long-term Market Loyalty in Double Auction Markets

Authors: Aleksandra Aloric, Peter Sollich, Peter McBurney, Tobias Galla

Abstract: Loyal buyer-seller relationships can arise by design, e.g. when a seller tailors a product to a specific market niche to accomplish the best possible returns, and buyers respond to the dedicated efforts the seller makes to meet their needs. We ask whether it is possible, instead, for loyalty to arise spontaneously, and in particular as a consequence of repeated interaction and co-adaptation among… ▽ More Loyal buyer-seller relationships can arise by design, e.g. when a seller tailors a product to a specific market niche to accomplish the best possible returns, and buyers respond to the dedicated efforts the seller makes to meet their needs. We ask whether it is possible, instead, for loyalty to arise spontaneously, and in particular as a consequence of repeated interaction and co-adaptation among the agents in a market. We devise a stylized model of double auction markets and adaptive traders that incorporates these features. Traders choose where to trade (which market) and how to trade (to buy or to sell) based on their previous experience. We find that when the typical scale of market returns (or, at fixed scale of returns, the intensity of choice) become higher than some threshold, the preferred state of the system is segregated: both buyers and sellers are segmented into subgroups that are persistently loyal to one market over another. We characterize the segregated state analytically in the limit of large markets: it is stabilized by some agents acting cooperatively to enable trade, and provides higher rewards than its unsegregated counterpart both for individual traders and the population as a whole. △ Less

Submitted 30 August, 2017; v1 submitted 20 October, 2015; originally announced October 2015.

Comments: 33 pages, 11 figures; referee remarks included, published: April 27, 2016

arXiv:1301.3874 [pdf]

Risk Agoras: Dialectical Argumentation for Scientific Reasoning

Authors: Peter McBurney, Simon Parsons

Abstract: We propose a formal framework for intelligent systems which can reason about scientific domains, in particular about the carcinogenicity of chemicals, and we study its properties. Our framework is grounded in a philosophy of scientific enquiry and discourse, and uses a model of dialectical argumentation. The formalism enables representation of scientific uncertainty and conflict in a manner suitab… ▽ More We propose a formal framework for intelligent systems which can reason about scientific domains, in particular about the carcinogenicity of chemicals, and we study its properties. Our framework is grounded in a philosophy of scientific enquiry and discourse, and uses a model of dialectical argumentation. The formalism enables representation of scientific uncertainty and conflict in a manner suitable for qualitative reasoning about the domain. △ Less

Submitted 16 January, 2013; originally announced January 2013.

Comments: Appears in Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI2000)

Report number: UAI-P-2000-PG-371-379

arXiv:1301.0585 [pdf]

Formalizing Scenario Analysis

Authors: Peter McBurney, Simon Parsons

Abstract: We propose a formal treatment of scenarios in the context of a dialectical argumentation formalism for qualitative reasoning about uncertain propositions. Our formalism extends prior work in which arguments for and against uncertain propositions were presented and compared in interaction spaces called Agoras. We now define the notion of a scenario in this framework and use it to define a set of qu… ▽ More We propose a formal treatment of scenarios in the context of a dialectical argumentation formalism for qualitative reasoning about uncertain propositions. Our formalism extends prior work in which arguments for and against uncertain propositions were presented and compared in interaction spaces called Agoras. We now define the notion of a scenario in this framework and use it to define a set of qualitative uncertainty labels for propositions across a collection of scenarios. This work is intended to lead to a formal theory of scenarios and scenario analysis. △ Less

Submitted 12 December, 2012; originally announced January 2013.

Comments: Appears in Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence (UAI2002)

Report number: UAI-P-2002-PG-327-334

Showing 1–12 of 12 results for author: McBurney, P