-
Assisting Unknown Teammates in Unknown Tasks: Ad Hoc Teamwork under Partial Observability
Authors:
João G. Ribeiro,
Cassandro Martinho,
Alberto Sardinha,
Francisco S. Melo
Abstract:
In this paper, we present a novel Bayesian online prediction algorithm for the problem setting of ad hoc teamwork under partial observability (ATPO), which enables on-the-fly collaboration with unknown teammates performing an unknown task without needing a pre-coordination protocol. Unlike previous works that assume a fully observable state of the environment, ATPO accommodates partial observabili…
▽ More
In this paper, we present a novel Bayesian online prediction algorithm for the problem setting of ad hoc teamwork under partial observability (ATPO), which enables on-the-fly collaboration with unknown teammates performing an unknown task without needing a pre-coordination protocol. Unlike previous works that assume a fully observable state of the environment, ATPO accommodates partial observability, using the agent's observations to identify which task is being performed by the teammates. Our approach assumes neither that the teammate's actions are visible nor an environment reward signal. We evaluate ATPO in three domains -- two modified versions of the Pursuit domain with partial observability and the overcooked domain. Our results show that ATPO is effective and robust in identifying the teammate's task from a large library of possible tasks, efficient at solving it in near-optimal time, and scalable in adapting to increasingly larger problem sizes.
△ Less
Submitted 10 January, 2022;
originally announced January 2022.
-
An enhanced simulation-based iterated local search metaheuristic for gravity fed water distribution network design optimization
Authors:
Willian C. S. Martinho,
Rafael A. Melo,
Kenneth Sörensen
Abstract:
The gravity fed water distribution network design (WDND) optimization problem consists in determining the pipe diameters of a water network such that hydraulic constraints are satisfied and the total cost is minimized. Traditionally, such design decisions are made on the basis of expert experience. When networks increase in size, however, rules of thumb will rarely lead to near optimal decisions.…
▽ More
The gravity fed water distribution network design (WDND) optimization problem consists in determining the pipe diameters of a water network such that hydraulic constraints are satisfied and the total cost is minimized. Traditionally, such design decisions are made on the basis of expert experience. When networks increase in size, however, rules of thumb will rarely lead to near optimal decisions. Over the past thirty years, a large number of techniques have been developed to tackle the problem of optimally designing a water distribution network. In this paper, we tackle the NP-hard water distribution network design (WDND) optimization problem in a multi-period setting where time varying demand patterns occur. We propose a new simulation-based iterated local search metaheuristic which further explores the structure of the problem in an attempt to obtain high quality solutions. Computational experiments show that our approach is very competitive as it is able to improve over a state-of-the-art metaheuristic for most of the performed tests. Furthermore, it converges much faster to low cost solutions and demonstrates a more robust performance in that it obtains smaller deviations from the best known solutions.
△ Less
Submitted 7 June, 2021; v1 submitted 2 September, 2020;
originally announced September 2020.
-
The Influence of Reward on the Social Valence of Interactions
Authors:
Tomás Alves,
Samuel Gomes,
João Dias,
Carlos Martinho
Abstract:
Throughout the years, social norms have been promoted as an informal enforcement mechanism for achieving beneficial collective outcomes. Among the most used methods to foster interactions, framing the context of a situation or setting in-game rules have shown strong results as mediators on how an individual interacts with their peers. Nevertheless, we found that there is a lack of research regardi…
▽ More
Throughout the years, social norms have been promoted as an informal enforcement mechanism for achieving beneficial collective outcomes. Among the most used methods to foster interactions, framing the context of a situation or setting in-game rules have shown strong results as mediators on how an individual interacts with their peers. Nevertheless, we found that there is a lack of research regarding the use of incentives such as scores to promote social interactions differing in valence. Weighing how incentives influence in-game behavior, we propose the use of rewards to promote interactions varying in valence, i.e. positive or negative, in a two-player scenario. To do so, we defined social valence as a continuous scale with two poles represented by Complicate and Help. Then, we performed user tests where participants where asked to play a game with two reward-based systems to test on whether the scoring system influenced the social interaction valence. The results indicate that the developed reward-based systems were able to foster interactions diverging in social valence scores, providing insights on how factors such as incentives overlap individual's established social norms. These findings empower game developers and designers with a low-cost and effective policy tool that is able to promote in-game behavior changes.
△ Less
Submitted 31 March, 2020; v1 submitted 27 March, 2020;
originally announced March 2020.
-
Reward-Mediated Individual and Altruistic Behavior
Authors:
Samuel Gomes,
Tomás Alves,
João Dias,
Carlos Martinho
Abstract:
Recent research has taken particular interest in observing the dynamics between altruistic and individual behavior. This is a commonly approached problem when reasoning about social dilemmas, which have a plethora of real world counterparts in the fields of education, health and economics. Weighing how incentives influence in-game behavior, our study examines individual and altruistic interactions…
▽ More
Recent research has taken particular interest in observing the dynamics between altruistic and individual behavior. This is a commonly approached problem when reasoning about social dilemmas, which have a plethora of real world counterparts in the fields of education, health and economics. Weighing how incentives influence in-game behavior, our study examines individual and altruistic interactions, by analyzing the players' strategies and interaction motives when facing different reward attribution strategies. Consequently, a model for interaction motives is also proposed, with the premise that the motives for interactions can be defined as a continuous space, ranging from self-oriented (associated to self-improvement behaviors) to others-oriented (associated to extreme altruism behaviors) motives. To evaluate the promotion of individual and altruistic behavior, we leverage Message Across, an in-loco two-player videogame with adaptable reward attribution systems. We conducted several user tests (N = 66) to verify to what extent individual and altruistic reward attribution systems led players to vary their strategies and motives orientation. Our results indicate that players' strategies and self-reported orientation of interaction motives varied highly significantly upon the deployment of individual and altruistic reward systems, which leads us to believe on the suitability of applying an incentive-based strategy to moderate the emergence of individual and altruistic behavior in games.
△ Less
Submitted 31 March, 2020; v1 submitted 21 March, 2020;
originally announced March 2020.
-
Dynamic Social Interaction Mechanics CrossAnt
Authors:
Samuel Gomes,
Carlos Martinho,
João Dias
Abstract:
Nowadays, big effort is being put to study gamification and how game elements can be used to engage players. In this scope, we believe there is a growing need to explore the impact game mechanics have on the players' interactions and perception. This work focuses on the application of game mechanics to lead players to achieve certain types of social interaction (we named this type of mechanics soc…
▽ More
Nowadays, big effort is being put to study gamification and how game elements can be used to engage players. In this scope, we believe there is a growing need to explore the impact game mechanics have on the players' interactions and perception. This work focuses on the application of game mechanics to lead players to achieve certain types of social interaction (we named this type of mechanics social interaction mechanics). A word matching game called CrossAnt was modified so that it could dynamically generate different social interaction mechanics. These mechanics consisted in different key combinations needed to play the game and were aimed to promote what we think are three important types of social interactions: cooperation, competition and individual exploration. Our evaluation consisted on the execution of several sessions where two players interacted with the game for several levels and had to find for themselves how to perform the actions needed to succeed. While some of the levels required the input from both players in order to be completed, others could be completed by each player independently. Our results show that cooperation was perceived when both players had to intervene to perform the game actions. However, longer interactions may still be needed so that the other types of interactions are promoted.
△ Less
Submitted 24 March, 2019; v1 submitted 17 November, 2018;
originally announced November 2018.