Skip to main content

Showing 1–11 of 11 results for author: Papoudakis, G

.
  1. arXiv:2312.14878  [pdf, other

    cs.AI cs.LG

    Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning

    Authors: Filippos Christianos, Georgios Papoudakis, Matthieu Zimmer, Thomas Coste, Zhihao Wu, **gxuan Chen, Khyati Khandelwal, James Doran, Xidong Feng, Jiacheng Liu, Zheng Xiong, Yicheng Luo, Jianye Hao, Kun Shao, Haitham Bou-Ammar, Jun Wang

    Abstract: A key method for creating Artificial Intelligence (AI) agents is Reinforcement Learning (RL). However, constructing a standalone RL policy that maps perception to action directly encounters severe problems, chief among them being its lack of generality across multiple tasks and the need for a large amount of training data. The leading cause is that it cannot effectively integrate prior information… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: paper and appendix, 27 pages

  2. arXiv:2212.11498  [pdf, other

    cs.LG cs.AI cs.MA cs.RO

    Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers

    Authors: Aleksandar Krnjaic, Raul D. Steleac, Jonathan D. Thomas, Georgios Papoudakis, Lukas Schäfer, Andrew Wing Keung To, Kuan-Ho Lao, Murat Cubuktepe, Matthew Haley, Peter Börsting, Stefano V. Albrecht

    Abstract: We envision a warehouse in which dozens of mobile robots and human pickers work together to collect and deliver items within the warehouse. The fundamental problem we tackle, called the order-picking problem, is how these worker agents must coordinate their movement and actions in the warehouse to maximise performance (e.g. order throughput). Established industry methods using heuristic approaches… ▽ More

    Submitted 7 July, 2023; v1 submitted 22 December, 2022; originally announced December 2022.

  3. arXiv:2209.14344  [pdf, other

    cs.LG cs.MA

    Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning

    Authors: Filippos Christianos, Georgios Papoudakis, Stefano V. Albrecht

    Abstract: This work focuses on equilibrium selection in no-conflict multi-agent games, where we specifically study the problem of selecting a Pareto-optimal Nash equilibrium among several existing equilibria. It has been shown that many state-of-the-art multi-agent reinforcement learning (MARL) algorithms are prone to converging to Pareto-dominated equilibria due to the uncertainty each agent has about the… ▽ More

    Submitted 14 October, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: Published in Transactions on Machine Learning Research (TMLR); Reviewed on OpenReview: https://openreview.net/forum?id=3AzqYa18ah

  4. arXiv:2208.01769  [pdf, other

    cs.MA cs.AI cs.LG

    Deep Reinforcement Learning for Multi-Agent Interaction

    Authors: Ibrahim H. Ahmed, Cillian Brewitt, Ignacio Carlucho, Filippos Christianos, Mhairi Dunion, Elliot Fosong, Samuel Garcin, Shangmin Guo, Balint Gyevnar, Trevor McInroe, Georgios Papoudakis, Arrasy Rahman, Lukas Schäfer, Massimiliano Tamborski, Giuseppe Vecchio, Cheng Wang, Stefano V. Albrecht

    Abstract: The development of autonomous agents which can interact with other agents to accomplish a given task is a core area of research in artificial intelligence and machine learning. Towards this goal, the Autonomous Agents Research Group develops novel machine learning algorithms for autonomous systems control, with a specific focus on deep reinforcement learning and multi-agent reinforcement learning.… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

    Comments: Published in AI Communications Special Issue on Multi-Agent Systems Research in the UK

  5. arXiv:2102.07475  [pdf, other

    cs.MA cs.LG

    Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing

    Authors: Filippos Christianos, Georgios Papoudakis, Arrasy Rahman, Stefano V. Albrecht

    Abstract: Sharing parameters in multi-agent deep reinforcement learning has played an essential role in allowing algorithms to scale to a large number of agents. Parameter sharing between agents significantly decreases the number of trainable parameters, shortening training times to tractable levels, and has been linked to more efficient learning. However, having all agents share the same parameters can als… ▽ More

    Submitted 12 June, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: To be published In Proceedings of the 38th International Conference on Machine Learning (ICML), 2021

  6. arXiv:2006.09447  [pdf, other

    cs.LG cs.MA stat.ML

    Agent Modelling under Partial Observability for Deep Reinforcement Learning

    Authors: Georgios Papoudakis, Filippos Christianos, Stefano V. Albrecht

    Abstract: Modelling the behaviours of other agents is essential for understanding how agents interact and making effective decisions. Existing methods for agent modelling commonly assume knowledge of the local observations and chosen actions of the modelled agents during execution. To eliminate this assumption, we extract representations from the local information of the controlled agent using encoder-decod… ▽ More

    Submitted 9 November, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: Published in the 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  7. arXiv:2006.07869  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks

    Authors: Georgios Papoudakis, Filippos Christianos, Lukas Schäfer, Stefano V. Albrecht

    Abstract: Multi-agent deep reinforcement learning (MARL) suffers from a lack of commonly-used evaluation tasks and criteria, making comparisons between approaches difficult. In this work, we provide a systematic evaluation and comparison of three different classes of MARL algorithms (independent learning, centralised multi-agent policy gradient, value decomposition) in a diverse range of cooperative multi-a… ▽ More

    Submitted 9 November, 2021; v1 submitted 14 June, 2020; originally announced June 2020.

    Comments: Published in 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks

  8. arXiv:2001.10829  [pdf, other

    cs.LG stat.ML

    Variational Autoencoders for Opponent Modeling in Multi-Agent Systems

    Authors: Georgios Papoudakis, Stefano V. Albrecht

    Abstract: Multi-agent systems exhibit complex behaviors that emanate from the interactions of multiple agents in a shared environment. In this work, we are interested in controlling one agent in a multi-agent system and successfully learn to interact with the other agents that have fixed policies. Modeling the behavior of other agents (opponents) is essential in understanding the interactions of the agents… ▽ More

    Submitted 29 January, 2020; originally announced January 2020.

    Comments: AAAI-20 Workshop on Reinforcement Learning in Games

  9. arXiv:1906.04737  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Dealing with Non-Stationarity in Multi-Agent Deep Reinforcement Learning

    Authors: Georgios Papoudakis, Filippos Christianos, Arrasy Rahman, Stefano V. Albrecht

    Abstract: Recent developments in deep reinforcement learning are concerned with creating decision-making agents which can perform well in various complex domains. A particular approach which has received increasing attention is multi-agent reinforcement learning, in which multiple agents learn concurrently to coordinate their actions. In such multi-agent environments, additional learning problems arise due… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

  10. arXiv:1807.01960  [pdf, other

    cs.LG cs.AI stat.ML

    Deep Reinforcement Learning for Doom using Unsupervised Auxiliary Tasks

    Authors: Georgios Papoudakis, Kyriakos C. Chatzidimitriou, Pericles A. Mitkas

    Abstract: Recent developments in deep reinforcement learning have enabled the creation of agents for solving a large variety of games given a visual input. These methods have been proven successful for 2D games, like the Atari games, or for simple tasks, like navigating in mazes. It is still an open question, how to address more complex environments, in which the reward is sparse and the state space is huge… ▽ More

    Submitted 5 July, 2018; originally announced July 2018.

    Comments: 4 pages, 3 figures, 3 tables

  11. A generative model for sparse, evolving digraphs

    Authors: Georgios Papoudakis, Philippe Preux, Martin Monperrus

    Abstract: Generating graphs that are similar to real ones is an open problem, while the similarity notion is quite elusive and hard to formalize. In this paper, we focus on sparse digraphs and propose SDG, an algorithm that aims at generating graphs similar to real ones. Since real graphs are evolving and this evolution is important to study in order to understand the underlying dynamical system, we tackle… ▽ More

    Submitted 17 October, 2017; originally announced October 2017.

    Journal ref: 6th International Conference on Complex Networks and their applications, Nov 2017, Lyon, France