Skip to main content

Showing 1–16 of 16 results for author: Christianos, F

.
  1. arXiv:2312.14878  [pdf, other

    cs.AI cs.LG

    Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning

    Authors: Filippos Christianos, Georgios Papoudakis, Matthieu Zimmer, Thomas Coste, Zhihao Wu, **gxuan Chen, Khyati Khandelwal, James Doran, Xidong Feng, Jiacheng Liu, Zheng Xiong, Yicheng Luo, Jianye Hao, Kun Shao, Haitham Bou-Ammar, Jun Wang

    Abstract: A key method for creating Artificial Intelligence (AI) agents is Reinforcement Learning (RL). However, constructing a standalone RL policy that maps perception to action directly encounters severe problems, chief among them being its lack of generality across multiple tasks and the need for a large amount of training data. The leading cause is that it cannot effectively integrate prior information… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: paper and appendix, 27 pages

  2. arXiv:2310.18127  [pdf, other

    cs.LG cs.AI cs.CL

    Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models

    Authors: Xue Yan, Yan Song, Xinyu Cui, Filippos Christianos, Haifeng Zhang, David Henry Mguni, Jun Wang

    Abstract: Large language models (LLMs) demonstrate their promise in tackling complicated practical challenges by combining action-based policies with chain of thought (CoT) reasoning. Having high-quality prompts on hand, however, is vital to the framework's effectiveness. Currently, these prompts are handcrafted utilising extensive human labor, resulting in CoT policies that frequently fail to generalise. H… ▽ More

    Submitted 28 February, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

  3. arXiv:2309.16347  [pdf, other

    cs.RO cs.CL cs.LG

    Intrinsic Language-Guided Exploration for Complex Long-Horizon Robotic Manipulation Tasks

    Authors: Eleftherios Triantafyllidis, Filippos Christianos, Zhibin Li

    Abstract: Current reinforcement learning algorithms struggle in sparse and complex environments, most notably in long-horizon manipulation tasks entailing a plethora of different sequences. In this work, we propose the Intrinsically Guided Exploration from Large Language Models (IGE-LLMs) framework. By leveraging LLMs as an assistive intrinsic reward, IGE-LLMs guides the exploratory process in reinforcement… ▽ More

    Submitted 7 March, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: Accepted at the International Conference on Robotics and Automation (ICRA), 2024. The manuscript consists of 10 pages and 6 figures

  4. arXiv:2305.05566  [pdf, other

    cs.LG cs.AI cs.MA

    SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning

    Authors: Adam Michalski, Filippos Christianos, Stefano V. Albrecht

    Abstract: There is a lack of standard benchmarks for Multi-Agent Reinforcement Learning (MARL) algorithms. The Starcraft Multi-Agent Challenge (SMAC) has been widely used in MARL research, but is built on top of a heavy, closed-source computer game, StarCraft II. Thus, SMAC is computationally expensive and requires knowledge and the use of proprietary tools specific to the game for any meaningful alteration… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  5. arXiv:2302.11793  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Revisiting the Gumbel-Softmax in MADDPG

    Authors: Callum Rhys Tilbury, Filippos Christianos, Stefano V. Albrecht

    Abstract: MADDPG is an algorithm in multi-agent reinforcement learning (MARL) that extends the popular single-agent method, DDPG, to multi-agent scenarios. Importantly, DDPG is an algorithm designed for continuous action spaces, where the gradient of the state-action value function exists. For this algorithm to work in discrete action spaces, discrete gradient estimation must be performed. For MADDPG, the G… ▽ More

    Submitted 14 June, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: Presented at AAMAS Workshop on Adaptive and Learning Agents, 2023

  6. arXiv:2210.14584  [pdf, other

    cs.LG cs.RO

    Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models

    Authors: Filippos Christianos, Peter Karkus, Boris Ivanovic, Stefano V. Albrecht, Marco Pavone

    Abstract: Reasoning with occluded traffic agents is a significant open challenge for planning for autonomous vehicles. Recent deep learning models have shown impressive results for predicting occluded agents based on the behaviour of nearby visible agents; however, as we show in experiments, these models are difficult to integrate into downstream planning. To this end, we propose Bi-level Variational Occlus… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: 7 pages, 6 figures

  7. arXiv:2209.14344  [pdf, other

    cs.LG cs.MA

    Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning

    Authors: Filippos Christianos, Georgios Papoudakis, Stefano V. Albrecht

    Abstract: This work focuses on equilibrium selection in no-conflict multi-agent games, where we specifically study the problem of selecting a Pareto-optimal Nash equilibrium among several existing equilibria. It has been shown that many state-of-the-art multi-agent reinforcement learning (MARL) algorithms are prone to converging to Pareto-dominated equilibria due to the uncertainty each agent has about the… ▽ More

    Submitted 14 October, 2023; v1 submitted 28 September, 2022; originally announced September 2022.

    Comments: Published in Transactions on Machine Learning Research (TMLR); Reviewed on OpenReview: https://openreview.net/forum?id=3AzqYa18ah

  8. arXiv:2208.01769  [pdf, other

    cs.MA cs.AI cs.LG

    Deep Reinforcement Learning for Multi-Agent Interaction

    Authors: Ibrahim H. Ahmed, Cillian Brewitt, Ignacio Carlucho, Filippos Christianos, Mhairi Dunion, Elliot Fosong, Samuel Garcin, Shangmin Guo, Balint Gyevnar, Trevor McInroe, Georgios Papoudakis, Arrasy Rahman, Lukas Schäfer, Massimiliano Tamborski, Giuseppe Vecchio, Cheng Wang, Stefano V. Albrecht

    Abstract: The development of autonomous agents which can interact with other agents to accomplish a given task is a core area of research in artificial intelligence and machine learning. Towards this goal, the Autonomous Agents Research Group develops novel machine learning algorithms for autonomous systems control, with a specific focus on deep reinforcement learning and multi-agent reinforcement learning.… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

    Comments: Published in AI Communications Special Issue on Multi-Agent Systems Research in the UK

  9. arXiv:2207.02249  [pdf, other

    cs.MA cs.AI cs.LG

    Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning

    Authors: Lukas Schäfer, Filippos Christianos, Amos Storkey, Stefano V. Albrecht

    Abstract: Successful deployment of multi-agent reinforcement learning often requires agents to adapt their behaviour. In this work, we discuss the problem of teamwork adaptation in which a team of agents needs to adapt their policies to solve novel tasks with limited fine-tuning. Motivated by the intuition that agents need to be able to identify and distinguish tasks in order to adapt their behaviour to the… ▽ More

    Submitted 20 November, 2023; v1 submitted 5 July, 2022; originally announced July 2022.

    Comments: To be presented at the Seventh Workshop on Generalization in Planning at the NeurIPS 2023 conference

  10. arXiv:2107.08966  [pdf, other

    cs.LG cs.AI

    Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration

    Authors: Lukas Schäfer, Filippos Christianos, Josiah P. Hanna, Stefano V. Albrecht

    Abstract: Intrinsic rewards can improve exploration in reinforcement learning, but the exploration process may suffer from instability caused by non-stationary reward sha** and strong dependency on hyperparameters. In this work, we introduce Decoupled RL (DeRL) as a general framework which trains separate policies for intrinsically-motivated exploration and exploitation. Such decoupling allows DeRL to lev… ▽ More

    Submitted 9 February, 2022; v1 submitted 19 July, 2021; originally announced July 2021.

    Comments: Published at the International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS) 2022

  11. arXiv:2102.07475  [pdf, other

    cs.MA cs.LG

    Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing

    Authors: Filippos Christianos, Georgios Papoudakis, Arrasy Rahman, Stefano V. Albrecht

    Abstract: Sharing parameters in multi-agent deep reinforcement learning has played an essential role in allowing algorithms to scale to a large number of agents. Parameter sharing between agents significantly decreases the number of trainable parameters, shortening training times to tractable levels, and has been linked to more efficient learning. However, having all agents share the same parameters can als… ▽ More

    Submitted 12 June, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: To be published In Proceedings of the 38th International Conference on Machine Learning (ICML), 2021

  12. arXiv:2006.10412  [pdf, other

    cs.LG cs.MA stat.ML

    Towards Open Ad Hoc Teamwork Using Graph-based Policy Learning

    Authors: Arrasy Rahman, Niklas Höpner, Filippos Christianos, Stefano V. Albrecht

    Abstract: Ad hoc teamwork is the challenging problem of designing an autonomous agent which can adapt quickly to collaborate with teammates without prior coordination mechanisms, including joint training. Prior work in this area has focused on closed teams in which the number of agents is fixed. In this work, we consider open teams by allowing agents with different fixed policies to enter and leave the envi… ▽ More

    Submitted 9 June, 2021; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: Published in the 38th International Conference on Machine Learning (ICML 2021)

  13. arXiv:2006.09447  [pdf, other

    cs.LG cs.MA stat.ML

    Agent Modelling under Partial Observability for Deep Reinforcement Learning

    Authors: Georgios Papoudakis, Filippos Christianos, Stefano V. Albrecht

    Abstract: Modelling the behaviours of other agents is essential for understanding how agents interact and making effective decisions. Existing methods for agent modelling commonly assume knowledge of the local observations and chosen actions of the modelled agents during execution. To eliminate this assumption, we extract representations from the local information of the controlled agent using encoder-decod… ▽ More

    Submitted 9 November, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: Published in the 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  14. arXiv:2006.07869  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks

    Authors: Georgios Papoudakis, Filippos Christianos, Lukas Schäfer, Stefano V. Albrecht

    Abstract: Multi-agent deep reinforcement learning (MARL) suffers from a lack of commonly-used evaluation tasks and criteria, making comparisons between approaches difficult. In this work, we provide a systematic evaluation and comparison of three different classes of MARL algorithms (independent learning, centralised multi-agent policy gradient, value decomposition) in a diverse range of cooperative multi-a… ▽ More

    Submitted 9 November, 2021; v1 submitted 14 June, 2020; originally announced June 2020.

    Comments: Published in 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks

  15. arXiv:2006.07169  [pdf, other

    cs.MA cs.LG

    Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning

    Authors: Filippos Christianos, Lukas Schäfer, Stefano V. Albrecht

    Abstract: Exploration in multi-agent reinforcement learning is a challenging problem, especially in environments with sparse rewards. We propose a general method for efficient exploration by sharing experience amongst agents. Our proposed algorithm, called Shared Experience Actor-Critic (SEAC), applies experience sharing in an actor-critic framework. We evaluate SEAC in a collection of sparse-reward multi-a… ▽ More

    Submitted 19 May, 2021; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: Published in 34th Conference on Neural Information Processing Systems (NeurIPS), see https://proceedings.neurips.cc/paper/2020/hash/7967cc8e3ab559e68cc944c44b1cf3e8-Abstract.html - This updated version of the paper is identical to the original paper published at NeurIPS 2020 but includes minor clarifications following recommendations in http://agents.inf.ed.ac.uk/blog/multiagent-rl-inaccuracies/

    Journal ref: Advances in Neural Information Processing System 33 (2020) 10707-10717

  16. arXiv:1906.04737  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Dealing with Non-Stationarity in Multi-Agent Deep Reinforcement Learning

    Authors: Georgios Papoudakis, Filippos Christianos, Arrasy Rahman, Stefano V. Albrecht

    Abstract: Recent developments in deep reinforcement learning are concerned with creating decision-making agents which can perform well in various complex domains. A particular approach which has received increasing attention is multi-agent reinforcement learning, in which multiple agents learn concurrently to coordinate their actions. In such multi-agent environments, additional learning problems arise due… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.