Skip to main content

Showing 1–29 of 29 results for author: Norman, T

.
  1. arXiv:2406.07277  [pdf, other

    cs.CL cs.AI cs.MA

    Speaking Your Language: Spatial Relationships in Interpretable Emergent Communication

    Authors: Olaf Lipinski, Adam J. Sobey, Federico Cerutti, Timothy J. Norman

    Abstract: Effective communication requires the ability to refer to specific parts of an observation in relation to others. While emergent communication literature shows success in develo** various language properties, no research has shown the emergence of such positional references. This paper demonstrates how agents can communicate about spatial relationships within their observations. The results indic… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 16 pages, 3 figures

  2. arXiv:2402.11653  [pdf, other

    cs.AI cs.DC cs.NI

    Combinatorial Client-Master Multiagent Deep Reinforcement Learning for Task Offloading in Mobile Edge Computing

    Authors: Tesfay Zemuy Gebrekidan, Sebastian Stein, Timothy J. Norman

    Abstract: Recently, there has been an explosion of mobile applications that perform computationally intensive tasks such as video streaming, data mining, virtual reality, augmented reality, image processing, video processing, face recognition, and online gaming. However, user devices (UDs), such as tablets and smartphones, have a limited ability to perform the computation needs of the tasks. Mobile edge com… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: 11 pages, 5 figures, 2 tables

    ACM Class: I.2.11

  3. arXiv:2402.04898  [pdf

    cs.AI cs.LG

    The Strain of Success: A Predictive Model for Injury Risk Mitigation and Team Success in Soccer

    Authors: Gregory Everett, Ryan Beal, Tim Matthews, Timothy J. Norman, Sarvapali D. Ramchurn

    Abstract: In this paper, we present a novel sequential team selection model in soccer. Specifically, we model the stochastic process of player injury and unavailability using player-specific information learned from real-world soccer data. Monte-Carlo Tree Search is used to select teams for games that optimise long-term team performance across a soccer season by reasoning over player injury probability. We… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 19 pages (16 main, 2 references, 1 appendix), 10 figures (9 main, 1 appendix). Accepted at the MIT Sloan Sports Analytics Conference 2024 Research Paper Competition

  4. arXiv:2401.11202  [pdf, other

    cs.LG cs.DC cs.PL

    PartIR: Composing SPMD Partitioning Strategies for Machine Learning

    Authors: Sami Alabed, Daniel Belov, Bart Chrzaszcz, Juliana Franco, Dominik Grewe, Dougal Maclaurin, James Molloy, Tom Natan, Tamara Norman, Xiaoyue Pan, Adam Paszke, Norman A. Rink, Michael Schaarschmidt, Timur Sitdikov, Agnieszka Swietlik, Dimitrios Vytiniotis, Joel Wee

    Abstract: Training of modern large neural networks (NN) requires a combination of parallelization strategies encompassing data, model, or optimizer sharding. When strategies increase in complexity, it becomes necessary for partitioning tools to be 1) expressive, allowing the composition of simpler strategies, and 2) predictable to estimate performance analytically. We present PartIR, our design for a NN par… ▽ More

    Submitted 3 March, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

  5. arXiv:2312.15667  [pdf, other

    cs.MA cs.AI

    TAPE: Leveraging Agent Topology for Cooperative Multi-Agent Policy Gradient

    Authors: Xingzhou Lou, Junge Zhang, Timothy J. Norman, Kaiqi Huang, Yali Du

    Abstract: Multi-Agent Policy Gradient (MAPG) has made significant progress in recent years. However, centralized critics in state-of-the-art MAPG methods still face the centralized-decentralized mismatch (CDM) issue, which means sub-optimal actions by some agents will affect other agent's policy learning. While using individual critics for policy updates can avoid this issue, they severely limit cooperation… ▽ More

    Submitted 15 January, 2024; v1 submitted 25 December, 2023; originally announced December 2023.

  6. arXiv:2310.06555  [pdf, other

    cs.CL cs.AI cs.LG cs.MA

    It's About Time: Temporal References in Emergent Communication

    Authors: Olaf Lipinski, Adam J. Sobey, Federico Cerutti, Timothy J. Norman

    Abstract: Emergent communication studies the development of language between autonomous agents, aiming to improve understanding of natural language evolution and increase communication efficiency. While temporal aspects of language have been considered in computational linguistics, there has been no research on temporal references in emergent communication. This paper addresses this gap, by exploring how ag… ▽ More

    Submitted 3 May, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: 26 pages main body and 36 pages supplementary material, 8 figures in main body. Code available at https://github.com/olipinski/TRG

  7. arXiv:2305.08664  [pdf, other

    cs.AI

    MADDM: Multi-Advisor Dynamic Binary Decision-Making by Maximizing the Utility

    Authors: Zhaori Guo, Timothy J. Norman, Enrico H. Gerding

    Abstract: Being able to infer ground truth from the responses of multiple imperfect advisors is a problem of crucial importance in many decision-making applications, such as lending, trading, investment, and crowd-sourcing. In practice, however, gathering answers from a set of advisors has a cost. Therefore, finding an advisor selection strategy that retrieves a reliable answer and maximizes the overall uti… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

  8. arXiv:2305.07559  [pdf, other

    q-fin.TR q-fin.CP

    PRIME: A Price-Reverting Impact Model of a cryptocurrency Exchange

    Authors: Christopher J. Cho, Timothy J. Norman, Manuel Nunes

    Abstract: In a financial exchange, market impact is a measure of the price change of an asset following a transaction. This is an important element of market microstructure, which determines the behaviour of the market following a trade. In this paper, we first provide a discussion on the market impact observed in the BTC/USD Futures market, then we present a novel multi-agent market simulation that can fol… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: Pre-print for the Cryptocurrency Research Conference 2023

  9. arXiv:2302.06569  [pdf, other

    cs.LG cs.MA

    Inferring Player Location in Sports Matches: Multi-Agent Spatial Imputation from Limited Observations

    Authors: Gregory Everett, Ryan J. Beal, Tim Matthews, Joseph Early, Timothy J. Norman, Sarvapali D. Ramchurn

    Abstract: Understanding agent behaviour in Multi-Agent Systems (MAS) is an important problem in domains such as autonomous driving, disaster response, and sports analytics. Existing MAS problems typically use uniform timesteps with observations for all agents. In this work, we analyse the problem of agent location imputation, specifically posed in environments with non-uniform timesteps and limited agent ob… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

    Comments: 11 Pages (8 main, 1 references, 2 appendix), 8 figures (7 main, 1 appendix). Accepted at AAMAS 2023 Main Track

  10. arXiv:2210.08050  [pdf, other

    cs.LG

    Multi-trainer Interactive Reinforcement Learning System

    Authors: Zhaori Guo, Timothy J. Norman, Enrico H. Gerding

    Abstract: Interactive reinforcement learning can effectively facilitate the agent training via human feedback. However, such methods often require the human teacher to know what is the correct action that the agent should take. In other words, if the human teacher is not always reliable, then it will not be consistently able to guide the agent through its training. In this paper, we propose a more effective… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

  11. arXiv:2210.06352  [pdf, other

    cs.DC cs.LG cs.NE

    Automatic Discovery of Composite SPMD Partitioning Strategies in PartIR

    Authors: Sami Alabed, Dominik Grewe, Juliana Franco, Bart Chrzaszcz, Tom Natan, Tamara Norman, Norman A. Rink, Dimitrios Vytiniotis, Michael Schaarschmidt

    Abstract: Large neural network models are commonly trained through a combination of advanced parallelism strategies in a single program, multiple data (SPMD) paradigm. For example, training large transformer models requires combining data, model, and pipeline partitioning; and optimizer sharding techniques. However, identifying efficient combinations for many model architectures and accelerator systems requ… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

  12. From Intelligent Agents to Trustworthy Human-Centred Multiagent Systems

    Authors: Mohammad Divband Soorati, Enrico H. Gerding, Enrico Marchioni, Pavel Naumov, Timothy J. Norman, Sarvapali D. Ramchurn, Bahar Rastegari, Adam Sobey, Sebastian Stein, Danesh Tarpore, Vahid Yazdanpanah, Jie Zhang

    Abstract: The Agents, Interaction and Complexity research group at the University of Southampton has a long track record of research in multiagent systems (MAS). We have made substantial scientific contributions across learning in MAS, game-theoretic techniques for coordinating agent systems, and formal methods for representation and reasoning. We highlight key results achieved by the group and elaborate on… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: Appears in the Special Issue on Multi-Agent Systems Research in the United Kingdom

    Journal ref: AI Communications, vol. 35, no. 4, pp. 443-457, 2022

  13. arXiv:2202.12259  [pdf

    cs.CV cs.AI

    Learning from the Pros: Extracting Professional Goalkeeper Technique from Broadcast Footage

    Authors: Matthew Wear, Ryan Beal, Tim Matthews, Tim Norman, Sarvapali Ramchurn

    Abstract: As an amateur goalkeeper playing grassroots soccer, who better to learn from than top professional goalkeepers? In this paper, we harness computer vision and machine learning models to appraise the save technique of professionals in a way those at lower levels can learn from. We train an unsupervised machine learning model using 3D body pose data extracted from broadcast footage to learn professio… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

    Comments: 17 pages, 15 figures, MIT Sloan Sports Analytics Conference, March 4-5 2022, Boston, USA

  14. arXiv:2202.02080  [pdf, other

    cs.LG

    Robust Linear Regression for General Feature Distribution

    Authors: Tom Norman, Nir Weinberger, Kfir Y. Levy

    Abstract: We investigate robust linear regression where data may be contaminated by an oblivious adversary, i.e., an adversary than may know the data distribution but is otherwise oblivious to the realizations of the data samples. This model has been previously analyzed under strong assumptions. Concretely, $\textbf{(i)}$ all previous works assume that the covariance matrix of the features is positive defin… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

  15. arXiv:2112.02958  [pdf, other

    cs.LG cs.DC

    Automap: Towards Ergonomic Automated Parallelism for ML Models

    Authors: Michael Schaarschmidt, Dominik Grewe, Dimitrios Vytiniotis, Adam Paszke, Georg Stefan Schmid, Tamara Norman, James Molloy, Jonathan Godwin, Norman Alexander Rink, Vinod Nair, Dan Belov

    Abstract: The rapid rise in demand for training large neural network architectures has brought into focus the need for partitioning strategies, for example by using data, model, or pipeline parallelism. Implementing these methods is increasingly supported through program primitives, but identifying efficient partitioning strategies requires expensive experimentation and expertise. We present the prototype o… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: Workshop on ML for Systems at NeurIPS 2021

  16. arXiv:2110.10548  [pdf, other

    cs.PL cs.DC cs.LG

    Synthesizing Optimal Parallelism Placement and Reduction Strategies on Hierarchical Systems for Deep Learning

    Authors: Ningning Xie, Tamara Norman, Dominik Grewe, Dimitrios Vytiniotis

    Abstract: We present a novel characterization of the map** of multiple parallelism forms (e.g. data and model parallelism) onto hierarchical accelerator systems that is hierarchy-aware and greatly reduces the space of software-to-hardware map**. We experimentally verify the substantial effect of these map**s on all-reduce performance (up to 448x). We offer a novel syntax-guided program synthesis frame… ▽ More

    Submitted 16 November, 2021; v1 submitted 20 October, 2021; originally announced October 2021.

  17. arXiv:2102.09469  [pdf, other

    cs.AI cs.GT

    Optimising Long-Term Outcomes using Real-World Fluent Objectives: An Application to Football

    Authors: Ryan Beal, Georgios Chalkiadakis, Timothy J. Norman, Sarvapali D. Ramchurn

    Abstract: In this paper, we present a novel approach for optimising long-term tactical and strategic decision-making in football (soccer) by encapsulating events in a league environment across a given time frame. We model the teams' objectives for a season and track how these evolve as games unfold to give a fluent objective that can aid in decision-making games. We develop Markov chain Monte Carlo and deep… ▽ More

    Submitted 18 February, 2021; originally announced February 2021.

    Comments: Pre-Print - Accepted for publication at AAMAS-21

  18. arXiv:2012.04380  [pdf, other

    cs.CL cs.AI

    Combining Machine Learning and Human Experts to Predict Match Outcomes in Football: A Baseline Model

    Authors: Ryan Beal, Stuart E. Middleton, Timothy J. Norman, Sarvapali D. Ramchurn

    Abstract: In this paper, we present a new application-focused benchmark dataset and results from a set of baseline Natural Language Processing and Machine Learning models for prediction of match outcomes for games of football (soccer). By doing so we give a baseline for the prediction accuracy that can be achieved exploiting both statistical match data and contextual articles from human sports journalists.… ▽ More

    Submitted 8 December, 2020; originally announced December 2020.

    Comments: Pre-print. Accepted at: The Thirty-Third Annual Conference on Innovative Applications of Artificial Intelligence (IAAI-21). 5 pages

  19. arXiv:2009.09806  [pdf, ps, other

    cs.LO cs.AI cs.DB

    SHACL Satisfiability and Containment (Extended Paper)

    Authors: Paolo Pareti, George Konstantinidis, Fabio Mogavero, Timothy J. Norman

    Abstract: The Shapes Constraint Language (SHACL) is a recent W3C recommendation language for validating RDF data. Specifically, SHACL documents are collections of constraints that enforce particular shapes on an RDF graph. Previous work on the topic has provided theoretical and practical results for the validation problem, but did not consider the standard decision problems of satisfiability and containment… ▽ More

    Submitted 5 November, 2020; v1 submitted 31 August, 2020; originally announced September 2020.

  20. arXiv:2006.00979  [pdf, other

    cs.LG cs.AI

    Acme: A Research Framework for Distributed Reinforcement Learning

    Authors: Matthew W. Hoffman, Bobak Shahriari, John Aslanides, Gabriel Barth-Maron, Nikola Momchev, Danila Sinopalnikov, Piotr Stańczyk, Sabela Ramos, Anton Raichuk, Damien Vincent, Léonard Hussenot, Robert Dadashi, Gabriel Dulac-Arnold, Manu Orsini, Alexis Jacq, Johan Ferret, Nino Vieillard, Seyed Kamyar Seyed Ghasemipour, Sertan Girgin, Olivier Pietquin, Feryal Behbahani, Tamara Norman, Abbas Abdolmaleki, Albin Cassirer, Fan Yang , et al. (14 additional authors not shown)

    Abstract: Deep reinforcement learning (RL) has led to many recent and groundbreaking advances. However, these advances have often come at the cost of both increased scale in the underlying architectures being trained as well as increased complexity of the RL algorithms used to train them. These increases have in turn made it more difficult for researchers to rapidly prototype new ideas or reproduce publishe… ▽ More

    Submitted 20 September, 2022; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: This work presents a second version of the paper which coincides with an increase in modularity, additional emphasis on offline, imitation and learning from demonstrations algorithms, as well as various new agents implemented as part of Acme

  21. arXiv:2003.10294  [pdf, other

    cs.AI cs.GT cs.MA

    Optimising Game Tactics for Football

    Authors: Ryan Beal, Georgios Chalkiadakis, Timothy J. Norman, Sarvapali D. Ramchurn

    Abstract: In this paper we present a novel approach to optimise tactical and strategic decision making in football (soccer). We model the game of football as a multi-stage game which is made up from a Bayesian game to model the pre-match decisions and a stochastic game to model the in-match state transitions and decisions. Using this formulation, we propose a method to predict the probability of game outcom… ▽ More

    Submitted 23 March, 2020; originally announced March 2020.

    Comments: AAMAS 2020 Pre-Print Version

  22. arXiv:1911.06657  [pdf, other

    cs.AI

    A Policy Editor for Semantic Sensor Networks

    Authors: Paolo Pareti, George Konstantinidis, Timothy J. Norman

    Abstract: An important use of sensors and actuator networks is to comply with health and safety policies in hazardous environments. In order to deal with increasingly large and dynamic environments, and to quickly react to emergencies, tools are needed to simplify the process of translating high-level policies into executable queries and rules. We present a framework to produce such tools, which uses rules… ▽ More

    Submitted 15 November, 2019; originally announced November 2019.

    Comments: Demo paper presented at the 18th International Semantic Web Conference (ISWC 2019)

  23. SHACL Constraints with Inference Rules

    Authors: Paolo Pareti, George Konstantinidis, Timothy J. Norman, Murat Şensoy

    Abstract: The Shapes Constraint Language (SHACL) has been recently introduced as a W3C recommendation to define constraints that can be validated against RDF graphs. Interactions of SHACL with other Semantic Web technologies, such as ontologies or reasoners, is a matter of ongoing research. In this paper we study the interaction of a subset of SHACL with inference rules expressed in datalog. On the one hand… ▽ More

    Submitted 1 November, 2019; originally announced November 2019.

    Journal ref: In International Semantic Web Conference, pp. 539-557. Springer, Cham, 2019

  24. arXiv:1907.01627  [pdf, other

    cs.DB cs.AI

    Rule Applicability on RDF Triplestore Schemas

    Authors: Paolo Pareti, George Konstantinidis, Timothy J. Norman, Murat Şensoy

    Abstract: Rule-based systems play a critical role in health and safety, where policies created by experts are usually formalised as rules. When dealing with increasingly large and dynamic sources of data, as in the case of Internet of Things (IoT) applications, it becomes important not only to efficiently apply rules, but also to reason about their applicability on datasets confined by a certain schema. In… ▽ More

    Submitted 2 July, 2019; originally announced July 2019.

    Comments: AI for Internet of Things Workshop, co-located with the 28th International Joint Conference on Artificial Intelligence (IJCAI-19)

  25. arXiv:1706.04033  [pdf, other

    cs.AI

    On Natural Language Generation of Formal Argumentation

    Authors: Federico Cerutti, Alice Toniolo, Timothy J. Norman

    Abstract: In this paper we provide a first analysis of the research questions that arise when dealing with the problem of communicating pieces of formal argumentation through natural language interfaces. It is a generally held opinion that formal models of argumentation naturally capture human argument, and some preliminary studies have focused on justifying this view. Unfortunately, the results are not onl… ▽ More

    Submitted 13 June, 2017; originally announced June 2017.

    Comments: 17 pages, 4 figures, technical report

  26. arXiv:1607.00091  [pdf, ps, other

    stat.AP

    Reducing overfitting in challenge-based competitions

    Authors: Elias Chaibub Neto, Bruce R Hoff, Chris Bare, Brian M Bot, Thomas Yu, Lara Magravite, Andrew D Trister, Thea Norman, Pablo Meyer, Julio Saez-Rodrigues, James C Costello, Justin Guinney, Gustavo Stolovitzky

    Abstract: Over-fitting is a dreaded foe in challenge-based competitions. Because participants rely on public leaderboards to evaluate and refine their models, there is always the danger they might over-fit to the holdout data supporting the leaderboard. The recently published Ladder algorithm aims to address this problem by preventing the participants from exploiting willingly or inadvertently minor fluctua… ▽ More

    Submitted 30 June, 2016; originally announced July 2016.

  27. arXiv:1407.3910  [pdf, ps, other

    math.OC

    Approachability in Population Games

    Authors: Dario Bauso, Thomas W L Norman

    Abstract: This paper reframes approachability theory within the context of population games. Thus, whilst one player aims at driving her average payoff to a predefined set, her opponent is not malevolent but rather extracted randomly from a population of individuals with given distribution on actions. First, convergence conditions are revisited based on the common prior on the population distribution, and w… ▽ More

    Submitted 15 July, 2014; originally announced July 2014.

    Comments: 24 pages, 5 figures

    MSC Class: 91A13

  28. arXiv:1312.4828  [pdf, ps, other

    cs.CR cs.AI cs.LO

    Subjective Logic Operators in Trust Assessment: an Empirical Study

    Authors: Federico Cerutti, Alice Toniolo, Nir Oren, Timothy J. Norman

    Abstract: Computational trust mechanisms aim to produce trust ratings from both direct and indirect information about agents' behaviour. Subjective Logic (SL) has been widely adopted as the core of such systems via its fusion and discount operators. In recent research we revisited the semantics of these operators to explore an alternative, geometric interpretation. In this paper we present a principled desi… ▽ More

    Submitted 19 November, 2013; originally announced December 2013.

    Comments: Submitted to Information Systems Frontiers Journal

  29. arXiv:1309.4994  [pdf, other

    cs.OH

    Context-dependent Trust Decisions with Subjective Logic

    Authors: Federico Cerutti, Alice Toniolo, Nir Oren, Timothy J. Norman

    Abstract: A decision procedure implemented over a computational trust mechanism aims to allow for decisions to be made regarding whether some entity or information should be trusted. As recognised in the literature, trust is contextual, and we describe how such a context often translates into a confidence level which should be used to modify an underlying trust value. Jøsang's Subjective Logic has long been… ▽ More

    Submitted 19 September, 2013; originally announced September 2013.

    Comments: 19 pages, 4 figures, technical report of the University of Aberdeen (preprint version)