Skip to main content

Showing 1–14 of 14 results for author: Mets, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09860  [pdf, other

    cs.LG cs.AI cs.CV

    Dataset Condensation with Latent Quantile Matching

    Authors: Wei Wei, Tom De Schepper, Kevin Mets

    Abstract: Dataset condensation (DC) methods aim to learn a smaller synthesized dataset with informative data records to accelerate the training of machine learning models. Current distribution matching (DM) based DC methods learn a synthesized dataset by matching the mean of the latent embeddings between the synthetic and the real dataset. However two distributions with the same mean can still be vastly dif… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted by CVPR Workshop 2024: 1st Workshop on Dataset Distillation for Computer Vision

  2. Benchmarking Sensitivity of Continual Graph Learning for Skeleton-Based Action Recognition

    Authors: Wei Wei, Tom De Schepper, Kevin Mets

    Abstract: Continual learning (CL) is the research field that aims to build machine learning models that can accumulate knowledge continuously over different tasks without retraining from scratch. Previous studies have shown that pre-training graph neural networks (GNN) may lead to negative transfer (Hu et al., 2020) after fine-tuning, a setting which is closely related to CL. Thus, we focus on studying GNN… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: This work is accepted at VISAPP 2024 as a short paper

  3. Safety Aware Autonomous Path Planning Using Model Predictive Reinforcement Learning for Inland Waterways

    Authors: Astrid Vanneste, Simon Vanneste, Olivier Vasseur, Robin Janssens, Mattias Billast, Ali Anwar, Kevin Mets, Tom De Schepper, Siegfried Mercelis, Peter Hellinckx

    Abstract: In recent years, interest in autonomous ship** in urban waterways has increased significantly due to the trend of kee** cars and trucks out of city centers. Classical approaches such as Frenet frame based planning and potential field navigation often require tuning of many configuration parameters and sometimes even require a different configuration depending on the situation. In this paper, w… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: \c{opyright} 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  4. arXiv:2308.04938  [pdf, other

    cs.LG cs.AI cs.MA

    An In-Depth Analysis of Discretization Methods for Communication Learning using Backpropagation with Multi-Agent Reinforcement Learning

    Authors: Astrid Vanneste, Simon Vanneste, Kevin Mets, Tom De Schepper, Siegfried Mercelis, Peter Hellinckx

    Abstract: Communication is crucial in multi-agent reinforcement learning when agents are not able to observe the full state of the environment. The most common approach to allow learned communication between agents is the use of a differentiable communication channel that allows gradients to flow between agents as a form of feedback. However, this is challenging when we want to use discrete messages to redu… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2204.05669

  5. arXiv:2308.04844  [pdf, other

    cs.LG cs.AI cs.MA

    Scalability of Message Encoding Techniques for Continuous Communication Learned with Multi-Agent Reinforcement Learning

    Authors: Astrid Vanneste, Thomas Somers, Simon Vanneste, Kevin Mets, Tom De Schepper, Siegfried Mercelis, Peter Hellinckx

    Abstract: Many multi-agent systems require inter-agent communication to properly achieve their goal. By learning the communication protocol alongside the action protocol using multi-agent reinforcement learning techniques, the agents gain the flexibility to determine which information should be shared. However, when the number of agents increases we need to create an encoding of the information contained in… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: Paper accepted to the BNAIC/BeNeLearn 2022 conference

  6. Deep set conditioned latent representations for action recognition

    Authors: Akash Singh, Tom De Schepper, Kevin Mets, Peter Hellinckx, Jose Oramas, Steven Latre

    Abstract: In recent years multi-label, multi-class video action recognition has gained significant popularity. While reasoning over temporally connected atomic actions is mundane for intelligent species, standard artificial neural networks (ANN) still struggle to classify them. In the real world, atomic actions often temporally connect to form more complex composite actions. The challenge lies in recognisin… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: Conference VISAPP 2022, 11 pages,5 figures, 2 Tables, 6 plots

    Journal ref: In Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications - Volume 5: VISAPP, ISBN 978-989-758-555-5; ISSN 2184-4321, year 2022, pages 456-466

  7. arXiv:2211.08467  [pdf, other

    cs.AI

    Structured Exploration Through Instruction Enhancement for Object Navigation

    Authors: Matthias Hutsebaut-Buysse, Kevin Mets, Tom De Schepper, Steven Latré

    Abstract: Finding an object of a specific class in an unseen environment remains an unsolved navigation problem. Hence, we propose a hierarchical learning-based method for object navigation. The top-level is capable of high-level planning, and building a memory on a floorplan-level (e.g., which room makes the most sense for the agent to visit next, where has the agent already been?). While the lower-level i… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: Paper accepted to the BNAIC/BeNeLearn 2022 conference

  8. arXiv:2204.05669  [pdf, other

    cs.LG cs.MA

    An Analysis of Discretization Methods for Communication Learning with Multi-Agent Reinforcement Learning

    Authors: Astrid Vanneste, Simon Vanneste, Kevin Mets, Tom De Schepper, Siegfried Mercelis, Steven Latré, Peter Hellinckx

    Abstract: Communication is crucial in multi-agent reinforcement learning when agents are not able to observe the full state of the environment. The most common approach to allow learned communication between agents is the use of a differentiable communication channel that allows gradients to flow between agents as a form of feedback. However, this is challenging when we want to use discrete messages to redu… ▽ More

    Submitted 12 April, 2022; originally announced April 2022.

    Comments: Accepted at Adaptive and Learning Agents Workshop (ALA 2022) https://ala2022.github.io/

  9. Learning to Communicate with Reinforcement Learning for an Adaptive Traffic Control System

    Authors: Simon Vanneste, Gauthier de Borrekens, Stig Bosmans, Astrid Vanneste, Kevin Mets, Siegfried Mercelis, Steven Latré, Peter Hellinckx

    Abstract: Recent work in multi-agent reinforcement learning has investigated inter agent communication which is learned simultaneously with the action policy in order to improve the team reward. In this paper, we investigate independent Q-learning (IQL) without communication and differentiable inter-agent learning (DIAL) with learned communication on an adaptive traffic control system (ATCS). In real world… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

  10. Mixed Cooperative-Competitive Communication Using Multi-Agent Reinforcement Learning

    Authors: Astrid Vanneste, Wesley Van Wijnsberghe, Simon Vanneste, Kevin Mets, Siegfried Mercelis, Steven Latré, Peter Hellinckx

    Abstract: By using communication between multiple agents in multi-agent environments, one can reduce the effects of partial observability by combining one agent's observation with that of others in the same dynamic environment. While a lot of successful research has been done towards communication learning in cooperative settings, communication learning in mixed cooperative-competitive settings is also impo… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

  11. arXiv:2009.08880  [pdf, other

    cs.LG cs.AI stat.ML

    HTMRL: Biologically Plausible Reinforcement Learning with Hierarchical Temporal Memory

    Authors: Jakob Struye, Kevin Mets, Steven Latré

    Abstract: Building Reinforcement Learning (RL) algorithms which are able to adapt to continuously evolving tasks is an open research challenge. One technology that is known to inherently handle such non-stationary input patterns well is Hierarchical Temporal Memory (HTM), a general and biologically plausible computational model for the human neocortex. As the RL paradigm is inspired by human learning, HTM i… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

  12. arXiv:2007.05196  [pdf, other

    cs.LG cs.AI stat.ML

    Pre-trained Word Embeddings for Goal-conditional Transfer Learning in Reinforcement Learning

    Authors: Matthias Hutsebaut-Buysse, Kevin Mets, Steven Latré

    Abstract: Reinforcement learning (RL) algorithms typically start tabula rasa, without any prior knowledge of the environment, and without any prior skills. This however often leads to low sample efficiency, requiring a large amount of interaction with the environment. This is especially true in a lifelong learning setting, in which the agent needs to continually extend its capabilities. In this paper, we ex… ▽ More

    Submitted 10 July, 2020; originally announced July 2020.

    Comments: Paper accepted to the ICML 2020 Language in Reinforcement Learning (LaReL) Workshop

  13. arXiv:2006.07200  [pdf, other

    cs.LG cs.MA stat.ML

    Learning to Communicate Using Counterfactual Reasoning

    Authors: Simon Vanneste, Astrid Vanneste, Kevin Mets, Tom De Schepper, Ali Anwar, Siegfried Mercelis, Steven Latré, Peter Hellinckx

    Abstract: Learning to communicate in order to share state information is an active problem in the area of multi-agent reinforcement learning (MARL). The credit assignment problem, the non-stationarity of the communication environment and the creation of influenceable agents are major challenges within this research field which need to be overcome in order to learn a valid communication protocol. This paper… ▽ More

    Submitted 26 April, 2022; v1 submitted 12 June, 2020; originally announced June 2020.

    Comments: Accepted at Adaptive and Learning Agents Workshop (ALA 2022) https://ala2022.github.io/

  14. arXiv:1910.04040  [pdf, other

    cs.AI

    Fast Task-Adaptation for Tasks Labeled Using Natural Language in Reinforcement Learning

    Authors: Matthias Hutsebaut-Buysse, Kevin Mets, Steven Latré

    Abstract: Over its lifetime, a reinforcement learning agent is often tasked with different tasks. How to efficiently adapt a previously learned control policy from one task to another, remains an open research question. In this paper, we investigate how instructions formulated in natural language can enable faster and more effective task adaptation. This can serve as the basis for develo** language instru… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.