Skip to main content

Showing 1–14 of 14 results for author: Paolo, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.10198  [pdf, other

    cs.LG stat.ML

    SAMformer: Unlocking the Potential of Transformers in Time Series Forecasting with Sharpness-Aware Minimization and Channel-Wise Attention

    Authors: Romain Ilbert, Ambroise Odonnat, Vasilii Feofanov, Aladin Virmaux, Giuseppe Paolo, Themis Palpanas, Ievgen Redko

    Abstract: Transformer-based architectures achieved breakthrough performance in natural language processing and computer vision, yet they remain inferior to simpler linear baselines in multivariate long-term forecasting. To better understand this phenomenon, we start by studying a toy linear forecasting problem for which we show that transformers are incapable of converging to their true solution despite the… ▽ More

    Submitted 3 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: Accepted as an Oral at ICML 2024, Vienna. The first two authors contributed equally

  2. arXiv:2402.03824  [pdf, ps, other

    cs.AI

    A call for embodied AI

    Authors: Giuseppe Paolo, Jonas Gonzalez-Billandon, Balázs Kégl

    Abstract: We propose Embodied AI as the next fundamental step in the pursuit of Artificial General Intelligence, juxtaposing it against current AI advancements, particularly Large Language Models. We traverse the evolution of the embodiment concept across diverse fields - philosophy, psychology, neuroscience, and robotics - to highlight how EAI distinguishes itself from the classical paradigm of static lear… ▽ More

    Submitted 28 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: Published in ICML 2024 Position paper track

  3. arXiv:2402.03146  [pdf, other

    cs.LG stat.ML

    A Multi-step Loss Function for Robust Learning of the Dynamics in Model-based Reinforcement Learning

    Authors: Abdelhakim Benechehab, Albert Thomas, Giuseppe Paolo, Maurizio Filippone, Balázs Kégl

    Abstract: In model-based reinforcement learning, most algorithms rely on simulating trajectories from one-step models of the dynamics learned on data. A critical challenge of this approach is the compounding of one-step prediction errors as the length of the trajectory grows. In this paper we tackle this issue by using a multi-step objective to train one-step models. Our objective is a weighted sum of the m… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  4. arXiv:2401.10107  [pdf

    eess.SP cs.LG physics.med-ph

    Comparison analysis between standard polysomnographic data and in-ear-EEG signals: A preliminary study

    Authors: Gianpaolo Palo, Luigi Fiorillo, Giuliana Monachino, Michal Bechny, Mark Melnykowycz, Athina Tzovara, Valentina Agostini, Francesca Dalia Faraci

    Abstract: Study Objectives: Polysomnography (PSG) currently serves as the benchmark for evaluating sleep disorders. Its discomfort, impracticality for home-use, and introduction of bias in sleep quality assessment necessitate the exploration of less invasive, cost-effective, and portable alternatives. One promising contender is the in-ear-EEG sensor, which offers advantages in terms of comfort, fixed electr… ▽ More

    Submitted 30 January, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: 12 figures, 1 table

  5. arXiv:2310.05672  [pdf, other

    cs.LG stat.ML

    Multi-timestep models for Model-based Reinforcement Learning

    Authors: Abdelhakim Benechehab, Giuseppe Paolo, Albert Thomas, Maurizio Filippone, Balázs Kégl

    Abstract: In model-based reinforcement learning (MBRL), most algorithms rely on simulating trajectories from one-step dynamics models learned on data. A critical challenge of this approach is the compounding of one-step prediction errors as length of the trajectory grows. In this paper we tackle this issue by using a multi-timestep objective to train one-step models. Our objective is a weighted sum of a los… ▽ More

    Submitted 11 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

  6. arXiv:2206.09743  [pdf, other

    cs.AI cs.LG eess.SY

    Guided Safe Shooting: model based reinforcement learning with safety constraints

    Authors: Giuseppe Paolo, Jonas Gonzalez-Billandon, Albert Thomas, Balázs Kégl

    Abstract: In the last decade, reinforcement learning successfully solved complex control tasks and decision-making problems, like the Go board game. Yet, there are few success stories when it comes to deploying those algorithms to real-world scenarios. One of the reasons is the lack of guarantees when dealing with and avoiding unsafe states, a fundamental requirement in critical control engineering systems.… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

  7. arXiv:2203.01027  [pdf, other

    cs.LG cs.AI cs.NE

    Learning in Sparse Rewards settings through Quality-Diversity algorithms

    Authors: Giuseppe Paolo

    Abstract: In the Reinforcement Learning (RL) framework, the learning is guided through a reward signal. This means that in situations of sparse rewards the agent has to focus on exploration, in order to discover which action, or set of actions leads to the reward. RL agents usually struggle with this. Exploration is the focus of Quality-Diversity (QD) methods. In this thesis, we approach the problem of spar… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

    Comments: PhD Thesis

  8. arXiv:2111.01919  [pdf, other

    cs.LG cs.AI cs.NE cs.RO

    Discovering and Exploiting Sparse Rewards in a Learned Behavior Space

    Authors: Giuseppe Paolo, Miranda Coninx, Alban Laflaquière, Stephane Doncieux

    Abstract: Learning optimal policies in sparse rewards settings is difficult as the learning agent has little to no feedback on the quality of its actions. In these situations, a good strategy is to focus on exploration, hopefully leading to the discovery of a reward signal to improve on. A learning algorithm capable of dealing with this kind of settings has to be able to (1) explore possible agent behaviors… ▽ More

    Submitted 26 September, 2023; v1 submitted 2 November, 2021; originally announced November 2021.

    Comments: 25 pages. Published by the Evolutionary Computation Journal, MIT Press

  9. arXiv:2102.03140  [pdf, other

    cs.NE cs.AI cs.LG cs.RO

    Sparse Reward Exploration via Novelty Search and Emitters

    Authors: Giuseppe Paolo, Alexandre Coninx, Stephane Doncieux, Alban Laflaquière

    Abstract: Reward-based optimization algorithms require both exploration, to find rewards, and exploitation, to maximize performance. The need for efficient exploration is even more significant in sparse reward settings, in which performance feedback is given sparingly, thus rendering it unsuitable for guiding the search process. In this work, we introduce the SparsE Reward Exploration via Novelty and Emitte… ▽ More

    Submitted 16 April, 2021; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: In 2021 Genetic and Evolutionary Computation Conference (GECCO 21), July, 2021, Lille, France. ACM, New York, NY, USA, 11 pages

  10. arXiv:2005.06224  [pdf, other

    cs.NE cs.AI cs.LG cs.RO

    Novelty Search makes Evolvability Inevitable

    Authors: Stephane Doncieux, Giuseppe Paolo, Alban Laflaquière, Alexandre Coninx

    Abstract: Evolvability is an important feature that impacts the ability of evolutionary processes to find interesting novel solutions and to deal with changing conditions of the problem to solve. The estimation of evolvability is not straightforward and is generally too expensive to be directly used as selective pressure in the evolutionary process. Indirectly promoting evolvability as a side effect of othe… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

  11. arXiv:1909.05508  [pdf

    cs.RO cs.AI cs.LG cs.NE

    Unsupervised Learning and Exploration of Reachable Outcome Space

    Authors: Giuseppe Paolo, Alban Laflaquière, Alexandre Coninx, Stephane Doncieux

    Abstract: Performing Reinforcement Learning in sparse rewards settings, with very little prior knowledge, is a challenging problem since there is no signal to properly guide the learning process. In such situations, a good search strategy is fundamental. At the same time, not having to adapt the algorithm to every single problem is very desirable. Here we introduce TAXONS, a Task Agnostic eXploration of Out… ▽ More

    Submitted 4 May, 2020; v1 submitted 12 September, 2019; originally announced September 2019.

    Comments: Published at IEEE International Conference on Robotics and Automation (ICRA) 2020

  12. arXiv:1709.08528  [pdf, other

    cs.RO

    A Data-driven Model for Interaction-aware Pedestrian Motion Prediction in Object Cluttered Environments

    Authors: Mark Pfeiffer, Giuseppe Paolo, Hannes Sommer, Juan Nieto, Roland Siegwart, Cesar Cadena

    Abstract: This paper reports on a data-driven, interaction-aware motion prediction approach for pedestrians in environments cluttered with static obstacles. When navigating in such workspaces shared with humans, robots need accurate motion predictions of the surrounding pedestrians. Human navigation behavior is mostly influenced by their surrounding pedestrians and by the static obstacles in their vicinity.… ▽ More

    Submitted 26 February, 2018; v1 submitted 25 September, 2017; originally announced September 2017.

    Comments: 8 pages, accepted for publication at the IEEE International Conference on Robotics and Automation (ICRA) 2018

  13. arXiv:1709.08430  [pdf, other

    cs.RO cs.AI cs.LG

    Towards continuous control of flippers for a multi-terrain robot using deep reinforcement learning

    Authors: Giuseppe Paolo, Lei Tai, Ming Liu

    Abstract: In this paper we focus on develo** a control algorithm for multi-terrain tracked robots with flippers using a reinforcement learning (RL) approach. The work is based on the deep deterministic policy gradient (DDPG) algorithm, proven to be very successful in simple simulation environments. The algorithm works in an end-to-end fashion in order to control the continuous position of the flippers. Th… ▽ More

    Submitted 25 September, 2017; originally announced September 2017.

    Comments: 12 pages, single column, submitted to International Journal of Robotics and Automation (IJRA)

  14. arXiv:1703.00420  [pdf, other

    cs.RO cs.AI cs.LG

    Virtual-to-real Deep Reinforcement Learning: Continuous Control of Mobile Robots for Mapless Navigation

    Authors: Lei Tai, Giuseppe Paolo, Ming Liu

    Abstract: We present a learning-based mapless motion planner by taking the sparse 10-dimensional range findings and the target position with respect to the mobile robot coordinate frame as input and the continuous steering commands as output. Traditional motion planners for mobile ground robots with a laser range sensor mostly depend on the obstacle map of the navigation environment where both the highly pr… ▽ More

    Submitted 21 July, 2017; v1 submitted 1 March, 2017; originally announced March 2017.

    Comments: video: https://www.youtube.com/watch?v=9AOIwBYIBbs, 6 pages, 9 figures, to appear in he 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2017), final submission version