Skip to main content

Showing 1–2 of 2 results for author: Baldi, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2006.05799  [pdf, other

    cs.MA eess.SY

    A Separation-Based Methodology to Consensus Tracking of Switched High-Order Nonlinear Multi-Agent Systems

    Authors: Maolong Lv, Wenwu Yu, **de Cao, Simone Baldi

    Abstract: This work investigates a reduced-complexity adaptive methodology to consensus tracking for a team of uncertain high-order nonlinear systems with switched (possibly asynchronous) dynamics. It is well known that high-order nonlinear systems are intrinsically challenging as feedback linearization and backstep** methods successfully developed for low-order systems fail to work. At the same time, eve… ▽ More

    Submitted 10 June, 2020; originally announced June 2020.

  2. arXiv:2005.07404  [pdf, other

    cs.AI cs.LG

    Think Too Fast Nor Too Slow: The Computational Trade-off Between Planning And Reinforcement Learning

    Authors: Thomas M. Moerland, Anna Deichler, Simone Baldi, Joost Broekens, Catholijn M. Jonker

    Abstract: Planning and reinforcement learning are two key approaches to sequential decision making. Multi-step approximate real-time dynamic programming, a recently successful algorithm class of which AlphaZero [Silver et al., 2018] is an example, combines both by nesting planning within a learning loop. However, the combination of planning and learning introduces a new question: how should we balance time… ▽ More

    Submitted 15 May, 2020; originally announced May 2020.