Skip to main content

Showing 1–20 of 20 results for author: Calvo-Fullana, M

.
  1. arXiv:2406.01782  [pdf, other

    eess.SY cs.AI cs.LG cs.MA

    Multi-agent assignment via state augmented reinforcement learning

    Authors: Leopoldo Agorio, Sean Van Alen, Miguel Calvo-Fullana, Santiago Paternain, Juan Andres Bazerque

    Abstract: We address the conflicting requirements of a multi-agent assignment problem through constrained reinforcement learning, emphasizing the inadequacy of standard regularization techniques for this purpose. Instead, we recur to a state augmentation approach in which the oscillation of dual variables is exploited by agents to alternate between tasks. In addition, we coordinate the actions of the multip… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 12 pages, 3 figures, 6th Annual Conference on Learning for Dynamics and Control

    MSC Class: 93E35

    Journal ref: Proceedings of Machine Learning Research vol 242 1 12, 2024. 6th Annual Conference on Learning for Dynamics and Control

  2. arXiv:2306.08737  [pdf, other

    cs.RO cs.IT math.OC

    A Networked Multi-Agent System for Mobile Wireless Infrastructure on Demand

    Authors: Miguel Calvo-Fullana, Mikhail Gerasimenko, Daniel Mox, Leopoldo Agorio, Mariana del Castillo, Vijay Kumar, Alejandro Ribeiro, Juan Andres Bazerque

    Abstract: Despite the prevalence of wireless connectivity in urban areas around the globe, there remain numerous and diverse situations where connectivity is insufficient or unavailable. To address this, we introduce mobile wireless infrastructure on demand, a system of UAVs that can be rapidly deployed to establish an ad-hoc wireless network. This network has the capability of reconfiguring itself dynamica… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  3. arXiv:2211.11038  [pdf, other

    cs.RO math.OC

    Mission-Aware Value of Information Censoring for Distributed Filtering

    Authors: Miguel Calvo-Fullana, Jonathan P. How

    Abstract: In this paper, we study the problem of distributed estimation with an emphasis on communication-efficiency. The proposed algorithm is based on a windowed maximum a posteriori (MAP) estimation problem, wherein each agent in the network locally computes a Kalman-like filter estimate that approximates the centralized MAP solution. Information sharing among agents is restricted to their neighbors only… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

  4. arXiv:2210.15573  [pdf, other

    cs.LG

    Multi-task Bias-Variance Trade-off Through Functional Constraints

    Authors: Juan Cervino, Juan Andres Bazerque, Miguel Calvo-Fullana, Alejandro Ribeiro

    Abstract: Multi-task learning aims to acquire a set of functions, either regressors or classifiers, that perform well for diverse tasks. At its core, the idea behind multi-task learning is to exploit the intrinsic similarity across data sources to aid in the learning process for each individual domain. In this paper we draw intuition from the two extreme learning scenarios -- a single function for all tasks… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

  5. arXiv:2204.00474  [pdf, other

    cs.RO math.OC

    Distributed Filtering with Value of Information Censoring

    Authors: Miguel Calvo-Fullana, Jonathan P. How

    Abstract: This work presents a distributed estimation algorithm that efficiently uses the available communication resources. The approach is based on Bayesian filtering that is distributed across a network by using the logarithmic opinion pool operator. Communication efficiency is achieved by having only agents with high Value of Information (VoI) share their estimates, and the algorithm provides a tunable… ▽ More

    Submitted 1 April, 2022; originally announced April 2022.

  6. arXiv:2203.00851  [pdf, other

    cs.RO math.OC

    Distributed Riemannian Optimization with Lazy Communication for Collaborative Geometric Estimation

    Authors: Yulun Tian, Amrit Singh Bedi, Alec Koppel, Miguel Calvo-Fullana, David M. Rosen, Jonathan P. How

    Abstract: We present the first distributed optimization algorithm with lazy communication for collaborative geometric estimation, the backbone of modern collaborative simultaneous localization and map** (SLAM) and structure-from-motion (SfM) applications. Our method allows agents to cooperatively reconstruct a shared geometric model on a central server by fusing individual observations, but without the ne… ▽ More

    Submitted 29 July, 2022; v1 submitted 1 March, 2022; originally announced March 2022.

    Comments: technical report (17 pages, 3 figures); to appear at IROS 2022

  7. arXiv:2103.05134  [pdf, other

    cs.LG math.ST stat.ML

    Constrained Learning with Non-Convex Losses

    Authors: Luiz F. O. Chamon, Santiago Paternain, Miguel Calvo-Fullana, Alejandro Ribeiro

    Abstract: Though learning has become a core component of modern information processing, there is now ample evidence that it can lead to biased, unsafe, and prejudiced systems. The need to impose requirements on learning is therefore paramount, especially as it reaches critical applications in social, industrial, and medical domains. However, the non-convexity of most modern statistical problems is only exac… ▽ More

    Submitted 19 October, 2022; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: IEEE Transactions on Information Theory

  8. arXiv:2102.12585  [pdf, other

    cs.LG math.OC

    Towards Safe Continuing Task Reinforcement Learning

    Authors: Miguel Calvo-Fullana, Luiz F. O. Chamon, Santiago Paternain

    Abstract: Safety is a critical feature of controller design for physical systems. When designing control policies, several approaches to guarantee this aspect of autonomy have been proposed, such as robust controllers or control barrier functions. However, these solutions strongly rely on the model of the system being available to the designer. As a parallel development, reinforcement learning provides mode… ▽ More

    Submitted 24 February, 2021; originally announced February 2021.

  9. arXiv:2102.11941  [pdf, other

    cs.LG cs.RO math.OC

    State Augmented Constrained Reinforcement Learning: Overcoming the Limitations of Learning with Rewards

    Authors: Miguel Calvo-Fullana, Santiago Paternain, Luiz F. O. Chamon, Alejandro Ribeiro

    Abstract: A common formulation of constrained reinforcement learning involves multiple rewards that must individually accumulate to given thresholds. In this class of problems, we show a simple example in which the desired optimal policy cannot be induced by any weighted linear combination of rewards. Hence, there exist constrained reinforcement learning problems for which neither regularized nor classical… ▽ More

    Submitted 21 September, 2023; v1 submitted 23 February, 2021; originally announced February 2021.

  10. arXiv:2101.10113  [pdf, other

    cs.RO

    ROS-NetSim: A Framework for the Integration of Robotic and Network Simulators

    Authors: Miguel Calvo-Fullana, Daniel Mox, Alexander Pyattaev, Jonathan Fink, Vijay Kumar, Alejandro Ribeiro

    Abstract: Multi-agent systems play an important role in modern robotics. Due to the nature of these systems, coordination among agents via communication is frequently necessary. Indeed, Perception-Action-Communication (PAC) loops, or Perception-Action loops closed over a communication channel, are a critical component of multi-robot systems. However, we lack appropriate tools for simulating PAC loops. To th… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

  11. arXiv:2010.12993  [pdf, other

    cs.LG eess.SP stat.ML

    Multi-task Supervised Learning via Cross-learning

    Authors: Juan Cervino, Juan Andres Bazerque, Miguel Calvo-Fullana, Alejandro Ribeiro

    Abstract: In this paper we consider a problem known as multi-task learning, consisting of fitting a set of classifier or regression functions intended for solving different tasks. In our novel formulation, we couple the parameters of these functions, so that they learn in their task specific domains while staying close to each other. This facilitates cross-fertilization in which data collected across differ… ▽ More

    Submitted 26 May, 2021; v1 submitted 24 October, 2020; originally announced October 2020.

  12. Multi-task Reinforcement Learning in Reproducing Kernel Hilbert Spaces via Cross-learning

    Authors: Juan Cervino, Juan Andres Bazerque, Miguel Calvo-Fullana, Alejandro Ribeiro

    Abstract: Reinforcement learning (RL) is a framework to optimize a control policy using rewards that are revealed by the system as a response to a control action. In its standard form, RL involves a single agent that uses its policy to accomplish a specific task. These methods require large amounts of reward samples to achieve good performance, and may not generalize well when the task is modified, even if… ▽ More

    Submitted 26 August, 2020; originally announced August 2020.

  13. arXiv:2002.05183  [pdf, other

    cs.LG math.OC stat.ML

    The empirical duality gap of constrained statistical learning

    Authors: Luiz F. O. Chamon, Santiago Paternain, Miguel Calvo-Fullana, Alejandro Ribeiro

    Abstract: This paper is concerned with the study of constrained statistical learning problems, the unconstrained version of which are at the core of virtually all of modern information processing. Accounting for constraints, however, is paramount to incorporate prior knowledge and impose desired structural and statistical properties on the solutions. Still, solving constrained statistical problems remains c… ▽ More

    Submitted 12 February, 2020; originally announced February 2020.

  14. arXiv:2002.03026  [pdf, other

    cs.RO eess.SY

    Mobile Wireless Network Infrastructure on Demand

    Authors: Daniel Mox, Miguel Calvo-Fullana, Mikhail Gerasimenko, Jonathan Fink, Vijay Kumar, Alejandro Ribeiro

    Abstract: In this work, we introduce Mobile Wireless In-frastructure on Demand: a framework for providing wireless connectivity to multi-robot teams via autonomously reconfiguring ad-hoc networks. In many cases, previous multi-agent systems either assumed the availability of existing communication infrastructure or were required to create a network in addition to completing their objective. Instead our syst… ▽ More

    Submitted 24 March, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

    Comments: 7 pages, 7 figures, accepted to ICRA 2020

  15. arXiv:1911.09101  [pdf, other

    eess.SY cs.LG math.OC

    Safe Policies for Reinforcement Learning via Primal-Dual Methods

    Authors: Santiago Paternain, Miguel Calvo-Fullana, Luiz F. O. Chamon, Alejandro Ribeiro

    Abstract: In this paper, we study the learning of safe policies in the setting of reinforcement learning problems. This is, we aim to control a Markov Decision Process (MDP) of which we do not know the transition probabilities, but we have access to sample trajectories through experience. We define safety as the agent remaining in a desired safe set with high probability during the operation time. We theref… ▽ More

    Submitted 12 January, 2022; v1 submitted 20 November, 2019; originally announced November 2019.

    Comments: arXiv admin note: text overlap with arXiv:1910.13393

  16. arXiv:1910.13393  [pdf, other

    cs.LG math.OC stat.ML

    Constrained Reinforcement Learning Has Zero Duality Gap

    Authors: Santiago Paternain, Luiz F. O. Chamon, Miguel Calvo-Fullana, Alejandro Ribeiro

    Abstract: Autonomous agents must often deal with conflicting requirements, such as completing tasks using the least amount of time/energy, learning multiple tasks, or dealing with multiple opponents. In the context of reinforcement learning~(RL), these problems are addressed by (i)~designing a reward function that simultaneously describes all requirements or (ii)~combining modular value functions that encod… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

  17. arXiv:1801.10141  [pdf, other

    math.OC cs.IT

    Random Access Communication for Wireless Control Systems with Energy Harvesting Sensors

    Authors: Miguel Calvo-Fullana, Carles Antón-Haro, Javier Matamoros, Alejandro Ribeiro

    Abstract: In this paper, we study wireless networked control systems in which the sensing devices are powered by energy harvesting. We consider a scenario with multiple plants, where the sensors communicate their measurements to their respective controllers over a shared wireless channel. Due to the shared nature of the medium, sensors transmitting simultaneously can lead to packet collisions. In order to d… ▽ More

    Submitted 30 January, 2018; originally announced January 2018.

  18. Stochastic Routing and Scheduling Policies for Energy Harvesting Communication Networks

    Authors: Miguel Calvo-Fullana, Carles Antón-Haro, Javier Matamoros, Alejandro Ribeiro

    Abstract: In this paper, we study the joint routing-scheduling problem in energy harvesting communication networks. Our policies, which are based on stochastic subgradient methods on the dual domain, act as an energy harvesting variant of the stochastic family of backpresure algorithms. Specifically, we propose two policies: (i) the Stochastic Backpressure with Energy Harvesting (SBP-EH), in which a node's… ▽ More

    Submitted 2 November, 2017; originally announced November 2017.

  19. arXiv:1701.06960  [pdf, ps, other

    cs.IT

    Reconstruction of Correlated Sources with Energy Harvesting Constraints in Delay-constrained and Delay-tolerant Communication Scenarios

    Authors: Miguel Calvo-Fullana, Javier Matamoros, Carles Antón-Haro

    Abstract: In this paper, we investigate the reconstruction of time-correlated sources in a point-to-point communications scenario comprising an energy-harvesting sensor and a Fusion Center (FC). Our goal is to minimize the average distortion in the reconstructed observations by using data from previously encoded sources as side information. First, we analyze a delay-constrained scenario, where the sources m… ▽ More

    Submitted 24 January, 2017; originally announced January 2017.

  20. arXiv:1608.03875  [pdf, ps, other

    cs.IT

    Sensor Selection and Power Allocation Strategies for Energy Harvesting Wireless Sensor Networks

    Authors: Miguel Calvo-Fullana, Javier Matamoros, Carles Antón-Haro

    Abstract: In this paper, we investigate the problem of jointly selecting a predefined number of energy-harvesting (EH) sensors and computing the optimal power allocation. The ultimate goal is to minimize the reconstruction distortion at the fusion center. This optimization problem is, unfortunately, non-convex. To circumvent that, we propose two suboptimal strategies: (i) a joint sensor selection and power… ▽ More

    Submitted 12 August, 2016; originally announced August 2016.