Skip to main content

Showing 1–50 of 50 results for author: Leonard, N

Searching in archive math. Search in all archives.
.
  1. arXiv:2402.14174  [pdf, other

    cs.RO cs.AI eess.SY math.OC

    Blending Data-Driven Priors in Dynamic Games

    Authors: Justin Lidard, Haimin Hu, Asher Hancock, Zixu Zhang, Albert Gimó Contreras, Vikash Modi, Jonathan DeCastro, Deepak Gopinath, Guy Rosman, Naomi Ehrich Leonard, María Santos, Jaime Fernández Fisac

    Abstract: As intelligent robots like autonomous vehicles become increasingly deployed in the presence of people, the extent to which these systems should leverage model-based game-theoretic planners versus data-driven policies for safe, interaction-aware motion planning remains an open question. Existing dynamic game formulations assume all agents are task-driven and behave optimally. However, in reality, h… ▽ More

    Submitted 6 July, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: 20 pages, 12 figures

  2. arXiv:2312.06395  [pdf, other

    cs.RO math.DS math.OC

    Threshold Decision-Making Dynamics Adaptive to Physical Constraints and Changing Environment

    Authors: Giovanna Amorim, María Santos, Shinkyu Park, Alessio Franci, Naomi Ehrich Leonard

    Abstract: We propose a threshold decision-making framework for controlling the physical dynamics of an agent switching between two spatial tasks. Our framework couples a nonlinear opinion dynamics model that represents the evolution of an agent's preference for a particular task with the physical dynamics of the agent. We prove the bifurcation that governs the behavior of the coupled dynamics. We show by me… ▽ More

    Submitted 7 June, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

  3. arXiv:2311.02204  [pdf, other

    q-bio.PE eess.SY math.DS

    Active risk aversion in SIS epidemics on networks

    Authors: Anastasia Bizyaeva, Marcela Ordorica Arango, Yunxiu Zhou, Simon Levin, Naomi Ehrich Leonard

    Abstract: We present and analyze an actively controlled Susceptible-Infected-Susceptible (actSIS) model of interconnected populations to study how risk aversion strategies, such as social distancing, affect network epidemics. A population using a risk aversion strategy reduces its contact rate with other populations when it perceives an increase in infection risk. The network actSIS model relies on two dist… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

  4. arXiv:2308.02755  [pdf, other

    physics.soc-ph cs.MA cs.SI math.DS math.OC

    Multi-topic belief formation through bifurcations over signed social networks

    Authors: Anastasia Bizyaeva, Alessio Franci, Naomi Ehrich Leonard

    Abstract: We propose and analyze a nonlinear dynamic model of continuous-time multi-dimensional belief formation over signed social networks. Our model accounts for the effects of a structured belief system, self-appraisal, internal biases, and various sources of cognitive dissonance posited by recent theories in social psychology. We prove that agents become opinionated as a consequence of a bifurcation. W… ▽ More

    Submitted 2 July, 2024; v1 submitted 4 August, 2023; originally announced August 2023.

    Comments: 16 pages, 7 figures

  5. arXiv:2305.17600  [pdf, other

    cs.LG cs.CV cs.GT cs.RO math.OC

    NashFormer: Leveraging Local Nash Equilibria for Semantically Diverse Trajectory Prediction

    Authors: Justin Lidard, Oswin So, Yanxia Zhang, Jonathan DeCastro, Xiongyi Cui, Xin Huang, Yen-Ling Kuo, John Leonard, Avinash Balachandran, Naomi Leonard, Guy Rosman

    Abstract: Interactions between road agents present a significant challenge in trajectory prediction, especially in cases involving multiple agents. Because existing diversity-aware predictors do not account for the interactive nature of multi-agent predictions, they may miss these important interaction outcomes. In this paper, we propose NashFormer, a framework for trajectory prediction that leverages game-… ▽ More

    Submitted 11 November, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: 8 pages, 6 figures

  6. arXiv:2210.00353  [pdf, other

    math.OC cs.MA cs.SI math.DS physics.soc-ph

    Sustained oscillations in multi-topic belief dynamics over signed networks

    Authors: Anastasia Bizyaeva, Alessio Franci, Naomi Ehrich Leonard

    Abstract: We study the dynamics of belief formation on multiple interconnected topics in networks of agents with a shared belief system. We establish sufficient conditions and necessary conditions under which sustained oscillations of beliefs arise on the network in a Hopf bifurcation and characterize the role of the communication graph and the belief system graph in sha** the relative phase and amplitude… ▽ More

    Submitted 22 March, 2023; v1 submitted 1 October, 2022; originally announced October 2022.

    Comments: 6 pages, 6 figures, accepted for publication in the 2023 American Control Conference proceedings

  7. arXiv:2206.14893  [pdf, other

    math.DS cs.MA cs.SI math.OC

    Breaking indecision in multi-agent, multi-option dynamics

    Authors: Alessio Franci, Martin Golubitsky, Ian Stewart, Anastasia Bizyaeva, Naomi Ehrich Leonard

    Abstract: How does a group of agents break indecision when deciding about options with qualities that are hard to distinguish? Biological and artificial multi-agent systems, from honeybees and bird flocks to bacteria, robots, and humans, often need to overcome indecision when choosing among options in situations in which the performance or even the survival of the group are at stake. Breaking indecision is… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: 36 pages

  8. arXiv:2203.11703  [pdf, other

    math.OC eess.SY math.DS

    Switching transformations for decentralized control of opinion patterns in signed networks: application to dynamic task allocation

    Authors: Anastasia Bizyaeva, Giovanna Amorim, Maria Santos, Alessio Franci, Naomi Ehrich Leonard

    Abstract: We propose a new decentralized design method to control opinion patterns on signed networks of agents making decisions about two options and to switch the network from any opinion pattern to a new desired one. Our method relies on switching transformations, which switch the sign of an agent's opinion at a stable equilibrium by flip** the sign of its local interactions with its neighbors. The glo… ▽ More

    Submitted 31 May, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

  9. arXiv:2201.13288  [pdf, other

    math.OC cs.LG stat.ML

    A Regret Minimization Approach to Multi-Agent Control

    Authors: Udaya Ghai, Udari Madhushani, Naomi Leonard, Elad Hazan

    Abstract: We study the problem of multi-agent control of a dynamical system with known dynamics and adversarial disturbances. Our study focuses on optimal control without centralized precomputed policies, but rather with adaptive control policies for the different agents that are only equipped with a stabilizing controller. We give a reduction from any (standard) regret minimizing control method to a distri… ▽ More

    Submitted 25 February, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:7422-7434, 2022

  10. arXiv:2110.07392  [pdf, other

    cs.LG cs.MA math.OC

    Provably Efficient Multi-Agent Reinforcement Learning with Fully Decentralized Communication

    Authors: Justin Lidard, Udari Madhushani, Naomi Ehrich Leonard

    Abstract: A challenge in reinforcement learning (RL) is minimizing the cost of sampling associated with exploration. Distributed exploration reduces sampling complexity in multi-agent RL (MARL). We investigate the benefits to performance in MARL when exploration is fully decentralized. Specifically, we consider a class of online, episodic, tabular $Q$-learning problems under time-varying reward and transiti… ▽ More

    Submitted 2 May, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: Accepted as a conference paper to American Control Conference (ACC) 2022

  11. arXiv:2108.00966  [pdf, other

    physics.soc-ph cs.MA cs.SI math.DS math.OC

    Tuning Cooperative Behavior in Games with Nonlinear Opinion Dynamics

    Authors: Shinkyu Park, Anastasia Bizyaeva, Mari Kawakatsu, Alessio Franci, Naomi Ehrich Leonard

    Abstract: We examine the tuning of cooperative behavior in repeated multi-agent games using an analytically tractable, continuous-time, nonlinear model of opinion dynamics. Each modeled agent updates its real-valued opinion about each available strategy in response to payoffs and other agent opinions, as observed over a network. We show how the model provides a principled and systematic means to investigate… ▽ More

    Submitted 23 November, 2021; v1 submitted 2 August, 2021; originally announced August 2021.

  12. arXiv:2103.14764  [pdf, ps, other

    math.OC cs.MA eess.SY

    Control of Agreement and Disagreement Cascades with Distributed Inputs

    Authors: Anastasia Bizyaeva, Timothy Sorochkin, Alessio Franci, Naomi Ehrich Leonard

    Abstract: For a group of autonomous communicating agents, the ability to distinguish a meaningful input from disturbance, and come to collective agreement or disagreement in response to that input, is paramount for carrying out coordinated objectives. In this work we study how a cascade of opinion formation spreads through a group of networked decision-makers in response to a distributed input signal. Using… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

    Comments: 7 pages, 4 figures

  13. arXiv:2103.12223  [pdf, other

    physics.soc-ph math.OC

    Analysis and control of agreement and disagreement opinion cascades

    Authors: Alessio Franci, Anastasia Bizyaeva, Shinkyu Park, Naomi Ehrich Leonard

    Abstract: We introduce and analyze a continuous time and state-space model of opinion cascades on networks of large numbers of agents that form opinions about two or more options. By leveraging our recent results on the emergence of agreement and disagreement states, we introduce novel tools to analyze and control agreement and disagreement opinion cascades. New notions of agreement and disagreement central… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

  14. arXiv:2011.07720  [pdf, other

    stat.ML cs.LG math.PR

    Distributed Bandits: Probabilistic Communication on $d$-regular Graphs

    Authors: Udari Madhushani, Naomi Ehrich Leonard

    Abstract: We study the decentralized multi-agent multi-armed bandit problem for agents that communicate with probability over a network defined by a $d$-regular graph. Every edge in the graph has probabilistic weight $p$ to account for the ($1\!-\!p$) probability of a communication link failure. At each time step, each agent chooses an arm and receives a numerical reward associated with the chosen arm. Afte… ▽ More

    Submitted 8 October, 2021; v1 submitted 15 November, 2020; originally announced November 2020.

  15. arXiv:2011.05927  [pdf, other

    cs.LG eess.SY math.OC

    On Using Hamiltonian Monte Carlo Sampling for Reinforcement Learning Problems in High-dimension

    Authors: Udari Madhushani, Biswadip Dey, Naomi Ehrich Leonard, Amit Chakraborty

    Abstract: Value function based reinforcement learning (RL) algorithms, for example, $Q$-learning, learn optimal policies from datasets of actions, rewards, and state transitions. However, when the underlying state transition dynamics are stochastic and evolve on a high-dimensional space, generating independent and identically distributed (IID) data samples for creating these datasets poses a significant cha… ▽ More

    Submitted 28 March, 2022; v1 submitted 11 November, 2020; originally announced November 2020.

  16. arXiv:2009.13600  [pdf, other

    math.OC cs.SI eess.SY math.DS

    Patterns of Nonlinear Opinion Formation on Networks

    Authors: Anastasia Bizyaeva, Ayanna Matthews, Alessio Franci, Naomi Ehrich Leonard

    Abstract: When communicating agents form opinions about a set of possible options, agreement and disagreement are both possible outcomes. Depending on the context, either can be desirable or undesirable. We show that for nonlinear opinion dynamics on networks, and a variety of network structures, the spectral properties of the underlying adjacency matrix fully characterize the occurrence of either agreement… ▽ More

    Submitted 26 March, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: 6 pages, 4 figures; accepted to appear in 2021 American Control Conference proceedings

  17. arXiv:2009.04332  [pdf, other

    math.OC cs.SI eess.SY math.DS

    Nonlinear Opinion Dynamics with Tunable Sensitivity

    Authors: Anastasia Bizyaeva, Alessio Franci, Naomi Ehrich Leonard

    Abstract: We propose a continuous-time multi-option nonlinear generalization of classical linear weighted-average opinion dynamics. Nonlinearity is introduced by saturating opinion exchanges, and this is enough to enable a significantly greater range of opinion-forming behaviors with our model as compared to existing linear and nonlinear models. For a group of agents that communicate opinions over a network… ▽ More

    Submitted 30 July, 2021; v1 submitted 9 September, 2020; originally announced September 2020.

  18. arXiv:2009.01339  [pdf, other

    math.OC stat.ML

    Heterogeneous Explore-Exploit Strategies on Multi-Star Networks

    Authors: Udari Madhushani, Naomi Leonard

    Abstract: We investigate the benefits of heterogeneity in multi-agent explore-exploit decision making where the goal of the agents is to maximize cumulative group reward. To do so we study a class of distributed stochastic bandit problems in which agents communicate over a multi-star network and make sequential choices among options in the same uncertain environment. Typically, in multi-agent bandit problem… ▽ More

    Submitted 1 December, 2020; v1 submitted 2 September, 2020; originally announced September 2020.

  19. arXiv:2008.04383  [pdf, other

    math.OC cs.MA math.DS

    Influence Spread in the Heterogeneous Multiplex Linear Threshold Model

    Authors: Yaofeng Desmond Zhong, Vaibhav Srivastava, Naomi Ehrich Leonard

    Abstract: The linear threshold model (LTM) has been used to study spread on single-layer networks defined by one inter-agent sensing modality and agents homogeneous in protocol. We define and analyze the heterogeneous multiplex LTM to study spread on multi-layer networks with each layer representing a different sensing modality and agents heterogeneous in protocol. Protocols are designed to distinguish sign… ▽ More

    Submitted 10 August, 2020; originally announced August 2020.

  20. arXiv:2007.01424  [pdf, ps, other

    physics.soc-ph math.DS q-bio.PE

    Active Control and Sustained Oscillations in actSIS Epidemic Dynamics

    Authors: Yunxiu Zhou, Simon A. Levin, Naomi E. Leonard

    Abstract: An actively controlled Susceptible-Infected-Susceptible (actSIS) contagion model is presented for studying epidemic dynamics with continuous-time feedback control of infection rates. Our work is inspired by the observation that epidemics can be controlled through decentralized disease-control strategies such as quarantining, sheltering in place, social distancing, etc., where individuals actively… ▽ More

    Submitted 2 July, 2020; originally announced July 2020.

  21. arXiv:2004.06171  [pdf, other

    cs.LG math.OC stat.ML

    Distributed Learning: Sequential Decision Making in Resource-Constrained Environments

    Authors: Udari Madhushani, Naomi Ehrich Leonard

    Abstract: We study cost-effective communication strategies that can be used to improve the performance of distributed learning systems in resource-constrained environments. For distributed learning in sequential decision making, we propose a new cost-effective partial communication protocol. We illustrate that with this protocol the group obtains the same order of performance that it obtains with full commu… ▽ More

    Submitted 13 April, 2020; originally announced April 2020.

  22. arXiv:2004.03793  [pdf, other

    math.OC cs.LG

    A Dynamic Observation Strategy for Multi-agent Multi-armed Bandit Problem

    Authors: Udari Madhushani, Naomi Ehrich Leonard

    Abstract: We define and analyze a multi-agent multi-armed bandit problem in which decision-making agents can observe the choices and rewards of their neighbors under a linear observation cost. Neighbors are defined by a network graph that encodes the inherent observation constraints of the system. We define a cost associated with observations such that at every instance an agent makes an observation it rece… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

  23. arXiv:2003.01312  [pdf, other

    math.OC cs.LG stat.ML

    Distributed Cooperative Decision Making in Multi-agent Multi-armed Bandits

    Authors: Peter Landgren, Vaibhav Srivastava, Naomi Ehrich Leonard

    Abstract: We study a distributed decision-making problem in which multiple agents face the same multi-armed bandit (MAB), and each agent makes sequential choices among arms to maximize its own individual reward. The agents cooperate by sharing their estimates over a fixed communication graph. We consider an unconstrained reward model in which two or more agents can choose the same arm and collect independen… ▽ More

    Submitted 11 August, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

  24. arXiv:1909.11852  [pdf, other

    math.OC eess.SY math.DS

    A Continuous Threshold Model of Cascade Dynamics

    Authors: Yaofeng Desmond Zhong, Naomi Ehrich Leonard

    Abstract: We present a continuous threshold model (CTM) of cascade dynamics for a network of agents with real-valued activity levels that change continuously in time. The model generalizes the linear threshold model (LTM) from the literature, where an agent becomes active (adopts an innovation) if the fraction of its neighbors that are active is above a threshold. With the CTM we study the influence on casc… ▽ More

    Submitted 25 September, 2019; originally announced September 2019.

  25. arXiv:1909.05765  [pdf, other

    math.OC math.DS physics.soc-ph q-bio.QM

    A model-independent theory of consensus and dissensus decision making

    Authors: Alessio Franci, Martin Golubitsky, Anastasia Bizyaeva, Naomi Ehrich Leonard

    Abstract: We develop a model-independent framework to study the dynamics of decision-making in opinion networks for an arbitrary number of agents and an arbitrary number of options. Model-independence means that the analysis is not performed on a specific set of equations, in contrast to classical approaches to decision making that fix a specific model and analyze it. Rather, the general features of decisio… ▽ More

    Submitted 8 September, 2020; v1 submitted 12 September, 2019; originally announced September 2019.

  26. arXiv:1907.08829  [pdf, other

    math.OC cs.SI math.DS physics.soc-ph

    Adaptive Susceptibility and Heterogeneity in Contagion Models on Networks

    Authors: Renato Pagliara, Naomi E. Leonard

    Abstract: Contagious processes, such as spread of infectious diseases, social behaviors, or computer viruses, affect biological, social, and technological systems. Epidemic models for large populations and finite populations on networks have been used to understand and control both transient and steady-state behaviors. Typically it is assumed that after recovery from an infection, every agent will either re… ▽ More

    Submitted 11 April, 2020; v1 submitted 20 July, 2019; originally announced July 2019.

    Comments: 14 pages, 5 figures

  27. arXiv:1905.08731  [pdf, other

    math.OC cs.LG

    Heterogeneous Stochastic Interactions for Multiple Agents in a Multi-armed Bandit Problem

    Authors: Udari Madhushani, Naomi Ehrich Leonard

    Abstract: We define and analyze a multi-agent multi-armed bandit problem in which decision-making agents can observe the choices and rewards of their neighbors. Neighbors are defined by a network graph with heterogeneous and stochastic interconnections. These interactions are determined by the sociability of each agent, which corresponds to the probability that the agent observes its neighbors. We design an… ▽ More

    Submitted 21 May, 2019; originally announced May 2019.

  28. arXiv:1812.07117  [pdf, other

    physics.soc-ph eess.SY math.DS

    Social decision-making driven by artistic explore-exploit tension

    Authors: Kayhan Ozcimder, Biswadip Dey, Alessio Franci, Rebecca Lazier, Daniel Trueman, Naomi Ehrich Leonard

    Abstract: We studied social decision-making in the rule-based improvisational dance $There$ $Might$ $Be$ $Others$, where dancers make in-the-moment compositional choices. Rehearsals provided a natural test-bed with communication restricted to non-verbal cues. We observed a key artistic explore-exploit tension in which the dancers switched between exploitation of existing artistic opportunities and riskier e… ▽ More

    Submitted 17 December, 2018; originally announced December 2018.

    Journal ref: K. Ozcimder, B. Dey, A. Franci, R. Lazier, D. Trueman, and N. E. Leonard (2018): Social decision-making driven by artistic explore-exploit tension, Interdisciplinary Science Reviews

  29. Mixed mode oscillations and phase locking in coupled FitzHugh-Nagumo model neurons

    Authors: Elizabeth N. Davison, Zahra Aminzare, Biswadip Dey, Naomi Ehrich Leonard

    Abstract: We study the dynamics of a low-dimensional system of coupled model neurons as a step towards understanding the vastly complex network of neurons in the brain. We analyze the bifurcation structure of a system of two model neurons with unidirectional coupling as a function of two physiologically relevant parameters: the external current input only to the first neuron and the strength of the coupling… ▽ More

    Submitted 27 July, 2018; originally announced July 2018.

    MSC Class: 34A26; 34C15; 34C60; 34D15; 34E17; 37G05; 37G10; 92B20; 92C20

  30. arXiv:1711.11578  [pdf, other

    math.OC

    Multi-agent decision-making dynamics inspired by honeybees

    Authors: Rebecca Gray, Alessio Franci, Vaibhav Srivastava, Naomi Ehrich Leonard

    Abstract: When choosing between candidate nest sites, a honeybee swarm reliably chooses the most valuable site and even when faced with the choice between near-equal value sites, it makes highly efficient decisions. Value-sensitive decision-making is enabled by a distributed social effort among the honeybees, and it leads to decision-making dynamics of the swarm that are remarkably robust to perturbation an… ▽ More

    Submitted 22 January, 2018; v1 submitted 30 November, 2017; originally announced November 2017.

  31. arXiv:1606.00911  [pdf, other

    eess.SY cs.LG math.OC

    Distributed Cooperative Decision-Making in Multiarmed Bandits: Frequentist and Bayesian Algorithms

    Authors: Peter Landgren, Vaibhav Srivastava, Naomi Ehrich Leonard

    Abstract: We study distributed cooperative decision-making under the explore-exploit tradeoff in the multiarmed bandit (MAB) problem. We extend the state-of-the-art frequentist and Bayesian algorithms for single-agent MAB problems to cooperative distributed algorithms for multi-agent MAB problems in which agents communicate according to a fixed network graph. We rely on a running consensus algorithm for eac… ▽ More

    Submitted 17 September, 2019; v1 submitted 2 June, 2016; originally announced June 2016.

    Comments: This revision provides a correction to the original paper, which appeared in the Proceedings of the 2016 IEEE Conference on Decision and Control (CDC). The second statement of Proposition 1 and Theorem 1 are new from arXiv:1512.06888v3 and Lemma 1 is new. These are used to prove regret bounds in Theorems 2 and 3

  32. arXiv:1512.07638  [pdf, other

    cs.LG math.OC stat.ML

    Satisficing in multi-armed bandit problems

    Authors: Paul Reverdy, Vaibhav Srivastava, Naomi Ehrich Leonard

    Abstract: Satisficing is a relaxation of maximizing and allows for less risky decision making in the face of uncertainty. We propose two sets of satisficing objectives for the multi-armed bandit problem, where the objective is to achieve reward-based decision-making performance above a given threshold. We show that these new problems are equivalent to various standard multi-armed bandit problems with maximi… ▽ More

    Submitted 19 December, 2016; v1 submitted 23 December, 2015; originally announced December 2015.

    Comments: To appear in IEEE Transactions on Automatic Control

  33. arXiv:1512.06888  [pdf, other

    eess.SY cs.MA math.OC stat.ML

    On Distributed Cooperative Decision-Making in Multiarmed Bandits

    Authors: Peter Landgren, Vaibhav Srivastava, Naomi Ehrich Leonard

    Abstract: We study the explore-exploit tradeoff in distributed cooperative decision-making using the context of the multiarmed bandit (MAB) problem. For the distributed cooperative MAB problem, we design the cooperative UCB algorithm that comprises two interleaved distributed processes: (i) running consensus algorithms for estimation of rewards, and (ii) upper-confidence-bound-based heuristics for selection… ▽ More

    Submitted 16 September, 2019; v1 submitted 21 December, 2015; originally announced December 2015.

    Comments: This revision provides a correction to the original paper, which appeared in the Proceedings of the 2016 European Control Conference (ECC). The second statement of Proposition 1, Theorem 1 and their proofs are new. The new Theorem 1 is used to prove the regret bounds in Theorem 2

  34. arXiv:1508.03373  [pdf, other

    math.PR math.OC q-bio.NC q-fin.MF

    A martingale analysis of first passage times of time-dependent Wiener diffusion models

    Authors: Vaibhav Srivastava, Samuel F. Feng, Jonathan D. Cohen, Naomi Ehrich Leonard, Amitai Shenhav

    Abstract: Research in psychology and neuroscience has successfully modeled decision making as a process of noisy evidence accumulation to a decision bound. While there are several variants and implementations of this idea, the majority of these models make use of a noisy accumulation between two absorbing boundaries. A common assumption of these models is that decision parameters, e.g., the rate of accumula… ▽ More

    Submitted 30 September, 2016; v1 submitted 13 August, 2015; originally announced August 2015.

  35. arXiv:1507.01160  [pdf, other

    math.OC cs.LG stat.ML

    Correlated Multiarmed Bandit Problem: Bayesian Algorithms and Regret Analysis

    Authors: Vaibhav Srivastava, Paul Reverdy, Naomi Ehrich Leonard

    Abstract: We consider the correlated multiarmed bandit (MAB) problem in which the rewards associated with each arm are modeled by a multivariate Gaussian random variable, and we investigate the influence of the assumptions in the Bayesian prior on the performance of the upper credible limit (UCL) algorithm and a new correlated UCL algorithm. We rigorously characterize the influence of accuracy, confidence,… ▽ More

    Submitted 7 July, 2015; v1 submitted 4 July, 2015; originally announced July 2015.

  36. arXiv:1503.08526  [pdf, other

    math.OC

    A Realization Theory for Bio-inspired Collective Decision-Making

    Authors: Alessio Franci, Vaibhav Srivastava, Naomi Ehrich Leonard

    Abstract: The collective decision-making exhibited by animal groups provides enormous inspiration for multi-agent control system design as it embodies several features that are desirable in engineered networks, including robustness and adaptability, low computational effort, and an intrinsically decentralized architecture. However, many of the mechanistic models for collective decision-making are described… ▽ More

    Submitted 30 November, 2017; v1 submitted 29 March, 2015; originally announced March 2015.

  37. arXiv:1502.04635  [pdf, other

    math.OC cs.LG stat.ML

    Parameter estimation in softmax decision-making models with linear objective functions

    Authors: Paul Reverdy, Naomi E. Leonard

    Abstract: With an eye towards human-centered automation, we contribute to the development of a systematic means to infer features of human decision-making from behavioral data. Motivated by the common use of softmax selection in models of human decision-making, we study the maximum likelihood parameter estimation problem for softmax decision-making models with linear objective functions. We present conditio… ▽ More

    Submitted 29 August, 2015; v1 submitted 16 February, 2015; originally announced February 2015.

    Comments: In press

    MSC Class: 93E10

  38. arXiv:1407.1569  [pdf, other

    math.OC eess.SY

    Joint Centrality Distinguishes Optimal Leaders in Noisy Networks

    Authors: Katherine E. Fitch, Naomi Ehrich Leonard

    Abstract: We study the performance of a network of agents tasked with tracking an external unknown signal in the presence of stochastic disturbances and under the condition that only a limited subset of agents, known as leaders, can measure the signal directly. We investigate the optimal leader selection problem for a prescribed maximum number of leaders, where the optimal leader set minimizes total system… ▽ More

    Submitted 15 June, 2015; v1 submitted 6 July, 2014; originally announced July 2014.

    Comments: Conditionally accepted to IEEE TCNS

  39. arXiv:1402.3634  [pdf, other

    math.OC cs.MA eess.SY

    Collective Decision-Making in Ideal Networks: The Speed-Accuracy Tradeoff

    Authors: Vaibhav Srivastava, Naomi Ehrich Leonard

    Abstract: We study collective decision-making in a model of human groups, with network interactions, performing two alternative choice tasks. We focus on the speed-accuracy tradeoff, i.e., the tradeoff between a quick decision and a reliable decision, for individuals in the network. We model the evidence aggregation process across the network using a coupled drift diffusion model (DDM) and consider the free… ▽ More

    Submitted 14 February, 2014; originally announced February 2014.

    Comments: to appear in IEEE TCNS

  40. arXiv:1310.5168  [pdf, other

    math.OC eess.SY

    A New Notion of Effective Resistance for Directed Graphs-Part II: Computing Resistances

    Authors: George Forrest Young, Luca Scardovi, Naomi Ehrich Leonard

    Abstract: In Part I of this work we defined a generalization of the concept of effective resistance to directed graphs, and we explored some of the properties of this new definition. Here, we use the theory developed in Part I to compute effective resistances in some prototypical directed graphs. This exploration highlights cases where our notion of effective resistance for directed graphs behaves analogous… ▽ More

    Submitted 21 October, 2013; v1 submitted 18 October, 2013; originally announced October 2013.

  41. arXiv:1310.5163  [pdf, other

    math.OC eess.SY

    A New Notion of Effective Resistance for Directed Graphs-Part I: Definition and Properties

    Authors: George Forrest Young, Luca Scardovi, Naomi Ehrich Leonard

    Abstract: The graphical notion of effective resistance has found wide-ranging applications in many areas of pure mathematics, applied mathematics and control theory. By the nature of its construction, effective resistance can only be computed in undirected graphs and yet in several areas of its application, directed graphs arise as naturally (or more naturally) than undirected ones. In part I of this work,… ▽ More

    Submitted 21 October, 2013; v1 submitted 18 October, 2013; originally announced October 2013.

  42. arXiv:1310.4188  [pdf, other

    math.OC eess.SY

    Nonuniform Line Coverage from Noisy Scalar Measurements

    Authors: P. Davison, N. E. Leonard, A. Olshevsky, M. Schwemmer

    Abstract: We study the problem of distributed coverage control in a network of mobile agents arranged on a line. The goal is to design distributed dynamics for the agents to achieve optimal coverage positions with respect to a scalar density field that measures the relative importance of each point on the line. Unlike previous work, which has implicitly assumed the agents know this density field, we only as… ▽ More

    Submitted 21 November, 2014; v1 submitted 15 October, 2013; originally announced October 2013.

  43. arXiv:1307.6134  [pdf, other

    cs.LG math.OC stat.ML

    Modeling Human Decision-making in Generalized Gaussian Multi-armed Bandits

    Authors: Paul Reverdy, Vaibhav Srivastava, Naomi E. Leonard

    Abstract: We present a formal model of human decision-making in explore-exploit tasks using the context of multi-armed bandit problems, where the decision-maker must choose among multiple options with uncertain rewards. We address the standard multi-armed bandit problem, the multi-armed bandit problem with transition costs, and the multi-armed bandit problem on graphs. We focus on the case of Gaussian rewar… ▽ More

    Submitted 20 December, 2019; v1 submitted 23 July, 2013; originally announced July 2013.

    Comments: 25 pages. Appendix G included in this version details minor modifications that correct for an oversight in the previously-published proofs. The remainder of the text reflects the previously-published version

    Journal ref: Proceedings of the IEEE, vol. 102, iss. 4, p. 544-571, 2014

  44. arXiv:1210.4235  [pdf, ps, other

    eess.SY math.OC

    Node Classification in Networks of Stochastic Evidence Accumulators

    Authors: Ioannis Poulakakis, Luca Scardovi, Naomi Ehrich Leonard

    Abstract: This paper considers a network of stochastic evidence accumulators, each represented by a drift-diffusion model accruing evidence towards a decision in continuous time by observing a noisy signal and by exchanging information with other units according to a fixed communication graph. We bring into focus the relationship between the location of each unit in the communication graph and its certainty… ▽ More

    Submitted 15 October, 2012; originally announced October 2012.

    Comments: 32 pages

  45. arXiv:1209.2194  [pdf, ps, other

    math.OC cs.LG cs.MA eess.SY

    Cooperative learning in multi-agent systems from intermittent measurements

    Authors: Naomi Ehrich Leonard, Alex Olshevsky

    Abstract: Motivated by the problem of tracking a direction in a decentralized way, we consider the general problem of cooperative learning in multi-agent systems with time-varying connectivity and intermittent measurements. We propose a distributed learning protocol capable of learning an unknown vector $μ$ from noisy measurements made independently by autonomous nodes. Our protocol is completely distribute… ▽ More

    Submitted 15 December, 2014; v1 submitted 10 September, 2012; originally announced September 2012.

  46. arXiv:1105.2541  [pdf, other

    math.OC eess.SY

    Rearranging trees for robust consensus

    Authors: George Forrest Young, Luca Scardovi, Naomi Ehrich Leonard

    Abstract: In this paper, we use the H2 norm associated with a communication graph to characterize the robustness of consensus to noise. In particular, we restrict our attention to trees and by systematic attention to the effect of local changes in topology, we derive a partial ordering for undirected trees according to the H2 norm. Our approach for undirected trees provides a constructive method for derivin… ▽ More

    Submitted 20 June, 2011; v1 submitted 12 May, 2011; originally announced May 2011.

    Comments: Submitted to CDC 2011

  47. arXiv:1104.0457  [pdf, other

    math.OC eess.SY

    Nonuniform Coverage Control on the Line

    Authors: Naomi Ehrich Leonard, Alex Olshevsky

    Abstract: This paper investigates control laws allowing mobile, autonomous agents to optimally position themselves on the line for distributed sensing in a nonuniform field. We show that a simple static control law, based only on local measurements of the field by each agent, drives the agents close to the optimal positions after the agents execute in parallel a number of sensing/movement/computation rounds… ▽ More

    Submitted 7 November, 2012; v1 submitted 3 April, 2011; originally announced April 2011.

  48. arXiv:0902.3710  [pdf, other

    math.OC

    Tensegrity Models and Shape Control of Vehicle Formations

    Authors: Benjamin Nabet, Naomi Ehrich Leonard

    Abstract: Using dynamic models of tensegrity structures, we derive provable, distributed control laws for stabilizing and changing the shape of a formation of vehicles in the plane. Tensegrity models define the desired, controlled, multi-vehicle system dynamics, where each node in the tensegrity structure maps to a vehicle and each interconnecting strut or cable in the structure maps to a virtual intercon… ▽ More

    Submitted 23 February, 2009; originally announced February 2009.

    Comments: 31 pages, 6 figures, Submitted

  49. arXiv:0806.3442  [pdf, other

    math.OC

    Stabilization of Three-Dimensional Collective Motion

    Authors: Luca Scardovi, Naomi Leonard, Rodolphe Sepulchre

    Abstract: This paper proposes a methodology to stabilize relative equilibria in a model of identical, steered particles moving in three-dimensional Euclidean space. Exploiting the Lie group structure of the resulting dynamical system, the stabilization problem is reduced to a consensus problem on the Lie algebra. The resulting equilibria correspond to parallel, circular and helical formations. We first de… ▽ More

    Submitted 21 June, 2008; v1 submitted 20 June, 2008; originally announced June 2008.

    Comments: 15 pages, 4 figures, Submitted

  50. arXiv:math/0205017  [pdf, ps, other

    math.OC

    Singular trajectories in multi-input time-optimal problems: Application to controlled mechanical systems

    Authors: M. Chyba, N. E. Leonard, E. D. Sontag

    Abstract: This paper addresses the time-optimal control problem for a class of control systems which includes controlled mechanical systems with possible dissipation terms. The Lie algebras associated with such mechanical systems enjoy certain special properties. These properties are explored and are used in conjunction with the Pontryagin maximum principle to determine the structure of singular extremals… ▽ More

    Submitted 2 May, 2002; originally announced May 2002.

    Comments: See http://www.math.rutgers.edu/~sontag for related work

    MSC Class: 93B05;57R27;37N05