Skip to main content

Showing 1–37 of 37 results for author: Leonard, N E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09810  [pdf, other

    cs.RO eess.SY

    Think Deep and Fast: Learning Neural Nonlinear Opinion Dynamics from Inverse Dynamic Games for Split-Second Interactions

    Authors: Haimin Hu, Jonathan DeCastro, Deepak Gopinath, Guy Rosman, Naomi Ehrich Leonard, Jaime Fernández Fisac

    Abstract: Non-cooperative interactions commonly occur in multi-agent scenarios such as car racing, where an ego vehicle can choose to overtake the rival, or stay behind it until a safe overtaking "corridor" opens. While an expert human can do well at making such time-sensitive decisions, the development of safe and efficient game-theoretic trajectory planners capable of rapidly reasoning discrete options is… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  2. arXiv:2405.20593  [pdf, other

    eess.SY cs.RO

    Excitable crawling

    Authors: Juncal Arbelaiz, Alessio Franci, Naomi Ehrich Leonard, Rodolphe Sepulchre, Bassam Bamieh

    Abstract: We propose and analyze the suitability of a spiking controller to engineer the locomotion of a soft robotic crawler. Inspired by the FitzHugh-Nagumo model of neural excitability, we design a bistable controller with an electrical flipflop circuit representation capable of generating spikes on-demand when coupled to the passive crawler mechanics. A proprioceptive sensory signal from the crawler mec… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 5 pages, MTNS 2024 extended abstract

  3. arXiv:2312.06395  [pdf, other

    cs.RO math.DS math.OC

    Threshold Decision-Making Dynamics Adaptive to Physical Constraints and Changing Environment

    Authors: Giovanna Amorim, María Santos, Shinkyu Park, Alessio Franci, Naomi Ehrich Leonard

    Abstract: We propose a threshold decision-making framework for controlling the physical dynamics of an agent switching between two spatial tasks. Our framework couples a nonlinear opinion dynamics model that represents the evolution of an agent's preference for a particular task with the physical dynamics of the agent. We prove the bifurcation that governs the behavior of the coupled dynamics. We show by me… ▽ More

    Submitted 7 June, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

  4. arXiv:2308.14666  [pdf, other

    cs.CV cs.CE cs.LG

    Learning to Predict 3D Rotational Dynamics from Images of a Rigid Body with Unknown Mass Distribution

    Authors: Justice Mason, Christine Allen-Blanchette, Nicholas Zolman, Elizabeth Davison, Naomi Ehrich Leonard

    Abstract: In many real-world settings, image observations of freely rotating 3D rigid bodies may be available when low-dimensional measurements are not. However, the high-dimensionality of image data precludes the use of classical estimation techniques to learn the dynamics. The usefulness of standard deep learning methods is also limited, because an image of a rigid body reveals nothing about the distribut… ▽ More

    Submitted 10 April, 2024; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: Previously appeared as arXiv:2209.11355v2, which was submitted as a replacement by accident. arXiv admin note: text overlap with arXiv:2209.11355

  5. arXiv:2308.02755  [pdf, other

    physics.soc-ph cs.MA cs.SI math.DS math.OC

    Multi-topic belief formation through bifurcations over signed social networks

    Authors: Anastasia Bizyaeva, Alessio Franci, Naomi Ehrich Leonard

    Abstract: We propose and analyze a nonlinear dynamic model of continuous-time multi-dimensional belief formation over signed social networks. Our model accounts for the effects of a structured belief system, self-appraisal, internal biases, and various sources of cognitive dissonance posited by recent theories in social psychology. We prove that strong beliefs emerge on the network as a consequence of a bif… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

    Comments: 16 pages, 7 figures

  6. arXiv:2304.02687  [pdf, other

    eess.SY cs.RO

    Emergent Coordination through Game-Induced Nonlinear Opinion Dynamics

    Authors: Haimin Hu, Kensuke Nakamura, Kai-Chieh Hsu, Naomi Ehrich Leonard, Jaime Fernández Fisac

    Abstract: We present a multi-agent decision-making framework for the emergent coordination of autonomous agents whose intents are initially undecided. Dynamic non-cooperative games have been used to encode multi-agent interaction, but ambiguity arising from factors such as goal preference or the presence of multiple equilibria may lead to coordination issues, ranging from the "freezing robot" problem to uns… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

  7. arXiv:2210.01642  [pdf, other

    cs.RO

    Proactive Opinion-Driven Robot Navigation around Human Movers

    Authors: Charlotte Cathcart, María Santos, Shinkyu Park, Naomi Ehrich Leonard

    Abstract: We propose, analyze, and experimentally verify a new proactive approach for robot social navigation driven by the robot's "opinion" for which way and by how much to pass human movers crossing its path. The robot forms an opinion over time according to nonlinear dynamics that depend on the robot's observations of human movers and its level of attention to these social cues. For these dynamics, it i… ▽ More

    Submitted 11 September, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: 8 pages, 7 figures

  8. arXiv:2210.00353  [pdf, other

    math.OC cs.MA cs.SI math.DS physics.soc-ph

    Sustained oscillations in multi-topic belief dynamics over signed networks

    Authors: Anastasia Bizyaeva, Alessio Franci, Naomi Ehrich Leonard

    Abstract: We study the dynamics of belief formation on multiple interconnected topics in networks of agents with a shared belief system. We establish sufficient conditions and necessary conditions under which sustained oscillations of beliefs arise on the network in a Hopf bifurcation and characterize the role of the communication graph and the belief system graph in sha** the relative phase and amplitude… ▽ More

    Submitted 22 March, 2023; v1 submitted 1 October, 2022; originally announced October 2022.

    Comments: 6 pages, 6 figures, accepted for publication in the 2023 American Control Conference proceedings

  9. arXiv:2208.01800  [pdf, other

    cs.RO cs.MA

    Decentralized Learning With Limited Communications for Multi-robot Coverage of Unknown Spatial Fields

    Authors: Kensuke Nakamura, María Santos, Naomi Ehrich Leonard

    Abstract: This paper presents an algorithm for a team of mobile robots to simultaneously learn a spatial field over a domain and spatially distribute themselves to optimally cover it. Drawing from previous approaches that estimate the spatial field through a centralized Gaussian process, this work leverages the spatial structure of the coverage problem and presents a decentralized strategy where samples are… ▽ More

    Submitted 2 August, 2022; originally announced August 2022.

    Comments: Accepted IROS 2022

  10. arXiv:2206.14893  [pdf, other

    math.DS cs.MA cs.SI math.OC

    Breaking indecision in multi-agent, multi-option dynamics

    Authors: Alessio Franci, Martin Golubitsky, Ian Stewart, Anastasia Bizyaeva, Naomi Ehrich Leonard

    Abstract: How does a group of agents break indecision when deciding about options with qualities that are hard to distinguish? Biological and artificial multi-agent systems, from honeybees and bird flocks to bacteria, robots, and humans, often need to overcome indecision when choosing among options in situations in which the performance or even the survival of the group are at stake. Breaking indecision is… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: 36 pages

  11. arXiv:2111.12482  [pdf, other

    stat.ML cs.LG

    One More Step Towards Reality: Cooperative Bandits with Imperfect Communication

    Authors: Udari Madhushani, Abhimanyu Dubey, Naomi Ehrich Leonard, Alex Pentland

    Abstract: The cooperative bandit problem is increasingly becoming relevant due to its applications in large-scale decision-making. However, most research for this problem focuses exclusively on the setting with perfect communication, whereas in most real-world distributed settings, communication is often over stochastic networks, with arbitrary corruptions and delays. In this paper, we study cooperative ban… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

    Journal ref: Conference on Neural Information Processing Systems, 2021

  12. arXiv:2110.07392  [pdf, other

    cs.LG cs.MA math.OC

    Provably Efficient Multi-Agent Reinforcement Learning with Fully Decentralized Communication

    Authors: Justin Lidard, Udari Madhushani, Naomi Ehrich Leonard

    Abstract: A challenge in reinforcement learning (RL) is minimizing the cost of sampling associated with exploration. Distributed exploration reduces sampling complexity in multi-agent RL (MARL). We investigate the benefits to performance in MARL when exploration is fully decentralized. Specifically, we consider a class of online, episodic, tabular $Q$-learning problems under time-varying reward and transiti… ▽ More

    Submitted 2 May, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: Accepted as a conference paper to American Control Conference (ACC) 2022

  13. arXiv:2108.00966  [pdf, other

    physics.soc-ph cs.MA cs.SI math.DS math.OC

    Tuning Cooperative Behavior in Games with Nonlinear Opinion Dynamics

    Authors: Shinkyu Park, Anastasia Bizyaeva, Mari Kawakatsu, Alessio Franci, Naomi Ehrich Leonard

    Abstract: We examine the tuning of cooperative behavior in repeated multi-agent games using an analytically tractable, continuous-time, nonlinear model of opinion dynamics. Each modeled agent updates its real-valued opinion about each available strategy in response to payoffs and other agent opinions, as observed over a network. We show how the model provides a principled and systematic means to investigate… ▽ More

    Submitted 23 November, 2021; v1 submitted 2 August, 2021; originally announced August 2021.

  14. arXiv:2103.14764  [pdf, ps, other

    math.OC cs.MA eess.SY

    Control of Agreement and Disagreement Cascades with Distributed Inputs

    Authors: Anastasia Bizyaeva, Timothy Sorochkin, Alessio Franci, Naomi Ehrich Leonard

    Abstract: For a group of autonomous communicating agents, the ability to distinguish a meaningful input from disturbance, and come to collective agreement or disagreement in response to that input, is paramount for carrying out coordinated objectives. In this work we study how a cascade of opinion formation spreads through a group of networked decision-makers in response to a distributed input signal. Using… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

    Comments: 7 pages, 4 figures

  15. arXiv:2011.07720  [pdf, other

    stat.ML cs.LG math.PR

    Distributed Bandits: Probabilistic Communication on $d$-regular Graphs

    Authors: Udari Madhushani, Naomi Ehrich Leonard

    Abstract: We study the decentralized multi-agent multi-armed bandit problem for agents that communicate with probability over a network defined by a $d$-regular graph. Every edge in the graph has probabilistic weight $p$ to account for the ($1\!-\!p$) probability of a communication link failure. At each time step, each agent chooses an arm and receives a numerical reward associated with the chosen arm. Afte… ▽ More

    Submitted 8 October, 2021; v1 submitted 15 November, 2020; originally announced November 2020.

  16. arXiv:2011.05927  [pdf, other

    cs.LG eess.SY math.OC

    On Using Hamiltonian Monte Carlo Sampling for Reinforcement Learning Problems in High-dimension

    Authors: Udari Madhushani, Biswadip Dey, Naomi Ehrich Leonard, Amit Chakraborty

    Abstract: Value function based reinforcement learning (RL) algorithms, for example, $Q$-learning, learn optimal policies from datasets of actions, rewards, and state transitions. However, when the underlying state transition dynamics are stochastic and evolve on a high-dimensional space, generating independent and identically distributed (IID) data samples for creating these datasets poses a significant cha… ▽ More

    Submitted 28 March, 2022; v1 submitted 11 November, 2020; originally announced November 2020.

  17. arXiv:2010.12932  [pdf, other

    cs.LG cs.CV

    LagNetViP: A Lagrangian Neural Network for Video Prediction

    Authors: Christine Allen-Blanchette, Sushant Veer, Anirudha Majumdar, Naomi Ehrich Leonard

    Abstract: The dominant paradigms for video prediction rely on opaque transition models where neither the equations of motion nor the underlying physical quantities of the system are easily inferred. The equations of motion, as defined by Newton's second law, describe the time evolution of a physical system state and can therefore be applied toward the determination of future system states. In this paper, we… ▽ More

    Submitted 24 October, 2020; originally announced October 2020.

  18. arXiv:2009.13600  [pdf, other

    math.OC cs.SI eess.SY math.DS

    Patterns of Nonlinear Opinion Formation on Networks

    Authors: Anastasia Bizyaeva, Ayanna Matthews, Alessio Franci, Naomi Ehrich Leonard

    Abstract: When communicating agents form opinions about a set of possible options, agreement and disagreement are both possible outcomes. Depending on the context, either can be desirable or undesirable. We show that for nonlinear opinion dynamics on networks, and a variety of network structures, the spectral properties of the underlying adjacency matrix fully characterize the occurrence of either agreement… ▽ More

    Submitted 26 March, 2021; v1 submitted 28 September, 2020; originally announced September 2020.

    Comments: 6 pages, 4 figures; accepted to appear in 2021 American Control Conference proceedings

  19. arXiv:2009.04332  [pdf, other

    math.OC cs.SI eess.SY math.DS

    Nonlinear Opinion Dynamics with Tunable Sensitivity

    Authors: Anastasia Bizyaeva, Alessio Franci, Naomi Ehrich Leonard

    Abstract: We propose a continuous-time multi-option nonlinear generalization of classical linear weighted-average opinion dynamics. Nonlinearity is introduced by saturating opinion exchanges, and this is enough to enable a significantly greater range of opinion-forming behaviors with our model as compared to existing linear and nonlinear models. For a group of agents that communicate opinions over a network… ▽ More

    Submitted 30 July, 2021; v1 submitted 9 September, 2020; originally announced September 2020.

  20. arXiv:2008.04383  [pdf, other

    math.OC cs.MA math.DS

    Influence Spread in the Heterogeneous Multiplex Linear Threshold Model

    Authors: Yaofeng Desmond Zhong, Vaibhav Srivastava, Naomi Ehrich Leonard

    Abstract: The linear threshold model (LTM) has been used to study spread on single-layer networks defined by one inter-agent sensing modality and agents homogeneous in protocol. We define and analyze the heterogeneous multiplex LTM to study spread on multi-layer networks with each layer representing a different sensing modality and agents heterogeneous in protocol. Protocols are designed to distinguish sign… ▽ More

    Submitted 10 August, 2020; originally announced August 2020.

  21. arXiv:2007.01926  [pdf, other

    cs.LG eess.SY stat.ML

    Unsupervised Learning of Lagrangian Dynamics from Images for Prediction and Control

    Authors: Yaofeng Desmond Zhong, Naomi Ehrich Leonard

    Abstract: Recent approaches for modelling dynamics of physical systems with neural networks enforce Lagrangian or Hamiltonian structure to improve prediction and generalization. However, when coordinates are embedded in high-dimensional data such as images, these approaches either lose interpretability or can only be applied to one particular example. We introduce a new unsupervised neural network model tha… ▽ More

    Submitted 31 August, 2022; v1 submitted 3 July, 2020; originally announced July 2020.

    Comments: This version corrects an error in Equation (3) of the 2020 NeurIPS Proceedings paper

  22. arXiv:2004.06171  [pdf, other

    cs.LG math.OC stat.ML

    Distributed Learning: Sequential Decision Making in Resource-Constrained Environments

    Authors: Udari Madhushani, Naomi Ehrich Leonard

    Abstract: We study cost-effective communication strategies that can be used to improve the performance of distributed learning systems in resource-constrained environments. For distributed learning in sequential decision making, we propose a new cost-effective partial communication protocol. We illustrate that with this protocol the group obtains the same order of performance that it obtains with full commu… ▽ More

    Submitted 13 April, 2020; originally announced April 2020.

  23. arXiv:2004.03793  [pdf, other

    math.OC cs.LG

    A Dynamic Observation Strategy for Multi-agent Multi-armed Bandit Problem

    Authors: Udari Madhushani, Naomi Ehrich Leonard

    Abstract: We define and analyze a multi-agent multi-armed bandit problem in which decision-making agents can observe the choices and rewards of their neighbors under a linear observation cost. Neighbors are defined by a network graph that encodes the inherent observation constraints of the system. We define a cost associated with observations such that at every instance an agent makes an observation it rece… ▽ More

    Submitted 7 April, 2020; originally announced April 2020.

  24. arXiv:2003.01312  [pdf, other

    math.OC cs.LG stat.ML

    Distributed Cooperative Decision Making in Multi-agent Multi-armed Bandits

    Authors: Peter Landgren, Vaibhav Srivastava, Naomi Ehrich Leonard

    Abstract: We study a distributed decision-making problem in which multiple agents face the same multi-armed bandit (MAB), and each agent makes sequential choices among arms to maximize its own individual reward. The agents cooperate by sharing their estimates over a fixed communication graph. We consider an unconstrained reward model in which two or more agents can choose the same arm and collect independen… ▽ More

    Submitted 11 August, 2020; v1 submitted 2 March, 2020; originally announced March 2020.

  25. arXiv:1907.08829  [pdf, other

    math.OC cs.SI math.DS physics.soc-ph

    Adaptive Susceptibility and Heterogeneity in Contagion Models on Networks

    Authors: Renato Pagliara, Naomi E. Leonard

    Abstract: Contagious processes, such as spread of infectious diseases, social behaviors, or computer viruses, affect biological, social, and technological systems. Epidemic models for large populations and finite populations on networks have been used to understand and control both transient and steady-state behaviors. Typically it is assumed that after recovery from an infection, every agent will either re… ▽ More

    Submitted 11 April, 2020; v1 submitted 20 July, 2019; originally announced July 2019.

    Comments: 14 pages, 5 figures

  26. arXiv:1905.08731  [pdf, other

    math.OC cs.LG

    Heterogeneous Stochastic Interactions for Multiple Agents in a Multi-armed Bandit Problem

    Authors: Udari Madhushani, Naomi Ehrich Leonard

    Abstract: We define and analyze a multi-agent multi-armed bandit problem in which decision-making agents can observe the choices and rewards of their neighbors. Neighbors are defined by a network graph with heterogeneous and stochastic interconnections. These interactions are determined by the sociability of each agent, which corresponds to the probability that the agent observes its neighbors. We design an… ▽ More

    Submitted 21 May, 2019; originally announced May 2019.

  27. arXiv:1808.07842  [pdf, other

    physics.soc-ph cs.SI

    In the Dance Studio: An Art and Engineering Exploration of Human Flocking

    Authors: Naomi E. Leonard, George F. Young, Kelsey Hochgraf, Daniel T. Swain, Aaron Trippe, Willa Chen, Katherine Fitch, Susan Marshall

    Abstract: Flock Logic was developed as an art and engineering project to explore how the feedback laws used to model flocking translate when applied by dancers. The artistic goal was to create choreographic tools that leverage multi-agent system dynamics with designed feedback and interaction. The engineering goal was to provide insights and design principles for multi-agent systems, such as human crowds, a… ▽ More

    Submitted 22 August, 2018; originally announced August 2018.

    Journal ref: Leonard N.E. et al. (2014) In the Dance Studio: An Art and Engineering Exploration of Human Flocking. In: LaViers A., Egerstedt M. (eds) Controls and Art. Springer, Cham

  28. arXiv:1710.00450  [pdf, other

    cs.LG

    Asymptotic Allocation Rules for a Class of Dynamic Multi-armed Bandit Problems

    Authors: T. W. U. Madhushani, D. H. S. Maithripala, N. E. Leonard

    Abstract: This paper presents a class of Dynamic Multi-Armed Bandit problems where the reward can be modeled as the noisy output of a time varying linear stochastic dynamic system that satisfies some boundedness constraints. The class allows many seemingly different problems with time varying option characteristics to be considered in a single framework. It also opens up the possibility of considering many… ▽ More

    Submitted 7 October, 2017; v1 submitted 1 October, 2017; originally announced October 2017.

    Comments: Pre-print submitted to 2018 American Control Conference

    MSC Class: 60-01

  29. arXiv:1606.00911  [pdf, other

    eess.SY cs.LG math.OC

    Distributed Cooperative Decision-Making in Multiarmed Bandits: Frequentist and Bayesian Algorithms

    Authors: Peter Landgren, Vaibhav Srivastava, Naomi Ehrich Leonard

    Abstract: We study distributed cooperative decision-making under the explore-exploit tradeoff in the multiarmed bandit (MAB) problem. We extend the state-of-the-art frequentist and Bayesian algorithms for single-agent MAB problems to cooperative distributed algorithms for multi-agent MAB problems in which agents communicate according to a fixed network graph. We rely on a running consensus algorithm for eac… ▽ More

    Submitted 17 September, 2019; v1 submitted 2 June, 2016; originally announced June 2016.

    Comments: This revision provides a correction to the original paper, which appeared in the Proceedings of the 2016 IEEE Conference on Decision and Control (CDC). The second statement of Proposition 1 and Theorem 1 are new from arXiv:1512.06888v3 and Lemma 1 is new. These are used to prove regret bounds in Theorems 2 and 3

  30. arXiv:1512.07638  [pdf, other

    cs.LG math.OC stat.ML

    Satisficing in multi-armed bandit problems

    Authors: Paul Reverdy, Vaibhav Srivastava, Naomi Ehrich Leonard

    Abstract: Satisficing is a relaxation of maximizing and allows for less risky decision making in the face of uncertainty. We propose two sets of satisficing objectives for the multi-armed bandit problem, where the objective is to achieve reward-based decision-making performance above a given threshold. We show that these new problems are equivalent to various standard multi-armed bandit problems with maximi… ▽ More

    Submitted 19 December, 2016; v1 submitted 23 December, 2015; originally announced December 2015.

    Comments: To appear in IEEE Transactions on Automatic Control

  31. arXiv:1512.06888  [pdf, other

    eess.SY cs.MA math.OC stat.ML

    On Distributed Cooperative Decision-Making in Multiarmed Bandits

    Authors: Peter Landgren, Vaibhav Srivastava, Naomi Ehrich Leonard

    Abstract: We study the explore-exploit tradeoff in distributed cooperative decision-making using the context of the multiarmed bandit (MAB) problem. For the distributed cooperative MAB problem, we design the cooperative UCB algorithm that comprises two interleaved distributed processes: (i) running consensus algorithms for estimation of rewards, and (ii) upper-confidence-bound-based heuristics for selection… ▽ More

    Submitted 16 September, 2019; v1 submitted 21 December, 2015; originally announced December 2015.

    Comments: This revision provides a correction to the original paper, which appeared in the Proceedings of the 2016 European Control Conference (ECC). The second statement of Proposition 1, Theorem 1 and their proofs are new. The new Theorem 1 is used to prove the regret bounds in Theorem 2

  32. arXiv:1507.01160  [pdf, other

    math.OC cs.LG stat.ML

    Correlated Multiarmed Bandit Problem: Bayesian Algorithms and Regret Analysis

    Authors: Vaibhav Srivastava, Paul Reverdy, Naomi Ehrich Leonard

    Abstract: We consider the correlated multiarmed bandit (MAB) problem in which the rewards associated with each arm are modeled by a multivariate Gaussian random variable, and we investigate the influence of the assumptions in the Bayesian prior on the performance of the upper credible limit (UCL) algorithm and a new correlated UCL algorithm. We rigorously characterize the influence of accuracy, confidence,… ▽ More

    Submitted 7 July, 2015; v1 submitted 4 July, 2015; originally announced July 2015.

  33. arXiv:1502.04635  [pdf, other

    math.OC cs.LG stat.ML

    Parameter estimation in softmax decision-making models with linear objective functions

    Authors: Paul Reverdy, Naomi E. Leonard

    Abstract: With an eye towards human-centered automation, we contribute to the development of a systematic means to infer features of human decision-making from behavioral data. Motivated by the common use of softmax selection in models of human decision-making, we study the maximum likelihood parameter estimation problem for softmax decision-making models with linear objective functions. We present conditio… ▽ More

    Submitted 29 August, 2015; v1 submitted 16 February, 2015; originally announced February 2015.

    Comments: In press

    MSC Class: 93E10

  34. arXiv:1402.3634  [pdf, other

    math.OC cs.MA eess.SY

    Collective Decision-Making in Ideal Networks: The Speed-Accuracy Tradeoff

    Authors: Vaibhav Srivastava, Naomi Ehrich Leonard

    Abstract: We study collective decision-making in a model of human groups, with network interactions, performing two alternative choice tasks. We focus on the speed-accuracy tradeoff, i.e., the tradeoff between a quick decision and a reliable decision, for individuals in the network. We model the evidence aggregation process across the network using a coupled drift diffusion model (DDM) and consider the free… ▽ More

    Submitted 14 February, 2014; originally announced February 2014.

    Comments: to appear in IEEE TCNS

  35. arXiv:1307.6134  [pdf, other

    cs.LG math.OC stat.ML

    Modeling Human Decision-making in Generalized Gaussian Multi-armed Bandits

    Authors: Paul Reverdy, Vaibhav Srivastava, Naomi E. Leonard

    Abstract: We present a formal model of human decision-making in explore-exploit tasks using the context of multi-armed bandit problems, where the decision-maker must choose among multiple options with uncertain rewards. We address the standard multi-armed bandit problem, the multi-armed bandit problem with transition costs, and the multi-armed bandit problem on graphs. We focus on the case of Gaussian rewar… ▽ More

    Submitted 20 December, 2019; v1 submitted 23 July, 2013; originally announced July 2013.

    Comments: 25 pages. Appendix G included in this version details minor modifications that correct for an oversight in the previously-published proofs. The remainder of the text reflects the previously-published version

    Journal ref: Proceedings of the IEEE, vol. 102, iss. 4, p. 544-571, 2014

  36. arXiv:1303.2242  [pdf, other

    nlin.AO cs.SI physics.soc-ph q-bio.PE

    Adaptive Network Dynamics and Evolution of Leadership in Collective Migration

    Authors: Darren Pais, Naomi Ehrich Leonard

    Abstract: The evolution of leadership in migratory populations depends not only on costs and benefits of leadership investments but also on the opportunities for individuals to rely on cues from others through social interactions. We derive an analytically tractable adaptive dynamic network model of collective migration with fast timescale migration dynamics and slow timescale adaptive dynamics of individua… ▽ More

    Submitted 9 March, 2013; originally announced March 2013.

    Comments: Submitted to Physica D: Nonlinear Phenomena

  37. arXiv:1209.2194  [pdf, ps, other

    math.OC cs.LG cs.MA eess.SY

    Cooperative learning in multi-agent systems from intermittent measurements

    Authors: Naomi Ehrich Leonard, Alex Olshevsky

    Abstract: Motivated by the problem of tracking a direction in a decentralized way, we consider the general problem of cooperative learning in multi-agent systems with time-varying connectivity and intermittent measurements. We propose a distributed learning protocol capable of learning an unknown vector $μ$ from noisy measurements made independently by autonomous nodes. Our protocol is completely distribute… ▽ More

    Submitted 15 December, 2014; v1 submitted 10 September, 2012; originally announced September 2012.