Skip to main content

Showing 101–150 of 152 results for author: How, J

.
  1. arXiv:1907.04904  [pdf, other

    cs.RO cs.DS

    A Resource-Aware Approach to Collaborative Loop Closure Detection with Provable Performance Guarantees

    Authors: Yulun Tian, Kasra Khosoussi, Jonathan P. How

    Abstract: This paper presents resource-aware algorithms for distributed inter-robot loop closure detection for applications such as collaborative simultaneous localization and map** (CSLAM) and distributed image retrieval. In real-world scenarios, this process is resource-intensive as it involves exchanging many observations and geometrically verifying a large number of potential matches. This poses sever… ▽ More

    Submitted 10 July, 2019; originally announced July 2019.

    Comments: submitted to IJRR (extension of WAFR 2018 paper invited to IJRR)

  2. arXiv:1903.06592  [pdf, other

    cs.LG cs.AI stat.ML

    Policy Distillation and Value Matching in Multiagent Reinforcement Learning

    Authors: Samir Wadhwania, Dong-Ki Kim, Shayegan Omidshafiei, Jonathan P. How

    Abstract: Multiagent reinforcement learning algorithms (MARL) have been demonstrated on complex tasks that require the coordination of a team of multiple agents to complete. Existing works have focused on sharing information between agents via centralized critics to stabilize learning or through communication to increase performance, but do not generally look at how information can be shared between agents… ▽ More

    Submitted 15 March, 2019; originally announced March 2019.

    Comments: Submitted as a conference paper to IROS 2019

  3. arXiv:1903.03558  [pdf, other

    cs.RO

    FASTER: Fast and Safe Trajectory Planner for Flights in Unknown Environments

    Authors: Jesus Tordesillas, Brett T. Lopez, Jonathan P. How

    Abstract: High-speed trajectory planning through unknown environments requires algorithmic techniques that enable fast reaction times while maintaining safety as new information about the operating environment is obtained. The requirement of computational tractability typically leads to optimization problems that do not include the obstacle constraints (collision checks are done on the solutions) or use a c… ▽ More

    Submitted 13 May, 2020; v1 submitted 8 March, 2019; originally announced March 2019.

    Comments: IROS 2019

  4. arXiv:1903.03216  [pdf, other

    cs.LG cs.AI cs.MA

    Learning Hierarchical Teaching Policies for Cooperative Agents

    Authors: Dong-Ki Kim, Miao Liu, Shayegan Omidshafiei, Sebastian Lopez-Cot, Matthew Riemer, Golnaz Habibi, Gerald Tesauro, Sami Mourad, Murray Campbell, Jonathan P. How

    Abstract: Collective learning can be greatly enhanced when agents effectively exchange knowledge with their peers. In particular, recent work studying agents that learn to teach other teammates has demonstrated that action advising accelerates team-wide learning. However, the prior work has simplified the learning of advising policies by using simple function approximations and only considered advising with… ▽ More

    Submitted 18 May, 2020; v1 submitted 7 March, 2019; originally announced March 2019.

    Comments: Presented at AAMAS 2020; arXiv version added with the appendix

  5. arXiv:1903.00597  [pdf, ps, other

    math.OC cs.LG

    Block-Coordinate Minimization for Large SDPs with Block-Diagonal Constraints

    Authors: Yulun Tian, Kasra Khosoussi, Jonathan P. How

    Abstract: The so-called Burer-Monteiro method is a well-studied technique for solving large-scale semidefinite programs (SDPs) via low-rank factorization. The main idea is to solve rank-restricted, albeit non-convex, surrogates instead of the SDP. Recent works have shown that, in an important class of SDPs with elegant geometric structure, one can find globally optimal solutions to the SDP by finding rank-d… ▽ More

    Submitted 27 August, 2019; v1 submitted 1 March, 2019; originally announced March 2019.

    Comments: Technical report

  6. arXiv:1902.05644  [pdf, other

    cs.AI

    Active Perception in Adversarial Scenarios using Maximum Entropy Deep Reinforcement Learning

    Authors: Macheng Shen, Jonathan P How

    Abstract: We pose an active perception problem where an autonomous agent actively interacts with a second agent with potentially adversarial behaviors. Given the uncertainty in the intent of the other agent, the objective is to collect further evidence to help discriminate potential threats. The main technical challenges are the partial observability of the agent intent, the adversary modeling, and the corr… ▽ More

    Submitted 18 September, 2019; v1 submitted 14 February, 2019; originally announced February 2019.

  7. arXiv:1902.02256  [pdf, other

    cs.RO cs.CV cs.MA

    CLEAR: A Consistent Lifting, Embedding, and Alignment Rectification Algorithm for Multi-View Data Association

    Authors: Kaveh Fathian, Kasra Khosoussi, Yulun Tian, Parker Lusk, Jonathan P. How

    Abstract: Many robotics applications require alignment and fusion of observations obtained at multiple views to form a global model of the environment. Multi-way data association methods provide a mechanism to improve alignment accuracy of pairwise associations and ensure their consistency. However, existing methods that solve this computationally challenging problem are often too slow for real-time applica… ▽ More

    Submitted 4 March, 2020; v1 submitted 6 February, 2019; originally announced February 2019.

  8. arXiv:1901.05925  [pdf, ps, other

    cs.RO cs.DS

    Resource-Aware Algorithms for Distributed Loop Closure Detection with Provable Performance Guarantees

    Authors: Yulun Tian, Kasra Khosoussi, Jonathan P. How

    Abstract: Inter-robot loop closure detection, e.g., for collaborative simultaneous localization and map** (CSLAM), is a fundamental capability for many multirobot applications in GPS-denied regimes. In real-world scenarios, this is a resource-intensive process that involves exchanging observations and verifying potential matches. This poses severe challenges especially for small-size and low-cost robots w… ▽ More

    Submitted 17 January, 2019; originally announced January 2019.

    Comments: International Workshop on the Algorithmic Foundations of Robotics (WAFR) 2018 (Extended Version)

  9. arXiv:1810.08700  [pdf, other

    cs.RO cs.AI cs.LG

    Safe Reinforcement Learning with Model Uncertainty Estimates

    Authors: Björn Lütjens, Michael Everett, Jonathan P. How

    Abstract: Many current autonomous systems are being designed with a strong reliance on black box predictions from deep neural networks (DNNs). However, DNNs tend to be overconfident in predictions on unseen data and can give unpredictable results for far-from-distribution test data. The importance of predictions that are robust to this distributional shift is evident for safety-critical applications, such a… ▽ More

    Submitted 1 March, 2019; v1 submitted 19 October, 2018; originally announced October 2018.

    Comments: ICRA 2019; Presented at IROS 2018 Workshop on Machine Learning in Robot Motion Planning

  10. arXiv:1810.01035  [pdf, other

    cs.RO

    Real-Time Planning with Multi-Fidelity Models for Agile Flights in Unknown Environments

    Authors: Jesus Tordesillas, Brett T. Lopez, John Carter, John Ware, Jonathan P. How

    Abstract: Autonomous navigation through unknown environments is a challenging task that entails real-time localization, perception, planning, and control. UAVs with this capability have begun to emerge in the literature with advances in lightweight sensing and computing. Although the planning methodologies vary from platform to platform, many algorithms adopt a hierarchical planning architecture where a slo… ▽ More

    Submitted 15 June, 2019; v1 submitted 1 October, 2018; originally announced October 2018.

    Comments: ICRA 2019

  11. arXiv:1809.09646  [pdf, other

    cs.RO

    Efficient Constellation-Based Map-Merging for Semantic SLAM

    Authors: Kristoffer M. Frey, Ted J. Steiner, Jonathan P. How

    Abstract: Data association in SLAM is fundamentally challenging, and handling ambiguity well is crucial to achieve robust operation in real-world environments. When ambiguous measurements arise, conservatism often mandates that the measurement is discarded or a new landmark is initialized rather than risking an incorrect association. To address the inevitable `duplicate' landmarks that arise, we present an… ▽ More

    Submitted 5 March, 2019; v1 submitted 25 September, 2018; originally announced September 2018.

    Comments: Accepted to IEEE International Conference on Robotics and Automation (ICRA) 2019

  12. arXiv:1806.09453  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Context-Aware Pedestrian Motion Prediction In Urban Intersections

    Authors: Golnaz Habibi, Nikita Jaipuria, Jonathan P. How

    Abstract: This paper presents a novel context-based approach for pedestrian motion prediction in crowded, urban intersections, with the additional flexibility of prediction in similar, but new, environments. Previously, Chen et. al. combined Markovian-based and clustering-based approaches to learn motion primitives in a grid-based world and subsequently predict pedestrian trajectories by modeling the transi… ▽ More

    Submitted 25 June, 2018; originally announced June 2018.

  13. arXiv:1806.09444  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    A Transferable Pedestrian Motion Prediction Model for Intersections with Different Geometries

    Authors: Nikita Jaipuria, Golnaz Habibi, Jonathan P. How

    Abstract: This paper presents a novel framework for accurate pedestrian intent prediction at intersections. Given some prior knowledge of the curbside geometry, the presented framework can accurately predict pedestrian trajectories, even in new intersections that it has not been trained on. This is achieved by making use of the contravariant components of trajectories in the curbside coordinate system, whic… ▽ More

    Submitted 25 June, 2018; originally announced June 2018.

  14. Partial Replanning for Decentralized Dynamic Task Allocation

    Authors: Noam Buckman, Han-Lim Choi, Jonathan P. How

    Abstract: In time-sensitive and dynamic missions, multi-UAV teams must respond quickly to new information and objectives. This paper presents a dynamic decentralized task allocation algorithm for allocating new tasks that appear online during the solving of the task allocation problem. Our algorithm extends the Consensus-Based Bundle Algorithm (CBBA), a decentralized task allocation algorithm, allowing for… ▽ More

    Submitted 25 October, 2018; v1 submitted 12 June, 2018; originally announced June 2018.

    Comments: 11 pages, Accepted to AIAA GNC 2019

  15. arXiv:1806.00188  [pdf, ps, other

    cs.RO cs.DS

    Near-Optimal Budgeted Data Exchange for Distributed Loop Closure Detection

    Authors: Yulun Tian, Kasra Khosoussi, Matthew Giamou, Jonathan P. How, Jonathan Kelly

    Abstract: Inter-robot loop closure detection is a core problem in collaborative SLAM (CSLAM). Establishing inter-robot loop closures is a resource-demanding process, during which robots must consume a substantial amount of mission-critical resources (e.g., battery and bandwidth) to exchange sensory data. However, even with the most resource-efficient techniques, the resources available onboard may be insuff… ▽ More

    Submitted 1 June, 2018; originally announced June 2018.

    Comments: RSS 2018 Extended Version

  16. arXiv:1805.09266  [pdf, ps, other

    cs.LG cs.DC stat.ML

    Collective Online Learning of Gaussian Processes in Massive Multi-Agent Systems

    Authors: Trong Nghia Hoang, Quang Minh Hoang, Kian Hsiang Low, Jonathan How

    Abstract: Distributed machine learning (ML) is a modern computation paradigm that divides its workload into independent tasks that can be simultaneously achieved by multiple machines (i.e., agents) for better scalability. However, a typical distributed system is usually implemented with a central server that collects data statistics from multiple independent machines operating on different subsets of data t… ▽ More

    Submitted 12 November, 2018; v1 submitted 23 May, 2018; originally announced May 2018.

    Comments: Extended version with proofs

  17. arXiv:1805.07830  [pdf, other

    cs.MA cs.AI

    Learning to Teach in Cooperative Multiagent Reinforcement Learning

    Authors: Shayegan Omidshafiei, Dong-Ki Kim, Miao Liu, Gerald Tesauro, Matthew Riemer, Christopher Amato, Murray Campbell, Jonathan P. How

    Abstract: Collective human knowledge has clearly benefited from the fact that innovations by individuals are taught to others through communication. Similar to human social groups, agents in distributed learning systems would likely benefit from communication to share knowledge and teach skills. The problem of teaching to improve agent learning has been investigated by prior works, but these approaches make… ▽ More

    Submitted 31 August, 2018; v1 submitted 20 May, 2018; originally announced May 2018.

  18. arXiv:1805.01956  [pdf, other

    cs.RO cs.AI cs.LG

    Motion Planning Among Dynamic, Decision-Making Agents with Deep Reinforcement Learning

    Authors: Michael Everett, Yu Fan Chen, Jonathan P. How

    Abstract: Robots that navigate among pedestrians use collision avoidance algorithms to enable safe and efficient operation. Recent works present deep reinforcement learning as a framework to model the complex interactions and cooperation. However, they are implemented using key assumptions about other agents' behavior that deviate from reality as the number of agents in the environment increases. This work… ▽ More

    Submitted 4 May, 2018; originally announced May 2018.

  19. arXiv:1804.00495  [pdf, other

    cs.CV cs.LG

    Transferable Pedestrian Motion Prediction Models at Intersections

    Authors: Macheng Shen, Golnaz Habibi, Jonathan P. How

    Abstract: One desirable capability of autonomous cars is to accurately predict the pedestrian motion near intersections for safe and efficient trajectory planning. We are interested in develo** transfer learning algorithms that can be trained on the pedestrian trajectories collected at one intersection and yet still provide accurate predictions of the trajectories at another, previously unseen intersectio… ▽ More

    Submitted 18 September, 2019; v1 submitted 15 March, 2018; originally announced April 2018.

  20. arXiv:1711.10314  [pdf, other

    cs.AI

    Crossmodal Attentive Skill Learner

    Authors: Shayegan Omidshafiei, Dong-Ki Kim, Jason Pazis, Jonathan P. How

    Abstract: This paper presents the Crossmodal Attentive Skill Learner (CASL), integrated with the recently-introduced Asynchronous Advantage Option-Critic (A2OC) architecture [Harb et al., 2017] to enable hierarchical reinforcement learning across multiple sensory inputs. We provide concrete examples where the approach not only improves performance in a single task, but accelerates transfer to new tasks. We… ▽ More

    Submitted 22 May, 2018; v1 submitted 28 November, 2017; originally announced November 2017.

    Comments: International Conference on Autonomous Agents and Multiagent Systems (AAMAS) 2018, NIPS 2017 Deep Reinforcement Learning Symposium

  21. arXiv:1710.06525  [pdf, other

    cs.AI cs.MA

    Near-Optimal Adversarial Policy Switching for Decentralized Asynchronous Multi-Agent Systems

    Authors: Trong Nghia Hoang, Yuchen Xiao, Kavinayan Sivakumar, Christopher Amato, Jonathan How

    Abstract: A key challenge in multi-robot and multi-agent systems is generating solutions that are robust to other self-interested or even adversarial parties who actively try to prevent the agents from achieving their goals. The practicality of existing works addressing this challenge is limited to only small-scale synchronous decision-making scenarios or a single agent planning its best response against a… ▽ More

    Submitted 17 October, 2017; originally announced October 2017.

  22. arXiv:1709.06821  [pdf, other

    cs.RO

    Complexity Analysis and Efficient Measurement Selection Primitives for High-Rate Graph SLAM

    Authors: Kristoffer M. Frey, Ted J. Steiner, Jonathan P. How

    Abstract: Sparsity has been widely recognized as crucial for efficient optimization in graph-based SLAM. Because the sparsity and structure of the SLAM graph reflect the set of incorporated measurements, many methods for sparsification have been proposed in hopes of reducing computation. These methods often focus narrowly on reducing edge count without regard for structure at a global level. Such structural… ▽ More

    Submitted 2 March, 2018; v1 submitted 20 September, 2017; originally announced September 2017.

    Comments: Pre-print accepted to ICRA 2018

  23. arXiv:1709.06675  [pdf, ps, other

    cs.RO

    Talk Resource-Efficiently to Me: Optimal Communication Planning for Distributed Loop Closure Detection

    Authors: Matthew Giamou, Kasra Khosoussi, Jonathan P. How

    Abstract: Due to the distributed nature of cooperative simultaneous localization and map** (CSLAM), detecting inter-robot loop closures necessitates sharing sensory data with other robots. A naïve approach to data sharing can easily lead to a waste of mission-critical resources. This paper investigates the logistical aspects of CSLAM. Particularly, we present a general resource-efficient communication pla… ▽ More

    Submitted 10 March, 2018; v1 submitted 19 September, 2017; originally announced September 2017.

    Comments: ICRA 2018; final version

  24. arXiv:1709.06645  [pdf, other

    eess.SY

    Closed-Loop Statistical Verification of Stochastic Nonlinear Systems Subject to Parametric Uncertainties

    Authors: John F. Quindlen, Ufuk Topcu, Girish Chowdhary, Jonathan P. How

    Abstract: This paper proposes a statistical verification framework using Gaussian processes (GPs) for simulation-based verification of stochastic nonlinear systems with parametric uncertainties. Given a small number of stochastic simulations, the proposed framework constructs a GP regression model and predicts the system's performance over the entire set of possible uncertainties. Included in the framework… ▽ More

    Submitted 1 October, 2017; v1 submitted 19 September, 2017; originally announced September 2017.

    Comments: 8 pages, submitted to ACC 2018

  25. arXiv:1707.08493  [pdf, other

    stat.ML

    Dynamic Clustering Algorithms via Small-Variance Analysis of Markov Chain Mixture Models

    Authors: Trevor Campbell, Brian Kulis, Jonathan How

    Abstract: Bayesian nonparametrics are a class of probabilistic models in which the model size is inferred from data. A recently developed methodology in this field is small-variance asymptotic analysis, a mathematical technique for deriving learning algorithms that capture much of the flexibility of Bayesian nonparametric inference algorithms, but are simpler to implement and less computationally expensive.… ▽ More

    Submitted 26 July, 2017; originally announced July 2017.

    Comments: 27 pages

  26. arXiv:1707.07399  [pdf, other

    cs.MA cs.LG cs.RO

    Learning for Multi-robot Cooperation in Partially Observable Stochastic Environments with Macro-actions

    Authors: Miao Liu, Kavinayan Sivakumar, Shayegan Omidshafiei, Christopher Amato, Jonathan P. How

    Abstract: This paper presents a data-driven approach for multi-robot coordination in partially-observable domains based on Decentralized Partially Observable Markov Decision Processes (Dec-POMDPs) and macro-actions (MAs). Dec-POMDPs provide a general framework for cooperative sequential decision making under uncertainty and MAs allow temporally extended and asynchronous action execution. To date, most metho… ▽ More

    Submitted 17 August, 2017; v1 submitted 24 July, 2017; originally announced July 2017.

    Comments: Accepted to the 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2017)

  27. arXiv:1706.04268  [pdf, other

    eess.SY

    Active Sampling-based Binary Verification of Dynamical Systems

    Authors: John F. Quindlen, Ufuk Topcu, Girish Chowdhary, Jonathan P. How

    Abstract: Nonlinear, adaptive, or otherwise complex control techniques are increasingly relied upon to ensure the safety of systems operating in uncertain environments. However, the nonlinearity of the resulting closed-loop system complicates verification that the system does in fact satisfy those requirements at all possible operating conditions. While analytical proof-based techniques and finite abstracti… ▽ More

    Submitted 16 January, 2018; v1 submitted 13 June, 2017; originally announced June 2017.

    Comments: 23 pages

  28. arXiv:1705.01471  [pdf, other

    eess.SY

    Active Sampling for Constrained Simulation-based Verification of Uncertain Nonlinear Systems

    Authors: John F. Quindlen, Ufuk Topcu, Girish Chowdhary, Jonathan P. How

    Abstract: Increasingly demanding performance requirements for dynamical systems motivates the adoption of nonlinear and adaptive control techniques. One challenge is the nonlinearity of the resulting closed-loop system complicates verification that the system does satisfy the requirements at all possible operating conditions. This paper presents a data-driven procedure for efficient simulation-based, statis… ▽ More

    Submitted 1 October, 2017; v1 submitted 3 May, 2017; originally announced May 2017.

    Comments: 8 pages, submitted to ACC 2018

  29. arXiv:1704.05959  [pdf, other

    cs.CV cs.RO

    SLAM with Objects using a Nonparametric Pose Graph

    Authors: Beipeng Mu, Shih-Yuan Liu, Liam Paull, John Leonard, Jonathan How

    Abstract: Map** and self-localization in unknown environments are fundamental capabilities in many robotic applications. These tasks typically involve the identification of objects as unique features or landmarks, which requires the objects both to be detected and then assigned a unique identifier that can be maintained when viewed from different perspectives and in different images. The \textit{data asso… ▽ More

    Submitted 19 April, 2017; originally announced April 2017.

    Comments: published at IROS 2016

  30. arXiv:1703.08862  [pdf, other

    cs.RO cs.AI cs.HC

    Socially Aware Motion Planning with Deep Reinforcement Learning

    Authors: Yu Fan Chen, Michael Everett, Miao Liu, Jonathan P. How

    Abstract: For robotic vehicles to navigate safely and efficiently in pedestrian-rich environments, it is important to model subtle human behaviors and navigation rules (e.g., passing on the right). However, while instinctive to humans, socially compliant navigation is still difficult to quantify due to the stochasticity in people's behaviors. Existing works are mostly focused on using feature-matching techn… ▽ More

    Submitted 4 May, 2018; v1 submitted 26 March, 2017; originally announced March 2017.

    Comments: 8 pages

  31. arXiv:1703.06182  [pdf, other

    cs.LG cs.AI cs.MA

    Deep Decentralized Multi-task Multi-Agent Reinforcement Learning under Partial Observability

    Authors: Shayegan Omidshafiei, Jason Pazis, Christopher Amato, Jonathan P. How, John Vian

    Abstract: Many real-world tasks involve multiple agents with partial observability and limited communication. Learning is challenging in these settings due to local viewpoints of agents, which perceive the world as non-stationary due to concurrently-exploring teammates. Approaches that learn specialized policies for individual tasks face problems when applied to the real world: not only do agents have to le… ▽ More

    Submitted 13 July, 2017; v1 submitted 17 March, 2017; originally announced March 2017.

    Comments: Accepted to ICML 2017

    Journal ref: Proceedings of the 34th International Conference on Machine Learning (ICML 2017), Sydney, Australia, PMLR 70:2681-2690, 2017

  32. arXiv:1703.05626  [pdf, other

    cs.MA cs.RO

    Scalable Accelerated Decentralized Multi-Robot Policy Search in Continuous Observation Spaces

    Authors: Shayegan Omidshafiei, Christopher Amato, Miao Liu, Michael Everett, Jonathan P. How, John Vian

    Abstract: This paper presents the first ever approach for solving \emph{continuous-observation} Decentralized Partially Observable Markov Decision Processes (Dec-POMDPs) and their semi-Markovian counterparts, Dec-POSMDPs. This contribution is especially important in robotics, where a vast number of sensors provide continuous observation data. A continuous-observation policy representation is introduced usin… ▽ More

    Submitted 16 March, 2017; originally announced March 2017.

  33. arXiv:1703.05623  [pdf, other

    cs.MA cs.RO

    Semantic-level Decentralized Multi-Robot Decision-Making using Probabilistic Macro-Observations

    Authors: Shayegan Omidshafiei, Shih-Yuan Liu, Michael Everett, Brett T. Lopez, Christopher Amato, Miao Liu, Jonathan P. How, John Vian

    Abstract: Robust environment perception is essential for decision-making on robots operating in complex domains. Intelligent task execution requires principled treatment of uncertainty sources in a robot's observation model. This is important not only for low-level observations (e.g., accelerometer data), but also for high-level observations such as semantic object labels. This paper formalizes the concept… ▽ More

    Submitted 16 March, 2017; originally announced March 2017.

  34. Dynamic Arrival Rate Estimation for Campus Mobility on Demand Network Graphs

    Authors: Justin Miller, Andres Hasfura, Shih-Yuan Liu, Jonathan P. How

    Abstract: Mobility On Demand (MOD) systems are revolutionizing transportation in urban settings by improving vehicle utilization and reducing parking congestion. A key factor in the success of an MOD system is the ability to measure and respond to real-time customer arrival data. Real time traffic arrival rate data is traditionally difficult to obtain due to the need to install fixed sensors throughout the… ▽ More

    Submitted 6 March, 2017; originally announced March 2017.

    Comments: Appears in 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). http://ieeexplore.ieee.org/abstract/document/7759357/

    Journal ref: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Daejeon, 2016, pp. 2285-2292 (2016)

  35. arXiv:1703.02130  [pdf, other

    cs.RO

    Demand Estimation and Chance-Constrained Fleet Management for Ride Hailing

    Authors: Justin Miller, Jonathan P. How

    Abstract: In autonomous Mobility on Demand (MOD) systems, customers request rides from a fleet of shared vehicles that can be automatically positioned in response to customer demand. Recent approaches to MOD systems have focused on environments where customers can only request rides through an app or by waiting at a station. This paper develops MOD fleet management approaches for ride hailing, where custome… ▽ More

    Submitted 22 July, 2017; v1 submitted 6 March, 2017; originally announced March 2017.

    Comments: 9 pages, 5 figures

  36. arXiv:1609.08116  [pdf, other

    cs.MA

    Predictive Positioning and Quality Of Service Ridesharing for Campus Mobility On Demand Systems

    Authors: Justin Miller, Jonathan P. How

    Abstract: Autonomous Mobility On Demand (MOD) systems can utilize fleet management strategies in order to provide a high customer quality of service (QoS). Previous works on autonomous MOD systems have developed methods for rebalancing single capacity vehicles, where QoS is maintained through large fleet sizing. This work focuses on MOD systems utilizing a small number of vehicles, such as those found on a… ▽ More

    Submitted 6 March, 2017; v1 submitted 26 September, 2016; originally announced September 2016.

    Comments: 8 pages, 5 figures

  37. arXiv:1609.07845  [pdf, other

    cs.MA

    Decentralized Non-communicating Multiagent Collision Avoidance with Deep Reinforcement Learning

    Authors: Yu Fan Chen, Miao Liu, Michael Everett, Jonathan P. How

    Abstract: Finding feasible, collision-free paths for multiagent systems can be challenging, particularly in non-communicating scenarios where each agent's intent (e.g. goal) is unobservable to the others. In particular, finding time efficient paths often requires anticipating interaction with neighboring agents, the process of which can be computationally prohibitive. This work presents a decentralized mult… ▽ More

    Submitted 28 September, 2016; v1 submitted 26 September, 2016; originally announced September 2016.

    Comments: 8 pages, 10 figures

  38. arXiv:1609.06757  [pdf, other

    stat.AP eess.SY math.ST

    Quickest Change Detection Approach to Optimal Control in Markov Decision Processes with Model Changes

    Authors: Taposh Banerjee, Miao Liu, Jonathan P. How

    Abstract: Optimal control in non-stationary Markov decision processes (MDP) is a challenging problem. The aim in such a control problem is to maximize the long-term discounted reward when the transition dynamics or the reward function can change over time. When a prior knowledge of change statistics is available, the standard Bayesian approach to this problem is to reformulate it as a partially observable M… ▽ More

    Submitted 1 March, 2017; v1 submitted 21 September, 2016; originally announced September 2016.

    Comments: In Proceedings of American Control Conference 2017, 7 pages

  39. arXiv:1607.06407  [pdf, other

    cs.CV math.ST stat.AP

    Small-Variance Nonparametric Clustering on the Hypersphere

    Authors: Julian Straub, Trevor Campbell, Jonathan P. How, John W. Fisher III

    Abstract: Structural regularities in man-made environments reflect in the distribution of their surface normals. Describing these surface normal distributions is important in many computer vision applications, such as scene understanding, plane segmentation, and regularization of 3D reconstructions. Based on the small-variance limit of Bayesian nonparametric von-Mises-Fisher (vMF) mixture distributions, we… ▽ More

    Submitted 21 July, 2016; originally announced July 2016.

    Journal ref: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (pp. 334-342). (2015)

  40. arXiv:1605.01042  [pdf, other

    cs.CV

    Hierarchical Bayesian Noise Inference for Robust Real-time Probabilistic Object Classification

    Authors: Shayegan Omidshafiei, Brett T. Lopez, Jonathan P. How, John Vian

    Abstract: Robust environment perception is essential for decision-making on robots operating in complex domains. Principled treatment of uncertainty sources in a robot's observation model is necessary for accurate map** and object detection. This is important not only for low-level observations (e.g., accelerometer data), but for high-level observations such as semantic object labels as well. This paper p… ▽ More

    Submitted 13 July, 2016; v1 submitted 3 May, 2016; originally announced May 2016.

  41. arXiv:1603.04868  [pdf, other

    cs.CV

    Efficient Global Point Cloud Alignment using Bayesian Nonparametric Mixtures

    Authors: Julian Straub, Trevor Campbell, Jonathan P. How, John W. Fisher III

    Abstract: Point cloud alignment is a common problem in computer vision and robotics, with applications ranging from 3D object recognition to reconstruction. We propose a novel approach to the alignment problem that utilizes Bayesian nonparametrics to describe the point cloud and surface normal densities, and branch and bound (BB) optimization to recover the relative transformation. BB uses a novel, refinabl… ▽ More

    Submitted 21 November, 2016; v1 submitted 15 March, 2016; originally announced March 2016.

  42. arXiv:1603.00861  [pdf, other

    math.ST stat.ME

    Truncated Random Measures

    Authors: Trevor Campbell, Jonathan H. Huggins, Jonathan P. How, Tamara Broderick

    Abstract: Completely random measures (CRMs) and their normalizations are a rich source of Bayesian nonparametric priors. Examples include the beta, gamma, and Dirichlet processes. In this paper we detail two major classes of sequential CRM representations---series representations and superposition representations---within which we organize both novel and existing sequential representations that can be used… ▽ More

    Submitted 18 February, 2019; v1 submitted 2 March, 2016; originally announced March 2016.

    Comments: To appear in Bernoulli; 58 pages, 3 figures

    Journal ref: Bernoulli, Volume 25, Number 2 (2019), 1256-1288

  43. arXiv:1510.09161  [pdf, other

    cs.LG stat.ML

    Streaming, Distributed Variational Inference for Bayesian Nonparametrics

    Authors: Trevor Campbell, Julian Straub, John W. Fisher III, Jonathan P. How

    Abstract: This paper presents a methodology for creating streaming, distributed inference algorithms for Bayesian nonparametric (BNP) models. In the proposed framework, processing nodes receive a sequence of data minibatches, compute a variational posterior for each, and make asynchronous streaming updates to a central model. In contrast to previous algorithms, the proposed framework is truly streaming, dis… ▽ More

    Submitted 30 October, 2015; originally announced October 2015.

    Comments: This paper was presented at NIPS 2015. Please use the following BibTeX citation: @inproceedings{Campbell15_NIPS, Author = {Trevor Campbell and Julian Straub and John W. {Fisher III} and Jonathan P. How}, Title = {Streaming, Distributed Variational Inference for Bayesian Nonparametrics}, Booktitle = {Advances in Neural Information Processing Systems (NIPS)}, Year = {2015}}

  44. arXiv:1509.08155  [pdf, ps, other

    cs.RO

    Information-based Active SLAM via Topological Feature Graphs

    Authors: Beipeng Mu, Matthew Giamou, Liam Paull, Ali-akbar Agha-mohammadi, John Leonard, Jonathan How

    Abstract: Active SLAM is the task of actively planning robot paths while simultaneously building a map and localizing within. Existing work has focused on planning paths with occupancy grid maps, which do not scale well and suffer from long term drift. This work proposes a Topological Feature Graph (TFG) representation that scales well and develops an active SLAM algorithm with it. The TFG uses graphical mo… ▽ More

    Submitted 29 August, 2016; v1 submitted 27 September, 2015; originally announced September 2015.

    Comments: published in CDC 2016

  45. arXiv:1505.00274  [pdf, other

    cs.AI eess.SY stat.ML

    Stick-Breaking Policy Learning in Dec-POMDPs

    Authors: Miao Liu, Christopher Amato, Xuejun Liao, Lawrence Carin, Jonathan P. How

    Abstract: Expectation maximization (EM) has recently been shown to be an efficient algorithm for learning finite-state controllers (FSCs) in large decentralized POMDPs (Dec-POMDPs). However, current methods use fixed-size FSCs and often converge to maxima that are far from optimal. This paper considers a variable-size FSC to represent the local policy of each agent. These variable-size FSCs are constructed… ▽ More

    Submitted 23 November, 2015; v1 submitted 1 May, 2015; originally announced May 2015.

  46. arXiv:1502.06030  [pdf, other

    cs.MA cs.AI cs.RO

    Decentralized Control of Partially Observable Markov Decision Processes using Belief Space Macro-actions

    Authors: Shayegan Omidshafiei, Ali-akbar Agha-mohammadi, Christopher Amato, Jonathan P. How

    Abstract: The focus of this paper is on solving multi-robot planning problems in continuous spaces with partial observability. Decentralized partially observable Markov decision processes (Dec-POMDPs) are general models for multi-robot coordination problems, but representing and solving Dec-POMDPs is often intractable for large problems. To allow for a high-level representation that is natural for multi-rob… ▽ More

    Submitted 20 February, 2015; originally announced February 2015.

  47. arXiv:1409.7808  [pdf, ps, other

    cs.IT

    Resource-Constrained Adaptive Search for Sparse Multi-Class Targets with Varying Importance

    Authors: Gregory E. Newstadt, Beipeng Mu, Dennis Wei, Jonathan P. How, Alfred O. Hero III

    Abstract: In sparse target inference problems it has been shown that significant gains can be achieved by adaptive sensing using convex criteria. We generalize previous work on adaptive sensing to (a) include multiple classes of targets with different levels of importance and (b) accommodate multiple sensor models. New optimization policies are developed to allocate a limited resource budget to simultaneous… ▽ More

    Submitted 27 September, 2014; originally announced September 2014.

    Comments: 49 pages, 9 figures

  48. arXiv:1405.5581  [pdf, other

    cs.RO

    Real-Time Predictive Modeling and Robust Avoidance of Pedestrians with Uncertain, Changing Intentions

    Authors: Sarah Ferguson, Brandon Luders, Robert C. Grande, Jonathan P. How

    Abstract: To plan safe trajectories in urban environments, autonomous vehicles must be able to quickly assess the future intentions of dynamic agents. Pedestrians are particularly challenging to model, as their motion patterns are often uncertain and/or unknown a priori. This paper presents a novel changepoint detection and clustering algorithm that, when coupled with offline unsupervised learning of a Gaus… ▽ More

    Submitted 21 May, 2014; originally announced May 2014.

    Comments: Submitted to 2014 International Workshop on the Algorithmic Foundations of Robotics

    MSC Class: 68T40

  49. arXiv:1403.7471  [pdf, other

    cs.LG

    Approximate Decentralized Bayesian Inference

    Authors: Trevor Campbell, Jonathan P. How

    Abstract: This paper presents an approximate method for performing Bayesian inference in models with conditional independence over a decentralized network of learning agents. The method first employs variational inference on each individual learning agent to generate a local approximate posterior, the agents transmit their local posteriors to other agents in the network, and finally each agent combines its… ▽ More

    Submitted 12 June, 2014; v1 submitted 28 March, 2014; originally announced March 2014.

    Comments: This paper was presented at UAI 2014. Please use the following BibTeX citation: @inproceedings{Campbell14_UAI, Author = {Trevor Campbell and Jonathan P. How}, Title = {Approximate Decentralized Bayesian Inference}, Booktitle = {Uncertainty in Artificial Intelligence (UAI)}, Year = {2014}}

  50. arXiv:1402.2871  [pdf, other

    cs.RO cs.AI cs.MA

    Planning for Decentralized Control of Multiple Robots Under Uncertainty

    Authors: Christopher Amato, George D. Konidaris, Gabriel Cruz, Christopher A. Maynor, Jonathan P. How, Leslie P. Kaelbling

    Abstract: We describe a probabilistic framework for synthesizing control policies for general multi-robot systems, given environment and sensor models and a cost function. Decentralized, partially observable Markov decision processes (Dec-POMDPs) are a general model of decision processes where a team of agents must cooperate to optimize some objective (specified by a shared reward or cost function) in the p… ▽ More

    Submitted 12 February, 2014; originally announced February 2014.

    ACM Class: I.2.9; I.2.11