Skip to main content

Showing 1–17 of 17 results for author: So, O

.
  1. arXiv:2402.09387  [pdf, other

    physics.plasm-ph cs.LG

    Active Disruption Avoidance and Trajectory Design for Tokamak Ramp-downs with Neural Differential Equations and Reinforcement Learning

    Authors: Allen M. Wang, Oswin So, Charles Dawson, Darren T. Garnier, Cristina Rea, Chuchu Fan

    Abstract: The tokamak offers a promising path to fusion energy, but plasma disruptions pose a major economic risk, motivating considerable advances in disruption avoidance. This work develops a reinforcement learning approach to this problem by training a policy to safely ramp-down the plasma current while avoiding limits on a number of quantities correlated with disruptions. The policy training environment… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  2. arXiv:2401.14554  [pdf, other

    cs.RO math.OC

    GCBF+: A Neural Graph Control Barrier Function Framework for Distributed Safe Multi-Agent Control

    Authors: Songyuan Zhang, Oswin So, Kunal Garg, Chuchu Fan

    Abstract: Distributed, scalable, and safe control of large-scale multi-agent systems (MAS) is a challenging problem. In this paper, we design a distributed framework for safe multi-agent control in large-scale environments with obstacles, where a large number of agents are required to maintain safety using only local information and reach their goal locations. We introduce a new class of certificates, terme… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 18 pages, 12 figures, submitted to IEEE T-RO. arXiv admin note: text overlap with arXiv:2311.13014

  3. arXiv:2312.02430  [pdf, ps, other

    math.OC cs.RO

    Almost-Sure Safety Guarantees of Stochastic Zero-Control Barrier Functions Do Not Hold

    Authors: Oswin So, Andrew Clark, Chuchu Fan

    Abstract: The 2021 paper "Control barrier functions for stochastic systems" provides theorems that give almost sure safety guarantees given stochastic zero control barrier function (ZCBF). Unfortunately, both the theorem and its proof is invalid. In this letter, we illustrate on a toy example that the almost sure safety guarantees for stochastic ZCBF do not hold and explain why the proof is flawed. Although… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Under Review

  4. arXiv:2311.13714  [pdf, other

    cs.RO cs.MA eess.SY math.OC

    Learning Safe Control for Multi-Robot Systems: Methods, Verification, and Open Challenges

    Authors: Kunal Garg, Songyuan Zhang, Oswin So, Charles Dawson, Chuchu Fan

    Abstract: In this survey, we review the recent advances in control design methods for robotic multi-agent systems (MAS), focussing on learning-based methods with safety considerations. We start by reviewing various notions of safety and liveness properties, and modeling frameworks used for problem formulation of MAS. Then we provide a comprehensive review of learning-based methods for safe control design fo… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: Submitted to Annual Reviews in Control

  5. arXiv:2310.15478  [pdf, other

    math.OC cs.RO

    How to Train Your Neural Control Barrier Function: Learning Safety Filters for Complex Input-Constrained Systems

    Authors: Oswin So, Zachary Serlin, Makai Mann, Jake Gonzales, Kwesi Rutledge, Nicholas Roy, Chuchu Fan

    Abstract: Control barrier functions (CBF) have become popular as a safety filter to guarantee the safety of nonlinear dynamical systems for arbitrary inputs. However, it is difficult to construct functions that satisfy the CBF constraints for high relative degree systems with input constraints. To address these challenges, recent work has explored learning CBFs using neural networks via neural CBF (NCBF). H… ▽ More

    Submitted 4 December, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

    Comments: Submitted to ICRA 2024. Project page can be found at https://mit-realm.github.io/pncbf

  6. arXiv:2305.17600  [pdf, other

    cs.LG cs.CV cs.GT cs.RO math.OC

    NashFormer: Leveraging Local Nash Equilibria for Semantically Diverse Trajectory Prediction

    Authors: Justin Lidard, Oswin So, Yanxia Zhang, Jonathan DeCastro, Xiongyi Cui, Xin Huang, Yen-Ling Kuo, John Leonard, Avinash Balachandran, Naomi Leonard, Guy Rosman

    Abstract: Interactions between road agents present a significant challenge in trajectory prediction, especially in cases involving multiple agents. Because existing diversity-aware predictors do not account for the interactive nature of multi-agent predictions, they may miss these important interaction outcomes. In this paper, we propose NashFormer, a framework for trajectory prediction that leverages game-… ▽ More

    Submitted 11 November, 2023; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: 8 pages, 6 figures

  7. arXiv:2305.14154  [pdf, other

    cs.RO math.OC

    Solving Stabilize-Avoid Optimal Control via Epigraph Form and Deep Reinforcement Learning

    Authors: Oswin So, Chuchu Fan

    Abstract: Tasks for autonomous robotic systems commonly require stabilization to a desired region while maintaining safety specifications. However, solving this multi-objective problem is challenging when the dynamics are nonlinear and high-dimensional, as traditional methods do not scale well and are often limited to specific problem structures. To address this issue, we propose a novel approach to solve t… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted to Robotics: Science and Systems 2023. Project page can be found at https://mit-realm.github.io/efppo

  8. arXiv:2211.11878  [pdf, other

    math.OC

    Sampling-Based Optimization for Multi-Agent Model Predictive Control

    Authors: Ziyi Wang, Augustinos D. Saravanos, Hassan Almubarak, Oswin So, Evangelos A. Theodorou

    Abstract: We systematically review the Variational Optimization, Variational Inference and Stochastic Search perspectives on sampling-based dynamic optimization and discuss their connections to state-of-the-art optimizers and Stochastic Optimal Control (SOC) theory. A general convergence and sample complexity analysis on the three perspectives is provided through the unifying Stochastic Search perspective.… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  9. arXiv:2210.10814  [pdf, other

    cs.GT cs.RO math.OC

    MPOGames: Efficient Multimodal Partially Observable Dynamic Games

    Authors: Oswin So, Paul Drews, Thomas Balch, Velin Dimitrov, Guy Rosman, Evangelos A. Theodorou

    Abstract: Game theoretic methods have become popular for planning and prediction in situations involving rich multi-agent interactions. However, these methods often assume the existence of a single local Nash equilibria and are hence unable to handle uncertainty in the intentions of different agents. While maximum entropy (MaxEnt) dynamic games try to address this issue, practical approaches solve for MaxEn… ▽ More

    Submitted 23 May, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted to ICRA 2023

  10. arXiv:2210.00090  [pdf, other

    cs.LG

    Data-driven discovery of non-Newtonian astronomy via learning non-Euclidean Hamiltonian

    Authors: Oswin So, Gongjie Li, Evangelos A. Theodorou, Molei Tao

    Abstract: Incorporating the Hamiltonian structure of physical dynamics into deep learning models provides a powerful way to improve the interpretability and prediction accuracy. While previous works are mostly limited to the Euclidean spaces, their extension to the Lie group manifold is needed when rotations form a key component of the dynamics, such as the higher-order physics beyond simple point-mass dyna… ▽ More

    Submitted 30 September, 2022; originally announced October 2022.

  11. arXiv:2209.09893  [pdf, other

    stat.ML cs.GT cs.LG math.OC

    Deep Generalized Schrödinger Bridge

    Authors: Guan-Horng Liu, Tianrong Chen, Oswin So, Evangelos A. Theodorou

    Abstract: Mean-Field Game (MFG) serves as a crucial mathematical framework in modeling the collective behavior of individual agents interacting stochastically with a large population. In this work, we aim at solving a challenging class of MFGs in which the differentiability of these interacting preferences may not be available to the solver, and the population is urged to converge exactly to some desired di… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: NeurIPS 2022

  12. arXiv:2202.10658  [pdf, other

    cs.MA cs.LG cs.RO eess.SY

    Decentralized Safe Multi-agent Stochastic Optimal Control using Deep FBSDEs and ADMM

    Authors: Marcus A. Pereira, Augustinos D. Saravanos, Oswin So, Evangelos A. Theodorou

    Abstract: In this work, we propose a novel safe and scalable decentralized solution for multi-agent control in the presence of stochastic disturbances. Safety is mathematically encoded using stochastic control barrier functions and safe controls are computed by solving quadratic programs. Decentralization is achieved by augmenting to each agent's optimization variables, copy variables, for its neighbors. Th… ▽ More

    Submitted 7 June, 2022; v1 submitted 21 February, 2022; originally announced February 2022.

    Journal ref: Robotics: Science and Systems (RSS), 2022

  13. arXiv:2201.12925  [pdf, other

    math.OC cs.RO

    Multimodal Maximum Entropy Dynamic Games

    Authors: Oswin So, Kyle Stachowicz, Evangelos A. Theodorou

    Abstract: Environments with multi-agent interactions often result a rich set of modalities of behavior between agents due to the inherent suboptimality of decision making processes when agents settle for satisfactory decisions. However, existing algorithms for solving these dynamic games are strictly unimodal and fail to capture the intricate multimodal behaviors of the agents. In this paper, we propose MME… ▽ More

    Submitted 2 February, 2022; v1 submitted 30 January, 2022; originally announced January 2022.

    Comments: Under review for RSS 2022. Supplementary Video: https://youtu.be/7molN_Q38dk

  14. arXiv:2110.06451  [pdf, other

    math.OC cs.RO

    Maximum Entropy Differential Dynamic Programming

    Authors: Oswin So, Ziyi Wang, Evangelos A. Theodorou

    Abstract: In this paper, we present a novel maximum entropy formulation of the Differential Dynamic Programming algorithm and derive two variants using unimodal and multimodal value functions parameterizations. By combining the maximum entropy Bellman equations with a particular approximation of the cost function, we are able to obtain a new formulation of Differential Dynamic Programming which is able to e… ▽ More

    Submitted 28 February, 2022; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: Accepted to ICRA 2022. Supplementary video available at https://youtu.be/NHr9Kj_jnAI

  15. arXiv:2104.04044  [pdf, other

    math.OC physics.app-ph

    Spatio-Temporal Differential Dynamic Programming for Control of Fields

    Authors: Ethan N. Evans, Oswin So, Andrew P. Kendall, Guan-Horng Liu, Evangelos A. Theodorou

    Abstract: We consider the optimal control problem of a general nonlinear spatio-temporal system described by Partial Differential Equations (PDEs). Theory and algorithms for control of spatio-temporal systems are of rising interest among the automatic control community and exhibit numerous challenging characteristic from a control standpoint. Recent methods focus on finite-dimensional optimization technique… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

    Comments: 28 pages, 7 figures. Submitted to IEEE Transactions on Automatic Control

  16. arXiv:2104.00241  [pdf, other

    cs.LG

    Variational Inference MPC using Tsallis Divergence

    Authors: Ziyi Wang, Oswin So, Jason Gibson, Bogdan Vlahov, Manan S. Gandhi, Guan-Horng Liu, Evangelos A. Theodorou

    Abstract: In this paper, we provide a generalized framework for Variational Inference-Stochastic Optimal Control by using thenon-extensive Tsallis divergence. By incorporating the deformed exponential function into the optimality likelihood function, a novel Tsallis Variational Inference-Model Predictive Control algorithm is derived, which includes prior works such as Variational Inference-Model Predictive… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

  17. arXiv:2009.01090  [pdf, other

    math.OC cs.RO

    Adaptive Risk Sensitive Model Predictive Control with Stochastic Search

    Authors: Ziyi Wang, Oswin So, Keuntaek Lee, Camilo A. Duarte, Evangelos A. Theodorou

    Abstract: We present a general framework for optimizing the Conditional Value-at-Risk for dynamical systems using stochastic search. The framework is capable of handling the uncertainty from the initial condition, stochastic dynamics, and uncertain parameters in the model. The algorithm is compared against a risk-sensitive distributional reinforcement learning framework and demonstrates outperformance on a… ▽ More

    Submitted 12 February, 2021; v1 submitted 2 September, 2020; originally announced September 2020.