Skip to main content

Showing 1–16 of 16 results for author: Acikmese, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2304.01583  [pdf, other

    cs.RO cs.CV eess.SY math.OC

    HALO: Hazard-Aware Landing Optimization for Autonomous Systems

    Authors: Christopher R. Hayner, Samuel C. Buckner, Daniel Broyles, Evelyn Madewell, Karen Leung, Behcet Acikmese

    Abstract: With autonomous aerial vehicles enacting safety-critical missions, such as the Mars Science Laboratory Curiosity rover's landing on Mars, the tasks of automatically identifying and reasoning about potentially hazardous landing sites is paramount. This paper presents a coupled perception-planning solution which addresses the hazard detection, optimal landing trajectory generation, and contingency p… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: The first two authors have contributed equally to this work. This work is to be published in the proceedings of the 2023 IEEE International Conference on Robotics and Automation (ICRA)

  2. arXiv:2207.07271  [pdf, other

    cs.LG math.OC

    Set-based value operators for non-stationary Markovian environments

    Authors: Sarah H. Q. Li, Assalé Adjé, Pierre-Loïc Garoche, Behçet Açıkmeşe

    Abstract: This paper analyzes finite state Markov Decision Processes (MDPs) with uncertain parameters in compact sets and re-examines results from robust MDP via set-based fixed point theory. To this end, we generalize the Bellman and policy evaluation operators to contracting operators on the value function space and denote them as \emph{value operators}. We lift these value operators to act on \emph{sets}… ▽ More

    Submitted 8 August, 2023; v1 submitted 14 July, 2022; originally announced July 2022.

    Comments: 17 pages, 11 figures, 1 table

  3. arXiv:2203.12133  [pdf, other

    cs.MA

    Congestion-aware path coordination game with Markov decision process dynamics

    Authors: Sarah H. Q. Li, Dan Calderone, Behcet Acikmese

    Abstract: Inspired by the path coordination problem arising from robo-taxis, warehouse management, and mixed-vehicle routing problems, we model a group of heterogeneous players responding to stochastic demands as a congestion game under Markov decision process dynamics. Players share a common state-action space but have unique transition dynamics, and each player's unique cost is a {function} of the joint s… ▽ More

    Submitted 5 July, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

    Comments: 6 pages, 4 figures

  4. Guided Policy Search using Sequential Convex Programming for Initialization of Trajectory Optimization Algorithms

    Authors: Taewan Kim, Purnanand Elango, Danylo Malyuta, Behcet Acikmese

    Abstract: Nonlinear trajectory optimization algorithms have been developed to handle optimal control problems with nonlinear dynamics and nonconvex constraints in trajectory planning. The performance and computational efficiency of many trajectory optimization methods are sensitive to the initial guess, i.e., the trajectory guess needed by the recursive trajectory optimization algorithm. Motivated by this o… ▽ More

    Submitted 19 May, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: Presented in American Control Conference (ACC) 2022

  5. arXiv:2108.02335  [pdf, other

    math.OC cs.LG eess.SY

    Advances in Trajectory Optimization for Space Vehicle Control

    Authors: Danylo Malyuta, Yue Yu, Purnanand Elango, Behcet Acikmese

    Abstract: Space mission design places a premium on cost and operational efficiency. The search for new science and life beyond Earth calls for spacecraft that can deliver scientific payloads to geologically rich yet hazardous landing sites. At the same time, the last four decades of optimization research have put a suite of powerful optimization tools at the fingertips of the controls engineer. As we enter… ▽ More

    Submitted 23 August, 2021; v1 submitted 4 August, 2021; originally announced August 2021.

    Comments: 100 pages, 18 figures, 1 table; accepted in Elsevier Annual Reviews in Control

  6. arXiv:2106.09125  [pdf, other

    math.OC cs.RO eess.SY

    Convex Optimization for Trajectory Generation

    Authors: Danylo Malyuta, Taylor P. Reynolds, Michael Szmuk, Thomas Lew, Riccardo Bonalli, Marco Pavone, Behcet Acikmese

    Abstract: Reliable and efficient trajectory generation methods are a fundamental need for autonomous dynamical systems of tomorrow. The goal of this article is to provide a comprehensive tutorial of three major convex optimization-based trajectory generation methods: lossless convexification (LCvx), and two sequential convex programming algorithms known as SCvx and GuSTO. In this article, trajectory generat… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: 68 pages, 42 figures, 5 tables. This work has been submitted to the IEEE for possible publication

  7. arXiv:2012.02303  [pdf, other

    math.OC cs.MA math.DS math.PR

    Decentralized State-Dependent Markov Chain Synthesis with an Application to Swarm Guidance

    Authors: Samet Uzun, Nazim Kemal Ure, Behcet Acikmese

    Abstract: This paper introduces a decentralized state-dependent Markov chain synthesis (DSMC) algorithm for finite-state Markov chains. We present a state-dependent consensus protocol that achieves exponential convergence under mild technical conditions, without relying on any connectivity assumptions regarding the dynamic network topology. Utilizing the proposed consensus protocol, we develop the DSMC algo… ▽ More

    Submitted 26 April, 2024; v1 submitted 4 December, 2020; originally announced December 2020.

    Comments: arXiv admin note: text overlap with arXiv:2012.01928

  8. arXiv:2011.05562  [pdf, other

    cs.GT eess.SY

    Stability of Gradient Learning Dynamics in Continuous Games: Vector Action Spaces

    Authors: Benjamin J. Chasnov, Daniel Calderone, Behçet Açıkmeşe, Samuel A. Burden, Lillian J. Ratliff

    Abstract: Towards characterizing the optimization landscape of games, this paper analyzes the stability of gradient-based dynamics near fixed points of two-player continuous games. We introduce the quadratic numerical range as a method to characterize the spectrum of game dynamics and prove the robustness of equilibria to variations in learning rates. By decomposing the game Jacobian into symmetric and skew… ▽ More

    Submitted 13 January, 2021; v1 submitted 6 November, 2020; originally announced November 2020.

    Comments: extension of arXiv:2011.03650 to vector action spaces. Submitted to IEEE L-CSS

  9. arXiv:2011.03650  [pdf, other

    cs.GT eess.SY

    Stability of Gradient Learning Dynamics in Continuous Games: Scalar Action Spaces

    Authors: Benjamin J. Chasnov, Daniel Calderone, Behçet Açıkmeşe, Samuel A. Burden, Lillian J. Ratliff

    Abstract: Learning processes in games explain how players grapple with one another in seeking an equilibrium. We study a natural model of learning based on individual gradients in two-player continuous games. In such games, the arguably natural notion of a local equilibrium is a differential Nash equilibrium. However, the set of locally exponentially stable equilibria of the learning dynamics do not necessa… ▽ More

    Submitted 6 November, 2020; originally announced November 2020.

    Comments: Accepted to 2020 IEEE Conference on Decision and Control

  10. Disturbance Decoupling for Gradient-based Multi-Agent Learning with Quadratic Costs

    Authors: Sarah H. Q. Li, Lillian Ratliff, Behçet Açıkmeşe

    Abstract: Motivated by applications of multi-agent learning in noisy environments, this paper studies the robustness of gradient-based learning dynamics with respect to disturbances. While disturbances injected along a coordinate corresponding to any individual player's actions can always affect the overall learning dynamics, a subset of players can be disturbance decoupled---i.e., such players' actions are… ▽ More

    Submitted 10 October, 2020; v1 submitted 14 July, 2020; originally announced July 2020.

    Journal ref: IEEE Control Systems Letters, vol. 5, no. 1, pp. 223-228, Jan. 2021

  11. Bounding Fixed Points of Set-Based Bellman Operator and Nash Equilibria of Stochastic Games

    Authors: Sarah H. Q. Li, Assalé, Adjé, Pierre-Loïc Garoche, Behçet Açıkmeşe

    Abstract: Motivated by uncertain parameters encountered in Markov decision processes (MDPs) and stochastic games, we study the effect of parameter uncertainty on Bellman operator-based algorithms under a set-based framework. Specifically, we first consider a family of MDPs where the cost parameters are in a given compact set; we then define a Bellman operator acting on a set of value functions to produce a… ▽ More

    Submitted 10 October, 2020; v1 submitted 22 January, 2020; originally announced January 2020.

    Comments: 15 pages, 4 figures

  12. arXiv:2001.04535  [pdf, ps, other

    math.OC cs.GT

    Fixed Points of the Set-Based Bellman Operator

    Authors: Sarah H. Q. Li, Assalé Adjé, Pierre-Loïc Garoche, Behçet Açıkmeşe

    Abstract: Motivated by uncertain parameters encountered in Markov decision processes (MDPs), we study the effect of parameter uncertainty on Bellman operator-based methods. Specifically, we consider a family of MDPs where the cost parameters are from a given compact set. We then define a Bellman operator acting on an input set of value functions to produce a new set of value functions as the output under al… ▽ More

    Submitted 29 February, 2020; v1 submitted 13 January, 2020; originally announced January 2020.

    Comments: 4 pages

  13. Sensitivity Analysis for Markov Decision Process Congestion Games

    Authors: Sarah H. Q. Li, Daniel Calderone, Lillian Ratliff, Behcet Acikmese

    Abstract: We consider a non-atomic congestion game where each decision maker performs selfish optimization over states of a common MDP. The decision makers optimize for their own expected costs, and influence each other through congestion effects on the state-action costs. We analyze on the sensitivity of MDP congestion game equilibria to uncertainty and perturbations in the state-action costs by applying a… ▽ More

    Submitted 12 September, 2019; v1 submitted 9 September, 2019; originally announced September 2019.

  14. arXiv:1907.08912  [pdf, other

    cs.GT math.OC

    Adaptive Constraint Satisfaction for Markov Decision Process Congestion Games: Application to Transportation Networks

    Authors: Sarah H. Q. Li, Yue Yu, Nicolas Miguel, Dan Calderone, Lillian J. Ratliff, Behcet Acikmese

    Abstract: Under the Markov decision process (MDP) congestion game framework, we study the problem of enforcing population distribution constraints on a population of players with stochastic dynamics and coupled congestion costs. Existing research demonstrates that the constraints on the players' population distribution can be satisfied by enforcing tolls. However, computing the minimum toll value for constr… ▽ More

    Submitted 14 August, 2022; v1 submitted 21 July, 2019; originally announced July 2019.

    Comments: 10 pages, 5 figures

  15. Tolling for Constraint Satisfaction in Markov Decision Process Congestion Games

    Authors: Sarah H. Q. Li, Yue Yu, Daniel Calderone, Lillian Ratliff, Behcet Acikmese

    Abstract: Markov decision process (MDP) congestion game is an extension of classic congestion games, where a continuous population of selfish agents solves Markov decision processes with congestion: the payoff of a strategy decreases as more population uses it. We draw parallels between key concepts from capacitated congestion games and MDP. In particular, we show that population mass constraints in MDP con… ▽ More

    Submitted 2 March, 2019; originally announced March 2019.

    Comments: 7 pages, 6 figures, accepted to American Control Conference 2019

  16. arXiv:1607.01478  [pdf, other

    cs.RO cs.AI eess.SY

    Mixed Strategy for Constrained Stochastic Optimal Control

    Authors: Masahiro Ono, Mahmoud El Chamie, Marco Pavone, Behcet Acikmese

    Abstract: Choosing control inputs randomly can result in a reduced expected cost in optimal control problems with stochastic constraints, such as stochastic model predictive control (SMPC). We consider a controller with initial randomization, meaning that the controller randomly chooses from K+1 control sequences at the beginning (called K-randimization).It is known that, for a finite-state, finite-action M… ▽ More

    Submitted 6 July, 2016; originally announced July 2016.

    Comments: 11 pages. 9 figures.Preliminary version of a working journal paper