Search | arXiv e-print repository

Six-Degree-of-Freedom Aircraft Landing Trajectory Planning with Runway Alignment

Authors: Taewan Kim, Abhinav G. Kamath, Niyousha Rahimi, Jasper Corleis, Behçet Açıkmeşe, Mehran Mesbahi

Abstract: This paper presents a numerical optimization algorithm for generating approach and landing trajectories for a six-degree-of-freedom (6-DoF) aircraft. We improve on the existing research on aircraft landing trajectory generation by formulating the trajectory optimization problem with additional real-world operational constraints, including 6-DoF aircraft dynamics, runway alignment, constant wind fi… ▽ More This paper presents a numerical optimization algorithm for generating approach and landing trajectories for a six-degree-of-freedom (6-DoF) aircraft. We improve on the existing research on aircraft landing trajectory generation by formulating the trajectory optimization problem with additional real-world operational constraints, including 6-DoF aircraft dynamics, runway alignment, constant wind field, and obstacle avoidance, to obtain a continuous-time nonconvex optimal control problem. Particularly, the runway alignment constraint enforces the trajectory of the aircraft to be aligned with the runway only during the final approach phase. This is a novel feature that is essential for preventing an approach that is either too steep or too shallow. The proposed method models the runway alignment constraint through a multi-phase trajectory planning scheme, imposing alignment conditions exclusively during the final approach phase. We compare this formulation with the existing state-triggered constraint formulation for runway alignment. To solve the formulated problem, we design a novel sequential convex programming algorithm called xPTR that extends the penalized trust-region (PTR) algorithm by incorporating an extrapolation step to expedite convergence. We validate the proposed method through extensive numerical simulations, including a Monte Carlo study, to evaluate the robustness of the algorithm to varying initial conditions. △ Less

Submitted 26 May, 2024; originally announced May 2024.

Comments: This article was submitted to JGCD

arXiv:2405.10996 [pdf, other]

Optimization with Temporal and Logical Specifications via Generalized Mean-based Smooth Robustness Measures

Authors: Samet Uzun, Purnanand Elango, Pierre-Loic Garoche, Behcet Acikmese

Abstract: This paper introduces a generalized mean-based C^1-smooth robustness measure over discrete-time signals (D-GMSR) for signal temporal logic (STL) specifications. In conjunction with its C1-smoothness, D-GMSR is proven to be both sound and complete. Furthermore, it demonstrates favorable gradient properties and addresses locality and masking problems, which are critical for numerical optimization. T… ▽ More This paper introduces a generalized mean-based C^1-smooth robustness measure over discrete-time signals (D-GMSR) for signal temporal logic (STL) specifications. In conjunction with its C1-smoothness, D-GMSR is proven to be both sound and complete. Furthermore, it demonstrates favorable gradient properties and addresses locality and masking problems, which are critical for numerical optimization. The C^1-smoothness of the proposed formulations enables the implementation of robust and efficient numerical optimization algorithms to solve problems with STL specifications while preserving their theoretical guarantees. The practical utility of the proposed robustness measure is demonstrated on two real-world trajectory optimization problems: i) quadrotor flight, and ii) autonomous rocket landing. A sequential convex programming (SCP) framework, incorporating a convergence-guaranteed optimization algorithm (the prox-linear method) is used to solve inherently non-convex trajectory optimization problems with STL specifications. The implementation is available at https://github.com/UW-ACL/D-GMSR △ Less

Submitted 16 May, 2024; originally announced May 2024.

arXiv:2405.00061 [pdf, other]

Successive Convexification for Nonlinear Model Predictive Control with Continuous-Time Constraint Satisfaction

Authors: Samet Uzun, Purnanand Elango, Abhinav G. Kamath, Taewan Kim, Behcet Acikmese

Abstract: We propose a nonlinear model predictive control (NMPC) framework based on a direct optimal control method that ensures continuous-time constraint satisfaction and accurate evaluation of the running cost, without compromising computational efficiency. We leverage the recently proposed successive convexification framework for trajectory optimization, where: (1) the path constraints and running cost… ▽ More We propose a nonlinear model predictive control (NMPC) framework based on a direct optimal control method that ensures continuous-time constraint satisfaction and accurate evaluation of the running cost, without compromising computational efficiency. We leverage the recently proposed successive convexification framework for trajectory optimization, where: (1) the path constraints and running cost are equivalently reformulated by augmenting the system dynamics, (2) multiple shooting is used for exact discretization, and (3) a convergence-guaranteed sequential convex programming (SCP) algorithm, the prox-linear method, is used to solve the discretized receding-horizon optimal control problems. The resulting NMPC framework is computationally efficient, owing to its support for warm-starting and premature termination of SCP, and its reliance on first-order information only. We demonstrate the effectiveness of the proposed NMPC framework by means of a numerical example with reference-tracking and obstacle avoidance. The implementation is available at https://github.com/UW-ACL/nmpc-ctcs △ Less

Submitted 26 April, 2024; originally announced May 2024.

arXiv:2404.18034 [pdf, other]

Fast Monte Carlo Analysis for 6-DoF Powered-Descent Guidance via GPU-Accelerated Sequential Convex Programming

Authors: Govind M. Chari, Abhinav G. Kamath, Purnanand Elango, Behçet Açıkmeşe

Abstract: We introduce a GPU-accelerated Monte Carlo framework for nonconvex, free-final-time trajectory optimization problems. This framework makes use of the prox-linear method, which belongs to the larger family of sequential convex programming (SCP) algorithms, in conjunction with a constraint reformulation that guarantees inter-sample constraint satisfaction. Key features of this framework are: (1) con… ▽ More We introduce a GPU-accelerated Monte Carlo framework for nonconvex, free-final-time trajectory optimization problems. This framework makes use of the prox-linear method, which belongs to the larger family of sequential convex programming (SCP) algorithms, in conjunction with a constraint reformulation that guarantees inter-sample constraint satisfaction. Key features of this framework are: (1) continuous-time constraint satisfaction; (2) a matrix-inverse-free solution method; (3) the use of the proportional-integral projected gradient (PIPG) method, a first-order convex optimization solver, customized to the convex subproblem at hand; and, (4) an end-to-end, library-free implementation of the algorithm. We demonstrate this GPU-based framework on the 6-DoF powered-descent guidance problem, and show that it is faster than an equivalent serial CPU implementation for Monte Carlo simulations with over 1000 runs. To the best of our knowledge, this is the first GPU-based implementation of a general-purpose nonconvex trajectory optimization solver. △ Less

Submitted 27 April, 2024; originally announced April 2024.

Comments: AIAA SciTech 2024 Forum

arXiv:2404.16826 [pdf, other]

Successive Convexification for Trajectory Optimization with Continuous-Time Constraint Satisfaction

Authors: Purnanand Elango, Dayou Luo, Abhinav G. Kamath, Samet Uzun, Taewan Kim, Behçet Açıkmeşe

Abstract: We present successive convexification, a real-time-capable solution method for nonconvex trajectory optimization, with continuous-time constraint satisfaction and guaranteed convergence, that only requires first-order information. The proposed framework combines several key methods to solve a large class of nonlinear optimal control problems: (i) exterior penalty-based reformulation of the path co… ▽ More We present successive convexification, a real-time-capable solution method for nonconvex trajectory optimization, with continuous-time constraint satisfaction and guaranteed convergence, that only requires first-order information. The proposed framework combines several key methods to solve a large class of nonlinear optimal control problems: (i) exterior penalty-based reformulation of the path constraints; (ii) generalized time-dilation; (iii) multiple-shooting discretization; (iv) $\ell_1$ exact penalization of the nonconvex constraints; and (v) the prox-linear method, a sequential convex programming (SCP) algorithm for convex-composite minimization. The reformulation of the path constraints enables continuous-time constraint satisfaction even on sparse discretization grids and obviates the need for mesh refinement heuristics. Through the prox-linear method, we guarantee convergence of the solution method to stationary points of the penalized problem and guarantee that the converged solutions that are feasible with respect to the discretized and control-parameterized optimal control problem are also Karush-Kuhn-Tucker (KKT) points. Furthermore, we highlight the specialization of this property to global minimizers of convex optimal control problems, wherein the reformulated path constraints cannot be represented by canonical cones, i.e., in the form required by existing convex optimization solvers. In addition to theoretical analysis, we demonstrate the effectiveness and real-time capability of the proposed framework with numerical examples based on popular optimal control applications: dynamic obstacle avoidance and rocket landing. △ Less

Submitted 25 April, 2024; originally announced April 2024.

arXiv:2404.16269 [pdf, other]

Expected Time-Optimal Control: a Particle MPC-based Approach via Sequential Convex Programming

Authors: Kazuya Echigo, Abhishek Cauligi, Behçet Açıkmeşe

Abstract: In this paper, we consider the problem of minimum-time optimal control for a dynamical system with initial state uncertainties and propose a sequential convex programming (SCP) solution framework. We seek to minimize the expected terminal (mission) time, which is an essential capability for planetary exploration missions where ground rovers have to carry out scientific tasks efficiently within the… ▽ More In this paper, we consider the problem of minimum-time optimal control for a dynamical system with initial state uncertainties and propose a sequential convex programming (SCP) solution framework. We seek to minimize the expected terminal (mission) time, which is an essential capability for planetary exploration missions where ground rovers have to carry out scientific tasks efficiently within the mission timelines in uncertain environments. Our main contribution is to convert the underlying stochastic optimal control problem into a deterministic, numerically tractable, optimal control problem. To this end, the proposed solution framework combines two strategies from previous methods: i) a partial model predictive control with consensus horizon approach and ii) a sum-of-norm cost, a temporally strictly increasing weighted-norm, promoting minimum-time trajectories. Our contribution is to adopt these formulations into an SCP solution framework and obtain a numerically tractable stochastic control algorithm. We then demonstrate the resulting control method in multiple applications: i) a closed-loop linear system as a representative result (a spacecraft double integrator model), ii) an open-loop linear system (the same model), and then iii) a nonlinear system (Dubin's car). △ Less

Submitted 24 April, 2024; originally announced April 2024.

Comments: submitted to CDC 2024

arXiv:2403.15656 [pdf, ps, other]

Constraint Preconditioning and Parameter Selection for a First-Order Primal-Dual Method applied to Model Predictive Control

Authors: Govind M. Chari, Yue Yu, Behçet Açıkmeşe

Abstract: Many techniques for real-time trajectory optimization and control require the solution of optimization problems at high frequencies. However, ill-conditioning in the optimization problem can significantly reduce the speed of first-order primal-dual optimization algorithms. We introduce a preconditioning technique and step-size heuristic for Proportional-Integral Projected Gradient (PIPG), a first-… ▽ More Many techniques for real-time trajectory optimization and control require the solution of optimization problems at high frequencies. However, ill-conditioning in the optimization problem can significantly reduce the speed of first-order primal-dual optimization algorithms. We introduce a preconditioning technique and step-size heuristic for Proportional-Integral Projected Gradient (PIPG), a first-order primal-dual algorithm. The preconditioning technique, based on the QR factorization, aims to reduce the condition number of the KKT matrix associated with the optimization problem. Our step-size selection heuristic chooses step-sizes to minimize the upper bound on the convergence of the primal-dual gap for the optimization problem. These algorithms are tested on two model predictive control problem examples and show a solve-time reduction of at least 3.6x. △ Less

Submitted 22 March, 2024; originally announced March 2024.

arXiv:2403.00733 [pdf, ps, other]

Remarks on "Successive Convexification: A Superlinearly Convergent Algorithm for Non-convex Optimal Control Problems"

Authors: Dayou Luo, Purnanand Elango, Behcet Acikmese

Abstract: The purpose of this note is to highlight and address inaccuracies in the convergence guarantees of SCvx, a nonconvex trajectory optimization algorithm proposed by Mao et al. (arXiv:1804.06539), and make connections to relevant prior work. Specifically, we identify errors in the convergence proof within Mao et al. (arXiv:1804.06539) and reestablish the proof of convergence by employing a new method… ▽ More The purpose of this note is to highlight and address inaccuracies in the convergence guarantees of SCvx, a nonconvex trajectory optimization algorithm proposed by Mao et al. (arXiv:1804.06539), and make connections to relevant prior work. Specifically, we identify errors in the convergence proof within Mao et al. (arXiv:1804.06539) and reestablish the proof of convergence by employing a new method under stricter assumptions. △ Less

Submitted 13 March, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

arXiv:2402.15629 [pdf, other]

Sufficient LMI Copositivity Conditions for Funnel Synthesis of Uncertain Nonlinear Systems

Authors: Taewan Kim, Behçet Açıkmeşe

Abstract: Funnel synthesis refers to a procedure for synthesizing a time-varying controlled invariant set and an associated control law around a nominal trajectory. The computation of the funnel involves solving a continuous-time differential equation or inequality, ensuring the invariance of the funnel. Previous approaches often compromise the invariance property of the funnel; for example, they may enforc… ▽ More Funnel synthesis refers to a procedure for synthesizing a time-varying controlled invariant set and an associated control law around a nominal trajectory. The computation of the funnel involves solving a continuous-time differential equation or inequality, ensuring the invariance of the funnel. Previous approaches often compromise the invariance property of the funnel; for example, they may enforce the equation or the inequality only at discrete temporal nodes and not having formal guarantee of invariance at all times. This paper proposes a computational funnel synthesis method that can satisfy the invariance of the funnel without such compromises. We derive a finite number of linear matrix inequalities (LMIs) that imply the satifaction of a continuous-time differential linear matrix inequality guaranteeing the invariance of the funnel at all times from the initial to the final time. To this end, we utilize LMI conditions ensuring matrix copositivity, which then imply continuous-time invariance. The primary contribution of the paper is to prove that the resulting funnel is indeed invariant over a finite time horizon. We validate the proposed method via a three-dimensional trajectory planning and control problem with obstacle avoidance constraints. △ Less

Submitted 23 February, 2024; originally announced February 2024.

Comments: submitted to LCSS

arXiv:2402.06517 [pdf]

UV Irradiation Facility for Solar Effects Simulations

Authors: Ali Behcet Alpat, Giovanni Bartolini, Sarah Bollanti, Paolo Di Lazzaro, Daniele Murra, Talifujiang Wusimanjiang

Abstract: We describe an experimental setup developed aiming to irradiate samples under UV radiation for accelerated test for solar effects according to the relevant ECSS-ESA standards. This facility has been already used for projects belonging to large space programs (Cosmic Vision, Artes) for simulations up to 3500 equivalent sun hours. In particular, we detail the calculation of the UV dose delivered by… ▽ More We describe an experimental setup developed aiming to irradiate samples under UV radiation for accelerated test for solar effects according to the relevant ECSS-ESA standards. This facility has been already used for projects belonging to large space programs (Cosmic Vision, Artes) for simulations up to 3500 equivalent sun hours. In particular, we detail the calculation of the UV dose delivered by Sun, the calibration of the detectors, the spatial distribution of the UV radiation on samples, the remote control of both samples temperature and lamp radiation, the samples heat dissipation and operation in a helium atmosphere. △ Less

Submitted 9 February, 2024; originally announced February 2024.

arXiv:2402.06027 [pdf]

Low Energy, High Flux, Uniform and Large Field Size Electron Beam Facility

Authors: Ali Behcet Alpat, Giovanni Bartolini, Talifujiang Wusimanjiang, Goesta Mattausch, Tobias Teichmann, Ralf Bluethner, Michael Thomas Müller, Carsten Zschech, Abdullah Coban, Arca Bozkurt

Abstract: Materials to be deployed in space applications have to undergo a variety of different test scenarios, simulating actual space conditions. Among these materials solar photovoltaic cells, optics, meta-materials and more will be directly exposed to space radiation and must be tested accordingly. From the design phase of such target materials to the final production, it is important to obtain informat… ▽ More Materials to be deployed in space applications have to undergo a variety of different test scenarios, simulating actual space conditions. Among these materials solar photovoltaic cells, optics, meta-materials and more will be directly exposed to space radiation and must be tested accordingly. From the design phase of such target materials to the final production, it is important to obtain information about their behavior and performance in defined irradiation scenarios and qualify them following the directions of relevant ECSS/ESA standards. An excellent method to cover part of these tests in a laboratory scale is the high-flux, high-fluence electron irradiation with the help of industrial electron beam generators. The present collaboration has developed different irradiation test setups and procedures, adapted to the existing electron beam generators at IPF in Dresden, Germany covering an energy range between 100 keV to 1.5 MeV. The test setups, aiming for high electron flux, uniform and large irradiation field sizes, and their capabilities for use in irradiation qualification tests for space applications are described in this paper. △ Less

Submitted 8 February, 2024; originally announced February 2024.

Comments: To be published on RADECS 2023 Data Workshop.It contains 4 pages, 7 Figures, 2 Tables, 7 References

arXiv:2402.04561 [pdf, other]

Spacecraft Rendezvous Guidance via Factorization-Free Sequential Convex Programming using a First-Order Method

Authors: Govind M. Chari, Behçet Açıkmeşe

Abstract: We implement a fully factorization-free algorithm for nonconvex, free-final-time trajectory optimization. This algorithm is based on sequential convex programming and utilizes an inverse-free, exact discretization procedure to ensure dynamic feasibility of the converged trajectory and PIPG, a fast, first-order conic optimization algorithm as the subproblem solver. Although PIPG requires the tuning… ▽ More We implement a fully factorization-free algorithm for nonconvex, free-final-time trajectory optimization. This algorithm is based on sequential convex programming and utilizes an inverse-free, exact discretization procedure to ensure dynamic feasibility of the converged trajectory and PIPG, a fast, first-order conic optimization algorithm as the subproblem solver. Although PIPG requires the tuning of a hyperparameter to achieve fastest convergence, we show that PIPG can be tuned to a nominal trajectory optimization problem and it is robust to variations in initial condition. We demonstrate this with a monte carlo simulation of the free-final-time rendezvous problem, using Clohessy-Wiltshire dynamics, an impulsive thrust model, and various state and control constraints including a spherical keepout zone. △ Less

Submitted 6 February, 2024; originally announced February 2024.

Comments: AAS Rocky Mountain Guidance, Navigation and Control Conference, 2024

arXiv:2401.16438 [pdf, other]

Do deep neural networks utilize the weight space efficiently?

Authors: Onur Can Koyun, Behçet Uğur Töreyin

Abstract: Deep learning models like Transformers and Convolutional Neural Networks (CNNs) have revolutionized various domains, but their parameter-intensive nature hampers deployment in resource-constrained settings. In this paper, we introduce a novel concept utilizes column space and row space of weight matrices, which allows for a substantial reduction in model parameters without compromising performance… ▽ More Deep learning models like Transformers and Convolutional Neural Networks (CNNs) have revolutionized various domains, but their parameter-intensive nature hampers deployment in resource-constrained settings. In this paper, we introduce a novel concept utilizes column space and row space of weight matrices, which allows for a substantial reduction in model parameters without compromising performance. Leveraging this paradigm, we achieve parameter-efficient deep learning models.. Our approach applies to both Bottleneck and Attention layers, effectively halving the parameters while incurring only minor performance degradation. Extensive experiments conducted on the ImageNet dataset with ViT and ResNet50 demonstrate the effectiveness of our method, showcasing competitive performance when compared to traditional models. This approach not only addresses the pressing demand for parameter efficient deep learning solutions but also holds great promise for practical deployment in real-world scenarios. △ Less

Submitted 26 January, 2024; originally announced January 2024.

arXiv:2305.19022 [pdf, other]

PDOZ: innovative personal electronic dosimeter for electron and gamma H*(d) dosimetry

Authors: Lucia Salvi, Giulia Rossi, Giovanni Bartolini, Ali Behcet Alpat, Arca Bozkurt, Mustafa Dogukan Cegil, Ahmet Talha Guleryuz

Abstract: The personal (or active) electronic dosimeters (PEDs) are devices used to determine the individual exposure to ionizing radiations and they are employed in hospitals, research laboratories and nuclear power plants. The PDOZ project is a personal electronic dosimeter able to detect, discriminate and measure the delivered dose by beta particles and gamma rays. In this paper, several Monte Carlo simu… ▽ More The personal (or active) electronic dosimeters (PEDs) are devices used to determine the individual exposure to ionizing radiations and they are employed in hospitals, research laboratories and nuclear power plants. The PDOZ project is a personal electronic dosimeter able to detect, discriminate and measure the delivered dose by beta particles and gamma rays. In this paper, several Monte Carlo simulations are described. The first one is regarding the ICRU sphere implemented to evaluate the ambient dose equivalent, H*(10), and the fluence-to-dose equivalent conversion coefficients for gamma rays and beta particles. The second simulation is carried out to study the prototype dosimeter response to gamma rays and beta particles and, also thanks to previous one, to obtain the conversion curve necessary to calculate the ambient dose equivalent from the silicon photomultipliers counts. In the last one, instead, the performance of a prototype dosimeter, composed by a small plastic scintillator coupled to two SiPMs, is evaluated and a simulation with different radioactive sources is made whose results are compared with the experimental measurements. All simulations are carried out by Geant4 including the optical photon transport. All simulations are carried out by Geant4 including the optical photon transport. △ Less

Submitted 26 July, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

arXiv:2305.08265 [pdf, other]

Vehicle Detection and Classification without Residual Calculation: Accelerating HEVC Image Decoding with Random Perturbation Injection

Authors: Muhammet Sebul Beratoğlu, Behçet Uğur Töreyin

Abstract: In the field of video analytics, particularly traffic surveillance, there is a growing need for efficient and effective methods for processing and understanding video data. Traditional full video decoding techniques can be computationally intensive and time-consuming, leading researchers to explore alternative approaches in the compressed domain. This study introduces a novel random perturbation-b… ▽ More In the field of video analytics, particularly traffic surveillance, there is a growing need for efficient and effective methods for processing and understanding video data. Traditional full video decoding techniques can be computationally intensive and time-consuming, leading researchers to explore alternative approaches in the compressed domain. This study introduces a novel random perturbation-based compressed domain method for reconstructing images from High Efficiency Video Coding (HEVC) bitstreams, specifically designed for traffic surveillance applications. To the best of our knowledge, our method is the first to propose substituting random perturbations for residual values, creating a condensed representation of the original image while retaining information relevant to video understanding tasks, particularly focusing on vehicle detection and classification as key use cases. By not using residual data, our proposed method significantly reduces the data needed in the image reconstruction process, allowing for more efficient storage and transmission of information. This is particularly important when considering the vast amount of video data involved in surveillance applications. Applied to the public BIT-Vehicle dataset, we demonstrate a significant increase in the reconstruction speed compared to the traditional full decoding approach, with our proposed method being approximately 56% faster than the pixel domain method. Additionally, we achieve a detection accuracy of 99.9%, on par with the pixel domain method, and a classification accuracy of 96.84%, only 0.98% lower than the pixel domain method. Furthermore, we showcase the significant reduction in data size, leading to more efficient storage and transmission. Our research establishes the potential of compressed domain methods in traffic surveillance applications, where speed and data size are critical factors. △ Less

Submitted 5 August, 2023; v1 submitted 14 May, 2023; originally announced May 2023.

Comments: 10 pages 4 figures

MSC Class: 68T20 ACM Class: E.4; I.4.5; H.3.3; I.5.4

arXiv:2305.00902 [pdf]

doi 10.1016/j.cpc.2023.108688

MRADSIM-Converter: A new software for STEP to GDML conversion

Authors: Ali Behcet Alpat, Abdullah Coban, Hakan Kaya, Giovanni Bartolini

Abstract: Radiation effects analysis of instruments operative in harsh radiation environment is crucial for performance and functionality of electronic devices and components. Engineering design of instruments is usually carried out in Computer Aided Design (CAD) engineering software. Geant4-based Monte Carlo codes are extensively used for particle transport simulation and analysis. However, Geant4 is not p… ▽ More Radiation effects analysis of instruments operative in harsh radiation environment is crucial for performance and functionality of electronic devices and components. Engineering design of instruments is usually carried out in Computer Aided Design (CAD) engineering software. Geant4-based Monte Carlo codes are extensively used for particle transport simulation and analysis. However, Geant4 is not prepared to accept CAD Standard for The Exchange of Product data (STEP) format. MRADSIM-Converter is a new software for STEP to Geometry Description Markup Language (GDML) format conversion, readable by Geant4-based Monte Carlo codes. Its validation with two different converters confirms its higher speed for importing CAD geometries with arbitrary size and complexity having a user-friendly interface for modifying volumes properties. △ Less

Submitted 25 April, 2023; originally announced May 2023.

Comments: 7 pages, 4 figures, 2 tables

Journal ref: Computer Physics Communications, Volume 286, May 2023, 108688

arXiv:2304.06625 [pdf, other]

Blameless and Optimal Control under Prioritized Safety Constraints

Authors: Natalia Pavlasek, Sarah H. Q. Li, Behçet Açıkmeşe, Meeko Oishi, Claus Danielson

Abstract: In many resource-limited optimal control problems, multiple constraints may be enforced that are jointly infeasible due to external factors such as subsystem failures, unexpected disturbances, or fuel limitations. In this manuscript, we introduce the concept of blameless optimality to characterize control actions that a) satisfy the highest prioritized and feasible constraints and b) remain optima… ▽ More In many resource-limited optimal control problems, multiple constraints may be enforced that are jointly infeasible due to external factors such as subsystem failures, unexpected disturbances, or fuel limitations. In this manuscript, we introduce the concept of blameless optimality to characterize control actions that a) satisfy the highest prioritized and feasible constraints and b) remain optimal with respect to a mission objective. For a general optimal control problem with jointly infeasible constraints, we prove that a single optimization problem cannot find a blamelessly optimal control sequence. Instead, finding blamelessly optimal control actions requires sequentially solving at least two optimal control problems: one to determine the highest priority level of constraints that is feasible and another to determine the optimal control action with respect to these constraints. We apply our results to a rocket landing scenario in which violating at least one safety-induced landing constraint is unavoidable. Leveraging the concept of blameless optimality, we formulate blamelessly optimal controllers that can autonomously prioritize the constraints most critical to a mission. △ Less

Submitted 3 November, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

arXiv:2304.01583 [pdf, other]

HALO: Hazard-Aware Landing Optimization for Autonomous Systems

Authors: Christopher R. Hayner, Samuel C. Buckner, Daniel Broyles, Evelyn Madewell, Karen Leung, Behcet Acikmese

Abstract: With autonomous aerial vehicles enacting safety-critical missions, such as the Mars Science Laboratory Curiosity rover's landing on Mars, the tasks of automatically identifying and reasoning about potentially hazardous landing sites is paramount. This paper presents a coupled perception-planning solution which addresses the hazard detection, optimal landing trajectory generation, and contingency p… ▽ More With autonomous aerial vehicles enacting safety-critical missions, such as the Mars Science Laboratory Curiosity rover's landing on Mars, the tasks of automatically identifying and reasoning about potentially hazardous landing sites is paramount. This paper presents a coupled perception-planning solution which addresses the hazard detection, optimal landing trajectory generation, and contingency planning challenges encountered when landing in uncertain environments. Specifically, we develop and combine two novel algorithms, Hazard-Aware Landing Site Selection (HALSS) and Adaptive Deferred-Decision Trajectory Optimization (Adaptive-DDTO), to address the perception and planning challenges, respectively. The HALSS framework processes point cloud information to identify feasible safe landing zones, while Adaptive-DDTO is a multi-target contingency planner that adaptively replans as new perception information is received. We demonstrate the efficacy of our approach using a simulated Martian environment and show that our coupled perception-planning method achieves greater landing success whilst being more fuel efficient compared to a nonadaptive DDTO approach. △ Less

Submitted 4 April, 2023; originally announced April 2023.

Comments: The first two authors have contributed equally to this work. This work is to be published in the proceedings of the 2023 IEEE International Conference on Robotics and Automation (ICRA)

arXiv:2303.10504 [pdf, other]

doi 10.1109/LCSYS.2023.3290229

Optimization-based Constrained Funnel Synthesis for Systems with Lipschitz Nonlinearities via Numerical Optimal Control

Authors: Taewan Kim, Purnanand Elango, Taylor P. Reynolds, Behçet Açıkmeşe, Mehran Mesbahi

Abstract: This paper presents a funnel synthesis algorithm for computing controlled invariant sets and feedback control gains around a given nominal trajectory for dynamical systems with locally Lipschitz nonlinearities and bounded disturbances. The resulting funnel synthesis problem involves a differential linear matrix inequality (DLMI) whose solution satisfies a Lyapunov condition that implies invariance… ▽ More This paper presents a funnel synthesis algorithm for computing controlled invariant sets and feedback control gains around a given nominal trajectory for dynamical systems with locally Lipschitz nonlinearities and bounded disturbances. The resulting funnel synthesis problem involves a differential linear matrix inequality (DLMI) whose solution satisfies a Lyapunov condition that implies invariance and attractivity properties. Due to these properties, the proposed method can balance maximization of initial invariant funnel size, i.e., size of the funnel entry, and minimization of the size of the attractive funnel for attenuating the effect of disturbance. To solve the resulting funnel synthesis problem with the DLMI as constraints, we employ a numerical optimal control approach that uses a multiple shooting method to convert the problem into a finite dimensional semidefinite programming problem. This framework does not require piecewise linear system matrices and funnel parameters, which is typically assumed in recent related work. We illustrate the proposed funnel synthesis method with a numerical example. △ Less

Submitted 1 July, 2023; v1 submitted 18 March, 2023; originally announced March 2023.

Comments: 6 pages, 3 figures, accepted to LCSS

arXiv:2212.04122 [pdf, other]

Reducing Collision Risk in Multi-Agent Path Planning: Application to Air traffic Management

Authors: Sarah H. Q. Li, Avi Mittal, Pierre-Loïc Garoche, Açıkmeşe, Behçet

Abstract: To minimize collision risks in the multi-agent path planning problem with stochastic transition dynamics, we formulate a Markov decision process congestion game with a multi-linear congestion cost. Players within the game complete individual tasks while minimizing their own collision risks. We show that the set of Nash equilibria coincides with the first-order KKT points of a non-convex optimizati… ▽ More To minimize collision risks in the multi-agent path planning problem with stochastic transition dynamics, we formulate a Markov decision process congestion game with a multi-linear congestion cost. Players within the game complete individual tasks while minimizing their own collision risks. We show that the set of Nash equilibria coincides with the first-order KKT points of a non-convex optimization problem. Our game is applied to a historical flight plan over France to reduce collision risks between commercial aircraft. △ Less

Submitted 10 December, 2022; v1 submitted 8 December, 2022; originally announced December 2022.

Comments: 6 pages, 2 figures

arXiv:2212.00375 [pdf, other]

Real-Time Sequential Conic Optimization for Multi-Phase Rocket Landing Guidance

Authors: Abhinav G. Kamath, Purnanand Elango, Yue Yu, Skye Mceowen, Govind M. Chari, John M. Carson III, Behçet Açıkmeşe

Abstract: We introduce a multi-phase rocket landing guidance framework that can handle nonlinear dynamics and does not mandate any additional mixed-integer or nonconvex constraints to handle discrete temporal events/switching. To achieve this, we first introduce sequential conic optimization (SeCO), a new paradigm for solving nonconvex optimal control problems that is entirely devoid of matrix factorization… ▽ More We introduce a multi-phase rocket landing guidance framework that can handle nonlinear dynamics and does not mandate any additional mixed-integer or nonconvex constraints to handle discrete temporal events/switching. To achieve this, we first introduce sequential conic optimization (SeCO), a new paradigm for solving nonconvex optimal control problems that is entirely devoid of matrix factorizations and inversions. This framework combines sequential convex programming (SCP) and first-order conic optimization and can solve unified multi-phase trajectory optimization problems in real-time. The novel features of this framework are: (1) time-interval dilation, which enables multi-phase trajectory optimization with free-transition-time; (2) single-crossing compound state-triggered constraints, which are entirely convex if the trigger and constraint conditions are convex; (3) virtual state, which is a new approach to handling artificial infeasibility in SCP methods that preserves the shapes of the constraint sets; and, (4) the use of the proportional-integral projected gradient method (PIPG), a high-performance first-order conic optimization solver, in tandem with the penalized trust region (PTR) SCP algorithm. We demonstrate the efficacy and real-time capability of SeCO by solving a relevant multi-phase rocket landing guidance problem with nonlinear dynamics and convex constraints only, and observe that our solver is 2.7 times faster than a state-of-the-art convex optimization solver. △ Less

Submitted 29 May, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

arXiv:2209.03535 [pdf, other]

Joint Synthesis of Trajectory and Controlled Invariant Funnel for Discrete-time Systems with Locally Lipschitz Nonlinearities

Authors: Taewan Kim, Purnanand Elango, Behcet Acikmese

Abstract: This paper presents a joint synthesis algorithm of trajectory and controlled invariant funnel (CIF) for locally Lipschitz nonlinear systems subject to bounded disturbances. The CIF synthesis refers to a procedure of computing controlled invariance sets and corresponding feedback gains. In contrast to existing CIF synthesis methods that compute the CIF with a pre-defined nominal trajectory, our wor… ▽ More This paper presents a joint synthesis algorithm of trajectory and controlled invariant funnel (CIF) for locally Lipschitz nonlinear systems subject to bounded disturbances. The CIF synthesis refers to a procedure of computing controlled invariance sets and corresponding feedback gains. In contrast to existing CIF synthesis methods that compute the CIF with a pre-defined nominal trajectory, our work aims to optimize the nominal trajectory and the CIF jointly to satisfy feasibility conditions without the relaxation of constraints and obtain a more cost-optimal nominal trajectory. The proposed work has a recursive scheme that mainly optimize trajectory update and funnel update. The trajectory update step optimizes the nominal trajectory while ensuring the feasibility of the CIF. Then, the funnel update step computes the funnel around the nominal trajectory so that the CIF guarantees an invariance property. As a result, with the optimized trajectory and CIF, any resulting trajectory propagated from an initial set by the control law with the computed feedback gain remains within the feasible region around the nominal trajectory under the presence of bounded disturbances. We validate the proposed method via two applications from robotics. △ Less

Submitted 12 January, 2024; v1 submitted 7 September, 2022; originally announced September 2022.

Comments: This paper has been accepted to International Journal of Robust and Nonlinear Control

arXiv:2208.07259 [pdf, other]

Real-Time Quadrotor Trajectory Optimization with Time-Triggered Corridor Constraints

Authors: Yue Yu, Kartik Nagpal, Skye Mceowen, Behçet Açıkmeşe, Ufuk Topcu

Abstract: One of the keys to flying quadrotors is to optimize their trajectories within the set of collision-free corridors. These corridors impose nonconvex constraints on the trajectories, making real-time trajectory optimization challenging. We introduce a novel numerical method that approximates the nonconvex corridor constraints with time-triggered convex corridor constraints. This method combines bise… ▽ More One of the keys to flying quadrotors is to optimize their trajectories within the set of collision-free corridors. These corridors impose nonconvex constraints on the trajectories, making real-time trajectory optimization challenging. We introduce a novel numerical method that approximates the nonconvex corridor constraints with time-triggered convex corridor constraints. This method combines bisection search and repeated infeasibility detection. We further develop a customized C++ implementation of the proposed method, based on a first-order conic optimization method that detects infeasibility and exploits problem structure. We demonstrate the efficiency and effectiveness of the proposed method using numerical simulation on randomly generated problem instances as well as indoor flight experiments with hoop obstacles. Compared with mixed integer programming, the proposed method is about 50--200 times faster. △ Less

Submitted 15 August, 2022; originally announced August 2022.

arXiv:2207.07271 [pdf, other]

Set-based value operators for non-stationary Markovian environments

Authors: Sarah H. Q. Li, Assalé Adjé, Pierre-Loïc Garoche, Behçet Açıkmeşe

Abstract: This paper analyzes finite state Markov Decision Processes (MDPs) with uncertain parameters in compact sets and re-examines results from robust MDP via set-based fixed point theory. To this end, we generalize the Bellman and policy evaluation operators to contracting operators on the value function space and denote them as \emph{value operators}. We lift these value operators to act on \emph{sets}… ▽ More This paper analyzes finite state Markov Decision Processes (MDPs) with uncertain parameters in compact sets and re-examines results from robust MDP via set-based fixed point theory. To this end, we generalize the Bellman and policy evaluation operators to contracting operators on the value function space and denote them as \emph{value operators}. We lift these value operators to act on \emph{sets} of value functions and denote them as \emph{set-based value operators}. We prove that the set-based value operators are \emph{contractions} in the space of compact value function sets. Leveraging insights from set theory, we generalize the rectangularity condition in classic robust MDP literature to a containment condition for all value operators, which is weaker and can be applied to a larger set of parameter-uncertain MDPs and contracting operators in dynamic programming. We prove that both the rectangularity condition and the containment condition sufficiently ensure that the set-based value operator's fixed point set contains its own extrema elements. For convex and compact sets of uncertain MDP parameters, we show equivalence between the classic robust value function and the supremum of the fixed point set of the set-based Bellman operator. Under dynamically changing MDP parameters in compact sets, we prove a set convergence result for value iteration, which otherwise may not converge to a single value function. Finally, we derive novel guarantees for probabilistic path-planning problems in planet exploration and stratospheric station-kee**. △ Less

Submitted 8 August, 2023; v1 submitted 14 July, 2022; originally announced July 2022.

Comments: 17 pages, 11 figures, 1 table

arXiv:2203.12976 [pdf, other]

doi 10.1016/j.image.2022.116675

Focus-and-Detect: A Small Object Detection Framework for Aerial Images

Authors: Onur Can Koyun, Reyhan Kevser Keser, İbrahim Batuhan Akkaya, Behçet Uğur Töreyin

Abstract: Despite recent advances, object detection in aerial images is still a challenging task. Specific problems in aerial images makes the detection problem harder, such as small objects, densely packed objects, objects in different sizes and with different orientations. To address small object detection problem, we propose a two-stage object detection framework called "Focus-and-Detect". The first stag… ▽ More Despite recent advances, object detection in aerial images is still a challenging task. Specific problems in aerial images makes the detection problem harder, such as small objects, densely packed objects, objects in different sizes and with different orientations. To address small object detection problem, we propose a two-stage object detection framework called "Focus-and-Detect". The first stage which consists of an object detector network supervised by a Gaussian Mixture Model, generates clusters of objects constituting the focused regions. The second stage, which is also an object detector network, predicts objects within the focal regions. Incomplete Box Suppression (IBS) method is also proposed to overcome the truncation effect of region search approach. Results indicate that the proposed two-stage framework achieves an AP score of 42.06 on VisDrone validation dataset, surpassing all other state-of-the-art small object detection methods reported in the literature, to the best of authors' knowledge. △ Less

Submitted 24 March, 2022; originally announced March 2022.

Comments: 12 pages, 6 figures

Journal ref: Signal Processing: Image Communication, Volume 104, May 2022, 116675

arXiv:2203.12133 [pdf, other]

Congestion-aware path coordination game with Markov decision process dynamics

Authors: Sarah H. Q. Li, Dan Calderone, Behcet Acikmese

Abstract: Inspired by the path coordination problem arising from robo-taxis, warehouse management, and mixed-vehicle routing problems, we model a group of heterogeneous players responding to stochastic demands as a congestion game under Markov decision process dynamics. Players share a common state-action space but have unique transition dynamics, and each player's unique cost is a {function} of the joint s… ▽ More Inspired by the path coordination problem arising from robo-taxis, warehouse management, and mixed-vehicle routing problems, we model a group of heterogeneous players responding to stochastic demands as a congestion game under Markov decision process dynamics. Players share a common state-action space but have unique transition dynamics, and each player's unique cost is a {function} of the joint state-action probability distribution. For a class of player cost functions, we formulate the player-specific optimization problem, prove the equivalence between the Nash equilibrium and the solution of a potential minimization problem, and derive dynamic programming approaches to solve the Nash equilibrium. We apply this game to model multi-agent path coordination and introduce congestion-based cost functions that enable players to complete individual tasks while avoiding congestion with their opponents. Finally, we present a learning algorithm for finding the Nash equilibrium that has linear complexity in the number of players. We demonstrate our game model on a multi-robot warehouse \change{path coordination problem}, in which robots autonomously retrieve and deliver packages while avoiding congested paths. △ Less

Submitted 5 July, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

Comments: 6 pages, 4 figures

arXiv:2203.04188 [pdf, other]

Extrapolated Proportional-Integral Projected Gradient Method for Conic Optimization

Authors: Yue Yu, Purnanand Elango, Behçet Açıkmeşe, Ufuk Topcu

Abstract: Conic optimization is the minimization of a convex quadratic function subject to conic constraints. We introduce a novel first-order method for conic optimization, named \emph{extrapolated proportional-integral projected gradient method (xPIPG)}, that automatically detects infeasibility. The iterates of xPIPG either asymptotically satisfy a set of primal-dual optimality conditions, or generate a p… ▽ More Conic optimization is the minimization of a convex quadratic function subject to conic constraints. We introduce a novel first-order method for conic optimization, named \emph{extrapolated proportional-integral projected gradient method (xPIPG)}, that automatically detects infeasibility. The iterates of xPIPG either asymptotically satisfy a set of primal-dual optimality conditions, or generate a proof of primal or dual infeasibility. We demonstrate the application of xPIPG using benchmark problems in model predictive control. xPIPG outperforms many state-of-the-art conic optimization solvers, especially when solving large-scale problems. △ Less

Submitted 23 June, 2022; v1 submitted 8 March, 2022; originally announced March 2022.

arXiv:2202.02056 [pdf, other]

Unsupervised Behaviour Analysis of News Consumption in Turkish Media

Authors: Didem Makaroglu, Altan Cakir, Behcet Ugur Toreyin

Abstract: Clickstream data, which come with a massive volume generated by human activities on websites, have become a prominent feature for identifying readers' characteristics by newsrooms after the digitization of news outlets. Although the nature of clickstream data has a similar logic within websites, it has inherent limitations in recognizing human behaviours when looking from a broad perspective, whic… ▽ More Clickstream data, which come with a massive volume generated by human activities on websites, have become a prominent feature for identifying readers' characteristics by newsrooms after the digitization of news outlets. Although the nature of clickstream data has a similar logic within websites, it has inherent limitations in recognizing human behaviours when looking from a broad perspective, which brings the need to limit the problem in niche areas. This study investigates the anonymized readers' click activities on the organizations' websites to identify news consumption patterns following referrals from Twitter,who incidentally reach but propensity is mainly routed news content. Methodologies for ensemble cluster analysis with mixed-type embedding strategies are applied and compared to find similar reader groups and interests independent of time. Various internal validation perspectives are used to determine the optimality of the quality of clusters, where the Calinski Harabasz Index (CHI) is found to give a generalizable result. Our findings demonstrate that clustering a mixed-type dataset approaches the optimal internal validation scores, which we define to discriminate the clusters and algorithms considering applied strategies when embedded by Uniform Manifold Approximation and Projection (UMAP) and using a consensus function as a key to access the most applicable hyperparameter configurations in the given ensemble rather than using consensus function results directly. Evaluation of the resulting clusters highlights specific clusters repeatedly present in the separated monthly samples by Adjusted Mutual Information scores greater than 0.5, which provide insights to the news organizations and overcome the degradation of the modeling behaviours due to the change in the interest over time. △ Less

Submitted 8 October, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

Comments: Submitted to Big Data Research

arXiv:2112.00108 [pdf, other]

SCvx-fast: A Superlinearly Convergent Algorithm for A Class of Non-Convex Optimal Control Problems

Authors: Yuanqi Mao, Behcet Acikmese

Abstract: In this paper, we extend our previous results and formally propose the SCvx-fast algorithm, a new addition to the Successive Convexification algorithmic framework. The said algorithm solves non-convex optimal control problems with specific types of state constraints (i.e. union of convex keep-out zones) and is faster to converge than SCvx, its predecessor. In order to preserve more feasibility, th… ▽ More In this paper, we extend our previous results and formally propose the SCvx-fast algorithm, a new addition to the Successive Convexification algorithmic framework. The said algorithm solves non-convex optimal control problems with specific types of state constraints (i.e. union of convex keep-out zones) and is faster to converge than SCvx, its predecessor. In order to preserve more feasibility, the proposed algorithm uses a novel project-and convexify procedure to successively convexify both state constraints and system dynamics, and thus a finite dimensional convex programming subproblem is solved at each succession. It also gets rid of the dependency on trust regions, gaining the ability to take larger steps and thus ultimately attaining faster convergence. The extension is in three folds as follows. i) We can now initialize the algorithm from an infeasible starting point, and regain feasibility in just one step; ii) We get rid of the smoothness conditions on the constraints so that a broader range of "obstacles" can be included. Significant changes are made to adjust the algorithm accordingly; iii) We obtain a proof of superlinear rate of convergence, a new theoretical result for SCvx-fast. Benefiting from its specific problem setup and the project-and convexify procedure, the SCvx-fast algorithm is particularly suitable for solving trajectory planning problems with collision avoidance constraints. Numerical simulations are performed, affirming the fast convergence rate. With powerful convex programming solvers, the algorithm can be implemented onboard for real-time autonomous guidance applications. △ Less

Submitted 30 November, 2021; originally announced December 2021.

Comments: 22 pages, 7 figures. arXiv admin note: substantial text overlap with arXiv:1701.00558

arXiv:2110.06975 [pdf, ps, other]

doi 10.23919/ACC53348.2022.9867151

Guided Policy Search using Sequential Convex Programming for Initialization of Trajectory Optimization Algorithms

Authors: Taewan Kim, Purnanand Elango, Danylo Malyuta, Behcet Acikmese

Abstract: Nonlinear trajectory optimization algorithms have been developed to handle optimal control problems with nonlinear dynamics and nonconvex constraints in trajectory planning. The performance and computational efficiency of many trajectory optimization methods are sensitive to the initial guess, i.e., the trajectory guess needed by the recursive trajectory optimization algorithm. Motivated by this o… ▽ More Nonlinear trajectory optimization algorithms have been developed to handle optimal control problems with nonlinear dynamics and nonconvex constraints in trajectory planning. The performance and computational efficiency of many trajectory optimization methods are sensitive to the initial guess, i.e., the trajectory guess needed by the recursive trajectory optimization algorithm. Motivated by this observation, we tackle the initialization problem for trajectory optimization via policy optimization. To optimize a policy, we propose a guided policy search method that has two key components: i) Trajectory update; ii) Policy update. The trajectory update involves offline solutions of a large number of trajectory optimization problems from different initial states via Sequential Convex Programming (SCP). Here we take a single SCP step to generate the trajectory iterate for each problem. In conjunction with these iterates, we also generate additional trajectories around each iterate via a feedback control law. Then all these trajectories are used by a stochastic gradient descent algorithm to update the neural network policy, i.e., the policy update step. As a result, the trained policy makes it possible to generate trajectory candidates that are close to the optimality and feasibility and that provide excellent initial guesses for the trajectory optimization methods. We validate the proposed method via a real-world 6-degree-of-freedom powered descent guidance problem for a reusable rocket. △ Less

Submitted 19 May, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

Comments: Presented in American Control Conference (ACC) 2022

arXiv:2109.02347 [pdf, ps, other]

Discrete-Time Linear-Quadratic Regulation via Optimal Transport

Authors: Mathias Hudoba de Badyn, Erik Miehling, Dylan Janak, Behçet Açıkmeşe, Mehran Mesbahi, Tamer Başar, John Lygeros, Roy S. Smith

Abstract: In this paper, we consider a discrete-time stochastic control problem with uncertain initial and target states. We first discuss the connection between optimal transport and stochastic control problems of this form. Next, we formulate a linear-quadratic regulator problem where the initial and terminal states are distributed according to specified probability densities. A closed-form solution for t… ▽ More In this paper, we consider a discrete-time stochastic control problem with uncertain initial and target states. We first discuss the connection between optimal transport and stochastic control problems of this form. Next, we formulate a linear-quadratic regulator problem where the initial and terminal states are distributed according to specified probability densities. A closed-form solution for the optimal transport map in the case of linear-time varying systems is derived, along with an algorithm for computing the optimal map. Two numerical examples pertaining to swarm deployment demonstrate the practical applicability of the model, and performance of the numerical method. △ Less

Submitted 6 September, 2021; originally announced September 2021.

Comments: 8 pages, 6 figures. To be included in the Proceedings of the 60th Conference on Decision and Control. This version includes proofs

arXiv:2108.10260 [pdf, other]

Proportional-Integral Projected Gradient Method for Conic Optimization

Authors: Yue Yu, Purnanand Elango, Ufuk Topcu, Behçet Açıkmeşe

Abstract: Conic optimization is the minimization of a differentiable convex objective function subject to conic constraints. We propose a novel primal-dual first-order method for conic optimization, named proportional-integral projected gradient method (PIPG). PIPG ensures that both the primal-dual gap and the constraint violation converge to zero at the rate of $O(1/k)$, where $k$ is the number of iter… ▽ More Conic optimization is the minimization of a differentiable convex objective function subject to conic constraints. We propose a novel primal-dual first-order method for conic optimization, named proportional-integral projected gradient method (PIPG). PIPG ensures that both the primal-dual gap and the constraint violation converge to zero at the rate of $O(1/k)$, where $k$ is the number of iterations. If the objective function is strongly convex, PIPG improves the convergence rate of the primal-dual gap to $O(1/k^2)$. Further, unlike any existing first-order methods, PIPG also improves the convergence rate of the constraint violation to $O(1/k^3)$. We demonstrate the application of PIPG in constrained optimal control problems. △ Less

Submitted 13 December, 2021; v1 submitted 23 August, 2021; originally announced August 2021.

arXiv:2108.02335 [pdf, other]

Advances in Trajectory Optimization for Space Vehicle Control

Authors: Danylo Malyuta, Yue Yu, Purnanand Elango, Behcet Acikmese

Abstract: Space mission design places a premium on cost and operational efficiency. The search for new science and life beyond Earth calls for spacecraft that can deliver scientific payloads to geologically rich yet hazardous landing sites. At the same time, the last four decades of optimization research have put a suite of powerful optimization tools at the fingertips of the controls engineer. As we enter… ▽ More Space mission design places a premium on cost and operational efficiency. The search for new science and life beyond Earth calls for spacecraft that can deliver scientific payloads to geologically rich yet hazardous landing sites. At the same time, the last four decades of optimization research have put a suite of powerful optimization tools at the fingertips of the controls engineer. As we enter the new decade, optimization theory, algorithms, and software tooling have reached a critical mass to start seeing serious application in space vehicle guidance and control systems. This survey paper provides a detailed overview of recent advances, successes, and promising directions for optimization-based space vehicle control. The considered applications include planetary landing, rendezvous and proximity operations, small body landing, constrained attitude reorientation, endo-atmospheric flight including ascent and reentry, and orbit transfer and injection. The primary focus is on the last ten years of progress, which have seen a veritable rise in the number of applications using three core technologies: lossless convexification, sequential convex programming, and model predictive control. The reader will come away with a well-rounded understanding of the state-of-the-art in each space vehicle control application, and will be well positioned to tackle important current open problems using convex optimization as a core technology. △ Less

Submitted 23 August, 2021; v1 submitted 4 August, 2021; originally announced August 2021.

Comments: 100 pages, 18 figures, 1 table; accepted in Elsevier Annual Reviews in Control

arXiv:2107.07001 [pdf, other]

Fast Homotopy for Spacecraft Rendezvous Trajectory Optimization with Discrete Logic

Authors: Danylo Malyuta, Behcet Acikmese

Abstract: This paper presents a computationally efficient optimization algorithm for solving nonconvex optimal control problems that involve discrete logic constraints. Traditional solution methods for these constraints require binary variables and mixed-integer programming, which is prohibitively slow and computationally expensive. This paper targets a fast solution that is capable of real-time implementat… ▽ More This paper presents a computationally efficient optimization algorithm for solving nonconvex optimal control problems that involve discrete logic constraints. Traditional solution methods for these constraints require binary variables and mixed-integer programming, which is prohibitively slow and computationally expensive. This paper targets a fast solution that is capable of real-time implementation onboard spacecraft. To do so, a novel algorithm is developed that blends sequential convex programming and numerical continuation into a single iterative solution process. Inside the algorithm, discrete logic constraints are approximated by smooth functions, and a homotopy parameter governs the accuracy of this approximation. As the algorithm converges, the homotopy parameter is updated such that the smooth approximations enforce the exact discrete logic. The effectiveness of this approach is numerically demonstrated for a realistic rendezvous scenario inspired by the Apollo Transposition and Docking maneuver. In under 15 seconds of cumulative solver time, the algorithm is able to reliably find difficult fuel-optimal trajectories that obey the following discrete logic constraints: thruster minimum impulse-bit, range-triggered approach cone, and range-triggered plume im**ement. The optimized trajectory uses significantly less fuel than reported NASA design targets. △ Less

Submitted 14 July, 2021; originally announced July 2021.

Comments: 40 pages, 19 figures; submitted to the AIAA Journal of Guidance, Control, and Dynamics

arXiv:2106.09125 [pdf, other]

Convex Optimization for Trajectory Generation

Authors: Danylo Malyuta, Taylor P. Reynolds, Michael Szmuk, Thomas Lew, Riccardo Bonalli, Marco Pavone, Behcet Acikmese

Abstract: Reliable and efficient trajectory generation methods are a fundamental need for autonomous dynamical systems of tomorrow. The goal of this article is to provide a comprehensive tutorial of three major convex optimization-based trajectory generation methods: lossless convexification (LCvx), and two sequential convex programming algorithms known as SCvx and GuSTO. In this article, trajectory generat… ▽ More Reliable and efficient trajectory generation methods are a fundamental need for autonomous dynamical systems of tomorrow. The goal of this article is to provide a comprehensive tutorial of three major convex optimization-based trajectory generation methods: lossless convexification (LCvx), and two sequential convex programming algorithms known as SCvx and GuSTO. In this article, trajectory generation is the computation of a dynamically feasible state and control signal that satisfies a set of constraints while optimizing key mission objectives. The trajectory generation problem is almost always nonconvex, which typically means that it is not readily amenable to efficient and reliable solution onboard an autonomous vehicle. The three algorithms that we discuss use problem reformulation and a systematic algorithmic strategy to nonetheless solve nonconvex trajectory generation tasks through the use of a convex optimizer. The theoretical guarantees and computational speed offered by convex optimization have made the algorithms popular in both research and industry circles. To date, the list of applications includes rocket landing, spacecraft hypersonic reentry, spacecraft rendezvous and docking, aerial motion planning for fixed-wing and quadrotor vehicles, robot motion planning, and more. Among these applications are high-profile rocket flights conducted by organizations like NASA, Masten Space Systems, SpaceX, and Blue Origin. This article aims to give the reader the tools and understanding necessary to work with each algorithm, and to know what each method can and cannot do. A publicly available source code repository supports the provided numerical examples. By the end of the article, the reader should be ready to use the methods, to extend them, and to contribute to their many exciting modern applications. △ Less

Submitted 16 June, 2021; originally announced June 2021.

Comments: 68 pages, 42 figures, 5 tables. This work has been submitted to the IEEE for possible publication

arXiv:2105.00695 [pdf, other]

ResVGAE: Going Deeper with Residual Modules for Link Prediction

Authors: Indrit Nallbani, Reyhan Kevser Keser, Aydin Ayanzadeh, Nurullah Çalık, Behçet Uğur Töreyin

Abstract: Graph autoencoders are efficient at embedding graph-based data sets. Most graph autoencoder architectures have shallow depths which limits their ability to capture meaningful relations between nodes separated by multi-hops. In this paper, we propose Residual Variational Graph Autoencoder, ResVGAE, a deep variational graph autoencoder model with multiple residual modules. We show that our multiple… ▽ More Graph autoencoders are efficient at embedding graph-based data sets. Most graph autoencoder architectures have shallow depths which limits their ability to capture meaningful relations between nodes separated by multi-hops. In this paper, we propose Residual Variational Graph Autoencoder, ResVGAE, a deep variational graph autoencoder model with multiple residual modules. We show that our multiple residual modules, a convolutional layer with residual connection, improve the average precision of the graph autoencoders. Experimental results suggest that our proposed model with residual modules outperforms the models without residual modules and achieves similar results when compared with other state-of-the-art methods. △ Less

Submitted 4 August, 2022; v1 submitted 3 May, 2021; originally announced May 2021.

arXiv:2103.00053 [pdf, other]

doi 10.1016/j.eswa.2022.119040

PURSUhInT: In Search of Informative Hint Points Based on Layer Clustering for Knowledge Distillation

Authors: Reyhan Kevser Keser, Aydin Ayanzadeh, Omid Abdollahi Aghdam, Caglar Kilcioglu, Behcet Ugur Toreyin, Nazim Kemal Ure

Abstract: One of the most efficient methods for model compression is hint distillation, where the student model is injected with information (hints) from several different layers of the teacher model. Although the selection of hint points can drastically alter the compression performance, conventional distillation approaches overlook this fact and use the same hint points as in the early studies. Therefore,… ▽ More One of the most efficient methods for model compression is hint distillation, where the student model is injected with information (hints) from several different layers of the teacher model. Although the selection of hint points can drastically alter the compression performance, conventional distillation approaches overlook this fact and use the same hint points as in the early studies. Therefore, we propose a clustering based hint selection methodology, where the layers of teacher model are clustered with respect to several metrics and the cluster centers are used as the hint points. Our method is applicable for any student network, once it is applied on a chosen teacher network. The proposed approach is validated in CIFAR-100 and ImageNet datasets, using various teacher-student pairs and numerous hint distillation methods. Our results show that hint points selected by our algorithm results in superior compression performance compared to state-of-the-art knowledge distillation algorithms on the same student models and datasets. △ Less

Submitted 3 November, 2022; v1 submitted 26 February, 2021; originally announced March 2021.

Comments: Our codes are published on Code Ocean, where the link to our codes is: https://codeocean.com/capsule/4245746/tree/v1

Journal ref: Expert Systems with Applications, Volume 213, Part B, March 2023, 119040

arXiv:2102.02953 [pdf, other]

On Controllability and Persistency of Excitation in Data-Driven Control: Extensions of Willems' Fundamental Lemma

Authors: Yue Yu, Shahriar Talebi, Henk J. van Waarde, Ufuk Topcu, Mehran Mesbahi, Behçet Açıkmeşe

Abstract: Willems' fundamental lemma asserts that all trajectories of a linear time-invariant system can be obtained from a finite number of measured ones, assuming that controllability and a persistency of excitation condition hold. We show that these two conditions can be relaxed. First, we prove that the controllability condition can be replaced by a condition on the controllable subspace, unobservable s… ▽ More Willems' fundamental lemma asserts that all trajectories of a linear time-invariant system can be obtained from a finite number of measured ones, assuming that controllability and a persistency of excitation condition hold. We show that these two conditions can be relaxed. First, we prove that the controllability condition can be replaced by a condition on the controllable subspace, unobservable subspace, and a certain subspace associated with the measured trajectories. Second, we prove that the persistency of excitation requirement can be relaxed if the degree of a certain minimal polynomial is tightly bounded. Our results show that data-driven predictive control using online data is equivalent to model predictive control, even for uncontrollable systems. Moreover, our results significantly reduce the amount of data needed in identifying homogeneous multi-agent systems. △ Less

Submitted 9 April, 2021; v1 submitted 4 February, 2021; originally announced February 2021.

arXiv:2012.02303 [pdf, other]

Decentralized State-Dependent Markov Chain Synthesis with an Application to Swarm Guidance

Authors: Samet Uzun, Nazim Kemal Ure, Behcet Acikmese

Abstract: This paper introduces a decentralized state-dependent Markov chain synthesis (DSMC) algorithm for finite-state Markov chains. We present a state-dependent consensus protocol that achieves exponential convergence under mild technical conditions, without relying on any connectivity assumptions regarding the dynamic network topology. Utilizing the proposed consensus protocol, we develop the DSMC algo… ▽ More This paper introduces a decentralized state-dependent Markov chain synthesis (DSMC) algorithm for finite-state Markov chains. We present a state-dependent consensus protocol that achieves exponential convergence under mild technical conditions, without relying on any connectivity assumptions regarding the dynamic network topology. Utilizing the proposed consensus protocol, we develop the DSMC algorithm, updating the Markov matrix based on the current state while ensuring the convergence conditions of the consensus protocol. This result establishes the desired steady-state distribution for the resulting Markov chain, ensuring exponential convergence from all initial distributions while adhering to transition constraints and minimizing state transitions. The DSMC's performance is demonstrated through a probabilistic swarm guidance example, which interprets the spatial distribution of a swarm comprising a large number of mobile agents as a probability distribution and utilizes the Markov chain to compute transition probabilities between states. Simulation results demonstrate faster convergence for the DSMC based algorithm when compared to the previous Markov chain based swarm guidance algorithms. △ Less

Submitted 26 April, 2024; v1 submitted 4 December, 2020; originally announced December 2020.

Comments: arXiv admin note: text overlap with arXiv:2012.01928

arXiv:2011.05562 [pdf, other]

Stability of Gradient Learning Dynamics in Continuous Games: Vector Action Spaces

Authors: Benjamin J. Chasnov, Daniel Calderone, Behçet Açıkmeşe, Samuel A. Burden, Lillian J. Ratliff

Abstract: Towards characterizing the optimization landscape of games, this paper analyzes the stability of gradient-based dynamics near fixed points of two-player continuous games. We introduce the quadratic numerical range as a method to characterize the spectrum of game dynamics and prove the robustness of equilibria to variations in learning rates. By decomposing the game Jacobian into symmetric and skew… ▽ More Towards characterizing the optimization landscape of games, this paper analyzes the stability of gradient-based dynamics near fixed points of two-player continuous games. We introduce the quadratic numerical range as a method to characterize the spectrum of game dynamics and prove the robustness of equilibria to variations in learning rates. By decomposing the game Jacobian into symmetric and skew-symmetric components, we assess the contribution of a vector field's potential and rotational components to the stability of differential Nash equilibria. Our results show that in zero-sum games, all Nash are stable and robust; in potential games, all stable points are Nash. For general-sum games, we provide a sufficient condition for instability. We conclude with a numerical example in which learning with timescale separation results in faster convergence. △ Less

Submitted 13 January, 2021; v1 submitted 6 November, 2020; originally announced November 2020.

Comments: extension of arXiv:2011.03650 to vector action spaces. Submitted to IEEE L-CSS

arXiv:2011.04093 [pdf, ps, other]

Synthesis of Interval Observers for Nonlinear Discrete-Time Systems

Authors: Adam M Tahir, Behcet Acikmese

Abstract: A systematic procedure to synthesize interval observers for nonlinear discrete-time systems is proposed. The feedback gains and other matrices are found from the solutions to semidefinite feasibility programs. Two cases are considered: (1) the interval observer is in the same coordinate frame as the given system, and (2) the interval observer uses a coordinate transformation. The conditions where… ▽ More A systematic procedure to synthesize interval observers for nonlinear discrete-time systems is proposed. The feedback gains and other matrices are found from the solutions to semidefinite feasibility programs. Two cases are considered: (1) the interval observer is in the same coordinate frame as the given system, and (2) the interval observer uses a coordinate transformation. The conditions where coordinate transformations are necessary are detailed. Numerical examples are provided to showcase the effectiveness of the interval observers and demonstrate their application to sampled-data systems. △ Less

Submitted 8 November, 2020; originally announced November 2020.

arXiv:2011.03650 [pdf, other]

Stability of Gradient Learning Dynamics in Continuous Games: Scalar Action Spaces

Authors: Benjamin J. Chasnov, Daniel Calderone, Behçet Açıkmeşe, Samuel A. Burden, Lillian J. Ratliff

Abstract: Learning processes in games explain how players grapple with one another in seeking an equilibrium. We study a natural model of learning based on individual gradients in two-player continuous games. In such games, the arguably natural notion of a local equilibrium is a differential Nash equilibrium. However, the set of locally exponentially stable equilibria of the learning dynamics do not necessa… ▽ More Learning processes in games explain how players grapple with one another in seeking an equilibrium. We study a natural model of learning based on individual gradients in two-player continuous games. In such games, the arguably natural notion of a local equilibrium is a differential Nash equilibrium. However, the set of locally exponentially stable equilibria of the learning dynamics do not necessarily coincide with the set of differential Nash equilibria of the corresponding game. To characterize this gap, we provide formal guarantees for the stability or instability of such fixed points by leveraging the spectrum of the linearized game dynamics. We provide a comprehensive understanding of scalar games and find that equilibria that are both stable and Nash are robust to variations in learning rates. △ Less

Submitted 6 November, 2020; originally announced November 2020.

Comments: Accepted to 2020 IEEE Conference on Decision and Control

arXiv:2010.16251 [pdf, other]

The FragmentatiOn Of Target Experiment (FOOT) and its DAQ system

Authors: Silvia Biondi, Andrey Alexandrov, Behcet Alpat, Giovanni Ambrosi, Stefano Argirò, Rau Arteche Diaz, Nazarm Bartosik, Giuseppe Battistoni, Nicola Belcari, Elettra Bellinzona, Maria Giuseppina Bisogni, Graziano Bruni, Pietro Carra, Piergiorgio Cerello, Esther Ciarrocchi, Alberto Clozza, Sofia Colombi, Giovanni De Lellis, Alberto Del Guerra, Micol De Simoni, Antonia Di Crescenzo, Benedetto Di Ruzza, Marco Donetti, Yunsheng Dong, Marco Durante , et al. (70 additional authors not shown)

Abstract: The FragmentatiOn Of Target (FOOT) experiment aims to provide precise nuclear cross-section measurements for two different fields: hadrontherapy and radio-protection in space. The main reason is the important role the nuclear fragmentation process plays in both fields, where the health risks caused by radiation are very similar and mainly attributable to the fragmentation process. The FOOT experim… ▽ More The FragmentatiOn Of Target (FOOT) experiment aims to provide precise nuclear cross-section measurements for two different fields: hadrontherapy and radio-protection in space. The main reason is the important role the nuclear fragmentation process plays in both fields, where the health risks caused by radiation are very similar and mainly attributable to the fragmentation process. The FOOT experiment has been developed in such a way that the experimental setup is easily movable and fits the space limitations of the experimental and treatment rooms available in hadrontherapy treatment centers, where most of the data takings are carried out. The Trigger and Data Acquisition system needs to follow the same criteria and it should work in different laboratories and in different conditions. It has been designed to acquire the largest sample size with high accuracy in a controlled and online-monitored environment. The data collected are processed in real-time for quality assessment and are available to the DAQ crew and detector experts during data taking. △ Less

Submitted 29 October, 2020; originally announced October 2020.

arXiv:2009.06980 [pdf, other]

Proportional-Integral Projected Gradient Method for Model Predictive Control

Authors: Yue Yu, Purnanand Elango, Behçet Açikmeşe

Abstract: Recently there has been an increasing interest in primal-dual methods for model predictive control (MPC), which require minimizing the (augmented) Lagrangian at each iteration. We propose a novel first order primal-dual method, termed \emph{proportional-integral projected gradient method}, for MPC where the underlying finite horizon optimal control problem has both state and input constraints. Ins… ▽ More Recently there has been an increasing interest in primal-dual methods for model predictive control (MPC), which require minimizing the (augmented) Lagrangian at each iteration. We propose a novel first order primal-dual method, termed \emph{proportional-integral projected gradient method}, for MPC where the underlying finite horizon optimal control problem has both state and input constraints. Instead of minimizing the (augmented) Lagrangian, each iteration of our method only computes a single projection onto the state and input constraint set. Our method ensures that, along a sequence of averaged iterates, both the distance to optimum and the constraint violation converge to zero at a rate of $O(1/k)$ if the objective function is convex, where $k$ is the iteration number. If the objective function is strongly convex, this rate can be improved to $O(1/k^2)$ for the distance to optimum and $O(1/k^3)$ for the constraint violation. We compare our method against existing methods via a trajectory-planning example with convexified keep-out-zone constraints. △ Less

Submitted 18 December, 2020; v1 submitted 15 September, 2020; originally announced September 2020.

Comments: Julia code available at: https://github.com/purnanandelango/pi-projgrad-demo

arXiv:2007.07228 [pdf, other]

doi 10.1109/LCSYS.2020.3001240

Disturbance Decoupling for Gradient-based Multi-Agent Learning with Quadratic Costs

Authors: Sarah H. Q. Li, Lillian Ratliff, Behçet Açıkmeşe

Abstract: Motivated by applications of multi-agent learning in noisy environments, this paper studies the robustness of gradient-based learning dynamics with respect to disturbances. While disturbances injected along a coordinate corresponding to any individual player's actions can always affect the overall learning dynamics, a subset of players can be disturbance decoupled---i.e., such players' actions are… ▽ More Motivated by applications of multi-agent learning in noisy environments, this paper studies the robustness of gradient-based learning dynamics with respect to disturbances. While disturbances injected along a coordinate corresponding to any individual player's actions can always affect the overall learning dynamics, a subset of players can be disturbance decoupled---i.e., such players' actions are completely unaffected by the injected disturbance. We provide necessary and sufficient conditions to guarantee this property for games with quadratic cost functions, which encompass quadratic one-shot continuous games, finite-horizon linear quadratic (LQ) dynamic games, and bilinear games. Specifically, disturbance decoupling is characterized by both algebraic and graph-theoretic conditions on the learning dynamics, the latter is obtained by constructing a game graph based on gradients of players' costs. For LQ games, we show that disturbance decoupling imposes constraints on the controllable and unobservable subspaces of players. For two player bilinear games, we show that disturbance decoupling within a player's action coordinates imposes constraints on the payoff matrices. Illustrative numerical examples are provided. △ Less

Submitted 10 October, 2020; v1 submitted 14 July, 2020; originally announced July 2020.

Journal ref: IEEE Control Systems Letters, vol. 5, no. 1, pp. 223-228, Jan. 2021

arXiv:2003.06949 [pdf, other]

RC Circuits based Distributed Conditional Gradient Method

Authors: Yue Yu, Behçet Açıkmeşe

Abstract: We consider distributed optimization on undirected connected graphs. We propose a novel distributed conditional gradient method with (O(1/\sqrt{k})) convergence. Compared with existing methods, each iteration of our method uses both communication and linear minimization step only once rather than multiple times. We further extend our results to cases with composite local constraints. We demonstrat… ▽ More We consider distributed optimization on undirected connected graphs. We propose a novel distributed conditional gradient method with (O(1/\sqrt{k})) convergence. Compared with existing methods, each iteration of our method uses both communication and linear minimization step only once rather than multiple times. We further extend our results to cases with composite local constraints. We demonstrate our results via examples on distributed matrix completion problem. △ Less

Submitted 15 March, 2020; originally announced March 2020.

arXiv:2001.07889 [pdf, other]

doi 10.1016/j.automatica.2021.109685

Bounding Fixed Points of Set-Based Bellman Operator and Nash Equilibria of Stochastic Games

Authors: Sarah H. Q. Li, Assalé, Adjé, Pierre-Loïc Garoche, Behçet Açıkmeşe

Abstract: Motivated by uncertain parameters encountered in Markov decision processes (MDPs) and stochastic games, we study the effect of parameter uncertainty on Bellman operator-based algorithms under a set-based framework. Specifically, we first consider a family of MDPs where the cost parameters are in a given compact set; we then define a Bellman operator acting on a set of value functions to produce a… ▽ More Motivated by uncertain parameters encountered in Markov decision processes (MDPs) and stochastic games, we study the effect of parameter uncertainty on Bellman operator-based algorithms under a set-based framework. Specifically, we first consider a family of MDPs where the cost parameters are in a given compact set; we then define a Bellman operator acting on a set of value functions to produce a new set of value functions as the output under all possible variations in the cost parameter. We prove the existence of a fixed point of this set-based Bellman operator by showing that it is contractive on a complete metric space, and explore its relationship with the corresponding family of MDPs and stochastic games. Additionally, we show that given interval set bounded cost parameters, we can form exact bounds on the set of optimal value functions. Finally, we utilize our results to bound the value function trajectory of a player in a stochastic game. △ Less

Submitted 10 October, 2020; v1 submitted 22 January, 2020; originally announced January 2020.

Comments: 15 pages, 4 figures

arXiv:2001.04535 [pdf, ps, other]

Fixed Points of the Set-Based Bellman Operator

Authors: Sarah H. Q. Li, Assalé Adjé, Pierre-Loïc Garoche, Behçet Açıkmeşe

Abstract: Motivated by uncertain parameters encountered in Markov decision processes (MDPs), we study the effect of parameter uncertainty on Bellman operator-based methods. Specifically, we consider a family of MDPs where the cost parameters are from a given compact set. We then define a Bellman operator acting on an input set of value functions to produce a new set of value functions as the output under al… ▽ More Motivated by uncertain parameters encountered in Markov decision processes (MDPs), we study the effect of parameter uncertainty on Bellman operator-based methods. Specifically, we consider a family of MDPs where the cost parameters are from a given compact set. We then define a Bellman operator acting on an input set of value functions to produce a new set of value functions as the output under all possible variations in the cost parameters. Finally we prove the existence of a fixed point of this set-based Bellman operator by showing that it is a contractive operator on a complete metric space. △ Less

Submitted 29 February, 2020; v1 submitted 13 January, 2020; originally announced January 2020.

Comments: 4 pages

arXiv:1911.09013 [pdf, other]

Lossless Convexification of Optimal Control Problems with Semi-continuous Inputs

Authors: Danylo Malyuta, Behcet Acikmese

Abstract: This paper presents a novel convex optimization-based method for finding the globally optimal solutions of a class of mixed-integer non-convex optimal control problems. We consider problems with non-convex constraints that restrict the input norms to be either zero or lower- and upper-bounded. The non-convex problem is relaxed to a convex one whose optimal solution is proved to be optimal almost e… ▽ More This paper presents a novel convex optimization-based method for finding the globally optimal solutions of a class of mixed-integer non-convex optimal control problems. We consider problems with non-convex constraints that restrict the input norms to be either zero or lower- and upper-bounded. The non-convex problem is relaxed to a convex one whose optimal solution is proved to be optimal almost everywhere for the original problem, a procedure known as lossless convexification. This paper is the first to allow individual input sets to overlap and to have different norm bounds, integral input and state costs, and convex state constraints that can be activated at discrete time instances. The solution relies on second-order cone programming and demonstrates that a meaningful class of optimal control problems with binary variables can be solved reliably and in polynomial time. A rocket landing example with a coupled thrust-gimbal constraint corroborates the effectiveness of the approach. △ Less

Submitted 18 November, 2019; originally announced November 2019.

Comments: 8 pages, 4 figures; submitted to 21st IFAC World Congress, 2020. arXiv admin note: substantial text overlap with arXiv:1902.02726

arXiv:1911.06273 [pdf, other]

RLC Circuits based Distributed Mirror Descent Method

Authors: Yue Yu, Behçet Açıkmeşe

Abstract: We consider distributed optimization with smooth convex objective functions defined on an undirected connected graph. Inspired by mirror descent mehod and RLC circuits, we propose a novel distributed mirror descent method. Compared with mirror-prox method, our algorithm achieves the same $\mathcal{O}(1/k)$ iteration complexity with only half the computation cost per iteration. We further extend… ▽ More We consider distributed optimization with smooth convex objective functions defined on an undirected connected graph. Inspired by mirror descent mehod and RLC circuits, we propose a novel distributed mirror descent method. Compared with mirror-prox method, our algorithm achieves the same $\mathcal{O}(1/k)$ iteration complexity with only half the computation cost per iteration. We further extend our results to cases where a) gradients are corrupted by stochastic noise, and b) objective function is composed of both smooth and non-smooth terms. We demonstrate our theoretical results via numerical experiments. △ Less

Submitted 22 February, 2020; v1 submitted 14 November, 2019; originally announced November 2019.

Showing 1–50 of 91 results for author: Behçet