Skip to main content

Showing 1–27 of 27 results for author: Gattami, A

.
  1. arXiv:2402.19212  [pdf, ps, other

    math.OC cs.LG

    Deep Reinforcement Learning: A Convex Optimization Approach

    Authors: Ather Gattami

    Abstract: In this paper, we consider reinforcement learning of nonlinear systems with continuous state and action spaces. We present an episodic learning algorithm, where we for each episode use convex optimization to find a two-layer neural network approximation of the optimal $Q$-function. The convex optimization approach guarantees that the weights calculated at each episode are optimal, with respect to… ▽ More

    Submitted 24 June, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

  2. arXiv:2301.11802  [pdf, ps, other

    cs.LG cs.GT

    Decentralized Online Bandit Optimization on Directed Graphs with Regret Bounds

    Authors: Johan Östman, Ather Gattami, Daniel Gillblad

    Abstract: We consider a decentralized multiplayer game, played over $T$ rounds, with a leader-follower hierarchy described by a directed acyclic graph. For each round, the graph structure dictates the order of the players and how players observe the actions of one another. By the end of each round, all players receive a joint bandit-reward based on their joint action that is used to update the player strate… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

  3. arXiv:2212.11567  [pdf, other

    math.OC

    Learning Team Decisions

    Authors: Olle Kjellqvist, Ather Gattami

    Abstract: In this paper, we treat linear quadratic team decision problems, where a team of agents minimizes a convex quadratic cost function over $T$ time steps subject to possibly distinct linear measurements of the state of nature. We assume that the state of nature is a Gaussian random variable and that the agents do not know the cost function nor the linear functions map** the state of nature to their… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

    Comments: Accepted and presented at IEEE CDC 2022. A few typos have been corrected

  4. arXiv:2006.05961   

    cs.LG cs.NI eess.SY math.OC stat.ML

    Model-Free Algorithm and Regret Analysis for MDPs with Long-Term Constraints

    Authors: Qinbo Bai, Vaneet Aggarwal, Ather Gattami

    Abstract: In the optimization of dynamical systems, the variables typically have constraints. Such problems can be modeled as a constrained Markov Decision Process (CMDP). This paper considers a model-free approach to the problem, where the transition probabilities are not known. In the presence of long-term (or average) constraints, the agent has to choose a policy that maximizes the long-term average rewa… ▽ More

    Submitted 30 January, 2021; v1 submitted 10 June, 2020; originally announced June 2020.

    Comments: The result has error

  5. arXiv:2003.05555  [pdf, other

    math.OC cs.LG eess.SY stat.ML

    Provably Efficient Model-Free Algorithm for MDPs with Peak Constraints

    Authors: Qinbo Bai, Vaneet Aggarwal, Ather Gattami

    Abstract: In the optimization of dynamic systems, the variables typically have constraints. Such problems can be modeled as a Constrained Markov Decision Process (CMDP). This paper considers the peak Constrained Markov Decision Process (PCMDP), where the agent chooses the policy to maximize total reward in the finite horizon as well as satisfy constraints at each epoch with probability 1. We propose a model… ▽ More

    Submitted 13 June, 2022; v1 submitted 11 March, 2020; originally announced March 2020.

  6. arXiv:2002.07638  [pdf, other

    cs.LG stat.ML

    Conditional Mutual information-based Contrastive Loss for Financial Time Series Forecasting

    Authors: Hanwei Wu, Ather Gattami, Markus Flierl

    Abstract: We present a representation learning framework for financial time series forecasting. One challenge of using deep learning models for finance forecasting is the shortage of available training data when using small datasets. Direct trend classification using deep neural networks trained on small datasets is susceptible to the overfitting problem. In this paper, we propose to first learn compact rep… ▽ More

    Submitted 7 May, 2021; v1 submitted 18 February, 2020; originally announced February 2020.

    Comments: Published in ICAIF 2020 : ACM International Conference on AI in Finance

  7. arXiv:1901.08978  [pdf, other

    math.OC

    Reinforcement Learning for Multi-Objective and Constrained Markov Decision Processes

    Authors: Ather Gattami, Qinbo Bai, Vaneet Agarwal

    Abstract: In this paper, we consider the problem of optimization and learning for constrained and multi-objective Markov decision processes, for both discounted rewards and expected average rewards. We formulate the problems as zero-sum games where one player (the agent) solves a Markov decision problem and its opponent solves a bandit optimization problem, which we here call Markov-Bandit games. We extend… ▽ More

    Submitted 4 March, 2021; v1 submitted 23 January, 2019; originally announced January 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1901.07839

  8. arXiv:1901.07839  [pdf, ps, other

    math.OC cs.LG

    Reinforcement Learning of Markov Decision Processes with Peak Constraints

    Authors: Ather Gattami

    Abstract: In this paper, we consider reinforcement learning of Markov Decision Processes (MDP) with peak constraints, where an agent chooses a policy to optimize an objective and at the same time satisfy additional constraints. The agent has to take actions based on the observed states, reward outputs, and constraint-outputs, without any knowledge about the dynamics, reward functions, and/or the knowledge o… ▽ More

    Submitted 6 December, 2019; v1 submitted 23 January, 2019; originally announced January 2019.

  9. arXiv:1605.04579  [pdf, other

    cs.IT

    Communicating One Bit over a Delay Constrained Gaussian MIMO Channel with Feedback

    Authors: Bo Bernhardsson, Ather Gattami

    Abstract: The energy-optimal scheme is found for communicating one bit over a memoryless Gaussian channel with an ideal feedback channel. It is assumed that the channel is allowed to be used at most N times before decoding. The optimal coding/decoding strategy is derived by dynamic programming. It is found that feedback gives a significant performance gain and that the optimal strategies are discontinuous.… ▽ More

    Submitted 15 May, 2016; originally announced May 2016.

    Comments: Submitted for publication

  10. arXiv:1511.06866  [pdf, other

    cs.IT math.OC

    Feedback Capacity of Gaussian Channels Revisited

    Authors: Ather Gattami

    Abstract: In this paper, we revisit the problem of finding the average capacity of the Gaussian feedback channel. First, we consider the problem of finding the average capacity of the analog Gaussian noise channel where the noise has an arbitrary spectral density. We introduce a new approach to the problem where we solve the problem over a finite number of transmissions and then consider the limit of an inf… ▽ More

    Submitted 23 January, 2019; v1 submitted 21 November, 2015; originally announced November 2015.

  11. arXiv:1506.00777  [pdf, other

    math.OC

    Team Decision Problems with Convex Quadratic Constraints

    Authors: Ather Gattami

    Abstract: In this paper, we consider linear quadratic team problems with an arbitrary number of quadratic constraints in both stochastic and deterministic settings. The team consists of players with different measurements about the state of nature. The objective of the team is to minimize a quadratic cost subject to additional finite number of quadratic constraints. We first consider the problem of countabl… ▽ More

    Submitted 2 June, 2015; originally announced June 2015.

    Comments: arXiv admin note: substantial text overlap with arXiv:1209.2551

  12. arXiv:1506.00484  [pdf, other

    cs.IT

    Optimal Communication of States of Dynamical Systems over Gaussian Channels with Noisy Feedback: The Scalar Case

    Authors: Ather Gattami

    Abstract: We consider the problem of communicating the state of a dynamical system via a Shannon Gaussian channel. The receiver, which acts as both a decoder and estimator, observes the noisy measurement of the channel output and makes an optimal estimate of the state of the dynamical system in the minimum mean square sense. Noisy feedback from the receiver to the transmitter is present. The transmitter obs… ▽ More

    Submitted 1 June, 2015; originally announced June 2015.

    Comments: arXiv admin note: substantial text overlap with arXiv:1404.4350

  13. arXiv:1505.03309  [pdf, other

    cs.IT

    Time Localization and Capacity of Faster-Than-Nyquist Signaling

    Authors: Ather Gattami, Emil Ringh, Johan Karlsson

    Abstract: In this paper, we consider communication over the bandwidth limited analog white Gaussian noise channel using non-orthogonal pulses. In particular, we consider non-orthogonal transmission by signaling samples at a rate higher than the Nyquist rate. Using the faster-than-Nyquist (FTN) framework, Mazo showed that one may transmit symbols carried by sinc pulses at a higher rate than that dictated by… ▽ More

    Submitted 7 December, 2015; v1 submitted 13 May, 2015; originally announced May 2015.

  14. arXiv:1505.02997  [pdf, other

    cs.IT

    Optimal Data and Training Symbol Ratio for Communication over Uncertain Channels

    Authors: Ather Gattami

    Abstract: We consider the problem of determining the power ratio between the training symbols and data symbols in order to maximize the channel capacity for transmission over uncertain channels with a channel estimate available at both the transmitter and receiver. The receiver makes an estimate of the channel by using a known sequence of training symbols. This channel estimate is then transmitted back to t… ▽ More

    Submitted 12 May, 2015; originally announced May 2015.

  15. arXiv:1503.07561  [pdf, ps, other

    eess.SY math.OC

    Primal robustness and semidefinite cones

    Authors: Seungil You, Ather Gattami, John C. Doyle

    Abstract: This paper reformulates and streamlines the core tools of robust stability and performance for LTI systems using now-standard methods in convex optimization. In particular, robustness analysis can be formulated directly as a primal convex (semidefinite program or SDP) optimization problem using sets of gramians whose closure is a semidefinite cone. This allows various constraints such as structure… ▽ More

    Submitted 25 March, 2015; originally announced March 2015.

    Comments: A shorter version submitted to CDC 15

  16. arXiv:1412.6160  [pdf, ps, other

    math.OC eess.SY

    H infinity Analysis Revisited

    Authors: Seungil You, Ather Gattami

    Abstract: This paper proposes a direct, and simple approach to the H infinity norm calculation in more general settings. In contrast to the method based on the Kalman-Yakubovich-Popov lemma, our approach does not require a controllability assumption, and returns a sinusoidal input that achieves the H infinity norm of the system including its frequency. In addition, using a semidefinite programming duality,… ▽ More

    Submitted 15 December, 2014; originally announced December 2014.

    Comments: Submitted to IEEE Transactions on Automatic Control

  17. arXiv:1404.4350  [pdf, other

    cs.IT math.OC

    Kalman meets Shannon

    Authors: Ather Gattami

    Abstract: We consider the problem of communicating the state of a dynamical system via a Shannon Gaussian channel. The receiver, which acts as both a decoder and estimator, observes the noisy measurement of the channel output and makes an optimal estimate of the state of the dynamical system in the minimum mean square sense. The transmitter observes a possibly noisy measurement of the state of the dynamical… ▽ More

    Submitted 12 May, 2015; v1 submitted 16 April, 2014; originally announced April 2014.

  18. arXiv:1402.3402  [pdf, ps, other

    math.OC

    Multi-Objective Optimal Control with Arbitrary Additive and Multiplicative Noise

    Authors: Ather Gattami

    Abstract: In this paper, we consider the problem of multi-objective optimal control of a dynamical system with additive and multiplicative noises with given second moments and arbitrary probability distributions. The objectives are given by quadratic constraints in the state and controller, where the quadratic forms maybe indefinite and thus not necessarily convex. We show that the problem can be transforme… ▽ More

    Submitted 14 February, 2014; originally announced February 2014.

  19. Optimal Distributed Controller Design with Communication Delays: Application to Vehicle Formations

    Authors: Hamid Reza Feyzmahdavian, Assad Alam, Ather Gattami

    Abstract: This paper develops a controller synthesis algorithm for distributed LQG control problems under output feedback. We consider a system consisting of three interconnected linear subsystems with a delayed information sharing structure. While the state-feedback case of this problem has previously been solved, the extension to output-feedback is nontrivial, as the classical separation principle fails.… ▽ More

    Submitted 17 September, 2013; originally announced September 2013.

    Comments: Submitted to the 51nd IEEE Conference on Decision and Control, 2012

  20. arXiv:1209.3135  [pdf, ps, other

    math.OC

    Deterministic Team Problems with Signaling Incentive

    Authors: Ather Gattami

    Abstract: This paper considers linear quadratic team decision problems where the players in the team affect each other's information structure through their decisions. Whereas the stochastic version of the problem is well known to be complex with nonlinear optimal solutions that are hard to find, the deterministic counterpart is shown to be tractable. We show that under some assumptions on the weight matrix… ▽ More

    Submitted 3 February, 2013; v1 submitted 14 September, 2012; originally announced September 2012.

    Comments: Submitted for publication

  21. arXiv:1209.2551  [pdf, ps, other

    math.OC

    Multi-Objective Linear Quadratic Team Optimization

    Authors: Ather Gattami

    Abstract: In this paper, we consider linear quadratic team problems with an arbitrary number of quadratic constraints in both stochastic and deterministic settings. The team consists of players with different measurements about the state of nature. The objective of the team is to minimize a quadratic cost subject to additional finite number of quadratic constraints. We will first consider the Gaussian case,… ▽ More

    Submitted 12 September, 2012; originally announced September 2012.

    Comments: Submitted for publication

  22. arXiv:1205.4563  [pdf, ps, other

    math.OC

    Iterative Source-Channel Coding Approach to Witsenhausen's Counterexample

    Authors: Johannes Kron, Ather Gattami, Tobias J. Oechtering, Mikael Skoglund

    Abstract: In 1968, Witsenhausen introduced his famous counterexample where he showed that even in the simple linear quadratic static team decision problem, complex nonlinear decisions could outperform any given linear decision. This problem has served as a benchmark problem for decades where researchers try to achieve the optimal solution. This paper introduces a systematic iterative source--channel coding… ▽ More

    Submitted 21 May, 2012; originally announced May 2012.

  23. arXiv:1205.1907  [pdf, ps, other

    math.OC

    Optimal Control and Estimation for Partially Nested Interconnected Systems

    Authors: Ather Gattami, Sanjoy Mitter

    Abstract: In this paper, we study distributed estimation and control problems over graphs under partially nested information patterns. We show a duality result that is very similar to the classical duality result between state estimation and state feedback control with a classical information pattern, under the condition that the disturbances entering different systems on the graph are uncorrelated. The dis… ▽ More

    Submitted 14 September, 2012; v1 submitted 9 May, 2012; originally announced May 2012.

    Comments: Submitted for publication

  24. arXiv:1204.6178  [pdf, other

    eess.SY

    Distributed Output-Feedback LQG Control with Delayed Information Sharing

    Authors: Hamid Reza Feyzmahdavian, Ather Gattami, Mikael Johansson

    Abstract: This paper develops a controller synthesis method for distributed LQG control problems under output-feedback. We consider a system consisting of three interconnected linear subsystems with a delayed information sharing structure. While the state-feedback case has previously been solved, the extension to output-feedback is nontrivial as the classical separation principle fails. To find the optimal… ▽ More

    Submitted 17 September, 2013; v1 submitted 27 April, 2012; originally announced April 2012.

    Comments: 25 pages, 3 figures

  25. arXiv:1204.3876  [pdf, ps, other

    math.OC

    On Optimal Distributed Output-Feedback Control over Acyclic Graphs

    Authors: Ather Gattami, Omid Khorsand

    Abstract: In this paper, we consider the problem of distributed optimal control of linear dynamical systems with a quadratic cost criterion. We study the case of output feedback control for two interconnected dynamical systems, and show that the linear optimal solution can be obtained from a combination of two uncoupled Riccati equations and two coupled Riccati equations.

    Submitted 17 April, 2012; originally announced April 2012.

  26. arXiv:1204.1869  [pdf, other

    math.OC

    Optimal Distributed Controller Synthesis for Chain Structures: Applications to Vehicle Formations

    Authors: Omid Khorsand, Assad Alam, Ather Gattami

    Abstract: We consider optimal distributed controller synthesis for an interconnected system subject to communication constraints, in linear quadratic settings. Motivated by the problem of finite heavy duty vehicle platooning, we study systems composed of interconnected subsystems over a chain graph. By decomposing the system into orthogonal modes, the cost function can be separated into individual component… ▽ More

    Submitted 9 April, 2012; originally announced April 2012.

  27. Converging an Overlay Network to a Gradient Topology

    Authors: Håkan Terelius, Guodong Shi, Jim Dowling, Amir Payberah, Ather Gattami, Karl Henrik Johansson

    Abstract: In this paper, we investigate the topology convergence problem for the gossip-based Gradient overlay network. In an overlay network where each node has a local utility value, a Gradient overlay network is characterized by the properties that each node has a set of neighbors with the same utility value (a similar view) and a set of neighbors containing higher utility values (gradient neighbor set),… ▽ More

    Submitted 29 March, 2011; originally announced March 2011.

    Comments: Submitted to 50th IEEE Conference on Decision and Control (CDC 2011)