Skip to main content

Showing 1–25 of 25 results for author: Kalathil, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2404.07315  [pdf, other

    eess.SY cs.AI cs.LG

    Structured Reinforcement Learning for Media Streaming at the Wireless Edge

    Authors: Archana Bura, Sarat Chandra Bobbili, Shreyas Rameshkumar, Desik Rengarajan, Dileep Kalathil, Srinivas Shakkottai

    Abstract: Media streaming is the dominant application over wireless edge (access) networks. The increasing softwarization of such networks has led to efforts at intelligent control, wherein application-specific actions may be dynamically taken to enhance the user experience. The goal of this work is to develop and demonstrate learning-based policies for optimal decision making to determine which clients to… ▽ More

    Submitted 16 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

    Comments: 15 pages, 14 figures

  2. arXiv:2312.15340  [pdf, other

    eess.SY cs.LG

    Meta-Learning-Based Adaptive Stability Certificates for Dynamical Systems

    Authors: Amit Jena, Dileep Kalathil, Le Xie

    Abstract: This paper addresses the problem of Neural Network (NN) based adaptive stability certification in a dynamical system. The state-of-the-art methods, such as Neural Lyapunov Functions (NLFs), use NN-based formulations to assess the stability of a non-linear dynamical system and compute a Region of Attraction (ROA) in the state space. However, under parametric uncertainty, if the values of system par… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

    Comments: This article has been accepted for AAAI-24 (The 38th Annual AAAI Conference on Artificial Intelligence)

  3. arXiv:2311.00226  [pdf, other

    eess.SP cs.LG

    Transformers are Provably Optimal In-context Estimators for Wireless Communications

    Authors: Vishnu Teja Kunde, Vicram Rajagopalan, Chandra Shekhara Kaushik Valmeekam, Krishna Narayanan, Srinivas Shakkottai, Dileep Kalathil, Jean-Francois Chamberland

    Abstract: Pre-trained transformers exhibit the capability of adapting to new tasks through in-context learning (ICL), where they efficiently utilize a limited set of prompts without explicit model optimization. The canonical communication problem of estimating transmitted symbols from received observations can be modelled as an in-context learning problem: Received observations are essentially a noisy fun… ▽ More

    Submitted 14 June, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

    Comments: 13 pages, 2 figures, 2 tables, preprint; abstract, references, theory updated

  4. arXiv:2302.12320  [pdf, other

    math.OC cs.LG eess.SY

    Dynamic Regret Analysis of Safe Distributed Online Optimization for Convex and Non-convex Problems

    Authors: Ting-Jui Chang, Sapana Chaudhary, Dileep Kalathil, Shahin Shahrampour

    Abstract: This paper addresses safe distributed online optimization over an unknown set of linear safety constraints. A network of agents aims at jointly minimizing a global, time-varying function, which is only partially observable to each individual agent. Therefore, agents must engage in local communications to generate a safe sequence of actions competitive with the best minimizer sequence in hindsight,… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

  5. arXiv:2210.06734  [pdf, other

    eess.SY

    Optimal Control of Material Micro-Structures

    Authors: Aayushman Sharma, Zirui Mao, Haiying Yang, Suman Chakravorty, Michael J Demkowicz, Dileep Kalathil

    Abstract: In this paper, we consider the optimal control of material micro-structures. Such material micro-structures are modeled by the so-called phase field model. We study the underlying physical structure of the model and propose a data based approach for its optimal control, along with a comparison to the control using a state of the art Reinforcement Learning (RL) algorithm. Simulation results show th… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

  6. arXiv:2208.10259  [pdf, ps, other

    cs.LG eess.SY

    Meta-Learning Online Control for Linear Dynamical Systems

    Authors: Deepan Muthirayan, Dileep Kalathil, Pramod P. Khargonekar

    Abstract: In this paper, we consider the problem of finding a meta-learning online control algorithm that can learn across the tasks when faced with a sequence of $N$ (similar) control tasks. Each task involves controlling a linear dynamical system for a finite horizon of $T$ time steps. The cost function and system noise at each time step are adversarial and unknown to the controller before taking the cont… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

  7. arXiv:2207.07731  [pdf, other

    eess.SY cs.LG

    Distributed Learning of Neural Lyapunov Functions for Large-Scale Networked Dissipative Systems

    Authors: Amit Jena, Tong Huang, S. Sivaranjani, Dileep Kalathil, Le Xie

    Abstract: This paper considers the problem of characterizing the stability region of a large-scale networked system comprised of dissipative nonlinear subsystems, in a distributed and computationally tractable way. One standard approach to estimate the stability region of a general nonlinear system is to first find a Lyapunov function for the system and characterize its region of attraction as the stability… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

  8. arXiv:2203.04430  [pdf, other

    eess.SY physics.soc-ph

    The Impact of Heavy-Duty Vehicle Electrification on Large Power Grids: a Synthetic Texas Case Study

    Authors: Rayan El Helou, S. Sivaranjani, Dileep Kalathil, Andrew Schaper, Le Xie

    Abstract: The electrification of heavy-duty vehicles (HDEVs) is a nascent and rapidly emerging avenue for decarbonization of the transportation sector. In this paper, we examine the impacts of increased vehicle electrification on the power grid infrastructure, with particular focus on HDEVs. We utilize a synthetic representation of the 2000-bus Texas transmission grid, and realistic representations of multi… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

  9. arXiv:2111.15063  [pdf, ps, other

    math.OC eess.SY

    Online Robust Control of Linear Dynamical Systems with Limited Prediction

    Authors: Deepan Muthirayan, Dileep Kalathil, Pramod P. Khargonekar

    Abstract: We study the online robust control problem for linear dynamical systems with disturbances and uncertainties in the cost functions, with limited preview of the future disturbances and the cost functions, $N$. Our goal is to find an online control policy that can minimize the disturbance gain, defined as the ratio of the cumulative cost and the cumulative energy in the disturbances over a period of… ▽ More

    Submitted 30 October, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

  10. arXiv:2111.15041  [pdf, ps, other

    cs.LG eess.SY

    Online Learning for Predictive Control with Provable Regret Guarantees

    Authors: Deepan Muthirayan, Jianjun Yuan, Dileep Kalathil, Pramod P. Khargonekar

    Abstract: We study the problem of online learning in predictive control of an unknown linear dynamical system with time varying cost functions which are unknown apriori. Specifically, we study the online learning problem where the control algorithm does not know the true system model and has only access to a fixed-length (that does not grow with the control horizon) preview of the future cost functions. The… ▽ More

    Submitted 31 October, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

  11. arXiv:2109.05802  [pdf, other

    eess.SY

    PyProD: A Machine Learning-Friendly Platform for Protection Analytics in Distribution Systems

    Authors: Dongqi Wu, Dileep Kalathil, Miroslav Begovic, Le Xie

    Abstract: This paper introduces PyProD, a Python-based machine learning (ML)-compatible test-bed for evaluating the efficacy of protection schemes in electric distribution grids. This testbed is designed to bridge the gap between conventional power distribution grid analysis and growing capability of ML-based decision making algorithms, in particular in the context of protection system design and configurat… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

    Comments: This paper has been accepted for HICSS 2022 and will appear in the conference proceedings

  12. arXiv:2008.05699  [pdf, other

    cs.RO eess.SY

    A Vision-Based Control Method for Autonomous Landing of Vertical Flight Aircraft On a Moving Platform Without Using GPS

    Authors: Bochan Lee, Vishnu Saj, Moble Benedict, Dileep Kalathil

    Abstract: The paper discusses a novel vision-based estimation and control approach to enable fully autonomous tracking and landing of vertical take-off and landing (VTOL) capable unmanned aerial vehicles (UAVs) on moving platforms without relying on a GPS signal. A unique feature of the present method is that it accomplishes this task without tracking the landing pad itself; however, by utilizing a standard… ▽ More

    Submitted 16 August, 2020; v1 submitted 13 August, 2020; originally announced August 2020.

    Comments: Presented at the VFS International 76th Annual Forum & Technology Display, October 6-8, 2020. Submitted to the Journal of Guidance, Control, and Dynamics(under review)

  13. arXiv:2008.01231  [pdf, other

    eess.SY

    Fully Decentralized Reinforcement Learning-based Control of Photovoltaics in Distribution Grids for Joint Provision of Real and Reactive Power

    Authors: Rayan El Helou, Dileep Kalathil, Le Xie

    Abstract: In this paper, we introduce a new framework to address the problem of voltage regulation in unbalanced distribution grids with deep photovoltaic penetration. In this framework, both real and reactive power setpoints are explicitly controlled at each solar panel smart inverter, and the objective is to simultaneously minimize system-wide voltage deviation and maximize solar power output. We formulat… ▽ More

    Submitted 29 April, 2021; v1 submitted 3 August, 2020; originally announced August 2020.

  14. arXiv:2006.11608  [pdf, other

    cs.LG eess.SY stat.ML

    Robust Reinforcement Learning using Least Squares Policy Iteration with Provable Performance Guarantees

    Authors: Kishan Panaganti, Dileep Kalathil

    Abstract: This paper addresses the problem of model-free reinforcement learning for Robust Markov Decision Process (RMDP) with large state spaces. The goal of the RMDP framework is to find a policy that is robust against the parameter uncertainties due to the mismatch between the simulator model and real-world settings. We first propose the Robust Least Squares Policy Evaluation algorithm, which is a multi-… ▽ More

    Submitted 11 February, 2021; v1 submitted 20 June, 2020; originally announced June 2020.

    Comments: 26 pages, 12 figures, 2 tables

  15. arXiv:2004.00472  [pdf, other

    cs.NI eess.SY

    Learning to Cache and Caching to Learn: Regret Analysis of Caching Algorithms

    Authors: Archana Bura, Desik Rengarajan, Dileep Kalathil, Srinivas Shakkottai, Jean-Francois Chamberland-Tremblay

    Abstract: Crucial performance metrics of a caching algorithm include its ability to quickly and accurately learn a popularity distribution of requests. However, a majority of work on analytical performance analysis focuses on hit probability after an asymptotically large time has elapsed. We consider an online learning viewpoint, and characterize the "regret" in terms of the finite time difference between t… ▽ More

    Submitted 1 April, 2020; originally announced April 2020.

  16. arXiv:2003.02422  [pdf, other

    eess.SY

    Deep Reinforcement Learning-BasedRobust Protection in DER-Rich Distribution Grids

    Authors: Dongqi Wu, Dileep Kalathil, Miroslav Begovic, Le Xie

    Abstract: This paper introduces the concept of Deep Reinforcement Learning based architecture for protective relay design in power distribution systems with many distributed energy resources (DERs). The performance of widely-used overcurrent protection scheme is hindered by the presence of distributed generation, power electronic interfaced devices and fault impedance. In this paper, a reinforcement learnin… ▽ More

    Submitted 1 June, 2021; v1 submitted 4 March, 2020; originally announced March 2020.

    Comments: Submitted to IEEE Transactions of Smart Grid, under review

  17. arXiv:1908.08180  [pdf, ps, other

    eess.SP

    Creation of Synthetic Networked PMU Data: A Generative Adversarial Network Approach

    Authors: Xiangtian Zheng, Bin Wang, Dileep Kalathil, Le Xie

    Abstract: This paper introduces a machine learning-based approach to synthetically creating realistic phasor measurement unit (PMU) data streams of multiple transient types. In contrast to the existing literature of transient simulation-based data generation methods, we propose a generative adversarial network (GAN) based approach to learning directly from the historical data and simultaneously reproduce mu… ▽ More

    Submitted 6 April, 2020; v1 submitted 21 August, 2019; originally announced August 2019.

    Comments: This manuscript has been submitted to IEEE Transactions on Power Systems

  18. arXiv:1906.10815  [pdf, other

    eess.SY

    Nested Reinforcement Learning Based Control for Protective Relays in Power Distribution Systems

    Authors: Dongqi Wu, Xiangtian Zheng, Dileep Kalathil, Le Xie

    Abstract: This paper envisions a new control architecture for the protective relay setting in future power distribution systems. With deepening penetration of distributed energy resources at the end users level, it has been recognized as a key engineering challenge to redesign the protective relays in the future distribution system. Conceptually, these protective relays are the discrete ON/OFF control devic… ▽ More

    Submitted 25 June, 2019; originally announced June 2019.

  19. arXiv:1906.01069  [pdf, ps, other

    eess.SY

    Selling Demand Response Using Options

    Authors: Deepan Muthirayan, Dileep Kalathil, Sen Li, Kameshwar Poolla, Pravin Varaiya

    Abstract: Wholesale electricity markets in many jurisdictions use a two-settlement structure: a day-ahead market for bulk power transactions and a real-time market for fine-grain supply-demand balancing. This paper explores trading demand response assets within this two-settlement market structure. We consider two approaches for trading demand response assets: (a) an intermediate spot market with contingent… ▽ More

    Submitted 2 August, 2020; v1 submitted 3 June, 2019; originally announced June 2019.

  20. arXiv:1904.08361  [pdf, other

    cs.LG cs.RO eess.SY stat.ML

    Decoupled Data Based Approach for Learning to Control Nonlinear Dynamical Systems

    Authors: Ran Wang, Karthikeya Parunandi, Dan Yu, Dileep Kalathil, Suman Chakravorty

    Abstract: This paper addresses the problem of learning the optimal control policy for a nonlinear stochastic dynamical system with continuous state space, continuous action space and unknown dynamics. This class of problems are typically addressed in stochastic adaptive control and reinforcement learning literature using model-based and model-free approaches respectively. Both methods rely on solving a dyna… ▽ More

    Submitted 17 April, 2019; originally announced April 2019.

  21. arXiv:1901.00959  [pdf, other

    cs.LG eess.IV stat.ML

    QFlow: A Learning Approach to High QoE Video Streaming at the Wireless Edge

    Authors: Rajarshi Bhattacharyya, Archana Bura, Desik Rengarajan, Mason Rumuly, Bainan Xia, Srinivas Shakkottai, Dileep Kalathil, Ricky K. P. Mok, Amogh Dhamdhere

    Abstract: The predominant use of wireless access networks is for media streaming applications, which are only gaining popularity as ever more devices become available for this purpose. However, current access networks treat all packets identically, and lack the agility to determine which clients are most in need of service at a given time. Software reconfigurability of networking devices has seen wide adopt… ▽ More

    Submitted 13 May, 2020; v1 submitted 3 January, 2019; originally announced January 2019.

    Comments: Submitted to ToN in May, 2020

  22. arXiv:1710.05394  [pdf, other

    stat.AP eess.SP

    Estimating Phase Duration for SPaT Messages

    Authors: Shahana Ibrahim, Dileep Kalathil, Rene O. Sanchez, Pravin Varaiya

    Abstract: A SPaT (Signal Phase and Timing) message describes for each lane the current phase at a signalized intersection together with an estimate of the residual time of that phase. Accurate SPaT messages can be used to construct a speed profile for a vehicle that reduces its fuel consumption as it approaches or leaves an intersection. This paper presents SPaT estimation algorithms at an intersection with… ▽ More

    Submitted 10 January, 2018; v1 submitted 15 October, 2017; originally announced October 2017.

    Comments: 9 Pages, 13 Figures, Under review

  23. arXiv:1608.06990  [pdf, other

    eess.SY

    The Sharing Economy for the Smart Grid

    Authors: Dileep Kalathil, Chenye Wu, Kameshwar Poolla, Pravin Varaiya

    Abstract: The sharing economy has disrupted housing and transportation sectors. Homeowners can rent out their property when they are away on vacation, car owners can offer ride sharing services. These sharing economy business models are based on monetizing under-utilized infrastructure. They are enabled by peer-to-peer platforms that match eager sellers with willing buyers. Are there compelling sharing ec… ▽ More

    Submitted 5 September, 2016; v1 submitted 24 August, 2016; originally announced August 2016.

    Comments: 11 pages, 11 figures

  24. arXiv:1411.0728  [pdf, ps, other

    cs.LG cs.GT eess.SY math.OC

    Approachability in Stackelberg Stochastic Games with Vector Costs

    Authors: Dileep Kalathil, Vivek Borkar, Rahul Jain

    Abstract: The notion of approachability was introduced by Blackwell [1] in the context of vector-valued repeated games. The famous Blackwell's approachability theorem prescribes a strategy for approachability, i.e., for `steering' the average cost of a given agent towards a given target set, irrespective of the strategies of the other agents. In this paper, motivated by the multi-objective optimization/deci… ▽ More

    Submitted 20 June, 2016; v1 submitted 3 November, 2014; originally announced November 2014.

    Comments: 18 Pages, Submitted to Dynamic Games and Applications

  25. arXiv:1206.3582  [pdf, other

    math.OC cs.LG eess.SY

    Decentralized Learning for Multi-player Multi-armed Bandits

    Authors: Dileep Kalathil, Naumaan Nayyar, Rahul Jain

    Abstract: We consider the problem of distributed online learning with multiple players in multi-armed bandits (MAB) models. Each player can pick among multiple arms. When a player picks an arm, it gets a reward. We consider both i.i.d. reward model and Markovian reward model. In the i.i.d. model each arm is modelled as an i.i.d. process with an unknown distribution with an unknown mean. In the Markovian mod… ▽ More

    Submitted 14 June, 2012; originally announced June 2012.

    Comments: 33 pages, 3 figures. Submitted to IEEE Transactions on Information Theory