Skip to main content

Showing 1–12 of 12 results for author: Paruchuri, P

.
  1. arXiv:2407.01310  [pdf, other

    cs.LG cs.CV

    Multi-State-Action Tokenisation in Decision Transformers for Multi-Discrete Action Spaces

    Authors: Perusha Moodley, Pramod Kaushik, Dhillu Thambi, Mark Trovinger, Praveen Paruchuri, Xia Hong, Benjamin Rosman

    Abstract: Decision Transformers, in their vanilla form, struggle to perform on image-based environments with multi-discrete action spaces. Although enhanced Decision Transformer architectures have been developed to improve performance, these methods have not specifically addressed this problem of multi-discrete action spaces which hampers existing Decision Transformer architectures from learning good repres… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  2. arXiv:2406.19626  [pdf, other

    cs.AI

    Safety through feedback in Constrained RL

    Authors: Shashank Reddy Chirra, Pradeep Varakantham, Praveen Paruchuri

    Abstract: In safety-critical RL settings, the inclusion of an additional cost function is often favoured over the arduous task of modifying the reward function to ensure the agent's safe behaviour. However, designing or evaluating such a cost function can be prohibitively expensive. For instance, in the domain of self-driving, designing a cost function that encompasses all unsafe behaviours (e.g. aggressive… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  3. arXiv:2307.12661  [pdf, other

    eess.SY math.OC

    Algorithmic construction of Lyapunov functions for continuous vector fields via convex semi-infinite programs

    Authors: Raavi Gupta, Sameep Chattopadhyay, Pradyumna Paruchuri, Debasish Chatterjee

    Abstract: This article presents a novel numerically tractable technique for synthesizing Lyapunov functions for equilibria of nonlinear vector fields. In broad strokes, corresponding to an isolated equilibrium point of a given vector field, a selection is made of a compact neighborhood of the equilibrium and a dictionary of functions in which a Lyapunov function is expected to lie. Then an algorithmic proce… ▽ More

    Submitted 25 August, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: 29 pages. Submitted

    MSC Class: 93D05; 93D20; 65K05; 65P40

  4. arXiv:2307.01304  [pdf, other

    math.OC cs.LG eess.SP eess.SY

    A numerical algorithm for attaining the Chebyshev bound in optimal learning

    Authors: Pradyumna Paruchuri, Debasish Chatterjee

    Abstract: Given a compact subset of a Banach space, the Chebyshev center problem consists of finding a minimal circumscribing ball containing the set. In this article we establish a numerically tractable algorithm for solving the Chebyshev center problem in the context of optimal learning from a finite set of data points. For a hypothesis space realized as a compact but not necessarily convex subset of a fi… ▽ More

    Submitted 3 July, 2023; originally announced July 2023.

    Comments: 22 pages, 16 figures

  5. arXiv:2302.14442  [pdf, other

    cs.AI

    City-scale Pollution Aware Traffic Routing by Sampling Max Flows using MCMC

    Authors: Shreevignesh Suriyanarayanan, Praveen Paruchuri, Girish Varma

    Abstract: A significant cause of air pollution in urban areas worldwide is the high volume of road traffic. Long-term exposure to severe pollution can cause serious health issues. One approach towards tackling this problem is to design a pollution-aware traffic routing policy that balances multiple objectives of i) avoiding extreme pollution in any area ii) enabling short transit times, and iii) making effe… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: Accepted in AAAI 2023 (AI for Social Impact Track)

  6. arXiv:2301.09892  [pdf, other

    cs.GT cs.CR

    Learning Effective Strategies for Moving Target Defense with Switching Costs

    Authors: Vignesh Viswanathan, Megha Bose, Praveen Paruchuri

    Abstract: Moving Target Defense (MTD) has emerged as a key technique in various security applications as it takes away the attacker's ability to perform reconnaissance for exploiting a system's vulnerabilities. However, most of the existing research in the field assumes unrealistic access to information about the attacker's motivations and/or actions when develo** MTD strategies. Many of the existing appr… ▽ More

    Submitted 24 January, 2023; originally announced January 2023.

  7. arXiv:2201.10127  [pdf, other

    cs.GT econ.TH

    Multi-unit Double Auctions: Equilibrium Analysis and Bidding Strategy using DDPG in Smart-grids

    Authors: Sanjay Chandlekar, Easwar Subramanian, Sanjay Bhat, Praveen Paruchuri, Sujit Gujar

    Abstract: Periodic double auctions (PDA) have applications in many areas such as in e-commerce, intra-day equity markets, and day-ahead energy markets in smart-grids. While the trades accomplished using PDAs are worth trillions of dollars, finding a reliable bidding strategy in such auctions is still a challenge as it requires the consideration of future auctions. A participating buyer in a PDA has to desig… ▽ More

    Submitted 22 February, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: Accepted for publication in the proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems (AAMAS-22)

  8. arXiv:2112.05495  [pdf, other

    cs.LG cs.AI

    How Private Is Your RL Policy? An Inverse RL Based Analysis Framework

    Authors: Kritika Prakash, Fiza Husain, Praveen Paruchuri, Sujit P. Gujar

    Abstract: Reinforcement Learning (RL) enables agents to learn how to perform various tasks from scratch. In domains like autonomous driving, recommendation systems, and more, optimal RL policies learned could cause a privacy breach if the policies memorize any part of the private reward. We study the set of existing differentially-private RL policies derived from various RL algorithms such as Value Iteratio… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Comments: 15 pages, 7 figures, 5 tables, version accepted at AAAI 2022

  9. arXiv:1911.08260  [pdf, other

    cs.GT cs.MA q-fin.TR

    Bidding in Smart Grid PDAs: Theory, Analysis and Strategy (Extended Version)

    Authors: Susobhan Ghosh, Sujit Gujar, Praveen Paruchuri, Easwar Subramanian, Sanjay P. Bhat

    Abstract: Periodic Double Auctions (PDAs) are commonly used in the real world for trading, e.g. in stock markets to determine stock opening prices, and energy markets to trade energy in order to balance net demand in smart grids, involving trillions of dollars in the process. A bidder, participating in such PDAs, has to plan for bids in the current auction as well as for the future auctions, which highlight… ▽ More

    Submitted 23 November, 2019; v1 submitted 19 November, 2019; originally announced November 2019.

    Comments: Accepted for publication in the proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence (AAAI-20)

  10. Discrete time optimal control with frequency constraints for non-smooth systems

    Authors: Shruti Kotpalliwar, Pradyumna Paruchuri, Debasish Chatterjee, Ravi Banavar

    Abstract: We present a Pontryagin maximum principle for discrete time optimal control problems with (a) pointwise constraints on the control actions and the states, (b) frequency constraints on the control and the state trajectories, and (c) nonsmooth dynamical systems. Pointwise constraints on the states and the control actions represent desired and/or physical limitations on the states and the control val… ▽ More

    Submitted 27 March, 2019; v1 submitted 19 January, 2019; originally announced January 2019.

  11. arXiv:1803.03052  [pdf, ps, other

    eess.SY math.OC

    A frequency-constrained geometric Pontryagin maximum principle on matrix Lie groups

    Authors: Shruti Kotpalliwar, Pradyumna Paruchuri, Karmvir Singh Phogat, Debasish Chatterjee, Ravi Banavar

    Abstract: In this article we present a geometric discrete-time Pontryagin maximum principle (PMP) on matrix Lie groups that incorporates frequency constraints on the controls in addition to pointwise constraints on the states and control actions directly at the stage of the problem formulation. This PMP gives first order necessary conditions for optimality, and leads to two-point boundary value problems tha… ▽ More

    Submitted 27 March, 2019; v1 submitted 8 March, 2018; originally announced March 2018.

  12. Discrete time Pontryagin maximum principle for optimal control problems under state-action-frequency constraints

    Authors: Pradyumna Paruchuri, Debasish Chatterjee

    Abstract: We establish a Pontryagin maximum principle for discrete time optimal control problems under the following three types of constraints: a) constraints on the states pointwise in time, b) constraints on the control actions pointwise in time, and c) constraints on the frequency spectrum of the optimal control trajectories. While the first two types of constraints are already included in the existing… ▽ More

    Submitted 15 August, 2017; originally announced August 2017.

    Comments: 31 pages

    MSC Class: 49K21