Skip to main content

Showing 1–19 of 19 results for author: Chandak, S

.
  1. arXiv:2407.00575  [pdf, other

    cs.MA cs.LG eess.SY

    Learning to Control Unknown Strongly Monotone Games

    Authors: Siddharth Chandak, Ilai Bistritz, Nicholas Bambos

    Abstract: Consider $N$ players each with a $d$-dimensional action set. Each of the players' utility functions includes their reward function and a linear term for each dimension, with coefficients that are controlled by the manager. We assume that the game is strongly monotone, so if each player runs gradient descent, the dynamics converge to a unique Nash equilibrium (NE). The NE is typically inefficient i… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

    Comments: Submitted to IEEE Transactions on Automatic Control

  2. arXiv:2406.18000  [pdf, other

    eess.SY

    Tiered Service Architecture for Remote Patient Monitoring

    Authors: Siddharth Chandak, Isha Thapa, Nicholas Bambos, David Scheinker

    Abstract: We develop a remote patient monitoring (RPM) service architecture, which has two tiers of monitoring: ordinary and intensive. The patient's health state improves or worsens in each time period according to certain probabilities, which depend on the monitoring tier. The patient incurs a "loss of quality of life" cost or an "invasiveness" cost, which is higher under intensive monitoring than under o… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Submitted to IEEE Healthcom 2024. 7 pages

  3. arXiv:2312.10424  [pdf, other

    cs.LG eess.SY stat.ML

    A Concentration Bound for TD(0) with Function Approximation

    Authors: Siddharth Chandak, Vivek S. Borkar

    Abstract: We derive a concentration bound of the type `for all $n \geq n_0$ for some $n_0$' for TD(0) with linear function approximation. We work with online TD learning with samples from a single sample path of the underlying Markov chain. This makes our analysis significantly different from offline TD learning or TD learning with access to independent samples from the stationary distribution of the Markov… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

    Comments: Submitted to Stochastic Systems

  4. arXiv:2302.13653  [pdf, other

    cs.LG cs.MA

    Equilibrium Bandits: Learning Optimal Equilibria of Unknown Dynamics

    Authors: Siddharth Chandak, Ilai Bistritz, Nicholas Bambos

    Abstract: Consider a decision-maker that can pick one out of $K$ actions to control an unknown system, for $T$ turns. The actions are interpreted as different configurations or policies. Holding the same action fixed, the system asymptotically converges to a unique equilibrium, as a function of this action. The dynamics of the system are unknown to the decision-maker, which can only observe a noisy reward a… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: Accepted at the 22nd International Conference on Autonomous Agents and Multiagent Systems (2023)

  5. arXiv:2211.01595  [pdf, other

    eess.SY cs.LG

    Reinforcement Learning in Non-Markovian Environments

    Authors: Siddharth Chandak, Pratik Shah, Vivek S Borkar, Parth Dodhia

    Abstract: Motivated by the novel paradigm developed by Van Roy and coauthors for reinforcement learning in arbitrary non-Markovian environments, we propose a related formulation and explicitly pin down the error caused by non-Markovianity of observations when the Q-learning algorithm is applied on this formulation. Based on this observation, we propose that the criterion for agent design should be to seek g… ▽ More

    Submitted 13 February, 2024; v1 submitted 3 November, 2022; originally announced November 2022.

    Comments: 19 pages, accepted for publication at Systems and Control Letters

  6. arXiv:2202.00401  [pdf, ps, other

    cs.NI cs.MA

    Learning to Speak on Behalf of a Group: Medium Access Control for Sending a Shared Message

    Authors: Shaan ul Haque, Siddharth Chandak, Federico Chiariotti, Deniz Gunduz, Petar Popovski

    Abstract: The rapid development of Industrial Internet of Things (IIoT) technologies has not only enabled new applications, but also presented new challenges for reliable communication with limited resources. In this work, we define a deceptively simple novel problem that can arise in these scenarios, in which a set of sensors need to communicate a joint observation. This observation is shared by a random s… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: Submitted to IEEE INFOCOM Workshops 2022

  7. arXiv:2111.02644  [pdf, ps, other

    cs.LG eess.SY

    A Concentration Bound for LSPE($λ$)

    Authors: Siddharth Chandak, Vivek S. Borkar, Harsh Dolhare

    Abstract: The popular LSPE($λ$) algorithm for policy evaluation is revisited to derive a concentration bound that gives high probability performance guarantees from some time on.

    Submitted 30 November, 2022; v1 submitted 4 November, 2021; originally announced November 2021.

    Comments: 17 pages, accepted for publication in Systems and Control Letters

  8. arXiv:2109.12075  [pdf, other

    cs.AI cs.LG

    Towards A Measure Of General Machine Intelligence

    Authors: Gautham Venkatasubramanian, Sibesh Kar, Abhimanyu Singh, Shubham Mishra, Dushyant Yadav, Shreyansh Chandak

    Abstract: To build general-purpose artificial intelligence systems that can deal with unknown variables across unknown domains, we need benchmarks that measure how well these systems perform on tasks they have never seen before. A prerequisite for this is a measure of a task's generalization difficulty, or how dissimilar it is from the system's prior knowledge and experience. If the skill of an intelligence… ▽ More

    Submitted 24 May, 2022; v1 submitted 24 September, 2021; originally announced September 2021.

    Comments: 31 pages, 15 Figures, 3 Tables; Sample Data and g-index Reference Code at https://github.com/mayahq/g-index-benchmark; g-index toy environment at https://github.com/mayahq/flatland; version 2 added a section about the toy environment; version 3 compressed images to reduce file size; version 4 updated description of flatland toy environment

    ACM Class: I.2.2; I.2.5; I.2.7

  9. arXiv:2106.14308  [pdf, other

    cs.LG eess.SY

    Concentration of Contractive Stochastic Approximation and Reinforcement Learning

    Authors: Siddharth Chandak, Vivek S. Borkar, Parth Dodhia

    Abstract: Using a martingale concentration inequality, concentration bounds `from time $n_0$ on' are derived for stochastic approximation algorithms with contractive maps and both martingale difference and Markov noises. These are applied to reinforcement learning algorithms, in particular to asynchronous Q-learning and TD(0).

    Submitted 11 June, 2022; v1 submitted 27 June, 2021; originally announced June 2021.

    Comments: 20 pages, Accepted for publication in Stochastic Systems

  10. arXiv:2106.14014  [pdf, other

    eess.IV cs.MM

    Txt2Vid: Ultra-Low Bitrate Compression of Talking-Head Videos via Text

    Authors: Pulkit Tandon, Shubham Chandak, Pat Pataranutaporn, Yimeng Liu, Anesu M. Mapuranga, Pattie Maes, Tsachy Weissman, Misha Sra

    Abstract: Video represents the majority of internet traffic today, driving a continual race between the generation of higher quality content, transmission of larger file sizes, and the development of network infrastructure. In addition, the recent COVID-19 pandemic fueled a surge in the use of video conferencing tools. Since videos take up considerable bandwidth (~100 Kbps to a few Mbps), improved video com… ▽ More

    Submitted 2 April, 2022; v1 submitted 26 June, 2021; originally announced June 2021.

    Comments: 11 pages, 8 figures, 2 table. Addition of statistical analysis of results. Reorganization and rewriting of text to make it clearer

  11. Prospect-theoretic Q-learning

    Authors: Vivek S. Borkar, Siddharth Chandak

    Abstract: We consider a prospect theoretic version of the classical Q-learning algorithm for discounted reward Markov decision processes, wherein the controller perceives a distorted and noisy future reward, modeled by a nonlinearity that accentuates gains and underrepresents losses relative to a reference point. We analyze the asymptotic behavior of the scheme by analyzing its limiting differential equatio… ▽ More

    Submitted 1 September, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: Published in Systems and Control Letters. 16 pages, 8 figures

  12. arXiv:2102.01839  [pdf, ps, other

    cs.IT

    On Coding for an Abstracted Nanopore Channel for DNA Storage

    Authors: Reyna Hulett, Shubham Chandak, Mary Wootters

    Abstract: In the emerging field of DNA storage, data is encoded as DNA sequences and stored. The data is read out again by sequencing the stored DNA. Nanopore sequencing is a new sequencing technology that has many advantages over other methods; in particular, it is cheap, portable, and can support longer reads. While several practical coding schemes have been developed for DNA storage with nanopore sequenc… ▽ More

    Submitted 2 February, 2021; originally announced February 2021.

  13. Hidden Markov Model-Based Encoding for Time-Correlated IoT Sources

    Authors: Siddharth Chandak, Federico Chiariotti, Petar Popovski

    Abstract: As the use of Internet of Things (IoT) devices for monitoring purposes becomes ubiquitous, the efficiency of sensor communication is a major issue for the modern Internet. Channel coding is less efficient for extremely short packets, and traditional techniques that rely on source compression require extensive signaling or pre-existing knowledge of the source dynamics. In this work, we propose an e… ▽ More

    Submitted 20 January, 2021; v1 submitted 19 January, 2021; originally announced January 2021.

    Comments: Preprint version of the paper published in IEEE Communications Letters

    MSC Class: 94A05 (Primary); 94B35; 62M05 (Secondary) ACM Class: E.4; H.1.1

  14. arXiv:2011.03800  [pdf, other

    eess.IV

    Reducing latency and bandwidth for video streaming using keypoint extraction and digital puppetry

    Authors: Roshan Prabhakar, Shubham Chandak, Carina Chiu, Renee Liang, Huong Nguyen, Kedar Tatwawadi, Tsachy Weissman

    Abstract: COVID-19 has made video communication one of the most important modes of information exchange. While extensive research has been conducted on the optimization of the video streaming pipeline, in particular the development of novel video codecs, further improvement in the video quality and latency is required, especially under poor network conditions. This paper proposes an alternative to the conve… ▽ More

    Submitted 8 January, 2021; v1 submitted 7 November, 2020; originally announced November 2020.

    Comments: 10 pages, 5 figures, 1-page summary to be published at DCC 2021. Revision: added references

  15. arXiv:1911.03572  [pdf, other

    cs.LG cs.IT stat.ML

    DZip: improved general-purpose lossless compression based on novel neural network modeling

    Authors: Mohit Goyal, Kedar Tatwawadi, Shubham Chandak, Idoia Ochoa

    Abstract: We consider lossless compression based on statistical data modeling followed by prediction-based encoding, where an accurate statistical model for the input data leads to substantial improvements in compression. We propose DZip, a general-purpose compressor for sequential data that exploits the well-known modeling capabilities of neural networks (NNs) for prediction, followed by arithmetic coding.… ▽ More

    Submitted 18 September, 2020; v1 submitted 8 November, 2019; originally announced November 2019.

    Comments: Updated manuscript and an efficient implementation added

  16. arXiv:1911.00208  [pdf, other

    eess.SP cs.LG

    LFZip: Lossy compression of multivariate floating-point time series data via improved prediction

    Authors: Shubham Chandak, Kedar Tatwawadi, Chengtao Wen, Lingyun Wang, Juan Aparicio, Tsachy Weissman

    Abstract: Time series data compression is emerging as an important problem with the growth in IoT devices and sensors. Due to the presence of noise in these datasets, lossy compression can often provide significant compression gains without impacting the performance of downstream applications. In this work, we propose an error-bounded lossy compressor, LFZip, for multivariate floating-point time series data… ▽ More

    Submitted 13 January, 2020; v1 submitted 1 November, 2019; originally announced November 2019.

  17. arXiv:1906.07887  [pdf, ps, other

    cs.DS cs.IT eess.SP

    Tutorial on algebraic deletion correction codes

    Authors: Kedar Tatwawadi, Shubham Chandak

    Abstract: The deletion channel is known to be a notoriously diffcult channel to design error-correction codes for. In spite of this difficulty, there are some beautiful code constructions which give some intuition about the channel and about what good deletion codes look like. In this tutorial we will take a look at some of them. This document is a transcript of my talk at the coding theory reading group on… ▽ More

    Submitted 18 June, 2019; originally announced June 2019.

  18. arXiv:1811.08162  [pdf, other

    cs.CL eess.SP q-bio.GN

    DeepZip: Lossless Data Compression using Recurrent Neural Networks

    Authors: Mohit Goyal, Kedar Tatwawadi, Shubham Chandak, Idoia Ochoa

    Abstract: Sequential data is being generated at an unprecedented pace in various forms, including text and genomic data. This creates the need for efficient compression mechanisms to enable better storage, transmission and processing of such data. To solve this problem, many of the existing compressors attempt to learn models for the data and perform prediction-based compression. Since neural networks are k… ▽ More

    Submitted 20 November, 2018; originally announced November 2018.

  19. arXiv:1810.11137  [pdf, other

    eess.IV cs.CV cs.IT cs.MM

    Towards improved lossy image compression: Human image reconstruction with public-domain images

    Authors: Ashutosh Bhown, Soham Mukherjee, Sean Yang, Shubham Chandak, Irena Fischer-Hwang, Kedar Tatwawadi, Judith Fan, Tsachy Weissman

    Abstract: Lossy image compression has been studied extensively in the context of typical loss functions such as RMSE, MS-SSIM, etc. However, compression at low bitrates generally produces unsatisfying results. Furthermore, the availability of massive public image datasets appears to have hardly been exploited in image compression. Here, we present a paradigm for eliciting human image reconstruction in order… ▽ More

    Submitted 24 June, 2019; v1 submitted 25 October, 2018; originally announced October 2018.