Skip to main content

Showing 1–9 of 9 results for author: Rengarajan, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.07315  [pdf, other

    eess.SY cs.AI cs.LG

    Structured Reinforcement Learning for Media Streaming at the Wireless Edge

    Authors: Archana Bura, Sarat Chandra Bobbili, Shreyas Rameshkumar, Desik Rengarajan, Dileep Kalathil, Srinivas Shakkottai

    Abstract: Media streaming is the dominant application over wireless edge (access) networks. The increasing softwarization of such networks has led to efforts at intelligent control, wherein application-specific actions may be dynamically taken to enhance the user experience. The goal of this work is to develop and demonstrate learning-based policies for optimal decision making to determine which clients to… ▽ More

    Submitted 16 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

    Comments: 15 pages, 14 figures

  2. arXiv:2310.18679  [pdf

    cs.CL cs.AI cs.LG

    N-Critics: Self-Refinement of Large Language Models with Ensemble of Critics

    Authors: Sajad Mousavi, Ricardo Luna Gutiérrez, Desik Rengarajan, Vineet Gundecha, Ashwin Ramesh Babu, Avisek Naug, Antonio Guillen, Soumyendu Sarkar

    Abstract: We propose a self-correction mechanism for Large Language Models (LLMs) to mitigate issues such as toxicity and fact hallucination. This method involves refining model outputs through an ensemble of critics and the model's own feedback. Drawing inspiration from human behavior, we explore whether LLMs can emulate the self-correction process observed in humans who often engage in self-reflection and… ▽ More

    Submitted 8 November, 2023; v1 submitted 28 October, 2023; originally announced October 2023.

    Journal ref: NeurIPS 2023 Workshop on Robustness of Few-shot and Zero-shot Learning in Foundation Models 2023(NeurIPS 2023)

  3. arXiv:2305.03097  [pdf, other

    cs.LG cs.AI

    Federated Ensemble-Directed Offline Reinforcement Learning

    Authors: Desik Rengarajan, Nitin Ragothaman, Dileep Kalathil, Srinivas Shakkottai

    Abstract: We consider the problem of federated offline reinforcement learning (RL), a scenario under which distributed learning agents must collaboratively learn a high-quality control policy only using small pre-collected datasets generated according to different unknown behavior policies. Naively combining a standard offline RL approach with a standard federated learning approach to solve this problem can… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

  4. arXiv:2209.13048  [pdf, other

    cs.LG cs.RO

    Enhanced Meta Reinforcement Learning using Demonstrations in Sparse Reward Environments

    Authors: Desik Rengarajan, Sapana Chaudhary, Jaewon Kim, Dileep Kalathil, Srinivas Shakkottai

    Abstract: Meta reinforcement learning (Meta-RL) is an approach wherein the experience gained from solving a variety of tasks is distilled into a meta-policy. The meta-policy, when adapted over only a small (or just a single) number of steps, is able to perform near-optimally on a new, related task. However, a major challenge to adopting this approach to solve real-world problems is that they are often assoc… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: Accepted to NeurIPS 2022; first two authors contributed equally

  5. arXiv:2202.04628  [pdf, other

    cs.LG cs.AI

    Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration

    Authors: Desik Rengarajan, Gargi Vaidya, Akshay Sarvesh, Dileep Kalathil, Srinivas Shakkottai

    Abstract: A major challenge in real-world reinforcement learning (RL) is the sparsity of reward feedback. Often, what is available is an intuitive but sparse reward function that only indicates whether the task is completed partially or fully. However, the lack of carefully designed, fine grain feedback implies that most existing RL algorithms fail to learn an acceptable policy in a reasonable time frame. T… ▽ More

    Submitted 13 February, 2022; v1 submitted 9 February, 2022; originally announced February 2022.

  6. arXiv:2006.11683  [pdf, other

    math.OC cs.GT cs.LG cs.MA

    Reinforcement Learning for Mean Field Games with Strategic Complementarities

    Authors: Kiyeob Lee, Desik Rengarajan, Dileep Kalathil, Srinivas Shakkottai

    Abstract: Mean Field Games (MFG) are the class of games with a very large number of agents and the standard equilibrium concept is a Mean Field Equilibrium (MFE). Algorithms for learning MFE in dynamic MFGs are unknown in general. Our focus is on an important subclass that possess a monotonicity property called Strategic Complementarities (MFG-SC). We introduce a natural refinement to the equilibrium concep… ▽ More

    Submitted 1 February, 2021; v1 submitted 20 June, 2020; originally announced June 2020.

  7. arXiv:2004.00472  [pdf, other

    cs.NI eess.SY

    Learning to Cache and Caching to Learn: Regret Analysis of Caching Algorithms

    Authors: Archana Bura, Desik Rengarajan, Dileep Kalathil, Srinivas Shakkottai, Jean-Francois Chamberland-Tremblay

    Abstract: Crucial performance metrics of a caching algorithm include its ability to quickly and accurately learn a popularity distribution of requests. However, a majority of work on analytical performance analysis focuses on hit probability after an asymptotically large time has elapsed. We consider an online learning viewpoint, and characterize the "regret" in terms of the finite time difference between t… ▽ More

    Submitted 1 April, 2020; originally announced April 2020.

  8. arXiv:1901.00959  [pdf, other

    cs.LG eess.IV stat.ML

    QFlow: A Learning Approach to High QoE Video Streaming at the Wireless Edge

    Authors: Rajarshi Bhattacharyya, Archana Bura, Desik Rengarajan, Mason Rumuly, Bainan Xia, Srinivas Shakkottai, Dileep Kalathil, Ricky K. P. Mok, Amogh Dhamdhere

    Abstract: The predominant use of wireless access networks is for media streaming applications, which are only gaining popularity as ever more devices become available for this purpose. However, current access networks treat all packets identically, and lack the agility to determine which clients are most in need of service at a given time. Software reconfigurability of networking devices has seen wide adopt… ▽ More

    Submitted 13 May, 2020; v1 submitted 3 January, 2019; originally announced January 2019.

    Comments: Submitted to ToN in May, 2020

  9. arXiv:1801.00825  [pdf, other

    cs.NI

    FlowBazaar: A Market-Mediated Software Defined Communications Ecosystem at the Wireless Edge

    Authors: Rajarshi Bhattacharyya, Bainan Xia, Desik Rengarajan, Srinivas Shakkottai, Dileep Kalathil

    Abstract: The predominant use of wireless access networks is for media streaming applications, which are only gaining popularity as ever more devices become available for this purpose. However, current access networks treat all packets identically, and lack the agility to determine which clients are most in need of service at a given time. Software reconfigurability of networking devices has seen wide adopt… ▽ More

    Submitted 23 January, 2019; v1 submitted 2 January, 2018; originally announced January 2018.

    Comments: Submitted to WiOpt, 2019