Skip to main content

Showing 1–5 of 5 results for author: Agarwal, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2402.13301  [pdf, other

    cs.SD cs.AI eess.AS

    Structure-informed Positional Encoding for Music Generation

    Authors: Manvi Agarwal, Changhong Wang, Gaël Richard

    Abstract: Music generated by deep learning methods often suffers from a lack of coherence and long-term organization. Yet, multi-scale hierarchical structure is a distinctive feature of music signals. To leverage this information, we propose a structure-informed positional encoding framework for music generation with Transformers. We design three variants in terms of absolute, relative and non-stationary po… ▽ More

    Submitted 28 February, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Journal ref: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Apr 2024, Seoul, South Korea

  2. arXiv:2106.06680  [pdf, other

    cs.LG cs.AI eess.SY

    Markov Decision Processes with Long-Term Average Constraints

    Authors: Mridul Agarwal, Qinbo Bai, Vaneet Aggarwal

    Abstract: We consider the problem of constrained Markov Decision Process (CMDP) where an agent interacts with a unichain Markov Decision Process. At every interaction, the agent obtains a reward. Further, there are $K$ cost functions. The agent aims to maximize the long-term average reward while simultaneously kee** the $K$ long-term average costs lower than a certain threshold. In this paper, we propose… ▽ More

    Submitted 20 June, 2022; v1 submitted 11 June, 2021; originally announced June 2021.

  3. arXiv:2105.14125  [pdf, other

    cs.LG cs.AI eess.SY

    Joint Optimization of Multi-Objective Reinforcement Learning with Policy Gradient Based Algorithm

    Authors: Qinbo Bai, Mridul Agarwal, Vaneet Aggarwal

    Abstract: Many engineering problems have multiple objectives, and the overall aim is to optimize a non-linear function of these objectives. In this paper, we formulate the problem of maximizing a non-linear concave function of multiple long-term objectives. A policy-gradient based model-free algorithm is proposed for the problem. To compute an estimate of the gradient, a biased estimator is proposed. The pr… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

  4. arXiv:2007.08637  [pdf, other

    eess.IV cs.CV cs.LG

    COV-ELM classifier: An Extreme Learning Machine based identification of COVID-19 using Chest X-Ray Images

    Authors: Sheetal Rajpal, Manoj Agarwal, Ankit Rajpal, Navin Lakhyani, Arpita Saggar, Naveen Kumar

    Abstract: Coronaviruses constitute a family of viruses that gives rise to respiratory diseases. As COVID-19 is highly contagious, early diagnosis of COVID-19 is crucial for an effective treatment strategy. However, the RT-PCR test which is considered to be a gold standard in the diagnosis of COVID-19 suffers from a high false-negative rate. Chest X-ray (CXR) image analysis has emerged as a feasible and effe… ▽ More

    Submitted 28 September, 2021; v1 submitted 16 July, 2020; originally announced July 2020.

  5. arXiv:2006.16498  [pdf, other

    cs.NE cs.LG eess.SP

    Accelerating Reinforcement Learning Agent with EEG-based Implicit Human Feedback

    Authors: Duo Xu, Mohit Agarwal, Ekansh Gupta, Faramarz Fekri, Raghupathy Sivakumar

    Abstract: Providing Reinforcement Learning (RL) agents with human feedback can dramatically improve various aspects of learning. However, previous methods require human observer to give inputs explicitly (e.g., press buttons, voice interface), burdening the human in the loop of RL agent's learning process. Further, it is sometimes difficult or impossible to obtain the explicit human advise (feedback), e.g.,… ▽ More

    Submitted 14 October, 2020; v1 submitted 29 June, 2020; originally announced June 2020.