Skip to main content

Showing 1–9 of 9 results for author: Satsangi, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.15016  [pdf, other

    cs.LG

    Estimating class separability of text embeddings with persistent homology

    Authors: Kostis Gourgoulias, Najah Ghalyan, Maxime Labonne, Yash Satsangi, Sean Moran, Joseph Sabelja

    Abstract: This paper introduces an unsupervised method to estimate the class separability of text datasets from a topological point of view. Using persistent homology, we demonstrate how tracking the evolution of embedding manifolds during training can inform about class separability. More specifically, we show how this technique can be applied to detect when the training process stops improving the separab… ▽ More

    Submitted 18 June, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Updated version of the article; pre-print of the version published at Transactions of Machine Learning Research, https://openreview.net/forum?id=8DWrIMuLya

  2. arXiv:2304.07163  [pdf, other

    cs.AI cs.LG

    Bandit-Based Policy Invariant Explicit Sha** for Incorporating External Advice in Reinforcement Learning

    Authors: Yash Satsangi, Paniz Behboudian

    Abstract: A key challenge for a reinforcement learning (RL) agent is to incorporate external/expert1 advice in its learning. The desired goals of an algorithm that can shape the learning of an RL agent with external advice include (a) maintaining policy invariance; (b) accelerating the learning of the agent; and (c) learning from arbitrary advice [3]. To address this challenge this paper formulates the prob… ▽ More

    Submitted 18 September, 2023; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: ALA workshop, AAMAS 2023

  3. arXiv:2208.09495  [pdf, other

    cs.SE cs.AI

    Topical: Learning Repository Embeddings from Source Code using Attention

    Authors: Agathe Lherondelle, Varun Babbar, Yash Satsangi, Fran Silavong, Shaltiel Eloul, Sean Moran

    Abstract: This paper presents Topical, a novel deep neural network for repository level embeddings. Existing methods, reliant on natural language documentation or naive aggregation techniques, are outperformed by Topical's utilization of an attention mechanism. This mechanism generates repository-level representations from source code, full dependency graphs, and script level textual data. Trained on public… ▽ More

    Submitted 4 November, 2023; v1 submitted 19 August, 2022; originally announced August 2022.

    Comments: Pre-print, under review

  4. arXiv:2110.15907  [pdf, other

    cs.AI cs.LG

    Learning to Be Cautious

    Authors: Montaser Mohammedalamen, Dustin Morrill, Alexander Sieusahai, Yash Satsangi, Michael Bowling

    Abstract: A key challenge in the field of reinforcement learning is to develop agents that behave cautiously in novel situations. It is generally impossible to anticipate all situations that an autonomous system may face or what behavior would best avoid bad outcomes. An agent that could learn to be cautious would overcome this challenge by discovering for itself when and how to behave cautiously. In contra… ▽ More

    Submitted 29 October, 2021; originally announced October 2021.

  5. arXiv:2011.01297  [pdf, other

    cs.LG cs.AI

    Useful Policy Invariant Sha** from Arbitrary Advice

    Authors: Paniz Behboudian, Yash Satsangi, Matthew E. Taylor, Anna Harutyunyan, Michael Bowling

    Abstract: Reinforcement learning is a powerful learning paradigm in which agents can learn to maximize sparse and delayed reward signals. Although RL has had many impressive successes in complex domains, learning can take hours, days, or even years of training data. A major challenge of contemporary RL research is to discover how to learn with less data. Previous work has shown that domain information can b… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Comments: 9 pages, 6 figures, Adaptive and Learning Agents (ALA) 2020 Workshop

  6. arXiv:2010.03024  [pdf, other

    cs.CV

    Real-Time Resource Allocation for Tracking Systems

    Authors: Yash Satsangi, Shimon Whiteson, Frans A. Oliehoek, Henri Bouma

    Abstract: Automated tracking is key to many computer vision applications. However, many tracking systems struggle to perform in real-time due to the high computational cost of detecting people, especially in ultra high resolution images. We propose a new algorithm called \emph{PartiMax} that greatly reduces this cost by applying the person detector only to the relevant parts of the image. PartiMax exploits… ▽ More

    Submitted 21 September, 2020; originally announced October 2020.

    Comments: http://auai.org/uai2017/proceedings/papers/130.pdf

    Journal ref: UAI 2017

  7. Exploiting Submodular Value Functions For Scaling Up Active Perception

    Authors: Yash Satsangi, Shimon Whiteson, Frans A. Oliehoek, Matthijs T. J. Spaan

    Abstract: In active perception tasks, an agent aims to select sensory actions that reduce its uncertainty about one or more hidden variables. While partially observable Markov decision processes (POMDPs) provide a natural model for such problems, reward functions that directly penalize uncertainty in the agent's belief can remove the piecewise-linear and convex property of the value function required by mos… ▽ More

    Submitted 21 September, 2020; originally announced September 2020.

    Journal ref: Autonomous Robot 42 2018. Original article available via Springer journal open access: https://link.springer.com/article/10.1007/s10514-017-9666-5

  8. arXiv:2005.04912  [pdf, other

    cs.AI cs.LG

    Maximizing Information Gain in Partially Observable Environments via Prediction Reward

    Authors: Yash Satsangi, Sungsu Lim, Shimon Whiteson, Frans Oliehoek, Martha White

    Abstract: Information gathering in a partially observable environment can be formulated as a reinforcement learning (RL), problem where the reward depends on the agent's uncertainty. For example, the reward can be the negative entropy of the agent's belief over an unknown (or hidden) variable. Typically, the rewards of an RL agent are defined as a function of the state-action pairs and not as a function of… ▽ More

    Submitted 11 May, 2020; originally announced May 2020.

    Journal ref: AAMAS 2020

  9. arXiv:1602.07860  [pdf, other

    cs.AI cs.LG stat.ML

    Probably Approximately Correct Greedy Maximization with Efficient Bounds on Information Gain for Sensor Selection

    Authors: Yash Satsangi, Shimon Whiteson, Frans A. Oliehoek

    Abstract: Submodular function maximization finds application in a variety of real-world decision-making problems. However, most existing methods, based on greedy maximization, assume it is computationally feasible to evaluate F, the function being maximized. Unfortunately, in many realistic settings F is too expensive to evaluate exactly even once. We present probably approximately correct greedy maximizati… ▽ More

    Submitted 10 August, 2020; v1 submitted 25 February, 2016; originally announced February 2016.