Skip to main content

Showing 1–3 of 3 results for author: Ghosal, G R

.
  1. arXiv:2403.06003  [pdf, other

    cs.RO cs.AI cs.LG

    A Generalized Acquisition Function for Preference-based Reward Learning

    Authors: Evan Ellis, Gaurav R. Ghosal, Stuart J. Russell, Anca Dragan, Erdem Bıyık

    Abstract: Preference-based reward learning is a popular technique for teaching robots and autonomous systems how a human user wants them to perform a task. Previous works have shown that actively synthesizing preference queries to maximize information gain about the reward function parameters improves data efficiency. The information gain criterion focuses on precisely identifying all parameters of the rewa… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  2. arXiv:2208.10687  [pdf, other

    cs.LG cs.AI

    The Effect of Modeling Human Rationality Level on Learning Rewards from Multiple Feedback Types

    Authors: Gaurav R. Ghosal, Matthew Zurek, Daniel S. Brown, Anca D. Dragan

    Abstract: When inferring reward functions from human behavior (be it demonstrations, comparisons, physical corrections, or e-stops), it has proven useful to model the human as making noisy-rational choices, with a "rationality coefficient" capturing how much noise or entropy we expect to see in the human behavior. Prior work typically sets the rationality level to a constant value, regardless of the type, o… ▽ More

    Submitted 9 March, 2023; v1 submitted 22 August, 2022; originally announced August 2022.

    Comments: Published at AAAI 2023; 10 pages, 5 figures plus appendices

  3. arXiv:2106.09636  [pdf, other

    cs.LG

    Multi-Modal Prototype Learning for Interpretable Multivariable Time Series Classification

    Authors: Gaurav R. Ghosal, Reza Abbasi-Asl

    Abstract: Multivariable time series classification problems are increasing in prevalence and complexity in a variety of domains, such as biology and finance. While deep learning methods are an effective tool for these problems, they often lack interpretability. In this work, we propose a novel modular prototype learning framework for multivariable time series classification. In the first stage of our framew… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: 14 pages, 6 figures