Skip to main content

Showing 1–7 of 7 results for author: Gervasio, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.12510  [pdf, other

    cs.LG

    Confidence Calibration for Systems with Cascaded Predictive Modules

    Authors: Yunye Gong, Yi Yao, Xiao Lin, Ajay Divakaran, Melinda Gervasio

    Abstract: Existing conformal prediction algorithms estimate prediction intervals at target confidence levels to characterize the performance of a regression model on new test samples. However, considering an autonomous system consisting of multiple modules, prediction intervals constructed for individual modules fall short of accommodating uncertainty propagation over different modules and thus cannot provi… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  2. arXiv:2307.08933  [pdf, other

    cs.AI cs.HC cs.LG

    IxDRL: A Novel Explainable Deep Reinforcement Learning Toolkit based on Analyses of Interestingness

    Authors: Pedro Sequeira, Melinda Gervasio

    Abstract: In recent years, advances in deep learning have resulted in a plethora of successes in the use of reinforcement learning (RL) to solve complex sequential decision tasks with high-dimensional inputs. However, existing systems lack the necessary mechanisms to provide humans with a holistic view of their competence, presenting an impediment to their adoption, particularly in critical applications whe… ▽ More

    Submitted 17 July, 2023; originally announced July 2023.

    Comments: To be published in the Proceedings of the 1st World Conference on eXplainable Artificial Intelligence (xAI 2023). arXiv admin note: substantial text overlap with arXiv:2211.06376

  3. arXiv:2211.06376  [pdf, other

    cs.AI cs.LG

    Global and Local Analysis of Interestingness for Competency-Aware Deep Reinforcement Learning

    Authors: Pedro Sequeira, Jesse Hostetler, Melinda Gervasio

    Abstract: In recent years, advances in deep learning have resulted in a plethora of successes in the use of reinforcement learning (RL) to solve complex sequential decision tasks with high-dimensional inputs. However, existing systems lack the necessary mechanisms to provide humans with a holistic view of their competence, presenting an impediment to their adoption, particularly in critical applications whe… ▽ More

    Submitted 11 November, 2022; originally announced November 2022.

    Comments: Appears in Proceedings of AAAI FSS-22 Symposium "Lessons Learned for Autonomous Assessment of Machine Abilities (LLAAMA)"

  4. arXiv:2208.08552  [pdf, other

    cs.AI cs.HC cs.LG cs.LO

    A Framework for Understanding and Visualizing Strategies of RL Agents

    Authors: Pedro Sequeira, Daniel Elenius, Jesse Hostetler, Melinda Gervasio

    Abstract: Recent years have seen significant advances in explainable AI as the need to understand deep learning models has gained importance with the increased emphasis on trust and ethics in AI. Comprehensible models for sequential decision tasks are a particular challenge as they require understanding not only individual predictions but a series of predictions that interact with environmental dynamics. We… ▽ More

    Submitted 17 August, 2022; originally announced August 2022.

  5. arXiv:2207.07710  [pdf, other

    cs.AI cs.LG

    Outcome-Guided Counterfactuals for Reinforcement Learning Agents from a Jointly Trained Generative Latent Space

    Authors: Eric Yeh, Pedro Sequeira, Jesse Hostetler, Melinda Gervasio

    Abstract: We present a novel generative method for producing unseen and plausible counterfactual examples for reinforcement learning (RL) agents based upon outcome variables that characterize agent behavior. Our approach uses a variational autoencoder to train a latent space that jointly encodes information about the observations and outcome variables pertaining to an agent's behavior. Counterfactuals are g… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

  6. arXiv:2104.00742  [pdf, other

    cs.LG cs.CV

    Confidence Calibration for Domain Generalization under Covariate Shift

    Authors: Yunye Gong, Xiao Lin, Yi Yao, Thomas G. Dietterich, Ajay Divakaran, Melinda Gervasio

    Abstract: Existing calibration algorithms address the problem of covariate shift via unsupervised domain adaptation. However, these methods suffer from the following limitations: 1) they require unlabeled data from the target domain, which may not be available at the stage of calibration in real-world applications and 2) their performance depends heavily on the disparity between the distributions of the sou… ▽ More

    Submitted 19 August, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 8958-8967

  7. arXiv:1912.09007  [pdf, other

    cs.LG cs.AI cs.HC stat.ML

    Interestingness Elements for Explainable Reinforcement Learning: Understanding Agents' Capabilities and Limitations

    Authors: Pedro Sequeira, Melinda Gervasio

    Abstract: We propose an explainable reinforcement learning (XRL) framework that analyzes an agent's history of interaction with the environment to extract interestingness elements that help explain its behavior. The framework relies on data readily available from standard RL algorithms, augmented with data that can easily be collected by the agent while learning. We describe how to create visual summaries o… ▽ More

    Submitted 18 August, 2020; v1 submitted 18 December, 2019; originally announced December 2019.

    Comments: To appear in: Artificial Intelligence