Skip to main content

Showing 1–29 of 29 results for author: Dietterich, T G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2303.12957  [pdf, other

    cs.LG cs.AI stat.ML

    Reinforcement Learning with Exogenous States and Rewards

    Authors: George Trimponias, Thomas G. Dietterich

    Abstract: Exogenous state variables and rewards can slow reinforcement learning by injecting uncontrolled variation into the reward signal. This paper formalizes exogenous state variables and rewards and shows that if the reward function decomposes additively into endogenous and exogenous components, the MDP can be decomposed into an exogenous Markov Reward Process (based on the exogenous reward) and an end… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

    Comments: Greatly extends the initial work reported in 1806.01584

  2. arXiv:2211.16462  [pdf, other

    cs.LG stat.ME stat.ML

    Will My Robot Achieve My Goals? Predicting the Probability that an MDP Policy Reaches a User-Specified Behavior Target

    Authors: Alexander Guyer, Thomas G. Dietterich

    Abstract: As an autonomous system performs a task, it should maintain a calibrated estimate of the probability that it will achieve the user's goal. If that probability falls below some desired level, it should alert the user so that appropriate interventions can be made. This paper considers settings where the user's goal is specified as a target interval for a real-valued performance summary, such as the… ▽ More

    Submitted 2 April, 2024; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: 12 pages, 4 figures. Appears in Proceedings of AAAI FSS-22 Symposium "Lessons Learned for Autonomous Assessment of Machine Abilities (LLAAMA)" The original submission had an error in theorem 2. Moreover, the stated guarantees for PCQR^{-1} were incorrect. This revision states the correct guarantees and corresponding theorem

  3. arXiv:2206.04860  [pdf, other

    cs.LG stat.ML

    Conformal Prediction Intervals for Markov Decision Process Trajectories

    Authors: Thomas G. Dietterich, Jesse Hostetler

    Abstract: Before delegating a task to an autonomous system, a human operator may want a guarantee about the behavior of the system. This paper extends previous work on conformal prediction for functional data and conformalized quantile regression to provide conformal prediction intervals over the future behavior of an autonomous system executing a fixed control policy on a Markov Decision Process (MDP). The… ▽ More

    Submitted 21 June, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 25 pages, 15 figures, 2 tables. Fixed typos and an error in the appendix plots

  4. The Familiarity Hypothesis: Explaining the Behavior of Deep Open Set Methods

    Authors: Thomas G. Dietterich, Alexander Guyer

    Abstract: In many object recognition applications, the set of possible categories is an open set, and the deployed recognition system will encounter novel objects belonging to categories unseen during training. Detecting such "novel category" objects is usually formulated as an anomaly detection problem. Anomaly detection algorithms for feature-vector data identify anomalies as outliers, but outlier detecti… ▽ More

    Submitted 27 July, 2022; v1 submitted 4 March, 2022; originally announced March 2022.

    Comments: Accepted for publication in Pattern Recognition. This version corrects minor typos

  5. arXiv:2202.01840  [pdf, other

    cs.LG

    Hidden Heterogeneity: When to Choose Similarity-Based Calibration

    Authors: Kiri L. Wagstaff, Thomas G. Dietterich

    Abstract: Trustworthy classifiers are essential to the adoption of machine learning predictions in many real-world settings. The predicted probability of possible outcomes can inform high-stakes decision making, particularly when assessing the expected value of alternative decisions or the risk of bad outcomes. These decisions require well-calibrated probabilities, not just the correct prediction of the mos… ▽ More

    Submitted 20 February, 2023; v1 submitted 3 February, 2022; originally announced February 2022.

    Comments: 22 pages, 8 figures

    ACM Class: I.2.6

    Journal ref: Transactions on Machine Learning Research, January 2023

  6. arXiv:2105.00137  [pdf, other

    cs.LG

    Deep Convolution for Irregularly Sampled Temporal Point Clouds

    Authors: Erich Merrill, Stefan Lee, Li Fuxin, Thomas G. Dietterich, Alan Fern

    Abstract: We consider the problem of modeling the dynamics of continuous spatial-temporal processes represented by irregular samples through both space and time. Such processes occur in sensor networks, citizen science, multi-robot systems, and many others. We propose a new deep model that is able to directly learn and predict over this irregularly sampled data, without voxelization, by leveraging a recent… ▽ More

    Submitted 30 April, 2021; originally announced May 2021.

    Comments: 12 pages, submitted to ICLR 2021

  7. arXiv:2104.00742  [pdf, other

    cs.LG cs.CV

    Confidence Calibration for Domain Generalization under Covariate Shift

    Authors: Yunye Gong, Xiao Lin, Yi Yao, Thomas G. Dietterich, Ajay Divakaran, Melinda Gervasio

    Abstract: Existing calibration algorithms address the problem of covariate shift via unsupervised domain adaptation. However, these methods suffer from the following limitations: 1) they require unlabeled data from the target domain, which may not be available at the stage of calibration in real-world applications and 2) their performance depends heavily on the disparity between the distributions of the sou… ▽ More

    Submitted 19 August, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2021, pp. 8958-8967

  8. arXiv:2101.00074  [pdf, other

    stat.ME cs.LG

    Three-quarter Sibling Regression for Denoising Observational Data

    Authors: Shiv Shankar, Daniel Sheldon, Tao Sun, John Pickering, Thomas G. Dietterich

    Abstract: Many ecological studies and conservation policies are based on field observations of species, which can be affected by systematic variability introduced by the observation process. A recently introduced causal modeling technique called 'half-sibling regression' can detect and correct for systematic errors in measurements of multiple independent random variables. However, it will remove intrinsic v… ▽ More

    Submitted 31 December, 2020; originally announced January 2021.

    Journal ref: IJCAI 2019

  9. arXiv:2009.11732  [pdf, other

    cs.LG cs.AI stat.ML

    A Unifying Review of Deep and Shallow Anomaly Detection

    Authors: Lukas Ruff, Jacob R. Kauffmann, Robert A. Vandermeulen, Grégoire Montavon, Wojciech Samek, Marius Kloft, Thomas G. Dietterich, Klaus-Robert Müller

    Abstract: Deep learning approaches to anomaly detection have recently improved the state of the art in detection performance on complex datasets such as large collections of images or text. These results have sparked a renewed interest in the anomaly detection problem and led to the introduction of a great variety of new methods. With the emergence of numerous such methods, including approaches based on gen… ▽ More

    Submitted 8 February, 2021; v1 submitted 24 September, 2020; originally announced September 2020.

    Comments: 40 pages; accepted for publication in the Proceedings of the IEEE;

    Journal ref: Proceedings of the IEEE (2021) 1-40

  10. arXiv:1811.10840  [pdf, ps, other

    cs.AI cs.CY

    Robust Artificial Intelligence and Robust Human Organizations

    Authors: Thomas G. Dietterich

    Abstract: Every AI system is deployed by a human organization. In high risk applications, the combined human plus AI system must function as a high-reliability organization in order to avoid catastrophic errors. This short note reviews the properties of high-reliability organizations and draws implications for the development of AI technology and the safe application of that technology.

    Submitted 27 November, 2018; originally announced November 2018.

    Comments: To appear as a Perspective in Frontiers in Computer Science

  11. arXiv:1809.03680  [pdf, other

    cs.CL

    Learning Scripts as Hidden Markov Models

    Authors: J. Walker Orr, Prasad Tadepalli, Janardhan Rao Doppa, Xiaoli Fern, Thomas G. Dietterich

    Abstract: Scripts have been proposed to model the stereotypical event sequences found in narratives. They can be applied to make a variety of inferences including filling gaps in the narratives and resolving ambiguous references. This paper proposes the first formal framework for scripts based on Hidden Markov Models (HMMs). Our framework supports robust inference and learning algorithms, which are lacking… ▽ More

    Submitted 11 September, 2018; originally announced September 2018.

    Comments: 7 pages, AAAI 2014

  12. arXiv:1809.01605  [pdf, other

    cs.LG stat.ML

    Anomaly Detection in the Presence of Missing Values

    Authors: Thomas G. Dietterich, Tadesse Zemicheal

    Abstract: Standard methods for anomaly detection assume that all features are observed at both learning time and prediction time. Such methods cannot process data containing missing values. This paper studies five strategies for handling missing values in test queries: (a) mean imputation, (b) MAP imputation, (c) reduction (reduced-dimension anomaly detectors via feature bagging), (d) marginalization (for d… ▽ More

    Submitted 5 September, 2018; originally announced September 2018.

  13. arXiv:1808.00529  [pdf, other

    cs.LG stat.ML

    Open Category Detection with PAC Guarantees

    Authors: Si Liu, Risheek Garrepalli, Thomas G. Dietterich, Alan Fern, Dan Hendrycks

    Abstract: Open category detection is the problem of detecting "alien" test instances that belong to categories or classes that were not present in the training data. In many applications, reliably detecting such aliens is central to ensuring the safety and accuracy of test set predictions. Unfortunately, there are no algorithms that provide theoretical guarantees on their ability to detect aliens under gene… ▽ More

    Submitted 1 August, 2018; originally announced August 2018.

  14. arXiv:1807.01697  [pdf, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    Benchmarking Neural Network Robustness to Common Corruptions and Surface Variations

    Authors: Dan Hendrycks, Thomas G. Dietterich

    Abstract: In this paper we establish rigorous benchmarks for image classifier robustness. Our first benchmark, ImageNet-C, standardizes and expands the corruption robustness topic, while showing which classifiers are preferable in safety-critical applications. Unlike recent robustness research, this benchmark evaluates performance on commonplace corruptions not worst-case adversarial corruptions. We find th… ▽ More

    Submitted 27 April, 2019; v1 submitted 4 July, 2018; originally announced July 2018.

    Comments: Superseded by _Benchmarking Neural Network Robustness to Common Corruptions and Perturbations_ arXiv:1903.12261

  15. arXiv:1806.01584  [pdf, other

    cs.LG stat.ML

    Discovering and Removing Exogenous State Variables and Rewards for Reinforcement Learning

    Authors: Thomas G. Dietterich, George Trimponias, Zhitang Chen

    Abstract: Exogenous state variables and rewards can slow down reinforcement learning by injecting uncontrolled variation into the reward signal. We formalize exogenous state variables and rewards and identify conditions under which an MDP with exogenous state can be decomposed into an exogenous Markov Reward Process involving only the exogenous state+reward and an endogenous Markov Decision Process defined… ▽ More

    Submitted 5 June, 2018; originally announced June 2018.

    Comments: To appear at ICML 2018

  16. arXiv:1708.09441  [pdf, other

    cs.LG cs.AI stat.ML

    Incorporating Feedback into Tree-based Anomaly Detection

    Authors: Shubhomoy Das, Weng-Keen Wong, Alan Fern, Thomas G. Dietterich, Md Amran Siddiqui

    Abstract: Anomaly detectors are often used to produce a ranked list of statistical anomalies, which are examined by human analysts in order to extract the actual anomalies of interest. Unfortunately, in realworld applications, this process can be exceedingly difficult for the analyst since a large fraction of high-ranking anomalies are false positives and not interesting from the application perspective. In… ▽ More

    Submitted 30 August, 2017; originally announced August 2017.

    Comments: 8 Pages, KDD 2017 Workshop on Interactive Data Exploration and Analytics (IDEA'17), August 14th, 2017, Halifax, Nova Scotia, Canada

    ACM Class: I.2.6; I.5.5

  17. arXiv:1703.09391  [pdf, other

    cs.LG stat.ML

    Fast Optimization of Wildfire Suppression Policies with SMAC

    Authors: Sean McGregor, Rachel Houtman, Claire Montgomery, Ronald Metoyer, Thomas G. Dietterich

    Abstract: Managers of US National Forests must decide what policy to apply for dealing with lightning-caused wildfires. Conflicts among stakeholders (e.g., timber companies, home owners, and wildlife biologists) have often led to spirited political debates and even violent eco-terrorism. One way to transform these conflicts into multi-stakeholder negotiations is to provide a high-fidelity simulation environ… ▽ More

    Submitted 27 March, 2017; originally announced March 2017.

  18. arXiv:1703.09390  [pdf, other

    cs.LG stat.ML

    Factoring Exogenous State for Model-Free Monte Carlo

    Authors: Sean McGregor, Rachel Houtman, Claire Montgomery, Ronald Metoyer, Thomas G. Dietterich

    Abstract: Policy analysts wish to visualize a range of policies for large simulator-defined Markov Decision Processes (MDPs). One visualization approach is to invoke the simulator to generate on-policy trajectories and then visualize those trajectories. When the simulator is expensive, this is not practical, and some method is required for generating trajectories for new policies without invoking the simula… ▽ More

    Submitted 3 November, 2017; v1 submitted 27 March, 2017; originally announced March 2017.

    Comments: 9 pages, 5 figures. Corrected equation 4

  19. arXiv:1510.05976  [pdf, other

    cs.LG

    Transductive Optimization of Top k Precision

    Authors: Li-** Liu, Thomas G. Dietterich, Nan Li, Zhi-Hua Zhou

    Abstract: Consider a binary classification problem in which the learner is given a labeled training set, an unlabeled test set, and is restricted to choosing exactly $k$ test points to output as positive predictions. Problems of this kind---{\it transductive precision@$k$}---arise in information retrieval, digital advertising, and reserve design for endangered species. Previous methods separate the training… ▽ More

    Submitted 20 October, 2015; originally announced October 2015.

  20. arXiv:1503.00038  [pdf, other

    cs.AI cs.LG stat.ML

    Sequential Feature Explanations for Anomaly Detection

    Authors: Md Amran Siddiqui, Alan Fern, Thomas G. Dietterich, Weng-Keen Wong

    Abstract: In many applications, an anomaly detection system presents the most anomalous data instance to a human analyst, who then must determine whether the instance is truly of interest (e.g. a threat in a security setting). Unfortunately, most anomaly detectors provide no explanation about why an instance was considered anomalous, leaving the analyst with no guidance about where to begin the investigatio… ▽ More

    Submitted 27 February, 2015; originally announced March 2015.

    Comments: 9 pages, 4 figures and submitted to KDD 2015

  21. arXiv:1405.5156  [pdf, other

    cs.LG cs.AI stat.ML

    Gaussian Approximation of Collective Graphical Models

    Authors: Li-** Liu, Daniel Sheldon, Thomas G. Dietterich

    Abstract: The Collective Graphical Model (CGM) models a population of independent and identically distributed individuals when only collective statistics (i.e., counts of individuals) are observed. Exact inference in CGMs is intractable, and previous work has explored Markov Chain Monte Carlo (MCMC) and MAP approximations for learning and inference. This paper studies Gaussian approximations to the CGM. As… ▽ More

    Submitted 20 May, 2014; originally announced May 2014.

    Comments: Accepted by ICML 2014. 10 page version with appendix

  22. arXiv:1210.4880  [pdf

    cs.AI cs.GT cs.LG

    Inferring Strategies from Limited Reconnaissance in Real-time Strategy Games

    Authors: Jesse Hostetler, Ethan W. Dereszynski, Thomas G. Dietterich, Alan Fern

    Abstract: In typical real-time strategy (RTS) games, enemy units are visible only when they are within sight range of a friendly unit. Knowledge of an opponent's disposition is limited to what can be observed through scouting. Information is costly, since units dedicated to scouting are unavailable for other purposes, and the enemy will resist scouting attempts. It is important to infer as much as possible… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-367-376

  23. arXiv:1210.4876  [pdf

    cs.LG stat.ML

    Active Imitation Learning via Reduction to I.I.D. Active Learning

    Authors: Kshitij Judah, Alan Fern, Thomas G. Dietterich

    Abstract: In standard passive imitation learning, the goal is to learn a target policy by passively observing full execution trajectories of it. Unfortunately, generating such trajectories can require substantial expert effort and be impractical in some cases. In this paper, we consider active imitation learning with the goal of reducing this effort by querying the expert about the desired action at individ… ▽ More

    Submitted 16 October, 2012; originally announced October 2012.

    Comments: Appears in Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence (UAI2012)

    Report number: UAI-P-2012-PG-428-437

  24. arXiv:1207.1364  [pdf

    cs.LG stat.ML

    Learning from Sparse Data by Exploiting Monotonicity Constraints

    Authors: Eric E. Altendorf, Angelo C. Restificar, Thomas G. Dietterich

    Abstract: When training data is sparse, more domain knowledge must be incorporated into the learning algorithm in order to reduce the effective size of the hypothesis space. This paper builds on previous work in which knowledge about qualitative monotonicities was formally represented and incorporated into learning algorithms (e.g., Clark & Matwin's work with the CN2 rule learning algorithm). We show how to… ▽ More

    Submitted 4 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence (UAI2005)

    Report number: UAI-P-2005-PG-18-26

  25. arXiv:1206.5250  [pdf

    cs.AI stat.AP

    Probabilistic Models for Anomaly Detection in Remote Sensor Data Streams

    Authors: Ethan W. Dereszynski, Thomas G. Dietterich

    Abstract: Remote sensors are becoming the standard for observing and recording ecological data in the field. Such sensors can record data at fine temporal resolutions, and they can operate under extreme conditions prohibitive to human access. Unfortunately, sensor data streams exhibit many kinds of errors ranging from corrupt communications to partial or total sensor failures. This means that the raw data s… ▽ More

    Submitted 20 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligence (UAI2007)

    Report number: UAI-P-2007-PG-75-82

  26. Integrating Learning from Examples into the Search for Diagnostic Policies

    Authors: V. Bayer-Zubek, T. G. Dietterich

    Abstract: This paper studies the problem of learning diagnostic policies from training examples. A diagnostic policy is a complete description of the decision-making actions of a diagnostician (i.e., tests followed by a diagnostic decision) for all possible combinations of test results. An optimal diagnostic policy is one that minimizes the expected total cost, which is the sum of measurement costs and mis… ▽ More

    Submitted 9 September, 2011; originally announced September 2011.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 24, pages 263-303, 2005

  27. arXiv:cs/9905015  [pdf, ps, other

    cs.LG

    State Abstraction in MAXQ Hierarchical Reinforcement Learning

    Authors: Thomas G. Dietterich

    Abstract: Many researchers have explored methods for hierarchical reinforcement learning (RL) with temporal abstractions, in which abstract actions are defined that can perform many primitive actions before terminating. However, little is known about learning with state abstractions, in which aspects of the state space are ignored. In previous work, we developed the MAXQ method for hierarchical RL. In thi… ▽ More

    Submitted 21 May, 1999; originally announced May 1999.

    Comments: 7 pages, 2 figures

    ACM Class: I.2.6

  28. arXiv:cs/9905014  [pdf, ps, other

    cs.LG

    Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition

    Authors: Thomas G. Dietterich

    Abstract: This paper presents the MAXQ approach to hierarchical reinforcement learning based on decomposing the target Markov decision process (MDP) into a hierarchy of smaller MDPs and decomposing the value function of the target MDP into an additive combination of the value functions of the smaller MDPs. The paper defines the MAXQ hierarchy, proves formal results on its representational power, and estab… ▽ More

    Submitted 21 May, 1999; originally announced May 1999.

    Comments: 63 pages, 15 figures

    ACM Class: I.2.6

  29. arXiv:cs/9501101  [pdf, ps

    cs.AI

    Solving Multiclass Learning Problems via Error-Correcting Output Codes

    Authors: T. G. Dietterich, G. Bakiri

    Abstract: Multiclass learning problems involve finding a definition for an unknown function f(x) whose range is a discrete set containing k &gt 2 values (i.e., k ``classes''). The definition is acquired by studying collections of training examples of the form [x_i, f (x_i)]. Existing approaches to multiclass learning problems include direct application of multiclass algorithms such as the decision-tree al… ▽ More

    Submitted 31 December, 1994; originally announced January 1995.

    Comments: See http://www.jair.org/ for any accompanying files

    Journal ref: Journal of Artificial Intelligence Research, Vol 2, (1995), 263-286