-
A Graphical Comparison of Screening Designs using Support Recovery Probabilities
Authors:
Kade Young,
Maria L. Weese,
Jonathan W. Stallrich,
Byran J. Smucker,
David J. Edwards
Abstract:
A screening experiment attempts to identify a subset of important effects using a relatively small number of experimental runs. Given the limited run size and a large number of possible effects, penalized regression is a popular tool used to analyze screening designs. In particular, an automated implementation of the Gauss-Dantzig selector has been widely recommended to compare screening design co…
▽ More
A screening experiment attempts to identify a subset of important effects using a relatively small number of experimental runs. Given the limited run size and a large number of possible effects, penalized regression is a popular tool used to analyze screening designs. In particular, an automated implementation of the Gauss-Dantzig selector has been widely recommended to compare screening design construction methods. Here, we illustrate potential reproducibility issues that arise when comparing screening designs via simulation, and recommend a graphical method, based on screening probabilities, which compares designs by evaluating them along the penalized regression solution path. This method can be implemented using simulation, or, in the case of lasso, by using exact local lasso sign recovery probabilities. Our approach circumvents the need to specify tuning parameters associated with regularization methods, leading to more reliable design comparisons. This article contains supplementary materials including code to implement the proposed methods.
△ Less
Submitted 21 November, 2023;
originally announced November 2023.
-
Optimal simulation-based Bayesian decisions
Authors:
Justin Alsing,
Thomas D. P. Edwards,
Benjamin Wandelt
Abstract:
We present a framework for the efficient computation of optimal Bayesian decisions under intractable likelihoods, by learning a surrogate model for the expected utility (or its distribution) as a function of the action and data spaces. We leverage recent advances in simulation-based inference and Bayesian optimization to develop active learning schemes to choose where in parameter and action space…
▽ More
We present a framework for the efficient computation of optimal Bayesian decisions under intractable likelihoods, by learning a surrogate model for the expected utility (or its distribution) as a function of the action and data spaces. We leverage recent advances in simulation-based inference and Bayesian optimization to develop active learning schemes to choose where in parameter and action spaces to simulate. This allows us to learn the optimal action in as few simulations as possible. The resulting framework is extremely simulation efficient, typically requiring fewer model calls than the associated posterior inference task alone, and a factor of $100-1000$ more efficient than Monte-Carlo based methods. Our framework opens up new capabilities for performing Bayesian decision making, particularly in the previously challenging regime where likelihoods are intractable, and simulations expensive.
△ Less
Submitted 9 November, 2023;
originally announced November 2023.
-
Graphing methods for Kendall's τ
Authors:
Nicholas D. Edwards,
Enzo de Jong,
Stephen T. Ferguson
Abstract:
Ranked data is commonly used in research across many fields of study including medicine, biology, psychology, and economics. One common statistic used for analyzing ranked data is Kendall's τ coefficient, a non-parametric measure of rank correlation which describes the strength of the association between two monotonic continuous or ordinal variables. While the mathematics involved in calculating K…
▽ More
Ranked data is commonly used in research across many fields of study including medicine, biology, psychology, and economics. One common statistic used for analyzing ranked data is Kendall's τ coefficient, a non-parametric measure of rank correlation which describes the strength of the association between two monotonic continuous or ordinal variables. While the mathematics involved in calculating Kendall's τ is well-established, there are relatively few graphing methods available to visualize the results. Here, we describe a visualization method and provide an interactive app for graphing Kendall's τ which uses a series of rigid Euclidean transformations along a Cartesian plane to map rank pairs into discrete quadrants. The resulting graph provides a visualization of rank correlation which helps display the proportion of concordant and discordant pairs. Moreover, this method highlights other key features of the data which are not represented by Kendall's τ alone but may nevertheless be meaningful, such as the relationship between discrete pairs of observations. We demonstrate the effectiveness of our approach through several examples and compare our results to other visualization methods.
△ Less
Submitted 1 September, 2023; v1 submitted 16 August, 2023;
originally announced August 2023.
-
Conjecturing-Based Discovery of Patterns in Data
Authors:
J. P. Brooks,
D. J. Edwards,
C. E. Larson,
N. Van Cleemput
Abstract:
We propose the use of a conjecturing machine that suggests feature relationships in the form of bounds involving nonlinear terms for numerical features and boolean expressions for categorical features. The proposed Conjecturing framework recovers known nonlinear and boolean relationships among features from data. In both settings, true underlying relationships are revealed. We then compare the met…
▽ More
We propose the use of a conjecturing machine that suggests feature relationships in the form of bounds involving nonlinear terms for numerical features and boolean expressions for categorical features. The proposed Conjecturing framework recovers known nonlinear and boolean relationships among features from data. In both settings, true underlying relationships are revealed. We then compare the method to a previously-proposed framework for symbolic regression on the ability to recover equations that are satisfied among features in a dataset. The framework is then applied to patient-level data regarding COVID-19 outcomes to suggest possible risk factors that are confirmed in the medical literature.
△ Less
Submitted 14 July, 2023; v1 submitted 23 November, 2020;
originally announced November 2020.
-
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
Authors:
Ashley D. Edwards,
Himanshu Sahni,
Rosanne Liu,
Jane Hung,
Ankit Jain,
Rui Wang,
Adrien Ecoffet,
Thomas Miconi,
Charles Isbell,
Jason Yosinski
Abstract:
In this paper, we introduce a novel form of value function, $Q(s, s')$, that expresses the utility of transitioning from a state $s$ to a neighboring state $s'$ and then acting optimally thereafter. In order to derive an optimal policy, we develop a forward dynamics model that learns to make next-state predictions that maximize this value. This formulation decouples actions from values while still…
▽ More
In this paper, we introduce a novel form of value function, $Q(s, s')$, that expresses the utility of transitioning from a state $s$ to a neighboring state $s'$ and then acting optimally thereafter. In order to derive an optimal policy, we develop a forward dynamics model that learns to make next-state predictions that maximize this value. This formulation decouples actions from values while still learning off-policy. We highlight the benefits of this approach in terms of value function transfer, learning within redundant action spaces, and learning off-policy from state observations generated by sub-optimal or completely random policies. Code and videos are available at http://sites.google.com/view/qss-paper.
△ Less
Submitted 25 August, 2020; v1 submitted 21 February, 2020;
originally announced February 2020.
-
Monotonic Nonparametric Dose Response Model
Authors:
Faten S. Alamri,
Edward L. Boone,
David J. Edwards
Abstract:
Toxicologists are often concerned with determining the dosage to which an individual can be exposed with an acceptable risk of adverse effect. These types of studies have been conducted widely in the past, and many novel approaches have been developed. Parametric techniques utilizing ANOVA and nonlinear regression models are well represented in the literature. The biggest drawback of parametric ap…
▽ More
Toxicologists are often concerned with determining the dosage to which an individual can be exposed with an acceptable risk of adverse effect. These types of studies have been conducted widely in the past, and many novel approaches have been developed. Parametric techniques utilizing ANOVA and nonlinear regression models are well represented in the literature. The biggest drawback of parametric approaches is the need to specify the correct model. Recently, there has been an interest in nonparametric approaches to tolerable dosage estimation. In this work, we focus on the monotonically decreasing dose response model where the response is a percent to control. This poses two constraints to the nonparametric approach. The doseresponse function must be one at control (dose = 0), and the function must always be positive. Here we propose a Bayesian solution to this problem using a novel class of nonparametric models. A basis function developed in this research is the Alamri Monotonic spline (AM-spline). Our approach is illustrated using both simulated data and an experimental dataset from pesticide related research at the US Environmental Protection Agency.
△ Less
Submitted 11 November, 2019; v1 submitted 30 September, 2019;
originally announced October 2019.
-
Perceptual Values from Observation
Authors:
Ashley D. Edwards,
Charles L. Isbell
Abstract:
Imitation by observation is an approach for learning from expert demonstrations that lack action information, such as videos. Recent approaches to this problem can be placed into two broad categories: training dynamics models that aim to predict the actions taken between states, and learning rewards or features for computing them for Reinforcement Learning (RL). In this paper, we introduce a novel…
▽ More
Imitation by observation is an approach for learning from expert demonstrations that lack action information, such as videos. Recent approaches to this problem can be placed into two broad categories: training dynamics models that aim to predict the actions taken between states, and learning rewards or features for computing them for Reinforcement Learning (RL). In this paper, we introduce a novel approach that learns values, rather than rewards, directly from observations. We show that by using values, we can significantly speed up RL by removing the need to bootstrap action-values, as compared to sparse-reward specifications.
△ Less
Submitted 19 May, 2019;
originally announced May 2019.
-
Imitating Latent Policies from Observation
Authors:
Ashley D. Edwards,
Himanshu Sahni,
Yannick Schroecker,
Charles L. Isbell
Abstract:
In this paper, we describe a novel approach to imitation learning that infers latent policies directly from state observations. We introduce a method that characterizes the causal effects of latent actions on observations while simultaneously predicting their likelihood. We then outline an action alignment procedure that leverages a small amount of environment interactions to determine a map** b…
▽ More
In this paper, we describe a novel approach to imitation learning that infers latent policies directly from state observations. We introduce a method that characterizes the causal effects of latent actions on observations while simultaneously predicting their likelihood. We then outline an action alignment procedure that leverages a small amount of environment interactions to determine a map** between the latent and real-world actions. We show that this corrected labeling can be used for imitating the observed behavior, even though no expert actions are given. We evaluate our approach within classic control environments and a platform game and demonstrate that it performs better than standard approaches. Code for this work is available at https://github.com/ashedwards/ILPO.
△ Less
Submitted 13 May, 2019; v1 submitted 21 May, 2018;
originally announced May 2018.
-
Forward-Backward Reinforcement Learning
Authors:
Ashley D. Edwards,
Laura Downs,
James C. Davidson
Abstract:
Goals for reinforcement learning problems are typically defined through hand-specified rewards. To design such problems, developers of learning algorithms must inherently be aware of what the task goals are, yet we often require agents to discover them on their own without any supervision beyond these sparse rewards. While much of the power of reinforcement learning derives from the concept that a…
▽ More
Goals for reinforcement learning problems are typically defined through hand-specified rewards. To design such problems, developers of learning algorithms must inherently be aware of what the task goals are, yet we often require agents to discover them on their own without any supervision beyond these sparse rewards. While much of the power of reinforcement learning derives from the concept that agents can learn with little guidance, this requirement greatly burdens the training process. If we relax this one restriction and endow the agent with knowledge of the reward function, and in particular of the goal, we can leverage backwards induction to accelerate training. To achieve this, we propose training a model to learn to take imagined reversal steps from known goal states. Rather than training an agent exclusively to determine how to reach a goal while moving forwards in time, our approach travels backwards to jointly predict how we got there. We evaluate our work in Gridworld and Towers of Hanoi and empirically demonstrate that it yields better performance than standard DDQN.
△ Less
Submitted 27 March, 2018;
originally announced March 2018.
-
Modeling goal chances in soccer: a Bayesian inference approach
Authors:
Gavin A. Whitaker,
Ricardo Silva,
Daniel Edwards
Abstract:
We consider the task of determining the number of chances a soccer team creates, along with the composite nature of each chance-the players involved and the locations on the pitch of the assist and the chance. We propose an interpretable Bayesian inference approach and implement a Poisson model to capture chance occurrences, from which we infer team abilities. We then use a Gaussian mixture model…
▽ More
We consider the task of determining the number of chances a soccer team creates, along with the composite nature of each chance-the players involved and the locations on the pitch of the assist and the chance. We propose an interpretable Bayesian inference approach and implement a Poisson model to capture chance occurrences, from which we infer team abilities. We then use a Gaussian mixture model to capture the areas on the pitch a player makes an assist/takes a chance. This approach allows the visualization of differences between players in the way they approach attacking play (making assists/taking chances). We apply the resulting scheme to the 2016/2017 English Premier League, capturing team abilities to create chances, before highlighting key areas where players have most impact.
△ Less
Submitted 23 February, 2018;
originally announced February 2018.
-
Transferring Agent Behaviors from Videos via Motion GANs
Authors:
Ashley D. Edwards,
Charles L. Isbell Jr
Abstract:
A major bottleneck for develo** general reinforcement learning agents is determining rewards that will yield desirable behaviors under various circumstances. We introduce a general mechanism for automatically specifying meaningful behaviors from raw pixels. In particular, we train a generative adversarial network to produce short sub-goals represented through motion templates. We demonstrate tha…
▽ More
A major bottleneck for develo** general reinforcement learning agents is determining rewards that will yield desirable behaviors under various circumstances. We introduce a general mechanism for automatically specifying meaningful behaviors from raw pixels. In particular, we train a generative adversarial network to produce short sub-goals represented through motion templates. We demonstrate that this approach generates visually meaningful behaviors in unknown environments with novel agents and describe how these motions can be used to train reinforcement learning agents.
△ Less
Submitted 21 November, 2017;
originally announced November 2017.
-
A Bayesian inference approach for determining player abilities in football
Authors:
Gavin A. Whitaker,
Ricardo Silva,
Daniel Edwards,
Ioannis Kosmidis
Abstract:
We consider the task of determining a football player's ability for a given event type, for example, scoring a goal. We propose an interpretable Bayesian model which is fit using variational inference methods. We implement a Poisson model to capture occurrences of event types, from which we infer player abilities. Our approach also allows the visualisation of differences between players, for a spe…
▽ More
We consider the task of determining a football player's ability for a given event type, for example, scoring a goal. We propose an interpretable Bayesian model which is fit using variational inference methods. We implement a Poisson model to capture occurrences of event types, from which we infer player abilities. Our approach also allows the visualisation of differences between players, for a specific ability, through the marginal posterior variational densities. We then use these inferred player abilities to extend the Bayesian hierarchical model of Baio and Blangiardo (2010) which captures a team's scoring rate (the rate at which they score goals). We apply the resulting scheme to the English Premier League, capturing player abilities over the 2013/2014 season, before using output from the hierarchical model to predict whether over or under 2.5 goals will be scored in a given game in the 2014/2015 season. This validates our model as a way of providing insights into team formation and the individual success of sports teams.
△ Less
Submitted 23 September, 2020; v1 submitted 25 September, 2017;
originally announced October 2017.
-
Characterising brain network topologies: a dynamic analysis approach using heat kernels
Authors:
A. W. Chung,
M. D. Schirmer,
M. L. Krishna,
G. Ball,
P. Aljabar,
A. D. Edwards,
G. Montana
Abstract:
Network theory provides a principled abstraction of the human brain: reducing a complex system into a simpler representation from which to investigate brain organisation. Recent advancement in the neuroimaging field are towards representing brain connectivity as a dynamic process in order to gain a deeper understanding of how the brain is organised for information transport. In this paper we propo…
▽ More
Network theory provides a principled abstraction of the human brain: reducing a complex system into a simpler representation from which to investigate brain organisation. Recent advancement in the neuroimaging field are towards representing brain connectivity as a dynamic process in order to gain a deeper understanding of how the brain is organised for information transport. In this paper we propose a network modelling approach based on the heat kernel to capture the process of heat diffusion in complex networks. By applying the heat kernel to structural brain networks, we define new features which quantify change in energy flow. Identifying suitable features which can classify networks between cohorts is useful towards understanding the effect of disease on brain architecture. We demonstrate the discriminative power of heat kernel features in both synthetic and clinical preterm data. By generating an extensive range of synthetic networks with varying density and randomisation, we investigate how heat flows in the networks in relation to changes in network topology. We demonstrate that our proposed features provide a metric of network efficiency and may be indicative of organisational principles commonly associated with, for example, small-world architecture. In addition, we show the potential of these features to characterise and classify between network topologies. We further demonstrate our methodology in a clinical setting by applying it to a large cohort of preterm babies scanned at term equivalent age from which diffusion networks were computed. We show that our heat kernel features are able to successfully predict motor function measured at two years of age (sensitivity, specificity, F-score, accuracy = 75.0, 82.5, 78.6, 82.3%, respectively.
△ Less
Submitted 22 March, 2016;
originally announced March 2016.
-
High-dimensional Graphical Model Search with gRapHD R Package
Authors:
Gabriel C. G. de Abreu,
Rodrigo Labouriau,
David Edwards
Abstract:
This paper presents the R package gRapHD for efficient selection of high-dimensional undirected graphical models. The package provides tools for selecting trees, forests and decomposable models minimizing information criteria such as AIC or BIC, and for displaying the independence graphs of the models. It has also some useful tools for analysing graphical structures. It supports the use of discret…
▽ More
This paper presents the R package gRapHD for efficient selection of high-dimensional undirected graphical models. The package provides tools for selecting trees, forests and decomposable models minimizing information criteria such as AIC or BIC, and for displaying the independence graphs of the models. It has also some useful tools for analysing graphical structures. It supports the use of discrete, continuous, or both types of variables simultaneously.
△ Less
Submitted 22 September, 2010; v1 submitted 7 September, 2009;
originally announced September 2009.
-
Interpolating fields of carbon monoxide data using a hybrid statistical-physical model
Authors:
Anders Malmberg,
Avelino Arellano,
David P. Edwards,
Natasha Flyer,
Doug Nychka,
Christopher Wikle
Abstract:
Atmospheric Carbon Monoxide (CO) provides a window on the chemistry of the atmosphere since it is one of few chemical constituents that can be remotely sensed, and it can be used to determine budgets of other greenhouse gases such as ozone and OH radicals. Remote sensing platforms in geostationary Earth orbit will soon provide regional observations of CO at several vertical layers with high spat…
▽ More
Atmospheric Carbon Monoxide (CO) provides a window on the chemistry of the atmosphere since it is one of few chemical constituents that can be remotely sensed, and it can be used to determine budgets of other greenhouse gases such as ozone and OH radicals. Remote sensing platforms in geostationary Earth orbit will soon provide regional observations of CO at several vertical layers with high spatial and temporal resolution. However, cloudy locations cannot be observed and estimates of the complete CO concentration fields have to be estimated based on the cloud-free observations. The current state-of-the-art solution of this interpolation problem is to combine cloud-free observations with prior information, computed by a deterministic physical model, which might introduce uncertainties that do not derive from data. While sharing features with the physical model, this paper suggests a Bayesian hierarchical model to estimate the complete CO concentration fields. The paper also provides a direct comparison to state-of-the-art methods. To our knowledge, such a model and comparison have not been considered before.
△ Less
Submitted 23 January, 2009;
originally announced January 2009.