-
A Graphical Comparison of Screening Designs using Support Recovery Probabilities
Authors:
Kade Young,
Maria L. Weese,
Jonathan W. Stallrich,
Byran J. Smucker,
David J. Edwards
Abstract:
A screening experiment attempts to identify a subset of important effects using a relatively small number of experimental runs. Given the limited run size and a large number of possible effects, penalized regression is a popular tool used to analyze screening designs. In particular, an automated implementation of the Gauss-Dantzig selector has been widely recommended to compare screening design co…
▽ More
A screening experiment attempts to identify a subset of important effects using a relatively small number of experimental runs. Given the limited run size and a large number of possible effects, penalized regression is a popular tool used to analyze screening designs. In particular, an automated implementation of the Gauss-Dantzig selector has been widely recommended to compare screening design construction methods. Here, we illustrate potential reproducibility issues that arise when comparing screening designs via simulation, and recommend a graphical method, based on screening probabilities, which compares designs by evaluating them along the penalized regression solution path. This method can be implemented using simulation, or, in the case of lasso, by using exact local lasso sign recovery probabilities. Our approach circumvents the need to specify tuning parameters associated with regularization methods, leading to more reliable design comparisons. This article contains supplementary materials including code to implement the proposed methods.
△ Less
Submitted 21 November, 2023;
originally announced November 2023.
-
Conjecturing-Based Discovery of Patterns in Data
Authors:
J. P. Brooks,
D. J. Edwards,
C. E. Larson,
N. Van Cleemput
Abstract:
We propose the use of a conjecturing machine that suggests feature relationships in the form of bounds involving nonlinear terms for numerical features and boolean expressions for categorical features. The proposed Conjecturing framework recovers known nonlinear and boolean relationships among features from data. In both settings, true underlying relationships are revealed. We then compare the met…
▽ More
We propose the use of a conjecturing machine that suggests feature relationships in the form of bounds involving nonlinear terms for numerical features and boolean expressions for categorical features. The proposed Conjecturing framework recovers known nonlinear and boolean relationships among features from data. In both settings, true underlying relationships are revealed. We then compare the method to a previously-proposed framework for symbolic regression on the ability to recover equations that are satisfied among features in a dataset. The framework is then applied to patient-level data regarding COVID-19 outcomes to suggest possible risk factors that are confirmed in the medical literature.
△ Less
Submitted 14 July, 2023; v1 submitted 23 November, 2020;
originally announced November 2020.
-
Monotonic Nonparametric Dose Response Model
Authors:
Faten S. Alamri,
Edward L. Boone,
David J. Edwards
Abstract:
Toxicologists are often concerned with determining the dosage to which an individual can be exposed with an acceptable risk of adverse effect. These types of studies have been conducted widely in the past, and many novel approaches have been developed. Parametric techniques utilizing ANOVA and nonlinear regression models are well represented in the literature. The biggest drawback of parametric ap…
▽ More
Toxicologists are often concerned with determining the dosage to which an individual can be exposed with an acceptable risk of adverse effect. These types of studies have been conducted widely in the past, and many novel approaches have been developed. Parametric techniques utilizing ANOVA and nonlinear regression models are well represented in the literature. The biggest drawback of parametric approaches is the need to specify the correct model. Recently, there has been an interest in nonparametric approaches to tolerable dosage estimation. In this work, we focus on the monotonically decreasing dose response model where the response is a percent to control. This poses two constraints to the nonparametric approach. The doseresponse function must be one at control (dose = 0), and the function must always be positive. Here we propose a Bayesian solution to this problem using a novel class of nonparametric models. A basis function developed in this research is the Alamri Monotonic spline (AM-spline). Our approach is illustrated using both simulated data and an experimental dataset from pesticide related research at the US Environmental Protection Agency.
△ Less
Submitted 11 November, 2019; v1 submitted 30 September, 2019;
originally announced October 2019.