Expert-Augmented Machine Learning
Authors:
E. D. Gennatas,
J. H. Friedman,
L. H. Ungar,
R. Pirracchio,
E. Eaton,
L. Reichman,
Y. Interian,
C. B. Simone,
A. Auerbach,
E. Delgado,
M. J. Van der Laan,
T. D. Solberg,
G. Valdes
Abstract:
Machine Learning is proving invaluable across disciplines. However, its success is often limited by the quality and quantity of available data, while its adoption by the level of trust that models afford users. Human vs. machine performance is commonly compared empirically to decide whether a certain task should be performed by a computer or an expert. In reality, the optimal learning strategy may…
▽ More
Machine Learning is proving invaluable across disciplines. However, its success is often limited by the quality and quantity of available data, while its adoption by the level of trust that models afford users. Human vs. machine performance is commonly compared empirically to decide whether a certain task should be performed by a computer or an expert. In reality, the optimal learning strategy may involve combining the complementary strengths of man and machine. Here we present Expert-Augmented Machine Learning (EAML), an automated method that guides the extraction of expert knowledge and its integration into machine-learned models. We use a large dataset of intensive care patient data to predict mortality and show that we can extract expert knowledge using an online platform, help reveal hidden confounders, improve generalizability on a different population and learn using less data. EAML presents a novel framework for high performance and dependable machine learning in critical applications.
△ Less
Submitted 5 January, 2021; v1 submitted 22 March, 2019;
originally announced March 2019.
Tree-Structured Boosting: Connections Between Gradient Boosted Stumps and Full Decision Trees
Authors:
José Marcio Luna,
Eric Eaton,
Lyle H. Ungar,
Eric Diffenderfer,
Shane T. Jensen,
Efstathios D. Gennatas,
Mateo Wirth,
Charles B. Simone II,
Timothy D. Solberg,
Gilmer Valdes
Abstract:
Additive models, such as produced by gradient boosting, and full interaction models, such as classification and regression trees (CART), are widely used algorithms that have been investigated largely in isolation. We show that these models exist along a spectrum, revealing never-before-known connections between these two approaches. This paper introduces a novel technique called tree-structured bo…
▽ More
Additive models, such as produced by gradient boosting, and full interaction models, such as classification and regression trees (CART), are widely used algorithms that have been investigated largely in isolation. We show that these models exist along a spectrum, revealing never-before-known connections between these two approaches. This paper introduces a novel technique called tree-structured boosting for creating a single decision tree, and shows that this method can produce models equivalent to CART or gradient boosted stumps at the extremes by varying a single parameter. Although tree-structured boosting is designed primarily to provide both the model interpretability and predictive performance needed for high-stake applications like medicine, it also can produce decision trees represented by hybrid models between CART and boosted stumps that can outperform either of these approaches.
△ Less
Submitted 17 November, 2017;
originally announced November 2017.