Skip to main content

Showing 1–17 of 17 results for author: van der Laan, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.09847  [pdf, other

    stat.ML cs.CY cs.LG stat.ME

    Statistical learning for constrained functional parameters in infinite-dimensional models with applications in fair machine learning

    Authors: Razieh Nabi, Nima S. Hejazi, Mark J. van der Laan, David Benkeser

    Abstract: Constrained learning has become increasingly important, especially in the realm of algorithmic fairness and machine learning. In these settings, predictive models are developed specifically to satisfy pre-defined notions of fairness. Here, we study the general problem of constrained statistical machine learning through a statistical functional lens. We consider learning a function-valued parameter… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  2. arXiv:2404.04399  [pdf, other

    stat.ML cs.AI cs.LG stat.AP stat.ME

    Longitudinal Targeted Minimum Loss-based Estimation with Temporal-Difference Heterogeneous Transformer

    Authors: Toru Shirakawa, Yi Li, Yulun Wu, Sky Qiu, Yuxuan Li, Mingduo Zhao, Hiroyasu Iso, Mark van der Laan

    Abstract: We propose Deep Longitudinal Targeted Minimum Loss-based Estimation (Deep LTMLE), a novel approach to estimate the counterfactual mean of outcome under dynamic treatment policies in longitudinal problem settings. Our approach utilizes a transformer architecture with heterogeneous type embedding trained using temporal-difference learning. After obtaining an initial estimate using the transformer, f… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  3. arXiv:2308.14895  [pdf, other

    cs.LG

    Conformal Meta-learners for Predictive Inference of Individual Treatment Effects

    Authors: Ahmed Alaa, Zaid Ahmad, Mark van der Laan

    Abstract: We investigate the problem of machine learning-based (ML) predictive inference on individual treatment effects (ITEs). Previous work has focused primarily on develo** ML-based meta-learners that can provide point estimates of the conditional average treatment effect (CATE); these are model-agnostic approaches for combining intermediate nuisance estimates to produce estimates of CATE. In this pap… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  4. arXiv:2301.12029  [pdf, other

    stat.ML cs.LG stat.ME

    Multi-task Highly Adaptive Lasso

    Authors: Ivana Malenica, Rachael V. Phillips, Daniel Lazzareschi, Jeremy R. Coyle, Romain Pirracchio, Mark J. van der Laan

    Abstract: We propose a novel, fully nonparametric approach for the multi-task learning, the Multi-task Highly Adaptive Lasso (MT-HAL). MT-HAL simultaneously learns features, samples and task associations important for the common model, while imposing a shared sparse structure among similar tasks. Given multiple tasks, our approach automatically finds a sparse sharing structure. The proposed MTL algorithm at… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

  5. arXiv:2205.10697  [pdf, other

    stat.ML cs.LG math.ST

    Lassoed Tree Boosting

    Authors: Alejandro Schuler, Yi Li, Mark van der Laan

    Abstract: Gradient boosting performs exceptionally in most prediction problems and scales well to large datasets. In this paper we prove that a ``lassoed'' gradient boosted tree algorithm with early stop** achieves faster than $n^{-1/4}$ L2 convergence in the large nonparametric space of cadlag functions of bounded sectional variation. This rate is remarkable because it does not depend on the dimension, s… ▽ More

    Submitted 8 December, 2023; v1 submitted 21 May, 2022; originally announced May 2022.

  6. arXiv:2110.12112  [pdf, ps, other

    math.ST cs.LG stat.ML

    Why Machine Learning Cannot Ignore Maximum Likelihood Estimation

    Authors: Mark J. van der Laan, Sherri Rose

    Abstract: The growth of machine learning as a field has been accelerating with increasing interest and publications across fields, including statistics, but predominantly in computer science. How can we parse this vast literature for developments that exemplify the necessary rigor? How many of these manuscripts incorporate foundational theory to allow for statistical inference? Which advances have the great… ▽ More

    Submitted 22 October, 2021; originally announced October 2021.

    Comments: 30 pages. Forthcoming as a chapter in the Handbook of Matching and Weighting in Causal Inference

  7. arXiv:2109.10452  [pdf, other

    stat.ML cs.LG

    Personalized Online Machine Learning

    Authors: Ivana Malenica, Rachael V. Phillips, Romain Pirracchio, Antoine Chambaz, Alan Hubbard, Mark J. van der Laan

    Abstract: In this work, we introduce the Personalized Online Super Learner (POSL) -- an online ensembling algorithm for streaming data whose optimization procedure accommodates varying degrees of personalization. Namely, POSL optimizes predictions with respect to baseline covariates, so personalization can vary from completely individualized (i.e., optimization with respect to baseline covariate subject ID)… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

  8. arXiv:2106.01723  [pdf, other

    stat.ML cs.LG math.ST

    Risk Minimization from Adaptively Collected Data: Guarantees for Supervised and Policy Learning

    Authors: Aurélien Bibaut, Antoine Chambaz, Maria Dimakopoulou, Nathan Kallus, Mark van der Laan

    Abstract: Empirical risk minimization (ERM) is the workhorse of machine learning, whether for classification and regression or for off-policy policy learning, but its model-agnostic guarantees can fail when we use adaptively collected data, such as the result of running a contextual bandit algorithm. We study a generic importance sampling weighted ERM algorithm for using adaptively collected data to minimiz… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

  9. arXiv:2106.00418  [pdf, other

    stat.ML cs.LG math.ST

    Post-Contextual-Bandit Inference

    Authors: Aurélien Bibaut, Antoine Chambaz, Maria Dimakopoulou, Nathan Kallus, Mark van der Laan

    Abstract: Contextual bandit algorithms are increasingly replacing non-adaptive A/B tests in e-commerce, healthcare, and policymaking because they can both improve outcomes for study participants and increase the chance of identifying good or even best policies. To support credible inference on novel interventions at the end of the study, nonetheless, we still want to construct valid confidence intervals on… ▽ More

    Submitted 1 June, 2021; originally announced June 2021.

  10. arXiv:2102.00102  [pdf, other

    math.ST cs.LG stat.ME stat.ML

    Adaptive Sequential Design for a Single Time-Series

    Authors: Ivana Malenica, Aurelien Bibaut, Mark J. van der Laan

    Abstract: The current work is motivated by the need for robust statistical methods for precision medicine; as such, we address the need for statistical methods that provide actionable inference for a single unit at any point in time. We aim to learn an optimal, unknown choice of the controlled components of the design in order to optimize the expected outcome; with that, we adapt the randomization mechanism… ▽ More

    Submitted 1 July, 2021; v1 submitted 29 January, 2021; originally announced February 2021.

    Comments: arXiv admin note: text overlap with arXiv:1809.00734

  11. arXiv:2006.03632  [pdf, other

    cs.LG stat.ML

    Rate-adaptive model selection over a collection of black-box contextual bandit algorithms

    Authors: Aurélien F. Bibaut, Antoine Chambaz, Mark J. van der Laan

    Abstract: We consider the model selection task in the stochastic contextual bandit setting. Suppose we are given a collection of base contextual bandit algorithms. We provide a master algorithm that combines them and achieves the same performance, up to constants, as the best base algorithm would, if it had been run on its own. Our approach only requires that each algorithm satisfy a high probability regret… ▽ More

    Submitted 5 June, 2020; originally announced June 2020.

  12. arXiv:2003.02873  [pdf, other

    cs.LG stat.ML

    Generalized Policy Elimination: an efficient algorithm for Nonparametric Contextual Bandits

    Authors: Aurélien F. Bibaut, Antoine Chambaz, Mark J. van der Laan

    Abstract: We propose the Generalized Policy Elimination (GPE) algorithm, an oracle-efficient contextual bandit (CB) algorithm inspired by the Policy Elimination algorithm of \cite{dudik2011}. We prove the first regret optimality guarantee theorem for an oracle-efficient CB algorithm competing against a nonparametric class with infinite VC-dimension. Specifically, we show that GPE is regret-optimal (up to lo… ▽ More

    Submitted 5 March, 2020; originally announced March 2020.

  13. arXiv:1912.06675  [pdf, other

    stat.ML cs.LG

    Conditional Super Learner

    Authors: Gilmer Valdes, Yannet Interian, Efstathios D. Gennatas Mark J. Van der Laan

    Abstract: In this article we consider the Conditional Super Learner (CSL), an algorithm which selects the best model candidate from a library conditional on the covariates. The CSL expands the idea of using cross-validation to select the best model and merges it with meta learning. Here we propose a specific algorithm that finds a local minimum to the problem posed, proof that it converges at a rate faster… ▽ More

    Submitted 13 December, 2019; originally announced December 2019.

  14. arXiv:1912.06292  [pdf, other

    cs.LG stat.ME stat.ML

    More Efficient Off-Policy Evaluation through Regularized Targeted Learning

    Authors: Aurélien F. Bibaut, Ivana Malenica, Nikos Vlassis, Mark J. van der Laan

    Abstract: We study the problem of off-policy evaluation (OPE) in Reinforcement Learning (RL), where the aim is to estimate the performance of a new policy given historical data that may have been generated by a different policy, or policies. In particular, we introduce a novel doubly-robust estimator for the OPE problem in RL, based on the Targeted Maximum Likelihood Estimation principle from the statistica… ▽ More

    Submitted 12 December, 2019; originally announced December 2019.

    Comments: We are uploading the full paper with the appendix as of 12/12/2019, as we noticed that, unlike the main text, the appendix has not been made available on PMLR's website. The version of the appendix in this document is the same that we have been sending by email since June 2019 to readers who solicited it

    Journal ref: Proceedings of the 36th International Conference on Machine Learning, PMLR 97:654-663, 2019

  15. Expert-Augmented Machine Learning

    Authors: E. D. Gennatas, J. H. Friedman, L. H. Ungar, R. Pirracchio, E. Eaton, L. Reichman, Y. Interian, C. B. Simone, A. Auerbach, E. Delgado, M. J. Van der Laan, T. D. Solberg, G. Valdes

    Abstract: Machine Learning is proving invaluable across disciplines. However, its success is often limited by the quality and quantity of available data, while its adoption by the level of trust that models afford users. Human vs. machine performance is commonly compared empirically to decide whether a certain task should be performed by a computer or an expert. In reality, the optimal learning strategy may… ▽ More

    Submitted 5 January, 2021; v1 submitted 22 March, 2019; originally announced March 2019.

  16. arXiv:1809.00734  [pdf, other

    math.ST cs.LG stat.AP stat.ME stat.ML

    Robust Estimation of Data-Dependent Causal Effects based on Observing a Single Time-Series

    Authors: Mark J. van der Laan, Ivana Malenica

    Abstract: Consider the case that one observes a single time-series, where at each time t one observes a data record O(t) involving treatment nodes A(t), possible covariates L(t) and an outcome node Y(t). The data record at time t carries information for an (potentially causal) effect of the treatment A(t) on the outcome Y(t), in the context defined by a fixed dimensional summary measure Co(t). We are concer… ▽ More

    Submitted 3 September, 2018; originally announced September 2018.

  17. arXiv:1704.01664  [pdf, other

    stat.ML cs.CV cs.LG stat.ME

    The Relative Performance of Ensemble Methods with Deep Convolutional Neural Networks for Image Classification

    Authors: Cheng Ju, Aurélien Bibaut, Mark J. van der Laan

    Abstract: Artificial neural networks have been successfully applied to a variety of machine learning tasks, including image recognition, semantic segmentation, and machine translation. However, few studies fully investigated ensembles of artificial neural networks. In this work, we investigated multiple widely used ensemble methods, including unweighted averaging, majority voting, the Bayes Optimal Classifi… ▽ More

    Submitted 5 April, 2017; originally announced April 2017.