Skip to main content

Showing 1–14 of 14 results for author: Li, M L

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.17019  [pdf, other

    stat.ME cs.LG stat.ML

    Neyman Meets Causal Machine Learning: Experimental Evaluation of Individualized Treatment Rules

    Authors: Michael Lingzhi Li, Kosuke Imai

    Abstract: A century ago, Neyman showed how to evaluate the efficacy of treatment using a randomized experiment under a minimal set of assumptions. This classical repeated sampling framework serves as a basis of routine experimental analyses conducted by today's scientists across disciplines. In this paper, we demonstrate that Neyman's methodology can also be used to experimentally evaluate the efficacy of i… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  2. arXiv:2403.07031  [pdf, other

    cs.LG stat.CO stat.ME stat.ML

    The Cram Method for Efficient Simultaneous Learning and Evaluation

    Authors: Zeyang Jia, Kosuke Imai, Michael Lingzhi Li

    Abstract: We introduce the "cram" method, a general and efficient approach to simultaneous learning and evaluation using a generic machine learning (ML) algorithm. In a single pass of batched data, the proposed method repeatedly trains an ML algorithm and tests its empirical performance. Because it utilizes the entire sample for both learning and evaluation, cramming is significantly more data-efficient tha… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  3. arXiv:2310.07973  [pdf, other

    stat.ME math.OC stat.AP stat.ML

    Statistical Performance Guarantee for Subgroup Identification with Generic Machine Learning

    Authors: Michael Lingzhi Li, Kosuke Imai

    Abstract: Across a wide array of disciplines, many researchers use machine learning (ML) algorithms to identify a subgroup of individuals who are likely to benefit from a treatment the most (``exceptional responders'') or those who are harmed by it. A common approach to this subgroup identification problem consists of two steps. First, researchers estimate the conditional average treatment effect (CATE) usi… ▽ More

    Submitted 20 December, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

  4. arXiv:2210.08326  [pdf, ps, other

    stat.ME cs.LG math.OC stat.ML

    Distributionally Robust Causal Inference with Observational Data

    Authors: Dimitris Bertsimas, Kosuke Imai, Michael Lingzhi Li

    Abstract: We consider the estimation of average treatment effects in observational studies and propose a new framework of robust causal inference with unobserved confounders. Our approach is based on distributionally robust optimization and proceeds in two steps. We first specify the maximal degree to which the distribution of unobserved potential outcomes may deviate from that of observed outcomes. We then… ▽ More

    Submitted 2 February, 2023; v1 submitted 15 October, 2022; originally announced October 2022.

  5. arXiv:2203.14511  [pdf, ps, other

    stat.ME stat.AP stat.ML

    Statistical Inference for Heterogeneous Treatment Effects Discovered by Generic Machine Learning in Randomized Experiments

    Authors: Kosuke Imai, Michael Lingzhi Li

    Abstract: Researchers are increasingly turning to machine learning (ML) algorithms to investigate causal heterogeneity in randomized experiments. Despite their promise, ML algorithms may fail to accurately ascertain heterogeneous treatment effects under practical settings with many covariates and small sample size. In addition, the quantification of estimation uncertainty remains a challenge. We develop a g… ▽ More

    Submitted 20 April, 2024; v1 submitted 28 March, 2022; originally announced March 2022.

  6. arXiv:2103.02506  [pdf, ps, other

    math.OC stat.CO stat.ML

    Stochastic Cutting Planes for Data-Driven Optimization

    Authors: Dimitris Bertsimas, Michael Lingzhi Li

    Abstract: We introduce a stochastic version of the cutting-plane method for a large class of data-driven Mixed-Integer Nonlinear Optimization (MINLO) problems. We show that under very weak assumptions the stochastic algorithm is able to converge to an $ε$-optimal solution with high probability. Numerical experiments on several problems show that stochastic cutting planes is able to deliver a multiple order-… ▽ More

    Submitted 3 March, 2021; originally announced March 2021.

  7. arXiv:2102.10773  [pdf, other

    cs.LG math.OC stat.CO stat.ML

    Slowly Varying Regression under Sparsity

    Authors: Dimitris Bertsimas, Vassilis Digalakis Jr, Michael Linghzi Li, Omar Skali Lami

    Abstract: We present the framework of slowly varying regression under sparsity, allowing sparse regression models to exhibit slow and sparse variations. The problem of parameter estimation is formulated as a mixed-integer optimization problem. We demonstrate that it can be precisely reformulated as a binary convex optimization problem through a novel relaxation technique. This relaxation involves a new equa… ▽ More

    Submitted 11 November, 2023; v1 submitted 21 February, 2021; originally announced February 2021.

    Comments: Submitted to Operations Research. First submission: 02/2021

  8. arXiv:2006.16509  [pdf, other

    stat.AP math.OC q-bio.PE stat.ML

    From predictions to prescriptions: A data-driven response to COVID-19

    Authors: Dimitris Bertsimas, Léonard Boussioux, Ryan Cory Wright, Arthur Delarue, Vassilis Digalakis Jr., Alexandre Jacquillat, Driss Lahlou Kitane, Galit Lukin, Michael Lingzhi Li, Luca Mingardi, Omid Nohadani, Agni Orfanoudaki, Theodore Papalexopoulos, Ivan Paskov, Jean Pauphilet, Omar Skali Lami, Bartolomeo Stellato, Hamza Tazi Bouardi, Kimberly Villalobos Carballo, Holly Wiberg, Cynthia Zeng

    Abstract: The COVID-19 pandemic has created unprecedented challenges worldwide. Strained healthcare providers make difficult decisions on patient triage, treatment and care management on a daily basis. Policy makers have imposed social distancing measures to slow the disease, at a steep economic price. We design analytical tools to support these decisions and combat the pandemic. Specifically, we propose a… ▽ More

    Submitted 29 June, 2020; originally announced June 2020.

    Comments: Submitted to PNAS

  9. arXiv:1911.05256  [pdf, other

    cs.LG cs.CV stat.ML

    A Hierarchy of Graph Neural Networks Based on Learnable Local Features

    Authors: Michael Lingzhi Li, Meng Dong, Jiawei Zhou, Alexander M. Rush

    Abstract: Graph neural networks (GNNs) are a powerful tool to learn representations on graphs by iteratively aggregating features from node neighbourhoods. Many variant models have been proposed, but there is limited understanding on both how to compare different architectures and how to construct GNNs systematically. Here, we propose a hierarchy of GNNs based on their aggregation regions. We derive theoret… ▽ More

    Submitted 12 November, 2019; originally announced November 2019.

  10. arXiv:1910.09092  [pdf, ps, other

    cs.LG math.OC stat.ME stat.ML

    Fast Exact Matrix Completion: A Unified Optimization Framework for Matrix Completion

    Authors: Dimitris Bertsimas, Michael Lingzhi Li

    Abstract: We formulate the problem of matrix completion with and without side information as a non-convex optimization problem. We design fastImpute based on non-convex gradient descent and show it converges to a global minimum that is guaranteed to recover closely the underlying matrix while it scales to matrices of sizes beyond $10^5 \times 10^5$. We report experiments on both synthetic and real-world dat… ▽ More

    Submitted 31 December, 2020; v1 submitted 20 October, 2019; originally announced October 2019.

    Journal ref: Journal of Machine Learning Research 21 (2020) 1-43

  11. arXiv:1905.05389  [pdf, other

    stat.AP stat.ME stat.ML

    Experimental Evaluation of Individualized Treatment Rules

    Authors: Kosuke Imai, Michael Lingzhi Li

    Abstract: The increasing availability of individual-level data has led to numerous applications of individualized (or personalized) treatment rules (ITRs). Policy makers often wish to empirically evaluate ITRs and compare their relative performance before implementing them in a target population. We propose a new evaluation metric, the population average prescriptive effect (PAPE). The PAPE compares the per… ▽ More

    Submitted 5 May, 2021; v1 submitted 14 May, 2019; originally announced May 2019.

    Comments: Accepted at JASA

  12. arXiv:1903.05063  [pdf, other

    cs.LG stat.ML

    Duration-of-Stay Storage Assignment under Uncertainty

    Authors: Michael Lingzhi Li, Elliott Wolf, Daniel Wintz

    Abstract: Optimizing storage assignment is a central problem in warehousing. Past literature has shown the superiority of the Duration-of-Stay (DoS) method in assigning pallets, but the methodology requires perfect prior knowledge of DoS for each pallet, which is unknown and uncertain under realistic conditions. The dynamic nature of a warehouse further complicates the validity of synthetic data testing tha… ▽ More

    Submitted 31 January, 2020; v1 submitted 12 March, 2019; originally announced March 2019.

    Comments: 15 pages, 4 figures. Accepted at ICLR 2020

  13. arXiv:1902.03272  [pdf, ps, other

    stat.ML cs.LG math.OC

    Scalable Holistic Linear Regression

    Authors: Dimitris Bertsimas, Michael Lingzhi Li

    Abstract: We propose a new scalable algorithm for holistic linear regression building on Bertsimas & King (2016). Specifically, we develop new theory to model significance and multicollinearity as lazy constraints rather than checking the conditions iteratively. The resulting algorithm scales with the number of samples $n$ in the 10,000s, compared to the low 100s in the previous framework. Computational res… ▽ More

    Submitted 3 March, 2020; v1 submitted 8 February, 2019; originally announced February 2019.

    Comments: Accepted by Operation Research Letters

  14. arXiv:1812.06647  [pdf, ps, other

    math.OC cs.LG stat.ML

    Interpretable Matrix Completion: A Discrete Optimization Approach

    Authors: Dimitris Bertsimas, Michael Lingzhi Li

    Abstract: We consider the problem of matrix completion on an $n \times m$ matrix. We introduce the problem of Interpretable Matrix Completion that aims to provide meaningful insights for the low-rank matrix using side information. We show that the problem can be reformulated as a binary convex optimization problem. We design OptComplete, based on a novel concept of stochastic cutting planes to enable effici… ▽ More

    Submitted 3 March, 2020; v1 submitted 17 December, 2018; originally announced December 2018.

    Comments: Submitted to Operational Research