Skip to main content

Showing 1–25 of 25 results for author: Spindler, M

Searching in archive econ. Search in all archives.
.
  1. arXiv:2406.18936  [pdf, other

    econ.GN stat.AP

    Credit Ratings: Heterogeneous Effect on Capital Structure

    Authors: Helmut Wasserbacher, Martin Spindler

    Abstract: Why do companies choose particular capital structures? A compelling answer to this question remains elusive despite extensive research. In this article, we use double machine learning to examine the heterogeneous causal effect of credit ratings on leverage. Taking advantage of the flexibility of random forests within the double machine learning framework, we model the relationship between variable… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: 288 pages, 13 figures

  2. arXiv:2406.11308  [pdf, other

    cs.LG cs.AI econ.EM stat.ML

    Management Decisions in Manufacturing using Causal Machine Learning -- To Rework, or not to Rework?

    Authors: Philipp Schwarz, Oliver Schacht, Sven Klaassen, Daniel Grünbaum, Sebastian Imhof, Martin Spindler

    Abstract: In this paper, we present a data-driven model for estimating optimal rework policies in manufacturing systems. We consider a single production stage within a multistage, lot-based system that allows for optional rework steps. While the rework decision depends on an intermediate state of the lot and system, the final product inspection, and thus the assessment of the actual yield, is delayed until… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 30 pages, 10 figures

  3. arXiv:2403.02467  [pdf

    econ.EM cs.LG stat.ME stat.ML

    Applied Causal Inference Powered by ML and AI

    Authors: Victor Chernozhukov, Christian Hansen, Nathan Kallus, Martin Spindler, Vasilis Syrgkanis

    Abstract: An introduction to the emerging fusion of machine learning and causal inference. The book presents ideas from classical structural equation models (SEMs) and their modern AI equivalent, directed acyclical graphs (DAGs) and structural causal models (SCMs), and covers Double/Debiased Machine Learning methods to do inference in such models using modern predictive tools.

    Submitted 4 March, 2024; originally announced March 2024.

  4. arXiv:2402.04674  [pdf, other

    econ.EM stat.ML

    Hyperparameter Tuning for Causal Inference with Double Machine Learning: A Simulation Study

    Authors: Philipp Bach, Oliver Schacht, Victor Chernozhukov, Sven Klaassen, Martin Spindler

    Abstract: Proper hyperparameter tuning is essential for achieving optimal performance of modern machine learning (ML) methods in predictive tasks. While there is an extensive literature on tuning ML learners for prediction, there is only little guidance available on tuning ML learners for causal machine learning and how to select among different ML learners. In this paper, we empirically assess the relation… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  5. arXiv:2402.01785  [pdf, other

    cs.LG cs.AI econ.EM stat.ME stat.ML

    DoubleMLDeep: Estimation of Causal Effects with Multimodal Data

    Authors: Sven Klaassen, Jan Teichert-Kluge, Philipp Bach, Victor Chernozhukov, Martin Spindler, Suhas Vijaykumar

    Abstract: This paper explores the use of unstructured, multimodal data, namely text and images, in causal inference and treatment effect estimation. We propose a neural network architecture that is adapted to the double machine learning (DML) framework, specifically the partially linear model. An additional contribution of our paper is a new method to generate a semi-synthetic dataset which can be used to e… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    MSC Class: 62; 91 ACM Class: I.2.0

  6. arXiv:2107.04851  [pdf, other

    econ.EM cs.AI

    Machine Learning for Financial Forecasting, Planning and Analysis: Recent Developments and Pitfalls

    Authors: Helmut Wasserbacher, Martin Spindler

    Abstract: This article is an introduction to machine learning for financial forecasting, planning and analysis (FP\&A). Machine learning appears well suited to support FP\&A with the highly automated extraction of information from large amounts of data. However, because most traditional machine learning techniques focus on forecasting (prediction), we discuss the particular care that must be taken to avoid… ▽ More

    Submitted 10 July, 2021; originally announced July 2021.

    Comments: 31 pages, 3 figures, 4 tables

  7. arXiv:2104.03220  [pdf, other

    stat.ML cs.LG econ.EM

    DoubleML -- An Object-Oriented Implementation of Double Machine Learning in Python

    Authors: Philipp Bach, Victor Chernozhukov, Malte S. Kurz, Martin Spindler

    Abstract: DoubleML is an open-source Python library implementing the double machine learning framework of Chernozhukov et al. (2018) for a variety of causal models. It contains functionalities for valid statistical inference on causal parameters when the estimation of nuisance parameters is based on machine learning methods. The object-oriented implementation of DoubleML provides a high flexibility in terms… ▽ More

    Submitted 20 December, 2021; v1 submitted 7 April, 2021; originally announced April 2021.

    Comments: 6 pages, 2 figures

    MSC Class: 62-04

    Journal ref: Journal of Machine Learning Research 23 (53), 2022, 1-6

  8. arXiv:2103.09603  [pdf, other

    stat.ML cs.LG econ.EM

    DoubleML -- An Object-Oriented Implementation of Double Machine Learning in R

    Authors: Philipp Bach, Victor Chernozhukov, Malte S. Kurz, Martin Spindler, Sven Klaassen

    Abstract: The R package DoubleML implements the double/debiased machine learning framework of Chernozhukov et al. (2018). It provides functionalities to estimate parameters in causal models based on machine learning methods. The double machine learning framework consist of three key ingredients: Neyman orthogonality, high-quality machine learning estimation and sample splitting. Estimation of nuisance compo… ▽ More

    Submitted 5 June, 2024; v1 submitted 17 March, 2021; originally announced March 2021.

    Comments: 56 pages, 8 Figures, 1 Table; Updated version for DoubleML 1.0.0; Updated version due to changes in R package paradox (for parameter tuning with mlr3)

    MSC Class: 62-04

    Journal ref: Journal of Statistical Software 2024

  9. arXiv:2102.08994  [pdf, other

    stat.ME econ.EM stat.OT

    Big Data meets Causal Survey Research: Understanding Nonresponse in the Recruitment of a Mixed-mode Online Panel

    Authors: Barbara Felderer, Jannis Kueck, Martin Spindler

    Abstract: Survey scientists increasingly face the problem of high-dimensionality in their research as digitization makes it much easier to construct high-dimensional (or "big") data sets through tools such as online surveys and mobile applications. Machine learning methods are able to handle such data, and they have been successfully applied to solve \emph{predictive} problems. However, in many situations,… ▽ More

    Submitted 17 February, 2021; originally announced February 2021.

    Comments: 33 pages, 3 figures, 3 tables

  10. arXiv:2011.01092  [pdf, other

    econ.GN physics.soc-ph stat.AP

    Insights from Optimal Pandemic Shielding in a Multi-Group SEIR Framework

    Authors: Philipp Bach, Victor Chernozhukov, Martin Spindler

    Abstract: The COVID-19 pandemic constitutes one of the largest threats in recent decades to the health and economic welfare of populations globally. In this paper, we analyze different types of policy measures designed to fight the spread of the virus and minimize economic losses. Our analysis builds on a multi-group SEIR model, which extends the multi-group SIR model introduced by Acemoglu et al.~(2020). W… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Comments: 39 pages, 23 figures

  11. arXiv:2004.01623  [pdf, other

    stat.ME econ.EM stat.ML

    Estimation and Uniform Inference in Sparse High-Dimensional Additive Models

    Authors: Philipp Bach, Sven Klaassen, Jannis Kueck, Martin Spindler

    Abstract: We develop a novel method to construct uniformly valid confidence bands for a nonparametric component $f_1$ in the sparse additive model $Y=f_1(X_1)+\ldots + f_p(X_p) + \varepsilon$ in a high-dimensional setting. Our method integrates sieve estimation into a high-dimensional Z-estimation framework, facilitating the construction of uniformly valid confidence bands for the target component $f_1$. To… ▽ More

    Submitted 23 April, 2024; v1 submitted 3 April, 2020; originally announced April 2020.

    MSC Class: 62G08; 62-07

  12. arXiv:2002.12710  [pdf, ps, other

    econ.EM

    Causal mediation analysis with double machine learning

    Authors: Helmut Farbmacher, Martin Huber, Lukáš Lafférs, Henrika Langen, Martin Spindler

    Abstract: This paper combines causal mediation analysis with double machine learning to control for observed confounders in a data-driven way under a selection-on-observables assumption in a high-dimensional setting. We consider the average indirect effect of a binary treatment operating through an intermediate variable (or mediator) on the causal path between the treatment and the outcome, as well as the u… ▽ More

    Submitted 16 February, 2021; v1 submitted 28 February, 2020; originally announced February 2020.

  13. arXiv:1912.12867  [pdf, other

    stat.ME cs.LG econ.EM stat.ML

    Adaptive Discrete Smoothing for High-Dimensional and Nonlinear Panel Data

    Authors: Xi Chen, Ye Luo, Martin Spindler

    Abstract: In this paper we develop a data-driven smoothing technique for high-dimensional and non-linear panel data models. We allow for individual specific (non-linear) functions and estimation with econometric or machine learning methods by using weighted observations from other individuals. The weights are determined by a data-driven way and depend on the similarity between the corresponding functions an… ▽ More

    Submitted 3 January, 2020; v1 submitted 30 December, 2019; originally announced December 2019.

    Comments: 18 pages, 1 figure, 6 tables

    MSC Class: I.2.6; G.3 ACM Class: I.2.6; G.3

  14. arXiv:1812.04345  [pdf, other

    econ.EM stat.AP stat.ML

    Closing the U.S. gender wage gap requires understanding its heterogeneity

    Authors: Philipp Bach, Victor Chernozhukov, Martin Spindler

    Abstract: In 2016, the majority of full-time employed women in the U.S. earned significantly less than comparable men. The extent to which women were affected by gender inequality in earnings, however, depended greatly on socio-economic characteristics, such as marital status or educational attainment. In this paper, we analyzed data from the 2016 American Community Survey using a high-dimensional wage regr… ▽ More

    Submitted 7 June, 2021; v1 submitted 11 December, 2018; originally announced December 2018.

    Comments: Main text: 8 pages, 3 figures; Supplementary Material available online

  15. arXiv:1809.04951  [pdf, other

    econ.EM stat.ML

    Valid Simultaneous Inference in High-Dimensional Settings (with the hdm package for R)

    Authors: Philipp Bach, Victor Chernozhukov, Martin Spindler

    Abstract: Due to the increasing availability of high-dimensional empirical applications in many research disciplines, valid simultaneous inference becomes more and more important. For instance, high-dimensional settings might arise in economic studies due to very rich data sets with many potential covariates or in the analysis of treatment heterogeneities. Also the evaluation of potentially more complicated… ▽ More

    Submitted 13 September, 2018; originally announced September 2018.

    Comments: 25 pages, 2 figures, 4 tables

  16. arXiv:1808.10543  [pdf, other

    cs.LG econ.EM stat.ML

    A Self-Attention Network for Hierarchical Data Structures with an Application to Claims Management

    Authors: Leander Löw, Martin Spindler, Eike Brechmann

    Abstract: Insurance companies must manage millions of claims per year. While most of these claims are non-fraudulent, fraud detection is core for insurance companies. The ultimate goal is a predictive model to single out the fraudulent claims and pay out the non-fraudulent ones immediately. Modern machine learning methods are well suited for this kind of problem. Health care claims often have a data structu… ▽ More

    Submitted 30 August, 2018; originally announced August 2018.

    Comments: 7 pages, 6 figures, 2 tables

  17. arXiv:1808.10532  [pdf, other

    stat.ME cs.LG econ.EM stat.ML

    Uniform Inference in High-Dimensional Gaussian Graphical Models

    Authors: Sven Klaassen, Jannis Kück, Martin Spindler, Victor Chernozhukov

    Abstract: Graphical models have become a very popular tool for representing dependencies within a large set of variables and are key for representing causal structures. We provide results for uniform inference on high-dimensional graphical models with the number of target parameters $d$ being possible much larger than sample size. This is in particular important when certain features or structures of a caus… ▽ More

    Submitted 3 December, 2018; v1 submitted 30 August, 2018; originally announced August 2018.

    Comments: 59 pages, 2 figures, 6 tables

    MSC Class: 62H15; 62J07;

  18. arXiv:1801.00364  [pdf, other

    stat.ML econ.EM stat.ME

    Estimation and Inference of Treatment Effects with $L_2$-Boosting in High-Dimensional Settings

    Authors: Jannis Kueck, Ye Luo, Martin Spindler, Zigan Wang

    Abstract: Empirical researchers are increasingly faced with rich data sets containing many controls or instrumental variables, making it essential to choose an appropriate approach to variable selection. In this paper, we provide results for valid inference after post- or orthogonal $L_2$-Boosting is used for variable selection. We consider treatment effects after selecting among many control variables and… ▽ More

    Submitted 1 July, 2021; v1 submitted 31 December, 2017; originally announced January 2018.

    Comments: 17 pages, 1 figure

    MSC Class: 62J07; 62F12

  19. arXiv:1712.07364  [pdf, other

    stat.ME econ.EM math.ST stat.ML

    Transformation Models in High-Dimensions

    Authors: Sven Klaassen, Jannis Kueck, Martin Spindler

    Abstract: Transformation models are a very important tool for applied statisticians and econometricians. In many applications, the dependent variable is transformed so that homogeneity or normal distribution of the error holds. In this paper, we analyze transformation models in a high-dimensional setting, where the set of potential covariates is large. We propose an estimator for the transformation paramete… ▽ More

    Submitted 20 December, 2017; originally announced December 2017.

    Comments: 63 pages, 4 figures

    MSC Class: 62H; 62F

  20. arXiv:1702.03244  [pdf, ps, other

    stat.ML econ.EM stat.ME

    $L_2$Boosting for Economic Applications

    Authors: Ye Luo, Martin Spindler

    Abstract: In the recent years more and more high-dimensional data sets, where the number of parameters $p$ is high compared to the number of observations $n$ or even larger, are available for applied researchers. Boosting algorithms represent one of the major advances in machine learning and statistics in recent years and are suitable for the analysis of such data sets. While Lasso has been applied very suc… ▽ More

    Submitted 10 February, 2017; originally announced February 2017.

    Comments: Submitted to American Economic Review, Papers and Proceedings 2017. arXiv admin note: text overlap with arXiv:1602.08927

  21. arXiv:1608.00354  [pdf, ps, other

    stat.ME econ.EM stat.ML

    hdm: High-Dimensional Metrics

    Authors: Victor Chernozhukov, Chris Hansen, Martin Spindler

    Abstract: In this article the package High-dimensional Metrics (\texttt{hdm}) is introduced. It is a collection of statistical methods for estimation and quantification of uncertainty in high-dimensional approximately sparse models. It focuses on providing confidence intervals and significance testing for (possibly many) low-dimensional subcomponents of the high-dimensional parameter vector. Efficient estim… ▽ More

    Submitted 1 August, 2016; originally announced August 2016.

    Comments: arXiv admin note: substantial text overlap with arXiv:1603.01700

  22. arXiv:1603.01700  [pdf, ps, other

    stat.ML econ.EM stat.ME

    High-Dimensional Metrics in R

    Authors: Victor Chernozhukov, Chris Hansen, Martin Spindler

    Abstract: The package High-dimensional Metrics (\Rpackage{hdm}) is an evolving collection of statistical methods for estimation and quantification of uncertainty in high-dimensional approximately sparse models. It focuses on providing confidence intervals and significance testing for (possibly many) low-dimensional subcomponents of the high-dimensional parameter vector. Efficient estimators and uniformly va… ▽ More

    Submitted 1 August, 2016; v1 submitted 5 March, 2016; originally announced March 2016.

    Comments: 34 pages; vignette for the R package hdm, available at http://cran.r-project.org/web/packages/hdm/ and http://r-forge.r-project.org/R/?group_id=2084 (development version)

    MSC Class: 62-01; 62-04; 62J07; 62G05

  23. arXiv:1602.08927  [pdf, other

    stat.ML cs.LG econ.EM math.ST stat.ME

    High-Dimensional $L_2$Boosting: Rate of Convergence

    Authors: Ye Luo, Martin Spindler, Jannis Kück

    Abstract: Boosting is one of the most significant developments in machine learning. This paper studies the rate of convergence of $L_2$Boosting, which is tailored for regression, in a high-dimensional setting. Moreover, we introduce so-called \textquotedblleft post-Boosting\textquotedblright. This is a post-selection estimator which applies ordinary least squares to the variables selected in the first stage… ▽ More

    Submitted 21 July, 2022; v1 submitted 29 February, 2016; originally announced February 2016.

    Comments: 19 pages, 4 tables; AMS 2000 subject classifications: Primary 62J05, 62J07, 41A25; secondary 49M15, 68Q32

    MSC Class: 62J05; 62J07; 41A25; 49M15; 68Q32

  24. Valid Post-Selection and Post-Regularization Inference: An Elementary, General Approach

    Authors: Victor Chernozhukov, Christian Hansen, Martin Spindler

    Abstract: Here we present an expository, general analysis of valid post-selection or post-regularization inference about a low-dimensional target parameter, $α$, in the presence of a very high-dimensional nuisance parameter, $η$, which is estimated using modern selection or regularization methods. Our analysis relies on high-level, easy-to-interpret conditions that allow one to clearly see the structures ne… ▽ More

    Submitted 18 August, 2015; v1 submitted 14 January, 2015; originally announced January 2015.

    Comments: 47 pages

    Journal ref: Annual Review of Economics, Vol. 7: 649-688 (August 2015)

  25. arXiv:1501.03185  [pdf, ps, other

    stat.AP econ.EM

    Post-Selection and Post-Regularization Inference in Linear Models with Many Controls and Instruments

    Authors: Victor Chernozhukov, Christian Hansen, Martin Spindler

    Abstract: In this note, we offer an approach to estimating causal/structural parameters in the presence of many instruments and controls based on methods for estimating sparse high-dimensional models. We use these high-dimensional methods to select both which instruments and which control variables to use. The approach we take extends BCCH2012, which covers selection of instruments for IV models with a smal… ▽ More

    Submitted 13 January, 2015; originally announced January 2015.

    Comments: American Economic Review 2015, Papers and Proceedings