-
Management Decisions in Manufacturing using Causal Machine Learning -- To Rework, or not to Rework?
Authors:
Philipp Schwarz,
Oliver Schacht,
Sven Klaassen,
Daniel Grünbaum,
Sebastian Imhof,
Martin Spindler
Abstract:
In this paper, we present a data-driven model for estimating optimal rework policies in manufacturing systems. We consider a single production stage within a multistage, lot-based system that allows for optional rework steps. While the rework decision depends on an intermediate state of the lot and system, the final product inspection, and thus the assessment of the actual yield, is delayed until…
▽ More
In this paper, we present a data-driven model for estimating optimal rework policies in manufacturing systems. We consider a single production stage within a multistage, lot-based system that allows for optional rework steps. While the rework decision depends on an intermediate state of the lot and system, the final product inspection, and thus the assessment of the actual yield, is delayed until production is complete. Repair steps are applied uniformly to the lot, potentially improving some of the individual items while degrading others. The challenge is thus to balance potential yield improvement with the rework costs incurred. Given the inherently causal nature of this decision problem, we propose a causal model to estimate yield improvement. We apply methods from causal machine learning, in particular double/debiased machine learning (DML) techniques, to estimate conditional treatment effects from data and derive policies for rework decisions. We validate our decision model using real-world data from opto-electronic semiconductor manufacturing, achieving a yield improvement of 2 - 3% during the color-conversion process of white light-emitting diodes (LEDs).
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
DoubleMLDeep: Estimation of Causal Effects with Multimodal Data
Authors:
Sven Klaassen,
Jan Teichert-Kluge,
Philipp Bach,
Victor Chernozhukov,
Martin Spindler,
Suhas Vijaykumar
Abstract:
This paper explores the use of unstructured, multimodal data, namely text and images, in causal inference and treatment effect estimation. We propose a neural network architecture that is adapted to the double machine learning (DML) framework, specifically the partially linear model. An additional contribution of our paper is a new method to generate a semi-synthetic dataset which can be used to e…
▽ More
This paper explores the use of unstructured, multimodal data, namely text and images, in causal inference and treatment effect estimation. We propose a neural network architecture that is adapted to the double machine learning (DML) framework, specifically the partially linear model. An additional contribution of our paper is a new method to generate a semi-synthetic dataset which can be used to evaluate the performance of causal effect estimation in the presence of text and images as confounders. The proposed methods and architectures are evaluated on the semi-synthetic dataset and compared to standard approaches, highlighting the potential benefit of using text and images directly in causal studies. Our findings have implications for researchers and practitioners in economics, marketing, finance, medicine and data science in general who are interested in estimating causal quantities using non-traditional data.
△ Less
Submitted 1 February, 2024;
originally announced February 2024.
-
Causally Learning an Optimal Rework Policy
Authors:
Oliver Schacht,
Sven Klaassen,
Philipp Schwarz,
Martin Spindler,
Daniel Grünbaum,
Sebastian Imhof
Abstract:
In manufacturing, rework refers to an optional step of a production process which aims to eliminate errors or remedy products that do not meet the desired quality standards. Reworking a production lot involves repeating a previous production stage with adjustments to ensure that the final product meets the required specifications. While offering the chance to improve the yield and thus increase th…
▽ More
In manufacturing, rework refers to an optional step of a production process which aims to eliminate errors or remedy products that do not meet the desired quality standards. Reworking a production lot involves repeating a previous production stage with adjustments to ensure that the final product meets the required specifications. While offering the chance to improve the yield and thus increase the revenue of a production lot, a rework step also incurs additional costs. Additionally, the rework of parts that already meet the target specifications may damage them and decrease the yield. In this paper, we apply double/debiased machine learning (DML) to estimate the conditional treatment effect of a rework step during the color conversion process in opto-electronic semiconductor manufacturing on the final product yield. We utilize the implementation DoubleML to develop policies for the rework of components and estimate their value empirically. From our causal machine learning analysis we derive implications for the coating of monochromatic LEDs with conversion layers.
△ Less
Submitted 7 June, 2023;
originally announced June 2023.
-
Incorporating Ethics in Computing Courses: Perspectives from Educators
Authors:
Jessie J. Smith,
Blakeley H. Payne,
Shamika Klassen,
Dylan Thomas Doyle,
Casey Fiesler
Abstract:
Incorporating ethics into computing education has become a priority for the SIGCSE community. Many computing departments and educators have contributed to this endeavor by creating standalone computing ethics courses or integrating ethics modules and discussions into preexisting curricula. In this study, we hope to support this effort by reporting on computing educators' attitudes toward including…
▽ More
Incorporating ethics into computing education has become a priority for the SIGCSE community. Many computing departments and educators have contributed to this endeavor by creating standalone computing ethics courses or integrating ethics modules and discussions into preexisting curricula. In this study, we hope to support this effort by reporting on computing educators' attitudes toward including ethics in their computing classroom, with a special focus on the structures that hinder or help this endeavor. We surveyed 138 higher education computing instructors to understand their attitudes toward including ethics in their classes, what barriers might be preventing them from doing so, and which structures best support them. We found that even though instructors were generally positive about ethics as a component of computing education, there are specific barriers preventing ethics from being included in some computing courses. In this work, we explore how to alleviate these barriers and outline support structures that could encourage further integration of ethics and computing in higher education.
△ Less
Submitted 26 January, 2023; v1 submitted 12 December, 2022;
originally announced December 2022.
-
DoubleML -- An Object-Oriented Implementation of Double Machine Learning in R
Authors:
Philipp Bach,
Victor Chernozhukov,
Malte S. Kurz,
Martin Spindler,
Sven Klaassen
Abstract:
The R package DoubleML implements the double/debiased machine learning framework of Chernozhukov et al. (2018). It provides functionalities to estimate parameters in causal models based on machine learning methods. The double machine learning framework consist of three key ingredients: Neyman orthogonality, high-quality machine learning estimation and sample splitting. Estimation of nuisance compo…
▽ More
The R package DoubleML implements the double/debiased machine learning framework of Chernozhukov et al. (2018). It provides functionalities to estimate parameters in causal models based on machine learning methods. The double machine learning framework consist of three key ingredients: Neyman orthogonality, high-quality machine learning estimation and sample splitting. Estimation of nuisance components can be performed by various state-of-the-art machine learning methods that are available in the mlr3 ecosystem. DoubleML makes it possible to perform inference in a variety of causal models, including partially linear and interactive regression models and their extensions to instrumental variable estimation. The object-oriented implementation of DoubleML enables a high flexibility for the model specification and makes it easily extendable. This paper serves as an introduction to the double machine learning framework and the R package DoubleML. In reproducible code examples with simulated and real data sets, we demonstrate how DoubleML users can perform valid inference based on machine learning methods.
△ Less
Submitted 5 June, 2024; v1 submitted 17 March, 2021;
originally announced March 2021.
-
Uniform Inference in High-Dimensional Gaussian Graphical Models
Authors:
Sven Klaassen,
Jannis Kück,
Martin Spindler,
Victor Chernozhukov
Abstract:
Graphical models have become a very popular tool for representing dependencies within a large set of variables and are key for representing causal structures. We provide results for uniform inference on high-dimensional graphical models with the number of target parameters $d$ being possible much larger than sample size. This is in particular important when certain features or structures of a caus…
▽ More
Graphical models have become a very popular tool for representing dependencies within a large set of variables and are key for representing causal structures. We provide results for uniform inference on high-dimensional graphical models with the number of target parameters $d$ being possible much larger than sample size. This is in particular important when certain features or structures of a causal model should be recovered. Our results highlight how in high-dimensional settings graphical models can be estimated and recovered with modern machine learning methods in complex data sets. To construct simultaneous confidence regions on many target parameters, sufficiently fast estimation rates of the nuisance functions are crucial. In this context, we establish uniform estimation rates and sparsity guarantees of the square-root estimator in a random design under approximate sparsity conditions that might be of independent interest for related problems in high-dimensions. We also demonstrate in a comprehensive simulation study that our procedure has good small sample properties.
△ Less
Submitted 3 December, 2018; v1 submitted 30 August, 2018;
originally announced August 2018.