-
A simulation framework of procurement operations in the container logistics industry
Authors:
George Vassos,
Klaus K. Holst,
Pierre Pinson,
Richard M. Lusby
Abstract:
This study proposes a simulation framework of procurement operations in the container logistics industry that can support the development of dynamic procurement strategies. The idea is inspired by the success of Passenger Origin-Destination Simulator (PODS) in the field of airline revenue management. By and large, research in procurement has focused on the optimisation of purchasing decisions, i.e…
▽ More
This study proposes a simulation framework of procurement operations in the container logistics industry that can support the development of dynamic procurement strategies. The idea is inspired by the success of Passenger Origin-Destination Simulator (PODS) in the field of airline revenue management. By and large, research in procurement has focused on the optimisation of purchasing decisions, i.e., when-to-order and supplier selection, but a principled approach to procurement operations is lacking. We fill this gap by develo** a probabilistic model of a procurement system. A discrete-event simulation logic is used to drive the evolution of the system. In a small case study, we use the simulation to deliver insights by comparing different supplier selection policies in a dynamic spot market environment. Policies based on contextual multi-armed bandits are seen to be robust to limited access to the information that determines the distribution of the outcome. This paper provides a pool of modelling ideas for simulation and observational studies. Moreover, the probabilistic formulation paves the way for advanced machine learning techniques and data-driven optimisation in procurement.
△ Less
Submitted 22 May, 2023; v1 submitted 22 March, 2023;
originally announced March 2023.
-
Policy Learning with the polle package
Authors:
Andreas Nordland,
Klaus K. Holst
Abstract:
The R package polle is a unifying framework for learning and evaluating finite stage policies based on observational data. The package implements a collection of existing and novel methods for causal policy learning including doubly robust restricted Q-learning, policy tree learning, and outcome weighted learning. The package deals with (near) positivity violations by only considering realistic po…
▽ More
The R package polle is a unifying framework for learning and evaluating finite stage policies based on observational data. The package implements a collection of existing and novel methods for causal policy learning including doubly robust restricted Q-learning, policy tree learning, and outcome weighted learning. The package deals with (near) positivity violations by only considering realistic policies. Highly flexible machine learning methods can be used to estimate the nuisance components and valid inference for the policy value is ensured via cross-fitting. The library is built up around a simple syntax with four main functions policy_data(), policy_def(), policy_learn(), and policy_eval() used to specify the data structure, define user-specified policies, specify policy learning methods and evaluate (learned) policies. The functionality of the package is illustrated via extensive reproducible examples.
△ Less
Submitted 3 July, 2024; v1 submitted 5 December, 2022;
originally announced December 2022.
-
A two-stage estimation procedure for non-linear structural equation models
Authors:
Klaus Kähler Holst,
Esben Budtz-Jørgensen
Abstract:
Applications of structural equation models (SEMs) are often restricted to linear associations between variables. Maximum likelihood (ML) estimation in non-linear models may be complex and require numerical integration. Furthermore, ML inference is sensitive to distributional assumptions. In this paper, we introduce a simple two-stage estimation technique for estimation of non-linear associations b…
▽ More
Applications of structural equation models (SEMs) are often restricted to linear associations between variables. Maximum likelihood (ML) estimation in non-linear models may be complex and require numerical integration. Furthermore, ML inference is sensitive to distributional assumptions. In this paper, we introduce a simple two-stage estimation technique for estimation of non-linear associations between latent variables. Here both steps are based on fitting linear SEMs: first a linear model is fitted to data on the latent predictor and terms describing the non-linear effect are predicted by their conditional means. In the second step, the predictions are included in a linear model for the latent outcome variable. We show that this procedure is consistent and identifies its asymptotic distribution. We also illustrate how this framework easily allows the association between latent variables to be modelled using restricted cubic splines and we develop a modified estimator which is robust to non-normality of the latent predictor. In a simulation study, we compare the proposed method to MLE and alternative two-stage estimation techniques.
△ Less
Submitted 9 October, 2018; v1 submitted 17 March, 2018;
originally announced March 2018.
-
A latent variable model with mixed binary and continuous response variables
Authors:
Klaus K. Holst,
Esben Budtz-Jørgensen,
Gitte Moos Knudsen
Abstract:
We propose a method for obtaining maximum likelihood estimates in a model with continuous and binary outcomes. Combinations of left and right censored observations are also naturally modeled in this framework. The model and estimation procedure has been implemented in the R package lava.tobit.
The method is demonstrated on brain imaging and personality data where measurement error on predictor v…
▽ More
We propose a method for obtaining maximum likelihood estimates in a model with continuous and binary outcomes. Combinations of left and right censored observations are also naturally modeled in this framework. The model and estimation procedure has been implemented in the R package lava.tobit.
The method is demonstrated on brain imaging and personality data where measurement error on predictor variables is handled in a latent variable framework. A simulation study is conducted comparing the small sample properties of the MLE with a limited information estimator.
△ Less
Submitted 5 July, 2015;
originally announced July 2015.
-
Model Diagnostics Based on Cumulative Residuals: The R-package gof
Authors:
Klaus K. Holst
Abstract:
The generalized linear model is widely used in all areas of applied statistics and while correct asymptotic inference can be achieved under misspecification of the distributional assumptions, a correctly specified mean structure is crucial to obtain interpretable results. Usually the linearity and functional form of predictors are checked by inspecting various scatter plots of the residuals, howev…
▽ More
The generalized linear model is widely used in all areas of applied statistics and while correct asymptotic inference can be achieved under misspecification of the distributional assumptions, a correctly specified mean structure is crucial to obtain interpretable results. Usually the linearity and functional form of predictors are checked by inspecting various scatter plots of the residuals, however, the subjective task of judging these can be challenging. In this paper we present an implementation of model diagnostics for the generalized linear model as well as structural equation models, based on aggregates of the residuals where the asymptotic behavior under the null is imitated by simulations. A procedure for checking the proportional hazard assumption in the Cox regression is also implemented.
△ Less
Submitted 5 July, 2015;
originally announced July 2015.
-
The Liability Threshold Model for Censored Twin Data
Authors:
Klaus K. Holst,
Thomas H. Scheike,
Jacob B. Hjelmborg
Abstract:
Family studies provide an important tool for understanding etiology of diseases, with the key aim of discovering evidence of family aggregation and to determine if such aggregation can be attributed to genetic components. Heritability and concordance estimates are routinely calculated in twin studies of diseases, as a way of quantifying such genetic contribution. The endpoint in these studies are…
▽ More
Family studies provide an important tool for understanding etiology of diseases, with the key aim of discovering evidence of family aggregation and to determine if such aggregation can be attributed to genetic components. Heritability and concordance estimates are routinely calculated in twin studies of diseases, as a way of quantifying such genetic contribution. The endpoint in these studies are typically defined as occurrence of a disease versus death without the disease. However, a large fraction of the subjects may still be alive at the time of follow-up without having experienced the disease thus still being at risk. Ignoring this right-censoring can lead to severely biased estimates. We propose to extend the classical liability threshold model with inverse probability of censoring weighting of complete observations. This leads to a flexible way of modeling twin concordance and obtaining consistent estimates of heritability. We apply the method in simulations and to data from the population based Danish twin cohort where we describe the dependence in prostate cancer occurrence in twins.
△ Less
Submitted 24 January, 2015; v1 submitted 7 July, 2014;
originally announced July 2014.
-
Linear Latent Variable Models: The lava-package
Authors:
Klaus K. Holst,
Esben Budtz-Jørgensen
Abstract:
An R package for specifying and estimating linear latent variable models is presented. The philosophy of the implementation is to separate the model specification from the actual data, which leads to a dynamic and easy way of modeling complex hierarchical structures. Several advanced features are implemented including robust standard errors for clustered correlated data, multigroup analyses, non-l…
▽ More
An R package for specifying and estimating linear latent variable models is presented. The philosophy of the implementation is to separate the model specification from the actual data, which leads to a dynamic and easy way of modeling complex hierarchical structures. Several advanced features are implemented including robust standard errors for clustered correlated data, multigroup analyses, non-linear parameter constraints, inference with incomplete data, maximum likelihood estimation with censored and binary observations, and instrumental variable estimators. In addition an extensive simulation interface covering a broad range of non-linear generalized structural equation models is described. The model and software are demonstrated in data of measurements of the serotonin transporter in the human brain.
△ Less
Submitted 15 June, 2012;
originally announced June 2012.