-
A Data Fusion Approach for Ride-sourcing Demand Estimation: A Discrete Choice Model with Sampling and Endogeneity Corrections
Authors:
Rico Krueger,
Michel Bierlaire,
Prateek Bansal
Abstract:
Ride-sourcing services offered by companies like Uber and Didi have grown rapidly in the last decade. Understanding the demand for these services is essential for planning and managing modern transportation systems. Existing studies develop statistical models for ride-sourcing demand estimation at an aggregate level due to limited data availability. These models lack foundations in microeconomic t…
▽ More
Ride-sourcing services offered by companies like Uber and Didi have grown rapidly in the last decade. Understanding the demand for these services is essential for planning and managing modern transportation systems. Existing studies develop statistical models for ride-sourcing demand estimation at an aggregate level due to limited data availability. These models lack foundations in microeconomic theory, ignore competition of ride-sourcing with other travel modes, and cannot be seamlessly integrated into existing individual-level (disaggregate) activity-based models to evaluate system-level impacts of ride-sourcing services. In this paper, we present and apply an approach for estimating ride-sourcing demand at a disaggregate level using discrete choice models and multiple data sources. We first construct a sample of trip-based mode choices in Chicago, USA by enriching household travel survey with publicly available ride-sourcing and taxi trip records. We then formulate a multivariate extreme value-based discrete choice with sampling and endogeneity corrections to account for the construction of the estimation sample from multiple data sources and endogeneity biases arising from supply-side constraints and surge pricing mechanisms in ride-sourcing systems. Our analysis of the constructed dataset reveals insights into the influence of various socio-economic, land use and built environment features on ride-sourcing demand. We also derive elasticities of ride-sourcing demand relative to travel cost and time. Finally, we illustrate how the developed model can be employed to quantify the welfare implications of ride-sourcing policies and regulations such as terminating certain types of services and introducing ride-sourcing taxes.
△ Less
Submitted 5 December, 2022;
originally announced December 2022.
-
Fuel consumption elasticities, rebound effect and feebate effectiveness in the Indian and Chinese new car markets
Authors:
Prateek Bansal,
Rubal Dua
Abstract:
China and India, the world's two most populous develo** economies, are also among the world's largest automotive markets and carbon emitters. To reduce carbon emissions from the passenger car sector, both countries have considered various policy levers affecting fuel prices, car prices and fuel economy. This study estimates the responsiveness of new car buyers in China and India to such policy l…
▽ More
China and India, the world's two most populous develo** economies, are also among the world's largest automotive markets and carbon emitters. To reduce carbon emissions from the passenger car sector, both countries have considered various policy levers affecting fuel prices, car prices and fuel economy. This study estimates the responsiveness of new car buyers in China and India to such policy levers and drivers including income. Furthermore, we estimate the potential for rebound effect and the effectiveness of a feebate policy. To accomplish this, we developed a joint discrete-continuous model of car choice and usage based on revealed preference survey data from approximately 8000 new car buyers from India and China who purchased cars in 2016-17. Conditional on buying a new car, the fuel consumption in both markets is found to be relatively unresponsive to fuel price and income, with magnitudes of elasticity estimates ranging from 0.12 to 0.15. For both markets, the mean segment-level direct elasticities of fuel consumption relative to car price and fuel economy range from 0.57 to 0.65. The rebound effect on fuel savings due to cost-free fuel economy improvement is found to be 17.1% for India and 18.8% for China. A revenue-neutral feebate policy, with average rebates and fees of up to around 15% of the retail price, resulted in fuel savings of around 0.7% for both markets. While the feebate policy's rebound effect is low - 7.3% for India and 1.6% for China - it does not appear to be an effective fuel conservation policy.
△ Less
Submitted 22 January, 2022;
originally announced January 2022.
-
A General Framework to Forecast the Adoption of Novel Products: A Case of Autonomous Vehicles
Authors:
Subodh Dubey,
Ishant Sharma,
Sabyasachee Mishra,
Oded Cats,
Prateek Bansal
Abstract:
Due to the unavailability of prototypes, the early adopters of novel products actively seek information from multiple sources (e.g., media and social networks) to minimize the potential risk. The existing behavior models not only fail to capture the information propagation within the individual's social network, but also they do not incorporate the impact of such word-of-mouth (WOM) dissemination…
▽ More
Due to the unavailability of prototypes, the early adopters of novel products actively seek information from multiple sources (e.g., media and social networks) to minimize the potential risk. The existing behavior models not only fail to capture the information propagation within the individual's social network, but also they do not incorporate the impact of such word-of-mouth (WOM) dissemination on the consumer's risk preferences. Moreover, even cutting-edge forecasting models rely on crude/synthetic consumer behavior models. We propose a general framework to forecast the adoption of novel products by develo** a new consumer behavior model and integrating it into a population-level agent-based model. Specifically, we extend the hybrid choice model to estimate consumer behavior, which incorporates social network effects and interplay between WOM and risk aversion. The calibrated consumer behavior model and synthetic population are passed through the agent-based model for forecasting the product market share. We apply the proposed framework to forecast the adoption of autonomous vehicles (AVs) in Nashville, USA. The consumer behavior model is calibrated with a stated preference survey data of 1,495 Nashville residents. The output of the agent-based model provides the effect of the purchase price, post-purchase satisfaction, and safety measures/regulations on the forecasted AV market share. With an annual AV price reduction of 5% at the initial purchase price of $40,000 and 90% of satisfied adopters, AVs are forecasted to attain around 85% market share in thirty years. These findings are crucial for policymakers to develop infrastructure plans and manufacturers to conduct an after-sales cost-benefit analysis.
△ Less
Submitted 7 September, 2021;
originally announced September 2021.
-
Face masks, vaccination rates and low crowding drive the demand for the London Underground during the COVID-19 pandemic
Authors:
Prateek Bansal,
Roselinde Kessels,
Rico Krueger,
Daniel J Graham
Abstract:
The COVID-19 pandemic has drastically impacted people's travel behaviour and out-of-home activity participation. While countermeasures are being eased with increasing vaccination rates, the demand for public transport remains uncertain. To investigate user preferences to travel by London Underground during the pandemic, we conducted a stated choice experiment among its pre-pandemic users (N=961).…
▽ More
The COVID-19 pandemic has drastically impacted people's travel behaviour and out-of-home activity participation. While countermeasures are being eased with increasing vaccination rates, the demand for public transport remains uncertain. To investigate user preferences to travel by London Underground during the pandemic, we conducted a stated choice experiment among its pre-pandemic users (N=961). We analysed the collected data using multinomial and mixed logit models. Our analysis provides insights into the sensitivity of the demand for the London Underground with respect to travel attributes (crowding density and travel time), the epidemic situation (confirmed new COVID-19 cases), and interventions (vaccination rates and mandatory face masks). Mandatory face masks and higher vaccination rates are the top two drivers of travel demand for the London Underground during COVID-19. The positive impact of vaccination rates on the Underground demand increases with crowding density, and the positive effect of mandatory face masks decreases with travel time. Mixed logit reveals substantial preference heterogeneity. For instance, while the average effect of mandatory face masks is positive, preferences of around 20% of the pre-pandemic users to travel by the Underground are negatively affected. The estimated demand sensitivities are relevant for supply-demand management in transit systems and the calibration of advanced epidemiological models.
△ Less
Submitted 6 July, 2021;
originally announced July 2021.
-
Revisiting the empirical fundamental relationship of traffic flow for highways using a causal econometric approach
Authors:
Anupriya,
Daniel J. Graham,
Daniel Hörcher,
Prateek Bansal
Abstract:
The fundamental relationship of traffic flow is empirically estimated by fitting a regression curve to a cloud of observations of traffic variables. Such estimates, however, may suffer from the confounding/endogeneity bias due to omitted variables such as driving behaviour and weather. To this end, this paper adopts a causal approach to obtain an unbiased estimate of the fundamental flow-density r…
▽ More
The fundamental relationship of traffic flow is empirically estimated by fitting a regression curve to a cloud of observations of traffic variables. Such estimates, however, may suffer from the confounding/endogeneity bias due to omitted variables such as driving behaviour and weather. To this end, this paper adopts a causal approach to obtain an unbiased estimate of the fundamental flow-density relationship using traffic detector data. In particular, we apply a Bayesian non-parametric spline-based regression approach with instrumental variables to adjust for the aforementioned confounding bias. The proposed approach is benchmarked against standard curve-fitting methods in estimating the flow-density relationship for three highway bottlenecks in the United States. Our empirical results suggest that the saturated (or hypercongested) regime of the estimated flow-density relationship using correlational curve fitting methods may be severely biased, which in turn leads to biased estimates of important traffic control inputs such as capacity and capacity-drop. We emphasise that our causal approach is based on the physical laws of vehicle movement in a traffic stream as opposed to a demand-supply framework adopted in the economics literature. By doing so, we also aim to conciliate the engineering and economics approaches to this empirical problem. Our results, thus, have important implications both for traffic engineers and transport economists.
△ Less
Submitted 6 April, 2021;
originally announced April 2021.
-
Willingness to Pay and Attitudinal Preferences of Indian Consumers for Electric Vehicles
Authors:
Prateek Bansal,
Rajeev Ranjan Kumar,
Alok Raj,
Subodh Dubey,
Daniel J. Graham
Abstract:
Consumer preference elicitation is critical to devise effective policies for the diffusion of electric vehicles (EVs) in India. This study contributes to the EV demand literature in the Indian context by (a) analysing the EV attributes and attitudinal factors of Indian car buyers that determine consumers' preferences for EVs, (b) estimating Indian consumers' willingness to pay (WTP) to buy EVs wit…
▽ More
Consumer preference elicitation is critical to devise effective policies for the diffusion of electric vehicles (EVs) in India. This study contributes to the EV demand literature in the Indian context by (a) analysing the EV attributes and attitudinal factors of Indian car buyers that determine consumers' preferences for EVs, (b) estimating Indian consumers' willingness to pay (WTP) to buy EVs with improved attributes, and c) quantifying how the reference dependence affects the WTP estimates. We adopt a hybrid choice modelling approach for the above analysis. The results indicate that accounting for reference dependence provides more realistic WTP estimates than the standard utility estimation approach. Our results suggest that Indian consumers are willing to pay an additional USD 10-34 in the purchase price to reduce the fast charging time by 1 minute, USD 7-40 to add a kilometre to the driving range of EVs at 200 kilometres, and USD 104-692 to save USD 1 per 100 kilometres in operating cost. These estimates and the effect of attitudes on the likelihood to adopt EVs provide insights about EV design, marketing strategies, and pro-EV policies (e.g., specialised lanes and reserved parking for EVs) to expedite the adoption of EVs in India.
△ Less
Submitted 13 May, 2021; v1 submitted 20 January, 2021;
originally announced January 2021.
-
Robust discrete choice models with t-distributed kernel errors
Authors:
Rico Krueger,
Michel Bierlaire,
Thomas Gasos,
Prateek Bansal
Abstract:
Outliers in discrete choice response data may result from misclassification and misreporting of the response variable and from choice behaviour that is inconsistent with modelling assumptions (e.g. random utility maximisation). In the presence of outliers, standard discrete choice models produce biased estimates and suffer from compromised predictive accuracy. Robust statistical models are less se…
▽ More
Outliers in discrete choice response data may result from misclassification and misreporting of the response variable and from choice behaviour that is inconsistent with modelling assumptions (e.g. random utility maximisation). In the presence of outliers, standard discrete choice models produce biased estimates and suffer from compromised predictive accuracy. Robust statistical models are less sensitive to outliers than standard non-robust models. This paper analyses two robust alternatives to the multinomial probit (MNP) model. The two models are robit models whose kernel error distributions are heavy-tailed t-distributions to moderate the influence of outliers. The first model is the multinomial robit (MNR) model, in which a generic degrees of freedom parameter controls the heavy-tailedness of the kernel error distribution. The second model, the generalised multinomial robit (Gen-MNR) model, is more flexible than MNR, as it allows for distinct heavy-tailedness in each dimension of the kernel error distribution. For both models, we derive Gibbs samplers for posterior inference. In a simulation study, we illustrate the excellent finite sample properties of the proposed Bayes estimators and show that MNR and Gen-MNR produce more accurate estimates if the choice data contain outliers through the lens of the non-robust MNP model. In a case study on transport mode choice behaviour, MNR and Gen-MNR outperform MNP by substantial margins in terms of in-sample fit and out-of-sample predictive accuracy. The case study also highlights differences in elasticity estimates across models.
△ Less
Submitted 5 December, 2022; v1 submitted 14 September, 2020;
originally announced September 2020.
-
A Dynamic Choice Model with Heterogeneous Decision Rules: Application in Estimating the User Cost of Rail Crowding
Authors:
Prateek Bansal,
Daniel Hörcher,
Daniel J. Graham
Abstract:
Crowding valuation of subway riders is an important input to various supply-side decisions of transit operators. The crowding cost perceived by a transit rider is generally estimated by capturing the trade-off that the rider makes between crowding and travel time while choosing a route. However, existing studies rely on static compensatory choice models and fail to account for inertia and the lear…
▽ More
Crowding valuation of subway riders is an important input to various supply-side decisions of transit operators. The crowding cost perceived by a transit rider is generally estimated by capturing the trade-off that the rider makes between crowding and travel time while choosing a route. However, existing studies rely on static compensatory choice models and fail to account for inertia and the learning behaviour of riders. To address these challenges, we propose a new dynamic latent class model (DLCM) which (i) assigns riders to latent compensatory and inertia/habit classes based on different decision rules, (ii) enables transitions between these classes over time, and (iii) adopts instance-based learning theory to account for the learning behaviour of riders. We use the expectation-maximisation algorithm to estimate DLCM, and the most probable sequence of latent classes for each rider is retrieved using the Viterbi algorithm. The proposed DLCM can be applied in any choice context to capture the dynamics of decision rules used by a decision-maker. We demonstrate its practical advantages in estimating the crowding valuation of an Asian metro's riders. To calibrate the model, we recover the daily route preferences and in-vehicle crowding experiences of regular metro riders using a two-month-long smart card and vehicle location data. The results indicate that the average rider follows the compensatory rule on only 25.5% of route choice occasions. DLCM estimates also show an increase of 47% in metro riders' valuation of travel time under extremely crowded conditions relative to that under uncrowded conditions.
△ Less
Submitted 7 July, 2020;
originally announced July 2020.
-
Variational Bayesian Inference for Mixed Logit Models with Unobserved Inter- and Intra-Individual Heterogeneity
Authors:
Rico Krueger,
Prateek Bansal,
Michel Bierlaire,
Ricardo A. Daziano,
Taha H. Rashidi
Abstract:
Variational Bayes (VB), a method originating from machine learning, enables fast and scalable estimation of complex probabilistic models. Thus far, applications of VB in discrete choice analysis have been limited to mixed logit models with unobserved inter-individual taste heterogeneity. However, such a model formulation may be too restrictive in panel data settings, since tastes may vary both bet…
▽ More
Variational Bayes (VB), a method originating from machine learning, enables fast and scalable estimation of complex probabilistic models. Thus far, applications of VB in discrete choice analysis have been limited to mixed logit models with unobserved inter-individual taste heterogeneity. However, such a model formulation may be too restrictive in panel data settings, since tastes may vary both between individuals as well as across choice tasks encountered by the same individual. In this paper, we derive a VB method for posterior inference in mixed logit models with unobserved inter- and intra-individual heterogeneity. In a simulation study, we benchmark the performance of the proposed VB method against maximum simulated likelihood (MSL) and Markov chain Monte Carlo (MCMC) methods in terms of parameter recovery, predictive accuracy and computational efficiency. The simulation study shows that VB can be a fast, scalable and accurate alternative to MSL and MCMC estimation, especially in applications in which fast predictions are paramount. VB is observed to be between 2.8 and 17.7 times faster than the two competing methods, while affording comparable or superior accuracy. Besides, the simulation study demonstrates that a parallelised implementation of the MSL estimator with analytical gradients is a viable alternative to MCMC in terms of both estimation accuracy and computational efficiency, as the MSL estimator is observed to be between 0.9 and 2.1 times faster than MCMC.
△ Less
Submitted 16 January, 2020; v1 submitted 1 May, 2019;
originally announced May 2019.
-
A Multicriteria Decision Making Approach to Study the Barriers to the Adoption of Autonomous Vehicles
Authors:
Alok Raj,
J Ajith Kumar,
Prateek Bansal
Abstract:
The automation technology is emerging, but the adoption rate of autonomous vehicles (AV) will largely depend upon how policymakers and the government address various challenges such as public acceptance and infrastructure development. This study proposes a five-step method to understand these barriers to AV adoption. First, based on a literature review followed by discussions with experts, ten bar…
▽ More
The automation technology is emerging, but the adoption rate of autonomous vehicles (AV) will largely depend upon how policymakers and the government address various challenges such as public acceptance and infrastructure development. This study proposes a five-step method to understand these barriers to AV adoption. First, based on a literature review followed by discussions with experts, ten barriers are identified. Second, the opinions of eighteen experts from industry and academia regarding inter-relations between these barriers are recorded. Third, a multicriteria decision making (MCDM) technique, the grey-based Decision-making Trial and Evaluation Laboratory (Grey-DEMATEL), is applied to characterize the structure of relationships between the barriers. Fourth, robustness of the results is tested using sensitivity analysis. Fifth, the key results are depicted in a causal loop diagram (CLD), a systems thinking approach, to comprehend cause-and-effect relationships between the barriers. The results indicate that the lack of customer acceptance (LCA) is the most prominent barrier, the one which should be addressed at the highest priority. The CLD suggests that LCA can be rather mitigated by addressing two other prominent, yet more tangible, barriers -- lack of industry standards and the absence of regulations and certifications. The study's overarching contribution thus lies in bringing to fore multiple barriers to AV adoption and their potential influences on each other. Moreover, the insights from this study can help associations related to AVs prioritize their endeavors to expedite AV adoption. From the methodological perspective, this is the first study in transportation literature that integrates Grey-DEMATEL with systems thinking.
△ Less
Submitted 27 December, 2019; v1 submitted 26 April, 2019;
originally announced April 2019.
-
A Generalized Continuous-Multinomial Response Model with a t-distributed Error Kernel
Authors:
Subodh Dubey,
Prateek Bansal,
Ricardo A. Daziano,
Erick Guerra
Abstract:
In multinomial response models, idiosyncratic variations in the indirect utility are generally modeled using Gumbel or normal distributions. This study makes a strong case to substitute these thin-tailed distributions with a t-distribution. First, we demonstrate that a model with a t-distributed error kernel better estimates and predicts preferences, especially in class-imbalanced datasets. Our pr…
▽ More
In multinomial response models, idiosyncratic variations in the indirect utility are generally modeled using Gumbel or normal distributions. This study makes a strong case to substitute these thin-tailed distributions with a t-distribution. First, we demonstrate that a model with a t-distributed error kernel better estimates and predicts preferences, especially in class-imbalanced datasets. Our proposed specification also implicitly accounts for decision-uncertainty behavior, i.e. the degree of certainty that decision-makers hold in their choices relative to the variation in the indirect utility of any alternative. Second, after applying a t-distributed error kernel in a multinomial response model for the first time, we extend this specification to a generalized continuous-multinomial (GCM) model and derive its full-information maximum likelihood estimator. The likelihood involves an open-form expression of the cumulative density function of the multivariate t-distribution, which we propose to compute using a combination of the composite marginal likelihood method and the separation-of-variables approach. Third, we establish finite sample properties of the GCM model with a t-distributed error kernel (GCM-t) and highlight its superiority over the GCM model with a normally-distributed error kernel (GCM-N) in a Monte Carlo study. Finally, we compare GCM-t and GCM-N in an empirical setting related to preferences for electric vehicles (EVs). We observe that accounting for decision-uncertainty behavior in GCM-t results in lower elasticity estimates and a higher willingness to pay for improving the EV attributes than those of the GCM-N model. These differences are relevant in making policies to expedite the adoption of EVs.
△ Less
Submitted 18 January, 2020; v1 submitted 17 April, 2019;
originally announced April 2019.
-
Can Mobility-on-Demand services do better after discerning reliability preferences of riders?
Authors:
Prateek Bansal,
Yang Liu,
Ricardo Daziano,
Samitha Samaranayake
Abstract:
We formalize one aspect of reliability in the context of Mobility-on-Demand (MoD) systems by acknowledging the uncertainty in the pick-up time of these services. This study answers two key questions: i) how the difference between the stated and actual pick-up times affect the propensity of a passenger to choose an MoD service? ii) how an MoD service provider can leverage this information to increa…
▽ More
We formalize one aspect of reliability in the context of Mobility-on-Demand (MoD) systems by acknowledging the uncertainty in the pick-up time of these services. This study answers two key questions: i) how the difference between the stated and actual pick-up times affect the propensity of a passenger to choose an MoD service? ii) how an MoD service provider can leverage this information to increase its ridership? We conduct a discrete choice experiment in New York to answer the former question and adopt a micro-simulation-based optimization method to answer the latter question. In our experiments, the ridership of an MoD service could be increased by up to 10\% via displaying the predicted wait time strategically.
△ Less
Submitted 16 April, 2019;
originally announced April 2019.
-
Pólygamma Data Augmentation to address Non-conjugacy in the Bayesian Estimation of Mixed Multinomial Logit Models
Authors:
Prateek Bansal,
Rico Krueger,
Michel Bierlaire,
Ricardo A. Daziano,
Taha H. Rashidi
Abstract:
The standard Gibbs sampler of Mixed Multinomial Logit (MMNL) models involves sampling from conditional densities of utility parameters using Metropolis-Hastings (MH) algorithm due to unavailability of conjugate prior for logit kernel. To address this non-conjugacy concern, we propose the application of Pólygamma data augmentation (PG-DA) technique for the MMNL estimation. The posterior estimates o…
▽ More
The standard Gibbs sampler of Mixed Multinomial Logit (MMNL) models involves sampling from conditional densities of utility parameters using Metropolis-Hastings (MH) algorithm due to unavailability of conjugate prior for logit kernel. To address this non-conjugacy concern, we propose the application of Pólygamma data augmentation (PG-DA) technique for the MMNL estimation. The posterior estimates of the augmented and the default Gibbs sampler are similar for two-alternative scenario (binary choice), but we encounter empirical identification issues in the case of more alternatives ($J \geq 3$).
△ Less
Submitted 13 April, 2019;
originally announced April 2019.
-
Eliciting Preferences of Ridehailing Users and Drivers: Evidence from the United States
Authors:
Prateek Bansal,
Akanksha Sinha,
Rubal Dua,
Ricardo Daziano
Abstract:
Transportation Network Companies (TNCs) are changing the transportation ecosystem, but micro-decisions of drivers and users need to be better understood to assess the system-level impacts of TNCs. In this regard, we contribute to the literature by estimating a) individuals' preferences of being a rider, a driver, or a non-user of TNC services; b) preferences of ridehailing users for ridepooling; c…
▽ More
Transportation Network Companies (TNCs) are changing the transportation ecosystem, but micro-decisions of drivers and users need to be better understood to assess the system-level impacts of TNCs. In this regard, we contribute to the literature by estimating a) individuals' preferences of being a rider, a driver, or a non-user of TNC services; b) preferences of ridehailing users for ridepooling; c) TNC drivers' choice to switch to vehicles with better fuel economy, and also d) the drivers' decision to buy, rent or lease new vehicles with driving for TNCs being a major consideration. Elicitation of drivers' preferences using a unique sample (N=11,902) of the U.S. population residing in TNC-served areas is the key feature of this study. The statistical analysis indicates that ridehailing services are mainly attracting personal vehicle users as riders, without substantially affecting demand for transit. Moreover, around 10% of ridehailing users reported postponing the purchase of a new car due to the availability of TNC services. The model estimation results indicate that the likelihood of being a TNC user increases with the increase in age for someone younger than 44 years, but the pattern is reversed post 44 years. This change in direction of the marginal effect of age is insightful as the previous studies have reported a negative association. We also find that postgraduate drivers who live in metropolitan regions are more likely to switch to fuel-efficient vehicles. These findings would inform transportation planners and TNCs in develo** policies to improve the fuel economy of the fleet.
△ Less
Submitted 14 April, 2019;
originally announced April 2019.
-
Bayesian Estimation of Mixed Multinomial Logit Models: Advances and Simulation-Based Evaluations
Authors:
Prateek Bansal,
Rico Krueger,
Michel Bierlaire,
Ricardo A. Daziano,
Taha H. Rashidi
Abstract:
Variational Bayes (VB) methods have emerged as a fast and computationally-efficient alternative to Markov chain Monte Carlo (MCMC) methods for scalable Bayesian estimation of mixed multinomial logit (MMNL) models. It has been established that VB is substantially faster than MCMC at practically no compromises in predictive accuracy. In this paper, we address two critical gaps concerning the usage a…
▽ More
Variational Bayes (VB) methods have emerged as a fast and computationally-efficient alternative to Markov chain Monte Carlo (MCMC) methods for scalable Bayesian estimation of mixed multinomial logit (MMNL) models. It has been established that VB is substantially faster than MCMC at practically no compromises in predictive accuracy. In this paper, we address two critical gaps concerning the usage and understanding of VB for MMNL. First, extant VB methods are limited to utility specifications involving only individual-specific taste parameters. Second, the finite-sample properties of VB estimators and the relative performance of VB, MCMC and maximum simulated likelihood estimation (MSLE) are not known. To address the former, this study extends several VB methods for MMNL to admit utility specifications including both fixed and random utility parameters. To address the latter, we conduct an extensive simulation-based evaluation to benchmark the extended VB methods against MCMC and MSLE in terms of estimation times, parameter recovery and predictive accuracy. The results suggest that all VB variants with the exception of the ones relying on an alternative variational lower bound constructed with the help of the modified Jensen's inequality perform as well as MCMC and MSLE at prediction and parameter recovery. In particular, VB with nonconjugate variational message passing and the delta-method (VB-NCVMP-Delta) is up to 16 times faster than MCMC and MSLE. Thus, VB-NCVMP-Delta can be an attractive alternative to MCMC and MSLE for fast, scalable and accurate estimation of MMNL models.
△ Less
Submitted 12 December, 2019; v1 submitted 7 April, 2019;
originally announced April 2019.