-
Estimating Policy Effects in a Social Network with Independent Set Sampling
Authors:
Eugene Ang,
Prasanta Bhattacharya,
Andrew Lim
Abstract:
Evaluating the impact of policy interventions on respondents who are embedded in a social network is often challenging due to the presence of network interference within the treatment groups, as well as between treatment and non-treatment groups throughout the network. In this paper, we propose a modeling strategy that combines existing work on stochastic actor-oriented models (SAOM) with a novel…
▽ More
Evaluating the impact of policy interventions on respondents who are embedded in a social network is often challenging due to the presence of network interference within the treatment groups, as well as between treatment and non-treatment groups throughout the network. In this paper, we propose a modeling strategy that combines existing work on stochastic actor-oriented models (SAOM) with a novel network sampling method based on the identification of independent sets. By assigning respondents from an independent set to the treatment, we are able to block any spillover of the treatment and network influence, thereby allowing us to isolate the direct effect of the treatment from the indirect network-induced effects, in the immediate term. As a result, our method allows for the estimation of both the \textit{direct} as well as the \textit{net effect} of a chosen policy intervention, in the presence of network effects in the population. We perform a comparative simulation analysis to show that our proposed sampling technique leads to distinct direct and net effects of the policy, as well as significant network effects driven by policy-linked homophily. This study highlights the importance of network sampling techniques in improving policy evaluation studies and has the potential to help researchers and policymakers with better planning, designing, and anticipating policy responses in a networked society.
△ Less
Submitted 25 February, 2024; v1 submitted 25 June, 2023;
originally announced June 2023.
-
Neural Continuous-Discrete State Space Models for Irregularly-Sampled Time Series
Authors:
Abdul Fatir Ansari,
Alvin Heng,
Andre Lim,
Harold Soh
Abstract:
Learning accurate predictive models of real-world dynamic phenomena (e.g., climate, biological) remains a challenging task. One key issue is that the data generated by both natural and artificial processes often comprise time series that are irregularly sampled and/or contain missing observations. In this work, we propose the Neural Continuous-Discrete State Space Model (NCDSSM) for continuous-tim…
▽ More
Learning accurate predictive models of real-world dynamic phenomena (e.g., climate, biological) remains a challenging task. One key issue is that the data generated by both natural and artificial processes often comprise time series that are irregularly sampled and/or contain missing observations. In this work, we propose the Neural Continuous-Discrete State Space Model (NCDSSM) for continuous-time modeling of time series through discrete-time observations. NCDSSM employs auxiliary variables to disentangle recognition from dynamics, thus requiring amortized inference only for the auxiliary variables. Leveraging techniques from continuous-discrete filtering theory, we demonstrate how to perform accurate Bayesian inference for the dynamic states. We propose three flexible parameterizations of the latent dynamics and an efficient training objective that marginalizes the dynamic states during inference. Empirical results on multiple benchmark datasets across various domains show improved imputation and forecasting performance of NCDSSM over existing models.
△ Less
Submitted 18 June, 2023; v1 submitted 26 January, 2023;
originally announced January 2023.
-
An Effective Method for Identifying Clusters of Robot Strengths
Authors:
Jen-Chieh Teng,
Chin-Tsang Chiang,
Alvin Lim
Abstract:
In the analysis of qualification data from the FIRST Robotics Competition, the ratio of the number of observations to the number of parameters has been found to be quite small for the commonly used winning margin power rating (WMPR) model. This usually leads to imprecise estimates and inaccurate predictions in such a three-on-three game. With the finding of a clustering feature in estimated robot…
▽ More
In the analysis of qualification data from the FIRST Robotics Competition, the ratio of the number of observations to the number of parameters has been found to be quite small for the commonly used winning margin power rating (WMPR) model. This usually leads to imprecise estimates and inaccurate predictions in such a three-on-three game. With the finding of a clustering feature in estimated robot strengths, a more flexible model with latent clusters of robots was proposed to alleviate overparameterization of the WMPR model. Since its structure can be regarded as a dimension reduction of the parameter space in the WMPR model, the identification of clusters of robot strengths is naturally transformed into a model selection problem. Instead of comparing a huge number of competing models, we develop an effective method to estimate the number of clusters, clusters of robots, and robot strengths. The new method consists of two parts: (i) a combination of hierarchical and non-hierarchical classifications to determine candidate models; and (ii) variant goodness-of-fit criteria to select optimal models. Different from existing hierarchical classification systems, each step of ours is based on estimated robot strengths from a candidate model in the preceding non-hierarchical classification step. A great advantage of the designed non-hierarchical classification system is to examine the possibility of reassigning robots to other cluster sets of robots. To reduce the overestimation of clusters by the mean squared prediction error criteria, the corresponding BIC are established as alternatives for model selection. By assembling these essential elements into a coherent whole, a systematic procedure is presented to perform the estimation. In addition, we propose two indices to measure the nested relation between cluster sets of two models and monotonic association between robot strengths of two models.
△ Less
Submitted 13 November, 2022; v1 submitted 26 July, 2022;
originally announced July 2022.
-
A data-driven approach to beating SAA out-of-sample
Authors:
Jun-ya Gotoh,
Michael Jong Kim,
Andrew E. B. Lim
Abstract:
While solutions of Distributionally Robust Optimization (DRO) problems can sometimes have a higher out-of-sample expected reward than the Sample Average Approximation (SAA), there is no guarantee. In this paper, we introduce a class of Distributionally Optimistic Optimization (DOO) models, and show that it is always possible to ``beat" SAA out-of-sample if we consider not just worst-case (DRO) mod…
▽ More
While solutions of Distributionally Robust Optimization (DRO) problems can sometimes have a higher out-of-sample expected reward than the Sample Average Approximation (SAA), there is no guarantee. In this paper, we introduce a class of Distributionally Optimistic Optimization (DOO) models, and show that it is always possible to ``beat" SAA out-of-sample if we consider not just worst-case (DRO) models but also best-case (DOO) ones. We also show, however, that this comes at a cost: Optimistic solutions are more sensitive to model error than either worst-case or SAA optimizers, and hence are less robust and calibrating the worst- or best-case model to outperform SAA may be difficult when data is limited.
△ Less
Submitted 11 June, 2023; v1 submitted 26 May, 2021;
originally announced May 2021.
-
Machine Learning Approaches for Type 2 Diabetes Prediction and Care Management
Authors:
Aloysius Lim,
Ashish Singh,
Jody Chiam,
Carly Eckert,
Vikas Kumar,
Muhammad Aurangzeb Ahmad,
Ankur Teredesai
Abstract:
Prediction of diabetes and its various complications has been studied in a number of settings, but a comprehensive overview of problem setting for diabetes prediction and care management has not been addressed in the literature. In this document we seek to remedy this omission in literature with an encompassing overview of diabetes complication prediction as well as situating this problem in the c…
▽ More
Prediction of diabetes and its various complications has been studied in a number of settings, but a comprehensive overview of problem setting for diabetes prediction and care management has not been addressed in the literature. In this document we seek to remedy this omission in literature with an encompassing overview of diabetes complication prediction as well as situating this problem in the context of real world healthcare management. We illustrate various problems encountered in real world clinical scenarios via our own experience with building and deploying such models. In this manuscript we illustrate a Machine Learning (ML) framework for addressing the problem of predicting Type 2 Diabetes Mellitus (T2DM) together with a solution for risk stratification, intervention and management. These ML models align with how physicians think about disease management and mitigation, which comprises these four steps: Identify, Stratify, Engage, Measure.
△ Less
Submitted 28 April, 2021; v1 submitted 15 April, 2021;
originally announced April 2021.
-
Marketing Mix Optimization with Practical Constraints
Authors:
Hsin-Chan Huang,
Jiefeng Xu,
Alvin Lim
Abstract:
In this paper, we address a variant of the marketing mix optimization (MMO) problem which is commonly encountered in many industries, e.g., retail and consumer packaged goods (CPG) industries. This problem requires the spend for each marketing activity, if adjusted, be changed by a non-negligible degree (minimum change) and also the total number of activities with spend change be limited (maximum…
▽ More
In this paper, we address a variant of the marketing mix optimization (MMO) problem which is commonly encountered in many industries, e.g., retail and consumer packaged goods (CPG) industries. This problem requires the spend for each marketing activity, if adjusted, be changed by a non-negligible degree (minimum change) and also the total number of activities with spend change be limited (maximum number of changes). With these two additional practical requirements, the original resource allocation problem is formulated as a mixed integer nonlinear program (MINLP). Given the size of a realistic problem in the industrial setting, the state-of-the-art integer programming solvers may not be able to solve the problem to optimality in a straightforward way within a reasonable amount of time. Hence, we propose a systematic reformulation to ease the computational burden. Computational tests show significant improvements in the solution process.
△ Less
Submitted 10 January, 2021;
originally announced January 2021.
-
Estimating Linear Mixed Effects Models with Truncated Normally Distributed Random Effects
Authors:
Hao Chen,
Lanshan Han,
Alvin Lim
Abstract:
Linear Mixed Effects (LME) models have been widely applied in clustered data analysis in many areas including marketing research, clinical trials, and biomedical studies. Inference can be conducted using maximum likelihood approach if assuming Normal distributions on the random effects. However, in many applications of economy, business and medicine, it is often essential to impose constraints on…
▽ More
Linear Mixed Effects (LME) models have been widely applied in clustered data analysis in many areas including marketing research, clinical trials, and biomedical studies. Inference can be conducted using maximum likelihood approach if assuming Normal distributions on the random effects. However, in many applications of economy, business and medicine, it is often essential to impose constraints on the regression parameters after taking their real-world interpretations into account. Therefore, in this paper we extend the classical (unconstrained) LME models to allow for sign constraints on its overall coefficients. We propose to assume a symmetric doubly truncated Normal (SDTN) distribution on the random effects instead of the unconstrained Normal distribution which is often found in classical literature. With the aforementioned change, difficulty has dramatically increased as the exact distribution of the dependent variable becomes analytically intractable. We then develop likelihood-based approaches to estimate the unknown model parameters utilizing the approximation of its exact distribution. Simulation studies have shown that the proposed constrained model not only improves real-world interpretations of results, but also achieves satisfactory performance on model fits as compared to the existing model.
△ Less
Submitted 30 July, 2021; v1 submitted 9 November, 2020;
originally announced November 2020.
-
Worst-case sensitivity
Authors:
Jun-ya Gotoh,
Michael Jong Kim,
Andrew E. B. Lim
Abstract:
We introduce the notion of Worst-Case Sensitivity, defined as the worst-case rate of increase in the expected cost of a Distributionally Robust Optimization (DRO) model when the size of the uncertainty set vanishes. We show that worst-case sensitivity is a Generalized Measure of Deviation and that a large class of DRO models are essentially mean-(worst-case) sensitivity problems when uncertainty s…
▽ More
We introduce the notion of Worst-Case Sensitivity, defined as the worst-case rate of increase in the expected cost of a Distributionally Robust Optimization (DRO) model when the size of the uncertainty set vanishes. We show that worst-case sensitivity is a Generalized Measure of Deviation and that a large class of DRO models are essentially mean-(worst-case) sensitivity problems when uncertainty sets are small, unifying recent results on the relationship between DRO and regularized empirical optimization with worst-case sensitivity playing the role of the regularizer. More generally, DRO solutions can be sensitive to the family and size of the uncertainty set, and reflect the properties of its worst-case sensitivity. We derive closed-form expressions of worst-case sensitivity for well known uncertainty sets including smooth $φ$-divergence, total variation, "budgeted" uncertainty sets, uncertainty sets corresponding to a convex combination of expected value and CVaR, and the Wasserstein metric. These can be used to select the uncertainty set and its size for a given application.
△ Less
Submitted 21 October, 2020;
originally announced October 2020.
-
An Exponential Factorization Machine with Percentage Error Minimization to Retail Sales Forecasting
Authors:
Chongshou Li,
Brenda Cheang,
Zhixing Luo,
Andrew Lim
Abstract:
This paper proposes a new approach to sales forecasting for new products with long lead time but short product life cycle. These SKUs are usually sold for one season only, without any replenishments. An exponential factorization machine (EFM) sales forecast model is developed to solve this problem which not only considers SKU attributes, but also pairwise interactions. The EFM model is significant…
▽ More
This paper proposes a new approach to sales forecasting for new products with long lead time but short product life cycle. These SKUs are usually sold for one season only, without any replenishments. An exponential factorization machine (EFM) sales forecast model is developed to solve this problem which not only considers SKU attributes, but also pairwise interactions. The EFM model is significantly different from the original Factorization Machines (FM) from two-fold: (1) the attribute-level formulation for explanatory variables and (2) exponential formulation for the positive response variable. The attribute-level formation excludes infeasible intra-attribute interactions and results in more efficient feature engineering comparing with the conventional one-hot encoding, while the exponential formulation is demonstrated more effective than the log-transformation for the positive but not skewed distributed responses. In order to estimate the parameters, percentage error squares (PES) and error squares (ES) are minimized by a proposed adaptive batch gradient descent method over the training set. Real-world data provided by a footwear retailer in Singapore is used for testing the proposed approach. The forecasting performance in terms of both mean absolute percentage error (MAPE) and mean absolute error (MAE) compares favourably with not only off-the-shelf models but also results reported by extant sales and demand forecasting studies. The effectiveness of the proposed approach is also demonstrated by two external public datasets. Moreover, we prove the theoretical relationships between PES and ES minimization, and present an important property of the PES minimization for regression models; that it trains models to underestimate data. This property fits the situation of sales forecasting where unit-holding cost is much greater than the unit-shortage cost.
△ Less
Submitted 22 September, 2020;
originally announced September 2020.
-
Hierarchical Marketing Mix Models with Sign Constraints
Authors:
Hao Chen,
Minguang Zhang,
Lanshan Han,
Alvin Lim
Abstract:
Marketing mix models (MMMs) are statistical models for measuring the effectiveness of various marketing activities such as promotion, media advertisement, etc. In this research, we propose a comprehensive marketing mix model that captures the hierarchical structure and the carryover, shape and scale effects of certain marketing activities, as well as sign restrictions on certain coefficients that…
▽ More
Marketing mix models (MMMs) are statistical models for measuring the effectiveness of various marketing activities such as promotion, media advertisement, etc. In this research, we propose a comprehensive marketing mix model that captures the hierarchical structure and the carryover, shape and scale effects of certain marketing activities, as well as sign restrictions on certain coefficients that are consistent with common business sense. In contrast to commonly adopted approaches in practice, which estimate parameters in a multi-stage process, the proposed approach estimates all the unknown parameters/coefficients simultaneously using a constrained maximum likelihood approach and solved with the Hamiltonian Monte Carlo algorithm. We present results on real datasets to illustrate the use of the proposed solution algorithm.
△ Less
Submitted 28 August, 2020;
originally announced August 2020.
-
A Note on the Sum of Non-Identically Distributed Doubly Truncated Normal Distributions
Authors:
Hao Chen,
Lanshan Han,
Alvin Lim
Abstract:
It is proved that the sum of n independent but non-identically distributed doubly truncated Normal distributions converges in distribution to a Normal distribution. It is also shown how the result can be applied in estimating a constrained mixed effects model.
It is proved that the sum of n independent but non-identically distributed doubly truncated Normal distributions converges in distribution to a Normal distribution. It is also shown how the result can be applied in estimating a constrained mixed effects model.
△ Less
Submitted 28 July, 2021; v1 submitted 18 August, 2020;
originally announced August 2020.
-
Directed Graph Convolutional Network
Authors:
Zekun Tong,
Yuxuan Liang,
Changsheng Sun,
David S. Rosenblum,
Andrew Lim
Abstract:
Graph Convolutional Networks (GCNs) have been widely used due to their outstanding performance in processing graph-structured data. However, the undirected graphs limit their application scope. In this paper, we extend spectral-based graph convolution to directed graphs by using first- and second-order proximity, which can not only retain the connection properties of the directed graph, but also e…
▽ More
Graph Convolutional Networks (GCNs) have been widely used due to their outstanding performance in processing graph-structured data. However, the undirected graphs limit their application scope. In this paper, we extend spectral-based graph convolution to directed graphs by using first- and second-order proximity, which can not only retain the connection properties of the directed graph, but also expand the receptive field of the convolution operation. A new GCN model, called DGCN, is then designed to learn representations on the directed graph, leveraging both the first- and second-order proximity information. We empirically show the fact that GCNs working only with DGCNs can encode more useful information from graph and help achieve better performance when generalized to other models. Moreover, extensive experiments on citation networks and co-purchase datasets demonstrate the superiority of our model against the state-of-the-art methods.
△ Less
Submitted 29 April, 2020;
originally announced April 2020.
-
On Isometry Robustness of Deep 3D Point Cloud Models under Adversarial Attacks
Authors:
Yue Zhao,
Yuwei Wu,
Caihua Chen,
Andrew Lim
Abstract:
While deep learning in 3D domain has achieved revolutionary performance in many tasks, the robustness of these models has not been sufficiently studied or explored. Regarding the 3D adversarial samples, most existing works focus on manipulation of local points, which may fail to invoke the global geometry properties, like robustness under linear projection that preserves the Euclidean distance, i.…
▽ More
While deep learning in 3D domain has achieved revolutionary performance in many tasks, the robustness of these models has not been sufficiently studied or explored. Regarding the 3D adversarial samples, most existing works focus on manipulation of local points, which may fail to invoke the global geometry properties, like robustness under linear projection that preserves the Euclidean distance, i.e., isometry. In this work, we show that existing state-of-the-art deep 3D models are extremely vulnerable to isometry transformations. Armed with the Thompson Sampling, we develop a black-box attack with success rate over 95% on ModelNet40 data set. Incorporating with the Restricted Isometry Property, we propose a novel framework of white-box attack on top of spectral norm based perturbation. In contrast to previous works, our adversarial samples are experimentally shown to be strongly transferable. Evaluated on a sequence of prevailing 3D models, our white-box attack achieves success rates from 98.88% to 100%. It maintains a successful attack rate over 95% even within an imperceptible rotation range $[\pm 2.81^{\circ}]$.
△ Less
Submitted 10 March, 2020; v1 submitted 27 February, 2020;
originally announced February 2020.
-
Quadratic Surface Support Vector Machine with L1 Norm Regularization
Authors:
Ahmad Mousavi,
Zheming Gao,
Lanshan Han,
Alvin Lim
Abstract:
We propose $\ell_1$ norm regularized quadratic surface support vector machine models for binary classification in supervised learning. We establish their desired theoretical properties, including the existence and uniqueness of the optimal solution, reduction to the standard SVMs over (almost) linearly separable data sets, and detection of true sparsity pattern over (almost) quadratically separabl…
▽ More
We propose $\ell_1$ norm regularized quadratic surface support vector machine models for binary classification in supervised learning. We establish their desired theoretical properties, including the existence and uniqueness of the optimal solution, reduction to the standard SVMs over (almost) linearly separable data sets, and detection of true sparsity pattern over (almost) quadratically separable data sets if the penalty parameter of $\ell_1$ norm is large enough. We also demonstrate their promising practical efficiency by conducting various numerical experiments on both synthetic and publicly available benchmark data sets.
△ Less
Submitted 30 January, 2021; v1 submitted 22 August, 2019;
originally announced August 2019.
-
Deep Pattern of Time Series and Its Applications in Estimation, Forecasting, Fault Diagnosis and Target Tracking
Authors:
Shixiong Wang,
Chongshou Li,
Andrew Lim
Abstract:
The information contained in a time series is more than what the values themselves are. In this paper, the Time-variant Local Autocorrelated Polynomial model with Kalman filter is proposed to model the underlying dynamics of a time series (or signal) and mine the deep pattern of it, except estimating the instantaneous mean function (also known as trend function), including: (1) identifying and pre…
▽ More
The information contained in a time series is more than what the values themselves are. In this paper, the Time-variant Local Autocorrelated Polynomial model with Kalman filter is proposed to model the underlying dynamics of a time series (or signal) and mine the deep pattern of it, except estimating the instantaneous mean function (also known as trend function), including: (1) identifying and predicting the peak and valley values of a time series; (2) reporting and forecasting the current changing pattern (increasing or decreasing pattern of the trend, and how fast it changes). We will show that it is this deep pattern that allows us to make higher-accuracy estimation and forecasting for a time series, to easily detect the anomalies (faults) of a sensor, and to track a highly-maneuvering target.
△ Less
Submitted 18 December, 2019; v1 submitted 19 April, 2019;
originally announced April 2019.
-
Why Are the ARIMA and SARIMA not Sufficient
Authors:
Shixiong Wang,
Chongshou Li,
Andrew Lim
Abstract:
The autoregressive moving average (ARMA) model takes the significant position in time series analysis for a wide-sense stationary time series. The difference operator and seasonal difference operator, which are bases of ARIMA and SARIMA (Seasonal ARIMA), respectively, were introduced to remove the trend and seasonal component so that the original non-stationary time series could be transformed int…
▽ More
The autoregressive moving average (ARMA) model takes the significant position in time series analysis for a wide-sense stationary time series. The difference operator and seasonal difference operator, which are bases of ARIMA and SARIMA (Seasonal ARIMA), respectively, were introduced to remove the trend and seasonal component so that the original non-stationary time series could be transformed into a wide-sense stationary one, which could then be handled by Box-Jenkins methodology. However, such difference operators are more practical experiences than exact theories by now. In this paper, we investigate the power of the (resp. seasonal) difference operator from the perspective of spectral analysis, linear system theory and digital filtering, and point out the characteristics and limitations of (resp. seasonal) difference operator. Besides, the general method that transforms a non-stationary (the non-stationarity in the mean sense) stochastic process to be wide-sense stationary will be presented.
△ Less
Submitted 2 March, 2021; v1 submitted 16 April, 2019;
originally announced April 2019.
-
Beyond the EM Algorithm: Constrained Optimization Methods for Latent Class Model
Authors:
Hao Chen,
Lanshan Han,
Alvin Lim
Abstract:
Latent class model (LCM), which is a finite mixture of different categorical distributions, is one of the most widely used models in statistics and machine learning fields. Because of its non-continuous nature and the flexibility in shape, researchers in practice areas such as marketing and social sciences also frequently use LCM to gain insights from their data. One likelihood-based method, the E…
▽ More
Latent class model (LCM), which is a finite mixture of different categorical distributions, is one of the most widely used models in statistics and machine learning fields. Because of its non-continuous nature and the flexibility in shape, researchers in practice areas such as marketing and social sciences also frequently use LCM to gain insights from their data. One likelihood-based method, the Expectation-Maximization (EM) algorithm, is often used to obtain the model estimators. However, the EM algorithm is well-known for its notoriously slow convergence. In this research, we explore alternative likelihood-based methods that can potential remedy the slow convergence of the EM algorithm. More specifically, we regard likelihood-based approach as a constrained nonlinear optimization problem, and apply quasi-Newton type methods to solve them. We examine two different constrained optimization methods to maximize the log likelihood function. We present simulation study results to show that the proposed methods not only converge in less iterations than the EM algorithm but also produce more accurate model estimators.
△ Less
Submitted 19 May, 2020; v1 submitted 9 January, 2019;
originally announced January 2019.
-
Estimating Robot Strengths with Application to Selection of Alliance Members in FIRST Robotics Competitions
Authors:
Alejandro Lim,
Chin-Tsang Chiang,
Jen-Chieh Teng
Abstract:
Since the inception of the FIRST Robotics Competition (FRC) and its special playoff system, robotics teams have longed to appropriately quantify the strengths of their designed robots. The FRC includes a playground draft-like phase (alliance selection), arguably the most game-changing part of the competition, in which the top-8 robotics teams in a tournament based on the FRC's ranking system asses…
▽ More
Since the inception of the FIRST Robotics Competition (FRC) and its special playoff system, robotics teams have longed to appropriately quantify the strengths of their designed robots. The FRC includes a playground draft-like phase (alliance selection), arguably the most game-changing part of the competition, in which the top-8 robotics teams in a tournament based on the FRC's ranking system assess potential alliance members for the opportunity of partnering in a playoff stage. In such a three-versus-three competition, several measures and models have been used to characterize actual or relative robot strengths. However, existing models are found to have poor predictive performance due to their imprecise estimates of robot strengths caused by a small ratio of the number of observations to the number of robots. A more general regression model with latent clusters of robot strengths is, thus, proposed to enhance their predictive capacities. Two effective estimation procedures are further developed to simultaneously estimate the number of clusters, clusters of robots, and robot strengths. Meanwhile, some measures are used to assess the predictive ability of competing models, the agreement between published FRC measures of strength and model-based robot strengths of all, playoff, and FRC top-8 robots, and the agreement between FRC top-8 robots and model-based top robots. Moreover, the stability of estimated robot strengths and accuracies is investigated to determine whether the scheduled matches are excessive or insufficient. In the analysis of qualification data from the 2018 FRC Houston and Detroit championships, the predictive ability of our model is also shown to be significantly better than those of existing models. Teams who adopt the new model can now appropriately rank their preferences for playoff alliance partners with greater predictive capability than before.
△ Less
Submitted 12 January, 2021; v1 submitted 12 October, 2018;
originally announced October 2018.
-
Traffic Density Estimation using a Convolutional Neural Network
Authors:
Julian Nubert,
Nicholas Giai Truong,
Abel Lim,
Herbert Ilhan Tanujaya,
Leah Lim,
Mai Anh Vu
Abstract:
The goal of this project is to introduce and present a machine learning application that aims to improve the quality of life of people in Singapore. In particular, we investigate the use of machine learning solutions to tackle the problem of traffic congestion in Singapore. In layman's terms, we seek to make Singapore (or any other city) a smoother place. To accomplish this aim, we present an end-…
▽ More
The goal of this project is to introduce and present a machine learning application that aims to improve the quality of life of people in Singapore. In particular, we investigate the use of machine learning solutions to tackle the problem of traffic congestion in Singapore. In layman's terms, we seek to make Singapore (or any other city) a smoother place. To accomplish this aim, we present an end-to-end system comprising of 1. A traffic density estimation algorithm at traffic lights/junctions and 2. a suitable traffic signal control algorithms that make use of the density information for better traffic control. Traffic density estimation can be obtained from traffic junction images using various machine learning techniques (combined with CV tools). After research into various advanced machine learning methods, we decided on convolutional neural networks (CNNs). We conducted experiments on our algorithms, using the publicly available traffic camera dataset published by the Land Transport Authority (LTA) to demonstrate the feasibility of this approach. With these traffic density estimates, different traffic algorithms can be applied to minimize congestion at traffic junctions in general.
△ Less
Submitted 5 September, 2018;
originally announced September 2018.
-
Calibration of Distributionally Robust Empirical Optimization Models
Authors:
Jun-Ya Gotoh,
Michael Jong Kim,
Andrew E. B. Lim
Abstract:
We study the out-of-sample properties of robust empirical optimization problems with smooth $φ$-divergence penalties and smooth concave objective functions, and develop a theory for data-driven calibration of the non-negative "robustness parameter" $δ$ that controls the size of the deviations from the nominal model. Building on the intuition that robust optimization reduces the sensitivity of the…
▽ More
We study the out-of-sample properties of robust empirical optimization problems with smooth $φ$-divergence penalties and smooth concave objective functions, and develop a theory for data-driven calibration of the non-negative "robustness parameter" $δ$ that controls the size of the deviations from the nominal model. Building on the intuition that robust optimization reduces the sensitivity of the expected reward to errors in the model by controlling the spread of the reward distribution, we show that the first-order benefit of ``little bit of robustness" (i.e., $δ$ small, positive) is a significant reduction in the variance of the out-of-sample reward while the corresponding impact on the mean is almost an order of magnitude smaller. One implication is that substantial variance (sensitivity) reduction is possible at little cost if the robustness parameter is properly calibrated. To this end, we introduce the notion of a robust mean-variance frontier to select the robustness parameter and show that it can be approximated using resampling methods like the bootstrap. Our examples show that robust solutions resulting from "open loop" calibration methods (e.g., selecting a $90\%$ confidence level regardless of the data and objective function) can be very conservative out-of-sample, while those corresponding to the robustness parameter that optimizes an estimate of the out-of-sample expected reward (e.g., via the bootstrap) with no regard for the variance are often insufficiently robust.
△ Less
Submitted 18 May, 2020; v1 submitted 17 November, 2017;
originally announced November 2017.
-
Performance-based regularization in mean-CVaR portfolio optimization
Authors:
Noureddine El Karoui,
Andrew E. B. Lim,
Gah-Yi Vahn
Abstract:
We introduce performance-based regularization (PBR), a new approach to addressing estimation risk in data-driven optimization, to mean-CVaR portfolio optimization. We assume the available log-return data is iid, and detail the approach for two cases: nonparametric and parametric (the log-return distribution belongs in the elliptical family). The nonparametric PBR method penalizes portfolios with l…
▽ More
We introduce performance-based regularization (PBR), a new approach to addressing estimation risk in data-driven optimization, to mean-CVaR portfolio optimization. We assume the available log-return data is iid, and detail the approach for two cases: nonparametric and parametric (the log-return distribution belongs in the elliptical family). The nonparametric PBR method penalizes portfolios with large variability in mean and CVaR estimations. The parametric PBR method solves the empirical Markowitz problem instead of the empirical mean-CVaR problem, as the solutions of the Markowitz and mean-CVaR problems are equivalent when the log-return distribution is elliptical. We derive the asymptotic behavior of the nonparametric PBR solution, which leads to insight into the effect of penalization, and justification of the parametric PBR method. We also show via simulations that the PBR methods produce efficient frontiers that are, on average, closer to the population efficient frontier than the empirical approach to the mean-CVaR problem, with less variability.
△ Less
Submitted 26 March, 2012; v1 submitted 8 November, 2011;
originally announced November 2011.