Search | arXiv e-print repository

Estimating Policy Effects in a Social Network with Independent Set Sampling

Authors: Eugene Ang, Prasanta Bhattacharya, Andrew Lim

Abstract: Evaluating the impact of policy interventions on respondents who are embedded in a social network is often challenging due to the presence of network interference within the treatment groups, as well as between treatment and non-treatment groups throughout the network. In this paper, we propose a modeling strategy that combines existing work on stochastic actor-oriented models (SAOM) with a novel… ▽ More Evaluating the impact of policy interventions on respondents who are embedded in a social network is often challenging due to the presence of network interference within the treatment groups, as well as between treatment and non-treatment groups throughout the network. In this paper, we propose a modeling strategy that combines existing work on stochastic actor-oriented models (SAOM) with a novel network sampling method based on the identification of independent sets. By assigning respondents from an independent set to the treatment, we are able to block any spillover of the treatment and network influence, thereby allowing us to isolate the direct effect of the treatment from the indirect network-induced effects, in the immediate term. As a result, our method allows for the estimation of both the \textit{direct} as well as the \textit{net effect} of a chosen policy intervention, in the presence of network effects in the population. We perform a comparative simulation analysis to show that our proposed sampling technique leads to distinct direct and net effects of the policy, as well as significant network effects driven by policy-linked homophily. This study highlights the importance of network sampling techniques in improving policy evaluation studies and has the potential to help researchers and policymakers with better planning, designing, and anticipating policy responses in a networked society. △ Less

Submitted 25 February, 2024; v1 submitted 25 June, 2023; originally announced June 2023.

arXiv:2301.11308 [pdf, other]

Neural Continuous-Discrete State Space Models for Irregularly-Sampled Time Series

Authors: Abdul Fatir Ansari, Alvin Heng, Andre Lim, Harold Soh

Abstract: Learning accurate predictive models of real-world dynamic phenomena (e.g., climate, biological) remains a challenging task. One key issue is that the data generated by both natural and artificial processes often comprise time series that are irregularly sampled and/or contain missing observations. In this work, we propose the Neural Continuous-Discrete State Space Model (NCDSSM) for continuous-tim… ▽ More Learning accurate predictive models of real-world dynamic phenomena (e.g., climate, biological) remains a challenging task. One key issue is that the data generated by both natural and artificial processes often comprise time series that are irregularly sampled and/or contain missing observations. In this work, we propose the Neural Continuous-Discrete State Space Model (NCDSSM) for continuous-time modeling of time series through discrete-time observations. NCDSSM employs auxiliary variables to disentangle recognition from dynamics, thus requiring amortized inference only for the auxiliary variables. Leveraging techniques from continuous-discrete filtering theory, we demonstrate how to perform accurate Bayesian inference for the dynamic states. We propose three flexible parameterizations of the latent dynamics and an efficient training objective that marginalizes the dynamic states during inference. Empirical results on multiple benchmark datasets across various domains show improved imputation and forecasting performance of NCDSSM over existing models. △ Less

Submitted 18 June, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

Comments: ICML 2023 Camera Ready Version; Code available at https://github.com/clear-nus/NCDSSM

arXiv:2207.12956 [pdf, other]

An Effective Method for Identifying Clusters of Robot Strengths

Authors: Jen-Chieh Teng, Chin-Tsang Chiang, Alvin Lim

Abstract: In the analysis of qualification data from the FIRST Robotics Competition, the ratio of the number of observations to the number of parameters has been found to be quite small for the commonly used winning margin power rating (WMPR) model. This usually leads to imprecise estimates and inaccurate predictions in such a three-on-three game. With the finding of a clustering feature in estimated robot… ▽ More In the analysis of qualification data from the FIRST Robotics Competition, the ratio of the number of observations to the number of parameters has been found to be quite small for the commonly used winning margin power rating (WMPR) model. This usually leads to imprecise estimates and inaccurate predictions in such a three-on-three game. With the finding of a clustering feature in estimated robot strengths, a more flexible model with latent clusters of robots was proposed to alleviate overparameterization of the WMPR model. Since its structure can be regarded as a dimension reduction of the parameter space in the WMPR model, the identification of clusters of robot strengths is naturally transformed into a model selection problem. Instead of comparing a huge number of competing models, we develop an effective method to estimate the number of clusters, clusters of robots, and robot strengths. The new method consists of two parts: (i) a combination of hierarchical and non-hierarchical classifications to determine candidate models; and (ii) variant goodness-of-fit criteria to select optimal models. Different from existing hierarchical classification systems, each step of ours is based on estimated robot strengths from a candidate model in the preceding non-hierarchical classification step. A great advantage of the designed non-hierarchical classification system is to examine the possibility of reassigning robots to other cluster sets of robots. To reduce the overestimation of clusters by the mean squared prediction error criteria, the corresponding BIC are established as alternatives for model selection. By assembling these essential elements into a coherent whole, a systematic procedure is presented to perform the estimation. In addition, we propose two indices to measure the nested relation between cluster sets of two models and monotonic association between robot strengths of two models. △ Less

Submitted 13 November, 2022; v1 submitted 26 July, 2022; originally announced July 2022.

arXiv:2105.12342 [pdf, ps, other]

A data-driven approach to beating SAA out-of-sample

Authors: Jun-ya Gotoh, Michael Jong Kim, Andrew E. B. Lim

Abstract: While solutions of Distributionally Robust Optimization (DRO) problems can sometimes have a higher out-of-sample expected reward than the Sample Average Approximation (SAA), there is no guarantee. In this paper, we introduce a class of Distributionally Optimistic Optimization (DOO) models, and show that it is always possible to ``beat" SAA out-of-sample if we consider not just worst-case (DRO) mod… ▽ More While solutions of Distributionally Robust Optimization (DRO) problems can sometimes have a higher out-of-sample expected reward than the Sample Average Approximation (SAA), there is no guarantee. In this paper, we introduce a class of Distributionally Optimistic Optimization (DOO) models, and show that it is always possible to ``beat" SAA out-of-sample if we consider not just worst-case (DRO) models but also best-case (DOO) ones. We also show, however, that this comes at a cost: Optimistic solutions are more sensitive to model error than either worst-case or SAA optimizers, and hence are less robust and calibrating the worst- or best-case model to outperform SAA may be difficult when data is limited. △ Less

Submitted 11 June, 2023; v1 submitted 26 May, 2021; originally announced May 2021.

Comments: 25 pages, 2 page bibliography, 2 Figures, 12 page Appendix

MSC Class: 90C17; 90C31; 93B35; 90C47; 90B50; 62G35; 62K25;

arXiv:2104.07820 [pdf]

Machine Learning Approaches for Type 2 Diabetes Prediction and Care Management

Authors: Aloysius Lim, Ashish Singh, Jody Chiam, Carly Eckert, Vikas Kumar, Muhammad Aurangzeb Ahmad, Ankur Teredesai

Abstract: Prediction of diabetes and its various complications has been studied in a number of settings, but a comprehensive overview of problem setting for diabetes prediction and care management has not been addressed in the literature. In this document we seek to remedy this omission in literature with an encompassing overview of diabetes complication prediction as well as situating this problem in the c… ▽ More Prediction of diabetes and its various complications has been studied in a number of settings, but a comprehensive overview of problem setting for diabetes prediction and care management has not been addressed in the literature. In this document we seek to remedy this omission in literature with an encompassing overview of diabetes complication prediction as well as situating this problem in the context of real world healthcare management. We illustrate various problems encountered in real world clinical scenarios via our own experience with building and deploying such models. In this manuscript we illustrate a Machine Learning (ML) framework for addressing the problem of predicting Type 2 Diabetes Mellitus (T2DM) together with a solution for risk stratification, intervention and management. These ML models align with how physicians think about disease management and mitigation, which comprises these four steps: Identify, Stratify, Engage, Measure. △ Less

Submitted 28 April, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

arXiv:2101.03663 [pdf, other]

Marketing Mix Optimization with Practical Constraints

Authors: Hsin-Chan Huang, Jiefeng Xu, Alvin Lim

Abstract: In this paper, we address a variant of the marketing mix optimization (MMO) problem which is commonly encountered in many industries, e.g., retail and consumer packaged goods (CPG) industries. This problem requires the spend for each marketing activity, if adjusted, be changed by a non-negligible degree (minimum change) and also the total number of activities with spend change be limited (maximum… ▽ More In this paper, we address a variant of the marketing mix optimization (MMO) problem which is commonly encountered in many industries, e.g., retail and consumer packaged goods (CPG) industries. This problem requires the spend for each marketing activity, if adjusted, be changed by a non-negligible degree (minimum change) and also the total number of activities with spend change be limited (maximum number of changes). With these two additional practical requirements, the original resource allocation problem is formulated as a mixed integer nonlinear program (MINLP). Given the size of a realistic problem in the industrial setting, the state-of-the-art integer programming solvers may not be able to solve the problem to optimality in a straightforward way within a reasonable amount of time. Hence, we propose a systematic reformulation to ease the computational burden. Computational tests show significant improvements in the solution process. △ Less

Submitted 10 January, 2021; originally announced January 2021.

MSC Class: 90C26; 90C27 ACM Class: I.6

arXiv:2011.04538 [pdf, other]

doi 10.1080/03610918.2022.2066696

Estimating Linear Mixed Effects Models with Truncated Normally Distributed Random Effects

Authors: Hao Chen, Lanshan Han, Alvin Lim

Abstract: Linear Mixed Effects (LME) models have been widely applied in clustered data analysis in many areas including marketing research, clinical trials, and biomedical studies. Inference can be conducted using maximum likelihood approach if assuming Normal distributions on the random effects. However, in many applications of economy, business and medicine, it is often essential to impose constraints on… ▽ More Linear Mixed Effects (LME) models have been widely applied in clustered data analysis in many areas including marketing research, clinical trials, and biomedical studies. Inference can be conducted using maximum likelihood approach if assuming Normal distributions on the random effects. However, in many applications of economy, business and medicine, it is often essential to impose constraints on the regression parameters after taking their real-world interpretations into account. Therefore, in this paper we extend the classical (unconstrained) LME models to allow for sign constraints on its overall coefficients. We propose to assume a symmetric doubly truncated Normal (SDTN) distribution on the random effects instead of the unconstrained Normal distribution which is often found in classical literature. With the aforementioned change, difficulty has dramatically increased as the exact distribution of the dependent variable becomes analytically intractable. We then develop likelihood-based approaches to estimate the unknown model parameters utilizing the approximation of its exact distribution. Simulation studies have shown that the proposed constrained model not only improves real-world interpretations of results, but also achieves satisfactory performance on model fits as compared to the existing model. △ Less

Submitted 30 July, 2021; v1 submitted 9 November, 2020; originally announced November 2020.

arXiv:2010.10794 [pdf, other]

Worst-case sensitivity

Authors: Jun-ya Gotoh, Michael Jong Kim, Andrew E. B. Lim

Abstract: We introduce the notion of Worst-Case Sensitivity, defined as the worst-case rate of increase in the expected cost of a Distributionally Robust Optimization (DRO) model when the size of the uncertainty set vanishes. We show that worst-case sensitivity is a Generalized Measure of Deviation and that a large class of DRO models are essentially mean-(worst-case) sensitivity problems when uncertainty s… ▽ More We introduce the notion of Worst-Case Sensitivity, defined as the worst-case rate of increase in the expected cost of a Distributionally Robust Optimization (DRO) model when the size of the uncertainty set vanishes. We show that worst-case sensitivity is a Generalized Measure of Deviation and that a large class of DRO models are essentially mean-(worst-case) sensitivity problems when uncertainty sets are small, unifying recent results on the relationship between DRO and regularized empirical optimization with worst-case sensitivity playing the role of the regularizer. More generally, DRO solutions can be sensitive to the family and size of the uncertainty set, and reflect the properties of its worst-case sensitivity. We derive closed-form expressions of worst-case sensitivity for well known uncertainty sets including smooth $φ$-divergence, total variation, "budgeted" uncertainty sets, uncertainty sets corresponding to a convex combination of expected value and CVaR, and the Wasserstein metric. These can be used to select the uncertainty set and its size for a given application. △ Less

Submitted 21 October, 2020; originally announced October 2020.

Comments: 27 Pages + 11 page Appendix, 4 Figures

MSC Class: 90C17; 90B35; 90B99; 90C15; 90C99

arXiv:2009.10619 [pdf, other]

An Exponential Factorization Machine with Percentage Error Minimization to Retail Sales Forecasting

Authors: Chongshou Li, Brenda Cheang, Zhixing Luo, Andrew Lim

Abstract: This paper proposes a new approach to sales forecasting for new products with long lead time but short product life cycle. These SKUs are usually sold for one season only, without any replenishments. An exponential factorization machine (EFM) sales forecast model is developed to solve this problem which not only considers SKU attributes, but also pairwise interactions. The EFM model is significant… ▽ More This paper proposes a new approach to sales forecasting for new products with long lead time but short product life cycle. These SKUs are usually sold for one season only, without any replenishments. An exponential factorization machine (EFM) sales forecast model is developed to solve this problem which not only considers SKU attributes, but also pairwise interactions. The EFM model is significantly different from the original Factorization Machines (FM) from two-fold: (1) the attribute-level formulation for explanatory variables and (2) exponential formulation for the positive response variable. The attribute-level formation excludes infeasible intra-attribute interactions and results in more efficient feature engineering comparing with the conventional one-hot encoding, while the exponential formulation is demonstrated more effective than the log-transformation for the positive but not skewed distributed responses. In order to estimate the parameters, percentage error squares (PES) and error squares (ES) are minimized by a proposed adaptive batch gradient descent method over the training set. Real-world data provided by a footwear retailer in Singapore is used for testing the proposed approach. The forecasting performance in terms of both mean absolute percentage error (MAPE) and mean absolute error (MAE) compares favourably with not only off-the-shelf models but also results reported by extant sales and demand forecasting studies. The effectiveness of the proposed approach is also demonstrated by two external public datasets. Moreover, we prove the theoretical relationships between PES and ES minimization, and present an important property of the PES minimization for regression models; that it trains models to underestimate data. This property fits the situation of sales forecasting where unit-holding cost is much greater than the unit-shortage cost. △ Less

Submitted 22 September, 2020; originally announced September 2020.

Comments: Accepted by ACM Transactions on Knowledge Discovery from Data (ACM TKDD)

Journal ref: ACM Transactions on Knowledge Discovery from Data 2020

arXiv:2008.12802 [pdf, other]

doi 10.1080/02664763.2021.1946020

Hierarchical Marketing Mix Models with Sign Constraints

Authors: Hao Chen, Minguang Zhang, Lanshan Han, Alvin Lim

Abstract: Marketing mix models (MMMs) are statistical models for measuring the effectiveness of various marketing activities such as promotion, media advertisement, etc. In this research, we propose a comprehensive marketing mix model that captures the hierarchical structure and the carryover, shape and scale effects of certain marketing activities, as well as sign restrictions on certain coefficients that… ▽ More Marketing mix models (MMMs) are statistical models for measuring the effectiveness of various marketing activities such as promotion, media advertisement, etc. In this research, we propose a comprehensive marketing mix model that captures the hierarchical structure and the carryover, shape and scale effects of certain marketing activities, as well as sign restrictions on certain coefficients that are consistent with common business sense. In contrast to commonly adopted approaches in practice, which estimate parameters in a multi-stage process, the proposed approach estimates all the unknown parameters/coefficients simultaneously using a constrained maximum likelihood approach and solved with the Hamiltonian Monte Carlo algorithm. We present results on real datasets to illustrate the use of the proposed solution algorithm. △ Less

Submitted 28 August, 2020; originally announced August 2020.

Journal ref: Journal of Applied Statistics (2021)

arXiv:2008.07954 [pdf, ps, other]

A Note on the Sum of Non-Identically Distributed Doubly Truncated Normal Distributions

Authors: Hao Chen, Lanshan Han, Alvin Lim

Abstract: It is proved that the sum of n independent but non-identically distributed doubly truncated Normal distributions converges in distribution to a Normal distribution. It is also shown how the result can be applied in estimating a constrained mixed effects model. It is proved that the sum of n independent but non-identically distributed doubly truncated Normal distributions converges in distribution to a Normal distribution. It is also shown how the result can be applied in estimating a constrained mixed effects model. △ Less

Submitted 28 July, 2021; v1 submitted 18 August, 2020; originally announced August 2020.

arXiv:2004.13970 [pdf, other]

Directed Graph Convolutional Network

Authors: Zekun Tong, Yuxuan Liang, Changsheng Sun, David S. Rosenblum, Andrew Lim

Abstract: Graph Convolutional Networks (GCNs) have been widely used due to their outstanding performance in processing graph-structured data. However, the undirected graphs limit their application scope. In this paper, we extend spectral-based graph convolution to directed graphs by using first- and second-order proximity, which can not only retain the connection properties of the directed graph, but also e… ▽ More Graph Convolutional Networks (GCNs) have been widely used due to their outstanding performance in processing graph-structured data. However, the undirected graphs limit their application scope. In this paper, we extend spectral-based graph convolution to directed graphs by using first- and second-order proximity, which can not only retain the connection properties of the directed graph, but also expand the receptive field of the convolution operation. A new GCN model, called DGCN, is then designed to learn representations on the directed graph, leveraging both the first- and second-order proximity information. We empirically show the fact that GCNs working only with DGCNs can encode more useful information from graph and help achieve better performance when generalized to other models. Moreover, extensive experiments on citation networks and co-purchase datasets demonstrate the superiority of our model against the state-of-the-art methods. △ Less

Submitted 29 April, 2020; originally announced April 2020.

arXiv:2002.12222 [pdf, ps, other]

On Isometry Robustness of Deep 3D Point Cloud Models under Adversarial Attacks

Authors: Yue Zhao, Yuwei Wu, Caihua Chen, Andrew Lim

Abstract: While deep learning in 3D domain has achieved revolutionary performance in many tasks, the robustness of these models has not been sufficiently studied or explored. Regarding the 3D adversarial samples, most existing works focus on manipulation of local points, which may fail to invoke the global geometry properties, like robustness under linear projection that preserves the Euclidean distance, i.… ▽ More While deep learning in 3D domain has achieved revolutionary performance in many tasks, the robustness of these models has not been sufficiently studied or explored. Regarding the 3D adversarial samples, most existing works focus on manipulation of local points, which may fail to invoke the global geometry properties, like robustness under linear projection that preserves the Euclidean distance, i.e., isometry. In this work, we show that existing state-of-the-art deep 3D models are extremely vulnerable to isometry transformations. Armed with the Thompson Sampling, we develop a black-box attack with success rate over 95% on ModelNet40 data set. Incorporating with the Restricted Isometry Property, we propose a novel framework of white-box attack on top of spectral norm based perturbation. In contrast to previous works, our adversarial samples are experimentally shown to be strongly transferable. Evaluated on a sequence of prevailing 3D models, our white-box attack achieves success rates from 98.88% to 100%. It maintains a successful attack rate over 95% even within an imperceptible rotation range $[\pm 2.81^{\circ}]$. △ Less

Submitted 10 March, 2020; v1 submitted 27 February, 2020; originally announced February 2020.

Comments: This paper was accepted for presentation at CVPR2020

arXiv:1908.08616 [pdf, other]

doi 10.3934/jimo.2021046

Quadratic Surface Support Vector Machine with L1 Norm Regularization

Authors: Ahmad Mousavi, Zheming Gao, Lanshan Han, Alvin Lim

Abstract: We propose $\ell_1$ norm regularized quadratic surface support vector machine models for binary classification in supervised learning. We establish their desired theoretical properties, including the existence and uniqueness of the optimal solution, reduction to the standard SVMs over (almost) linearly separable data sets, and detection of true sparsity pattern over (almost) quadratically separabl… ▽ More We propose $\ell_1$ norm regularized quadratic surface support vector machine models for binary classification in supervised learning. We establish their desired theoretical properties, including the existence and uniqueness of the optimal solution, reduction to the standard SVMs over (almost) linearly separable data sets, and detection of true sparsity pattern over (almost) quadratically separable data sets if the penalty parameter of $\ell_1$ norm is large enough. We also demonstrate their promising practical efficiency by conducting various numerical experiments on both synthetic and publicly available benchmark data sets. △ Less

Submitted 30 January, 2021; v1 submitted 22 August, 2019; originally announced August 2019.

arXiv:1904.09245 [pdf, other]

doi 10.1109/TIM.2021.3059321

Deep Pattern of Time Series and Its Applications in Estimation, Forecasting, Fault Diagnosis and Target Tracking

Authors: Shixiong Wang, Chongshou Li, Andrew Lim

Abstract: The information contained in a time series is more than what the values themselves are. In this paper, the Time-variant Local Autocorrelated Polynomial model with Kalman filter is proposed to model the underlying dynamics of a time series (or signal) and mine the deep pattern of it, except estimating the instantaneous mean function (also known as trend function), including: (1) identifying and pre… ▽ More The information contained in a time series is more than what the values themselves are. In this paper, the Time-variant Local Autocorrelated Polynomial model with Kalman filter is proposed to model the underlying dynamics of a time series (or signal) and mine the deep pattern of it, except estimating the instantaneous mean function (also known as trend function), including: (1) identifying and predicting the peak and valley values of a time series; (2) reporting and forecasting the current changing pattern (increasing or decreasing pattern of the trend, and how fast it changes). We will show that it is this deep pattern that allows us to make higher-accuracy estimation and forecasting for a time series, to easily detect the anomalies (faults) of a sensor, and to track a highly-maneuvering target. △ Less

Submitted 18 December, 2019; v1 submitted 19 April, 2019; originally announced April 2019.

Journal ref: Published in the IEEE Transactions on Instrumentation and Measurement in Feb 2021, with the adapted version

arXiv:1904.07632 [pdf, other]

Why Are the ARIMA and SARIMA not Sufficient

Authors: Shixiong Wang, Chongshou Li, Andrew Lim

Abstract: The autoregressive moving average (ARMA) model takes the significant position in time series analysis for a wide-sense stationary time series. The difference operator and seasonal difference operator, which are bases of ARIMA and SARIMA (Seasonal ARIMA), respectively, were introduced to remove the trend and seasonal component so that the original non-stationary time series could be transformed int… ▽ More The autoregressive moving average (ARMA) model takes the significant position in time series analysis for a wide-sense stationary time series. The difference operator and seasonal difference operator, which are bases of ARIMA and SARIMA (Seasonal ARIMA), respectively, were introduced to remove the trend and seasonal component so that the original non-stationary time series could be transformed into a wide-sense stationary one, which could then be handled by Box-Jenkins methodology. However, such difference operators are more practical experiences than exact theories by now. In this paper, we investigate the power of the (resp. seasonal) difference operator from the perspective of spectral analysis, linear system theory and digital filtering, and point out the characteristics and limitations of (resp. seasonal) difference operator. Besides, the general method that transforms a non-stationary (the non-stationarity in the mean sense) stochastic process to be wide-sense stationary will be presented. △ Less

Submitted 2 March, 2021; v1 submitted 16 April, 2019; originally announced April 2019.

arXiv:1901.02928 [pdf, other]

doi 10.1080/03610918.2020.1764034

Beyond the EM Algorithm: Constrained Optimization Methods for Latent Class Model

Authors: Hao Chen, Lanshan Han, Alvin Lim

Abstract: Latent class model (LCM), which is a finite mixture of different categorical distributions, is one of the most widely used models in statistics and machine learning fields. Because of its non-continuous nature and the flexibility in shape, researchers in practice areas such as marketing and social sciences also frequently use LCM to gain insights from their data. One likelihood-based method, the E… ▽ More Latent class model (LCM), which is a finite mixture of different categorical distributions, is one of the most widely used models in statistics and machine learning fields. Because of its non-continuous nature and the flexibility in shape, researchers in practice areas such as marketing and social sciences also frequently use LCM to gain insights from their data. One likelihood-based method, the Expectation-Maximization (EM) algorithm, is often used to obtain the model estimators. However, the EM algorithm is well-known for its notoriously slow convergence. In this research, we explore alternative likelihood-based methods that can potential remedy the slow convergence of the EM algorithm. More specifically, we regard likelihood-based approach as a constrained nonlinear optimization problem, and apply quasi-Newton type methods to solve them. We examine two different constrained optimization methods to maximize the log likelihood function. We present simulation study results to show that the proposed methods not only converge in less iterations than the EM algorithm but also produce more accurate model estimators. △ Less

Submitted 19 May, 2020; v1 submitted 9 January, 2019; originally announced January 2019.

arXiv:1810.05763 [pdf, other]

Estimating Robot Strengths with Application to Selection of Alliance Members in FIRST Robotics Competitions

Authors: Alejandro Lim, Chin-Tsang Chiang, Jen-Chieh Teng

Abstract: Since the inception of the FIRST Robotics Competition (FRC) and its special playoff system, robotics teams have longed to appropriately quantify the strengths of their designed robots. The FRC includes a playground draft-like phase (alliance selection), arguably the most game-changing part of the competition, in which the top-8 robotics teams in a tournament based on the FRC's ranking system asses… ▽ More Since the inception of the FIRST Robotics Competition (FRC) and its special playoff system, robotics teams have longed to appropriately quantify the strengths of their designed robots. The FRC includes a playground draft-like phase (alliance selection), arguably the most game-changing part of the competition, in which the top-8 robotics teams in a tournament based on the FRC's ranking system assess potential alliance members for the opportunity of partnering in a playoff stage. In such a three-versus-three competition, several measures and models have been used to characterize actual or relative robot strengths. However, existing models are found to have poor predictive performance due to their imprecise estimates of robot strengths caused by a small ratio of the number of observations to the number of robots. A more general regression model with latent clusters of robot strengths is, thus, proposed to enhance their predictive capacities. Two effective estimation procedures are further developed to simultaneously estimate the number of clusters, clusters of robots, and robot strengths. Meanwhile, some measures are used to assess the predictive ability of competing models, the agreement between published FRC measures of strength and model-based robot strengths of all, playoff, and FRC top-8 robots, and the agreement between FRC top-8 robots and model-based top robots. Moreover, the stability of estimated robot strengths and accuracies is investigated to determine whether the scheduled matches are excessive or insufficient. In the analysis of qualification data from the 2018 FRC Houston and Detroit championships, the predictive ability of our model is also shown to be significantly better than those of existing models. Teams who adopt the new model can now appropriately rank their preferences for playoff alliance partners with greater predictive capability than before. △ Less

Submitted 12 January, 2021; v1 submitted 12 October, 2018; originally announced October 2018.

Comments: 16 pages, 6 tables

MSC Class: 62J05; 62-08; 62P30; 62R07 ACM Class: G.3; I.2; I.6

arXiv:1809.01564 [pdf, other]

Traffic Density Estimation using a Convolutional Neural Network

Authors: Julian Nubert, Nicholas Giai Truong, Abel Lim, Herbert Ilhan Tanujaya, Leah Lim, Mai Anh Vu

Abstract: The goal of this project is to introduce and present a machine learning application that aims to improve the quality of life of people in Singapore. In particular, we investigate the use of machine learning solutions to tackle the problem of traffic congestion in Singapore. In layman's terms, we seek to make Singapore (or any other city) a smoother place. To accomplish this aim, we present an end-… ▽ More The goal of this project is to introduce and present a machine learning application that aims to improve the quality of life of people in Singapore. In particular, we investigate the use of machine learning solutions to tackle the problem of traffic congestion in Singapore. In layman's terms, we seek to make Singapore (or any other city) a smoother place. To accomplish this aim, we present an end-to-end system comprising of 1. A traffic density estimation algorithm at traffic lights/junctions and 2. a suitable traffic signal control algorithms that make use of the density information for better traffic control. Traffic density estimation can be obtained from traffic junction images using various machine learning techniques (combined with CV tools). After research into various advanced machine learning methods, we decided on convolutional neural networks (CNNs). We conducted experiments on our algorithms, using the publicly available traffic camera dataset published by the Land Transport Authority (LTA) to demonstrate the feasibility of this approach. With these traffic density estimates, different traffic algorithms can be applied to minimize congestion at traffic junctions in general. △ Less

Submitted 5 September, 2018; originally announced September 2018.

Comments: Machine Learning Project National University of Singapore. 6 pages, 5 figures

arXiv:1711.06565 [pdf, ps, other]

Calibration of Distributionally Robust Empirical Optimization Models

Authors: Jun-Ya Gotoh, Michael Jong Kim, Andrew E. B. Lim

Abstract: We study the out-of-sample properties of robust empirical optimization problems with smooth $φ$-divergence penalties and smooth concave objective functions, and develop a theory for data-driven calibration of the non-negative "robustness parameter" $δ$ that controls the size of the deviations from the nominal model. Building on the intuition that robust optimization reduces the sensitivity of the… ▽ More We study the out-of-sample properties of robust empirical optimization problems with smooth $φ$-divergence penalties and smooth concave objective functions, and develop a theory for data-driven calibration of the non-negative "robustness parameter" $δ$ that controls the size of the deviations from the nominal model. Building on the intuition that robust optimization reduces the sensitivity of the expected reward to errors in the model by controlling the spread of the reward distribution, we show that the first-order benefit of ``little bit of robustness" (i.e., $δ$ small, positive) is a significant reduction in the variance of the out-of-sample reward while the corresponding impact on the mean is almost an order of magnitude smaller. One implication is that substantial variance (sensitivity) reduction is possible at little cost if the robustness parameter is properly calibrated. To this end, we introduce the notion of a robust mean-variance frontier to select the robustness parameter and show that it can be approximated using resampling methods like the bootstrap. Our examples show that robust solutions resulting from "open loop" calibration methods (e.g., selecting a $90\%$ confidence level regardless of the data and objective function) can be very conservative out-of-sample, while those corresponding to the robustness parameter that optimizes an estimate of the out-of-sample expected reward (e.g., via the bootstrap) with no regard for the variance are often insufficiently robust. △ Less

Submitted 18 May, 2020; v1 submitted 17 November, 2017; originally announced November 2017.

Comments: 51 pages

arXiv:1111.2091 [pdf, ps, other]

Performance-based regularization in mean-CVaR portfolio optimization

Authors: Noureddine El Karoui, Andrew E. B. Lim, Gah-Yi Vahn

Abstract: We introduce performance-based regularization (PBR), a new approach to addressing estimation risk in data-driven optimization, to mean-CVaR portfolio optimization. We assume the available log-return data is iid, and detail the approach for two cases: nonparametric and parametric (the log-return distribution belongs in the elliptical family). The nonparametric PBR method penalizes portfolios with l… ▽ More We introduce performance-based regularization (PBR), a new approach to addressing estimation risk in data-driven optimization, to mean-CVaR portfolio optimization. We assume the available log-return data is iid, and detail the approach for two cases: nonparametric and parametric (the log-return distribution belongs in the elliptical family). The nonparametric PBR method penalizes portfolios with large variability in mean and CVaR estimations. The parametric PBR method solves the empirical Markowitz problem instead of the empirical mean-CVaR problem, as the solutions of the Markowitz and mean-CVaR problems are equivalent when the log-return distribution is elliptical. We derive the asymptotic behavior of the nonparametric PBR solution, which leads to insight into the effect of penalization, and justification of the parametric PBR method. We also show via simulations that the PBR methods produce efficient frontiers that are, on average, closer to the population efficient frontier than the empirical approach to the mean-CVaR problem, with less variability. △ Less

Submitted 26 March, 2012; v1 submitted 8 November, 2011; originally announced November 2011.

MSC Class: 90C20; 62P05 (Primary) 90C90; 91B30 (Secondary)

Showing 1–21 of 21 results for author: Lim, A