Skip to main content

Showing 1–35 of 35 results for author: Tan, L

Searching in archive stat. Search in all archives.
.
  1. arXiv:2306.02813  [pdf, other

    stat.ME

    Variational inference based on a subclass of closed skew normals

    Authors: Linda S. L. Tan

    Abstract: Gaussian distributions are widely used in Bayesian variational inference to approximate intractable posterior densities, but the ability to accommodate skewness can improve approximation accuracy significantly, especially when data or prior information is scarce. We study the properties of a subclass of closed skew normals constructed using affine transformation of independent standardized univari… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: keywords: Closed skew normal; Gaussian variational approximation; natural gradient; centered parametrization; LU decomposition

  2. arXiv:2305.05529  [pdf, other

    stat.CO cs.LG math.PR math.ST stat.ML

    Accelerate Langevin Sampling with Birth-Death process and Exploration Component

    Authors: Lezhi Tan, Jianfeng Lu

    Abstract: Sampling a probability distribution with known likelihood is a fundamental task in computational science and engineering. Aiming at multimodality, we propose a new sampling method that takes advantage of both birth-death process and exploration component. The main idea of this method is \textit{look before you leap}. We keep two sets of samplers, one at warmer temperature and one at original tempe… ▽ More

    Submitted 6 May, 2023; originally announced May 2023.

    Comments: 23 pages, 10 figures

  3. arXiv:2303.16208  [pdf, ps, other

    stat.ML cs.CC cs.DS cs.LG

    Lifting uniform learners via distributional decomposition

    Authors: Guy Blanc, Jane Lange, Ali Malik, Li-Yang Tan

    Abstract: We show how any PAC learning algorithm that works under the uniform distribution can be transformed, in a blackbox fashion, into one that works under an arbitrary and unknown distribution $\mathcal{D}$. The efficiency of our transformation scales with the inherent complexity of $\mathcal{D}$, running in $\mathrm{poly}(n, (md)^d)$ time for distributions over $\{\pm 1\}^n$ whose pmfs are computed by… ▽ More

    Submitted 29 March, 2023; v1 submitted 27 March, 2023; originally announced March 2023.

    Comments: To appear in STOC 2023

  4. arXiv:2302.10175  [pdf, other

    q-fin.PM cs.LG q-fin.TR stat.ML

    Spatio-Temporal Momentum: Jointly Learning Time-Series and Cross-Sectional Strategies

    Authors: Wee Ling Tan, Stephen Roberts, Stefan Zohren

    Abstract: We introduce Spatio-Temporal Momentum strategies, a class of models that unify both time-series and cross-sectional momentum strategies by trading assets based on their cross-sectional momentum features over time. While both time-series and cross-sectional momentum strategies are designed to systematically capture momentum risk premia, these strategies are regarded as distinct implementations and… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

    Journal ref: The Journal of Financial Data Science, Summer 2023

  5. arXiv:2210.10566  [pdf, other

    stat.ME

    Second order stochastic gradient update for Cholesky factor in Gaussian variational approximation from Stein's Lemma

    Authors: Linda S. L. Tan

    Abstract: In stochastic variational inference, use of the reparametrization trick for the multivariate Gaussian gives rise to efficient updates for the mean and Cholesky factor of the covariance matrix, which depend on the first order derivative of the log joint model density. In this article, we show that an alternative unbiased gradient estimate for the Cholesky factor which depends on the second order de… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: 15 pages, 2 figures

  6. arXiv:2206.14431  [pdf, other

    cs.DS cs.LG stat.ML

    Open Problem: Properly learning decision trees in polynomial time?

    Authors: Guy Blanc, Jane Lange, Mingda Qiao, Li-Yang Tan

    Abstract: The authors recently gave an $n^{O(\log\log n)}$ time membership query algorithm for properly learning decision trees under the uniform distribution (Blanc et al., 2021). The previous fastest algorithm for this problem ran in $n^{O(\log n)}$ time, a consequence of Ehrenfeucht and Haussler (1989)'s classic algorithm for the distribution-free setting. In this article we highlight the natural open pr… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: 5 pages, to appear at the Open Problem sessions at COLT 2022

  7. arXiv:2109.00375  [pdf, other

    stat.CO

    Analytic natural gradient updates for Cholesky factor in Gaussian variational approximation

    Authors: Linda S. L. Tan

    Abstract: Natural gradients can improve convergence in stochastic variational inference significantly but inverting the Fisher information matrix is daunting in high dimensions. Moreover, in Gaussian variational approximation, natural gradient updates of the precision matrix do not ensure positive definiteness. To tackle this issue, we derive analytic natural gradient updates of the Cholesky factor of the c… ▽ More

    Submitted 19 May, 2024; v1 submitted 1 September, 2021; originally announced September 2021.

    Comments: 47 pages, 10 figures

  8. arXiv:2107.00819  [pdf, other

    cs.LG cs.DS stat.ML

    Decision tree heuristics can fail, even in the smoothed setting

    Authors: Guy Blanc, Jane Lange, Mingda Qiao, Li-Yang Tan

    Abstract: Greedy decision tree learning heuristics are mainstays of machine learning practice, but theoretical justification for their empirical success remains elusive. In fact, it has long been known that there are simple target functions for which they fail badly (Kearns and Mansour, STOC 1996). Recent work of Brutzkus, Daniely, and Malach (COLT 2020) considered the smoothed analysis model as a possibl… ▽ More

    Submitted 2 July, 2021; originally announced July 2021.

    Comments: To appear in RANDOM 2021

  9. arXiv:2105.03594  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Learning stochastic decision trees

    Authors: Guy Blanc, Jane Lange, Li-Yang Tan

    Abstract: We give a quasipolynomial-time algorithm for learning stochastic decision trees that is optimally resilient to adversarial noise. Given an $η$-corrupted set of uniform random samples labeled by a size-$s$ stochastic decision tree, our algorithm runs in time $n^{O(\log(s/\varepsilon)/\varepsilon^2)}$ and returns a hypothesis with error within an additive $2η+ \varepsilon$ of the Bayes optimal. An a… ▽ More

    Submitted 8 May, 2021; originally announced May 2021.

    Comments: To appear in ICALP 2021

  10. arXiv:2101.07392  [pdf

    stat.AP

    Powering population health research: Considerations for plausible and actionable effect sizes

    Authors: Ellicott C. Matthay, Erin Hagan, Laura M. Gottlieb, May Lynn Tan, David Vlahov, Nancy Adler, M. Maria Glymour

    Abstract: Evidence for Action (E4A), a signature program of the Robert Wood Johnson Foundation, funds investigator-initiated research on the impacts of social programs and policies on population health and health inequities. Across thousands of letters of intent and full proposals E4A has received since 2015, one of the most common methodological challenges faced by applicants is selecting realistic effect… ▽ More

    Submitted 18 January, 2021; originally announced January 2021.

    Comments: 24 pages, 1 figure

  11. arXiv:2010.08633  [pdf, ps, other

    cs.LG cs.DS stat.ML

    Universal guarantees for decision tree induction via a higher-order splitting criterion

    Authors: Guy Blanc, Neha Gupta, Jane Lange, Li-Yang Tan

    Abstract: We propose a simple extension of top-down decision tree learning heuristics such as ID3, C4.5, and CART. Our algorithm achieves provable guarantees for all target functions $f: \{-1,1\}^n \to \{-1,1\}$ with respect to the uniform distribution, circumventing impossibility results showing that existing heuristics fare poorly even for simple target functions. The crux of our extension is a new splitt… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

  12. arXiv:2007.07176  [pdf, other

    cs.LG stat.ML

    Robustifying Reinforcement Learning Agents via Action Space Adversarial Training

    Authors: Kai Liang Tan, Yasaman Esfandiari, Xian Yeow Lee, Aakanksha, Soumik Sarkar

    Abstract: Adoption of machine learning (ML)-enabled cyber-physical systems (CPS) are becoming prevalent in various sectors of modern society such as transportation, industrial, and power grids. Recent studies in deep reinforcement learning (DRL) have demonstrated its benefits in a large variety of data-driven decisions and control applications. As reliance on ML-enabled systems grows, it is imperative to st… ▽ More

    Submitted 14 July, 2020; originally announced July 2020.

    Comments: Accepted for publication in American Control Conference 2020, 6 Pages

  13. arXiv:1912.01203  [pdf

    cs.LG eess.AS stat.ML

    Music Style Classification with Compared Methods in XGB and BPNN

    Authors: Lifeng Tan, Cong **, Zhiyuan Cheng, Xin Lv, Leiyu Song

    Abstract: Scientists have used many different classification methods to solve the problem of music classification. But the efficiency of each classification is different. In this paper, we propose two compared methods on the task of music style classification. More specifically, feature extraction for representing timbral texture, rhythmic content and pitch content are proposed. Comparative evaluations on p… ▽ More

    Submitted 3 December, 2019; originally announced December 2019.

    Comments: 5 pages, 1 figures

  14. arXiv:1909.02583  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    Spatiotemporally Constrained Action Space Attacks on Deep Reinforcement Learning Agents

    Authors: Xian Yeow Lee, Sambit Ghadai, Kai Liang Tan, Chinmay Hegde, Soumik Sarkar

    Abstract: Robustness of Deep Reinforcement Learning (DRL) algorithms towards adversarial attacks in real world applications such as those deployed in cyber-physical systems (CPS) are of increasing concern. Numerous studies have investigated the mechanisms of attacks on the RL agent's state space. Nonetheless, attacks on the RL agent's action space (AS) (corresponding to actuators in engineering systems) are… ▽ More

    Submitted 18 November, 2019; v1 submitted 5 September, 2019; originally announced September 2019.

    Comments: Version 2 with supplementary materials

  15. arXiv:1905.13409  [pdf, other

    cs.LG cs.CR stat.ML

    Bypassing Backdoor Detection Algorithms in Deep Learning

    Authors: Te Juin Lester Tan, Reza Shokri

    Abstract: Deep learning models are vulnerable to various adversarial manipulations of their training data, parameters, and input sample. In particular, an adversary can modify the training data and model parameters to embed backdoors into the model, so the model behaves according to the adversary's objective if the input contains the backdoor features, referred to as the backdoor trigger (e.g., a stamp on a… ▽ More

    Submitted 6 June, 2020; v1 submitted 31 May, 2019; originally announced May 2019.

    Comments: IEEE European Symposium on Security and Privacy 2020

  16. arXiv:1904.09591  [pdf, other

    stat.CO

    Conditionally structured variational Gaussian approximation with importance weights

    Authors: Linda S. L. Tan, Aishwarya Bhaskaran, David J. Nott

    Abstract: We develop flexible methods of deriving variational inference for models with complex latent variable structure. By splitting the variables in these models into "global" parameters and "local" latent variables, we define a class of variational approximations that exploit this partitioning and go beyond Gaussian variational approximation. This approximation is motivated by the fact that in many hie… ▽ More

    Submitted 21 April, 2019; originally announced April 2019.

    Comments: 18 pages, 7 figures

  17. arXiv:1811.07886  [pdf, other

    physics.chem-ph cs.LG stat.ML

    Chemical Structure Elucidation from Mass Spectrometry by Matching Substructures

    Authors: **g Lim, Joshua Wong, Minn Xuan Wong, Lee Han Eric Tan, Hai Leong Chieu, Davin Choo, Neng Kai Nigel Neo

    Abstract: Chemical structure elucidation is a serious bottleneck in analytical chemistry today. We address the problem of identifying an unknown chemical threat given its mass spectrum and its chemical formula, a task which might take well trained chemists several days to complete. Given a chemical formula, there could be over a million possible candidate structures. We take a data driven approach to rank t… ▽ More

    Submitted 17 November, 2018; originally announced November 2018.

  18. arXiv:1811.06100  [pdf, ps, other

    stat.ML cs.LG

    Newton Methods for Convolutional Neural Networks

    Authors: Chien-Chih Wang, Kent Loong Tan, Chih-Jen Lin

    Abstract: Deep learning involves a difficult non-convex optimization problem, which is often solved by stochastic gradient (SG) methods. While SG is usually effective, it may not be robust in some situations. Recently, Newton methods have been investigated as an alternative optimization technique, but nearly all existing studies consider only fully-connected feedforward neural networks. They do not investig… ▽ More

    Submitted 14 November, 2018; originally announced November 2018.

    Comments: Supplementary materials, experimental code and an efficient MATLAB implementation are available at https://www.csie.ntu.edu.tw/~cjlin/cnn/

  19. arXiv:1811.04249  [pdf, other

    stat.CO

    Bayesian variational inference for exponential random graph models

    Authors: Linda S. L. Tan, Nial Friel

    Abstract: Deriving Bayesian inference for exponential random graph models (ERGMs) is a challenging "doubly intractable" problem as the normalizing constants of the likelihood and posterior density are both intractable. Markov chain Monte Carlo (MCMC) methods which yield Bayesian inference for ERGMs, such as the exchange algorithm, are asymptotically exact but computationally intensive, as a network has to b… ▽ More

    Submitted 23 November, 2019; v1 submitted 10 November, 2018; originally announced November 2018.

    Comments: 45 pages

  20. arXiv:1805.07267  [pdf, ps, other

    stat.ME

    Use of model reparametrization to improve variational Bayes

    Authors: Linda S. L. Tan

    Abstract: We propose using model reparametrization to improve variational Bayes inference for hierarchical models whose variables can be classified as global (shared across observations) or local (observation specific). Posterior dependence between local and global variables is minimized by applying an invertible affine transformation on the local variables. The functional form of this transformation is ded… ▽ More

    Submitted 7 March, 2020; v1 submitted 18 May, 2018; originally announced May 2018.

    Journal ref: JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2020

  21. arXiv:1802.00130  [pdf, other

    stat.ML cs.LG math.OC

    Distributed Newton Methods for Deep Neural Networks

    Authors: Chien-Chih Wang, Kent Loong Tan, Chun-Ting Chen, Yu-Hsiang Lin, S. Sathiya Keerthi, Dhruv Mahajan, S. Sundararajan, Chih-Jen Lin

    Abstract: Deep learning involves a difficult non-convex optimization problem with a large number of weights between any two adjacent layers of a deep structure. To handle large data sets or complicated networks, distributed training is needed, but the calculation of function, gradient, and Hessian is expensive. In particular, the communication and the synchronization cost may become a bottleneck. In this pa… ▽ More

    Submitted 31 January, 2018; originally announced February 2018.

    Comments: Supplementary materials and experimental code are available at https://www.csie.ntu.edu.tw/~cjlin/papers/dnn

  22. arXiv:1712.08887  [pdf, other

    stat.ME

    Efficient data augmentation techniques for some classes of state space models

    Authors: Linda S. L. Tan

    Abstract: Data augmentation improves the convergence of iterative algorithms, such as the EM algorithm and Gibbs sampler by introducing carefully designed latent variables. In this article, we first propose a data augmentation scheme for the first-order autoregression plus noise model, where optimal values of working parameters introduced for recentering and rescaling of the latent states, can be derived an… ▽ More

    Submitted 4 July, 2022; v1 submitted 24 December, 2017; originally announced December 2017.

    Comments: Keywords: Data augmentation, State space model, Stochastic volatility model, EM algorithm, Reparametrization, Markov chain Monte Carlo, Ancillarity-sufficiency interweaving strategy

  23. Dynamic degree-corrected blockmodels for social networks: a nonparametric approach

    Authors: Linda S. L. Tan, Maria De Iorio

    Abstract: A nonparametric approach to the modeling of social networks using degree-corrected stochastic blockmodels is proposed. The model for static network consists of a stochastic blockmodel using a probit regression formulation and popularity parameters are incorporated to account for degree heterogeneity. Dirichlet processes are used to detect community structure as well as induce clustering in the pop… ▽ More

    Submitted 25 May, 2017; originally announced May 2017.

    Journal ref: Statistical Modelling (2019), 19, 386-411

  24. arXiv:1606.04995  [pdf, other

    cs.NI cs.IT math.OC math.ST stat.ML

    Joint Data Compression and MAC Protocol Design for Smartgrids with Renewable Energy

    Authors: Le Thanh Tan, Long Bao Le

    Abstract: In this paper, we consider the joint design of data compression and 802.15.4-based medium access control (MAC) protocol for smartgrids with renewable energy. We study the setting where a number of nodes, each of which comprises electricity load and/or renewable sources, report periodically their injected powers to a data concentrator. Our design exploits the correlation of the reported data in bot… ▽ More

    Submitted 15 June, 2016; originally announced June 2016.

    Comments: https://arxiv.longhoe.net/admin/q/1589135, Wireless Communications and Mobile Computing, 2016. arXiv admin note: substantial text overlap with arXiv:1506.08318

  25. Gaussian variational approximation with sparse precision matrices

    Authors: Linda S. L. Tan, David J. Nott

    Abstract: We consider the problem of learning a Gaussian variational approximation to the posterior distribution for a high-dimensional parameter, where we impose sparsity in the precision matrix to reflect appropriate conditional independence structure in the model. Incorporating sparsity in the precision matrix allows the Gaussian variational distribution to be both flexible and parsimonious, and the spar… ▽ More

    Submitted 12 April, 2017; v1 submitted 18 May, 2016; originally announced May 2016.

    Comments: 18 pages, 9 figures

    Journal ref: Statistics and Computing 28 (2018) 259-275

  26. arXiv:1604.07087  [pdf, other

    stat.ME math.ST

    Optimal Estimation of Slope Vector in High-dimensional Linear Transformation Model

    Authors: Xin Lu Tan

    Abstract: In a linear transformation model, there exists an unknown monotone nonlinear transformation function such that the transformed response variable and the predictor variables satisfy a linear regression model. In this paper, we present CENet, a new method for estimating the slope vector and simultaneously performing variable selection in the high-dimensional sparse linear transformation model. CENet… ▽ More

    Submitted 24 April, 2016; originally announced April 2016.

    Comments: 25 pages, 7 figures, 1 table

  27. Bayesian inference for multiple Gaussian graphical models with application to metabolic association networks

    Authors: Linda S. L. Tan, Ajay Jasra, Maria De Iorio, Timothy M. D. Ebbels

    Abstract: We investigate the effect of cadmium (a toxic environmental pollutant) on the correlation structure of a number of urinary metabolites using Gaussian graphical models (GGMs). The inferred metabolic associations can provide important information on the physiological state of a metabolic system and insights on complex metabolic relationships. Using the fitted GGMs, we construct differential networks… ▽ More

    Submitted 13 April, 2017; v1 submitted 21 March, 2016; originally announced March 2016.

    Journal ref: Ann. Appl. Stat. 11 (2017) 2222-2251

  28. arXiv:1511.06821  [pdf, other

    stat.ME stat.ML

    Kernel Additive Principal Components

    Authors: Xin Lu Tan, Andreas Buja, Zongming Ma

    Abstract: Additive principal components (APCs for short) are a nonlinear generalization of linear principal components. We focus on smallest APCs to describe additive nonlinear constraints that are approximately satisfied by the data. Thus APCs fit data with implicit equations that treat the variables symmetrically, as opposed to regression analyses which fit data with explicit equations that treat the data… ▽ More

    Submitted 20 November, 2015; originally announced November 2015.

    Comments: 54 pages including appendices

  29. arXiv:1502.07190  [pdf, other

    stat.ML cs.LG

    Topic-adjusted visibility metric for scientific articles

    Authors: Linda S. L. Tan, Aik Hui Chan, Tian Zheng

    Abstract: Measuring the impact of scientific articles is important for evaluating the research output of individual scientists, academic institutions and journals. While citations are raw data for constructing impact measures, there exist biases and potential issues if factors affecting citation patterns are not properly accounted for. In this work, we address the problem of field variation and introduce an… ▽ More

    Submitted 16 October, 2015; v1 submitted 25 February, 2015; originally announced February 2015.

    Journal ref: Annals of Applied Statistics, Volume 10, Number 1 (2016), 1-31

  30. Stochastic variational inference for large-scale discrete choice models using adaptive batch sizes

    Authors: Linda S. L. Tan

    Abstract: Discrete choice models describe the choices made by decision makers among alternatives and play an important role in transportation planning, marketing research and other applications. The mixed multinomial logit (MMNL) model is a popular discrete choice model that captures heterogeneity in the preferences of decision makers through random coefficients. While Markov chain Monte Carlo methods provi… ▽ More

    Submitted 8 October, 2015; v1 submitted 21 May, 2014; originally announced May 2014.

    Journal ref: Statistics and Computing (2017) 27 pp 237-257

  31. Variational inference for sparse spectrum Gaussian process regression

    Authors: Linda S. L. Tan, Victor M. H. Ong, David J. Nott, Ajay Jasra

    Abstract: We develop a fast variational approximation scheme for Gaussian process (GP) regression, where the spectrum of the covariance function is subjected to a sparse approximation. Our approach enables uncertainty in covariance function hyperparameters to be treated without using Monte Carlo methods and is robust to overfitting. Our article makes three contributions. First, we present a variational Baye… ▽ More

    Submitted 26 January, 2015; v1 submitted 9 June, 2013; originally announced June 2013.

    Comments: 20 pages, 11 figures, 1 table

    Journal ref: Statistics and Computing (2016) 26 pp 1243-1261

  32. A stochastic variational framework for fitting and diagnosing generalized linear mixed models

    Authors: Linda S. L. Tan, David J. Nott

    Abstract: In stochastic variational inference, the variational Bayes objective function is optimized using stochastic gradient approximation, where gradients computed on small random subsets of data are used to approximate the true gradient over the whole data set. This enables complex models to be fit to large data sets as data can be processed in mini-batches. In this article, we extend stochastic variati… ▽ More

    Submitted 28 March, 2014; v1 submitted 24 August, 2012; originally announced August 2012.

    Comments: 42 pages, 13 figures, 9 tables

    Journal ref: Bayesian Analysis (2014), 9, 963-1004

  33. arXiv:1207.4155  [pdf

    cs.LG stat.ML

    Similarity-Driven Cluster Merging Method for Unsupervised Fuzzy Clustering

    Authors: Xuejian Xiong, Kap Chan, Kian Lee Tan

    Abstract: In this paper, a similarity-driven cluster merging method is proposed for unsuper-vised fuzzy clustering. The cluster merging method is used to resolve the problem of cluster validation. Starting with an overspecified number of clusters in the data, pairs of similar clusters are merged based on the proposed similarity-driven cluster merging criterion. The similarity between clusters is calculated… ▽ More

    Submitted 11 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence (UAI2004)

    Report number: UAI-P-2004-PG-611-618

  34. arXiv:1205.3906  [pdf, ps, other

    stat.CO stat.ME

    Variational Inference for Generalized Linear Mixed Models Using Partially Noncentered Parametrizations

    Authors: Linda S. L. Tan, David J. Nott

    Abstract: The effects of different parametrizations on the convergence of Bayesian computational algorithms for hierarchical models are well explored. Techniques such as centering, noncentering and partial noncentering can be used to accelerate convergence in MCMC and EM algorithms but are still not well studied for variational Bayes (VB) methods. As a fast deterministic approach to posterior approximation,… ▽ More

    Submitted 11 June, 2013; v1 submitted 17 May, 2012; originally announced May 2012.

    Comments: Published in at http://dx.doi.org/10.1214/13-STS418 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-STS-STS418

    Journal ref: Statistical Science 2013, Vol. 28, No. 2, 168-188

  35. Variational approximation for mixtures of linear mixed models

    Authors: Siew Li Tan, David J. Nott

    Abstract: Mixtures of linear mixed models (MLMMs) are useful for clustering grouped data and can be estimated by likelihood maximization through the EM algorithm. The conventional approach to determining a suitable number of components is to compare different mixture models using penalized log-likelihood criteria such as BIC.We propose fitting MLMMs with variational methods which can perform parameter estim… ▽ More

    Submitted 29 August, 2012; v1 submitted 20 December, 2011; originally announced December 2011.

    Comments: 36 pages, 5 figures, 2 tables, submitted to JCGS

    Journal ref: Journal of Computational and Graphical Statistics. Volume 23, Issue 2, 2014, pages 564-585