Skip to main content

Showing 1–30 of 30 results for author: Luo, L

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.18284  [pdf, other

    stat.ML cs.LG

    Adaptive debiased SGD in high-dimensional GLMs with streaming data

    Authors: Ruijian Han, Lan Luo, Yuanhang Luo, Yuanyuan Lin, Jian Huang

    Abstract: Online statistical inference facilitates real-time analysis of sequentially collected data, making it different from traditional methods that rely on static datasets. This paper introduces a novel approach to online inference in high-dimensional generalized linear models, where we update regression coefficient estimates and their standard errors upon each new data arrival. In contrast to existing… ▽ More

    Submitted 1 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: 37 pages, 4 figures

  2. arXiv:2310.16203  [pdf, other

    stat.ME

    Multivariate Dynamic Mediation Analysis under a Reinforcement Learning Framework

    Authors: Lan Luo, Chengchun Shi, Jitao Wang, Zhenke Wu, Lexin Li

    Abstract: Mediation analysis is an important analytic tool commonly used in a broad range of scientific applications. In this article, we study the problem of mediation analysis when there are multivariate and conditionally dependent mediators, and when the variables are observed over multiple time points. The problem is challenging, because the effect of a mediator involves not only the path from the treat… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  3. arXiv:2307.00126  [pdf, other

    math.OC cs.LG stat.ML

    Accelerating Inexact HyperGradient Descent for Bilevel Optimization

    Authors: Haikuo Yang, Luo Luo, Chris Junchi Li, Michael I. Jordan

    Abstract: We present a method for solving general nonconvex-strongly-convex bilevel optimization problems. Our method -- the \emph{Restarted Accelerated HyperGradient Descent} (\texttt{RAHGD}) method -- finds an $ε$-first-order stationary point of the objective with $\tilde{\mathcal{O}}(κ^{3.25}ε^{-1.75})$ oracle complexity, where $κ$ is the condition number of the lower-level objective and $ε$ is the desir… ▽ More

    Submitted 30 June, 2023; originally announced July 2023.

  4. arXiv:2212.08761  [pdf

    stat.AP

    Evaluating the Impact of Automated Vehicles on Residential Location Distribution using Activity-based Accessibility: A Case Study of Japanese Regional Areas

    Authors: Lichen Luo, Giancarlos Parady, Kiyoshi Takami

    Abstract: Automated Vehicles (AVs) are expected to disrupt the transport sector in the future. Extensive research efforts have been dedicated to studying its potential implications. However, the existing literature is yet limited regarding the long-term impacts. To fill this gap, this paper estimates and validates a residential location choice model to evaluate the impacts of AVs on residential location dis… ▽ More

    Submitted 16 December, 2022; originally announced December 2022.

    Comments: Submitted to and accepted by Transportation Research Board Annual Conference, 2023

  5. Statistical Inference for Streamed Longitudinal Data

    Authors: Lan Luo, **gshen Wang, Emily C. Hector

    Abstract: Modern longitudinal data, for example from wearable devices, measures biological signals on a fixed set of participants at a diverging number of time points. Traditional statistical methods are not equipped to handle the computational burden of repeatedly analyzing the cumulatively growing dataset each time new data is collected. We propose a new estimation and inference framework for dynamic upda… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: 18 pages, 2 figures. Biometrika (2023)

  6. arXiv:2205.01224  [pdf, other

    cs.LG stat.ML

    COMET Flows: Towards Generative Modeling of Multivariate Extremes and Tail Dependence

    Authors: Andrew McDonald, Pang-Ning Tan, Lifeng Luo

    Abstract: Normalizing flows, a popular class of deep generative models, often fail to represent extreme phenomena observed in real-world processes. In particular, existing normalizing flow architectures struggle to model multivariate extremes, characterized by heavy-tailed marginal distributions and asymmetric tail dependence among variables. In light of this shortcoming, we propose COMET (COpula Multivaria… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

    Comments: 7 pages, 4 figures, accepted to IJCAI'22

  7. arXiv:2111.13775  [pdf, other

    stat.ME

    Online Causal Inference with Application to Near Real-Time Post-Market Vaccine Safety Surveillance

    Authors: Xu Shi, Lan Luo

    Abstract: Streaming data routinely generated by mobile phones, social networks, e-commerce, and electronic health records present new opportunities for near real-time surveillance of the impact of an intervention on an outcome of interest via causal inference methods. However, as data grow rapidly in volume and velocity, storing and combing data become increasingly challenging. The amount of time and effort… ▽ More

    Submitted 26 November, 2021; originally announced November 2021.

  8. arXiv:2111.00032  [pdf, other

    stat.CO stat.AP

    Parallel-and-stream accelerator for computationally fast supervised learning

    Authors: Emily C. Hector, Lan Luo, Peter X. -K. Song

    Abstract: Two dominant distributed computing strategies have emerged to overcome the computational bottleneck of supervised learning with big data: parallel data processing in the MapReduce paradigm and serial data processing in the online streaming paradigm. Despite the two strategies' common divide-and-combine approach, they differ in how they aggregate information, leading to different trade-offs between… ▽ More

    Submitted 29 October, 2021; originally announced November 2021.

    Comments: 22 pages, 3 figures

  9. arXiv:2106.15794  [pdf, other

    stat.ME

    Real-Time Regression Analysis of Streaming Clustered Data With Possible Abnormal Data Batches

    Authors: Lan Luo, Ling Zhou, Peter X. -K. Song

    Abstract: This paper develops an incremental learning algorithm based on quadratic inference function (QIF) to analyze streaming datasets with correlated outcomes such as longitudinal data and clustered data. We propose a renewable QIF (RenewQIF) method within a paradigm of renewable estimation and incremental inference, in which parameter estimates are recursively renewed with current data and summary stat… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

  10. arXiv:2009.02553  [pdf, other

    cs.LG cs.DS stat.ML

    Revisiting Co-Occurring Directions: Sharper Analysis and Efficient Algorithm for Sparse Matrices

    Authors: Luo Luo, Cheng Chen, Guangzeng Xie, Haishan Ye

    Abstract: We study the streaming model for approximate matrix multiplication (AMM). We are interested in the scenario that the algorithm can only take one pass over the data with limited memory. The state-of-the-art deterministic sketching algorithm for streaming AMM is the co-occurring directions (COD), which has much smaller approximation errors than randomized algorithms and outperforms other determinist… ▽ More

    Submitted 17 December, 2020; v1 submitted 5 September, 2020; originally announced September 2020.

  11. arXiv:2007.16031  [pdf, other

    stat.ME stat.AP

    Decomposition of the Total Effect for Two Mediators: A Natural Counterfactual Interaction Effect Framework

    Authors: Xin Gao, Li Li, Li Luo

    Abstract: Mediation analysis has been used in many disciplines to explain the mechanism or process that underlies an observed relationship between an exposure variable and an outcome variable via the inclusion of mediators. Decompositions of the total causal effect of an exposure variable into effects characterizing mediation pathways and interactions have gained an increasing amount of interest in the last… ▽ More

    Submitted 30 July, 2020; originally announced July 2020.

    Comments: 112 pages, 6 figures. arXiv admin note: text overlap with arXiv:2004.06054

  12. arXiv:2007.11847  [pdf, other

    cs.LG stat.ML

    METEOR: Learning Memory and Time Efficient Representations from Multi-modal Data Streams

    Authors: Amila Silva, Shanika Karunasekera, Christopher Leckie, Ling Luo

    Abstract: Many learning tasks involve multi-modal data streams, where continuous data from different modes convey a comprehensive description about objects. A major challenge in this context is how to efficiently interpret multi-modal information in complex environments. This has motivated numerous studies on learning unsupervised representations from multi-modal data streams. These studies aim to understan… ▽ More

    Submitted 23 July, 2020; originally announced July 2020.

  13. arXiv:2006.10396  [pdf, other

    cs.LG stat.ML

    OMBA: User-Guided Product Representations for Online Market Basket Analysis

    Authors: Amila Silva, Ling Luo, Shanika Karunasekera, Christopher Leckie

    Abstract: Market Basket Analysis (MBA) is a popular technique to identify associations between products, which is crucial for business decision making. Previous studies typically adopt conventional frequent itemset mining algorithms to perform MBA. However, they generally fail to uncover rarely occurring associations among the products at their most granular level. Also, they have limited ability to capture… ▽ More

    Submitted 16 February, 2021; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: 16 pages, 4 Figures

  14. arXiv:2005.00797  [pdf, ps, other

    cs.LG math.OC stat.ML

    Multi-consensus Decentralized Accelerated Gradient Descent

    Authors: Haishan Ye, Luo Luo, Ziang Zhou, Tong Zhang

    Abstract: This paper considers the decentralized convex optimization problem, which has a wide range of applications in large-scale machine learning, sensor networks, and control theory. We propose novel algorithms that achieve optimal computation complexity and near optimal communication complexity. Our theoretical results give affirmative answers to the open problem on whether there exists an algorithm th… ▽ More

    Submitted 10 October, 2023; v1 submitted 2 May, 2020; originally announced May 2020.

  15. arXiv:2004.06054  [pdf, other

    stat.ME stat.AP stat.OT

    Decomposition of Total Effect with the Notion of Natural Counterfactual Interaction Effect

    Authors: Xin Gao, Li Li, Li Luo

    Abstract: Mediation analysis serves as a crucial tool to obtain causal inference based on directed acyclic graphs, which has been widely employed in the areas of biomedical science, social science, epidemiology and psychology. Decomposition of total effect provides a deep insight to fully understand the casual contribution from each path and interaction term. Since the four-way decomposition method was prop… ▽ More

    Submitted 13 April, 2020; originally announced April 2020.

    Comments: 72 pages in total, 12 figures

  16. arXiv:2004.00539  [pdf, other

    stat.AP

    From scenario-based seismic hazard to scenario-based landslide hazard: rewinding to the past via statistical simulations

    Authors: Luguang Luo, Luigi Lombardo, Cees van Westen, Xiangjun Pei, Runqiu Huang

    Abstract: The vast majority of landslide susceptibility studies assumes the slope instability process to be time-invariant under the definition that "the past and present are keys to the future". This assumption may generally be valid. However, the trigger, be it a rainfall or an earthquake event, clearly varies over time. And yet, the temporal component of the trigger is rarely included in landslide suscep… ▽ More

    Submitted 1 April, 2020; originally announced April 2020.

  17. arXiv:2002.11394  [pdf, other

    stat.ML cs.LG

    Bayesian Nonparametric Space Partitions: A Survey

    Authors: Xuhui Fan, Bin Li, Ling Luo, Scott A. Sisson

    Abstract: Bayesian nonparametric space partition (BNSP) models provide a variety of strategies for partitioning a $D$-dimensional space into a set of blocks. In this way, the data points lie in the same block would share certain kinds of homogeneity. BNSP models can be applied to various areas, such as regression/classification trees, random feature construction, relational modeling, etc. In this survey, we… ▽ More

    Submitted 28 February, 2021; v1 submitted 26 February, 2020; originally announced February 2020.

  18. arXiv:2001.03724  [pdf, other

    cs.LG math.OC stat.ML

    Stochastic Recursive Gradient Descent Ascent for Stochastic Nonconvex-Strongly-Concave Minimax Problems

    Authors: Luo Luo, Haishan Ye, Zhichao Huang, Tong Zhang

    Abstract: We consider nonconvex-concave minimax optimization problems of the form $\min_{\bf x}\max_{\bf y\in{\mathcal Y}} f({\bf x},{\bf y})$, where $f$ is strongly-concave in $\bf y$ but possibly nonconvex in $\bf x$ and ${\mathcal Y}$ is a convex and compact set. We focus on the stochastic setting, where we can only access an unbiased stochastic gradient estimate of $f$ at each iteration. This formulatio… ▽ More

    Submitted 23 October, 2020; v1 submitted 11 January, 2020; originally announced January 2020.

  19. arXiv:1910.10335  [pdf, other

    cs.LG cs.HC cs.SI stat.ML

    USTAR: Online Multimodal Embedding for Modeling User-Guided Spatiotemporal Activity

    Authors: Amila Silva, Shanika Karunasekera, Christopher Leckie, Ling Luo

    Abstract: Building spatiotemporal activity models for people's activities in urban spaces is important for understanding the ever-increasing complexity of urban dynamics. With the emergence of Geo-Tagged Social Media (GTSM) records, previous studies demonstrate the potential of GTSM records for spatiotemporal activity modeling. State-of-the-art methods for this task embed different modalities (location, tim… ▽ More

    Submitted 23 October, 2019; originally announced October 2019.

    Comments: 10 pages, IEEE International Conference on Big Data 2019 (IEEE Big Data 2019)

  20. arXiv:1909.06946  [pdf, other

    cs.LG math.OC stat.ML

    A Stochastic Proximal Point Algorithm for Saddle-Point Problems

    Authors: Luo Luo, Cheng Chen, Yujun Li, Guangzeng Xie, Zhihua Zhang

    Abstract: We consider saddle point problems which objective functions are the average of $n$ strongly convex-concave individual components. Recently, researchers exploit variance reduction methods to solve such problems and achieve linear-convergence guarantees. However, these methods have a slow convergence when the condition number of the problem is very large. In this paper, we propose a stochastic proxi… ▽ More

    Submitted 13 September, 2019; originally announced September 2019.

  21. arXiv:1908.08394  [pdf, ps, other

    math.OC cs.LG stat.ML

    A General Analysis Framework of Lower Complexity Bounds for Finite-Sum Optimization

    Authors: Guangzeng Xie, Luo Luo, Zhihua Zhang

    Abstract: This paper studies the lower bound complexity for the optimization problem whose objective function is the average of $n$ individual smooth convex functions. We consider the algorithm which gets access to gradient and proximal oracle for each individual component. For the strongly-convex case, we prove such an algorithm can not reach an $\varepsilon$-suboptimal point in fewer than… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.

  22. arXiv:1908.06079  [pdf, other

    cs.CV stat.ML

    Task-Assisted Domain Adaptation with Anchor Tasks

    Authors: Zhizhong Li, Linjie Luo, Sergey Tulyakov, Qieyun Dai, Derek Hoiem

    Abstract: Some tasks, such as surface normals or single-view depth estimation, require per-pixel ground truth that is difficult to obtain on real images but easy to obtain on synthetic. However, models learned on synthetic images often do not generalize well to real images due to the domain shift. Our key idea to improve domain adaptation is to introduce a separate anchor task (such as facial landmarks) who… ▽ More

    Submitted 9 November, 2020; v1 submitted 16 August, 2019; originally announced August 2019.

    Comments: In WACV 2021

  23. arXiv:1906.08357  [pdf

    stat.AP econ.EM stat.ME

    The Age-Period-Cohort-Interaction Model for Describing and Investigating Inter-Cohort Deviations and Intra-Cohort Life-Course Dynamics

    Authors: Liying Luo, James Hodges

    Abstract: Social scientists have frequently sought to understand the distinct effects of age, period, and cohort, but disaggregation of the three dimensions is difficult because cohort = period - age. We argue that this technical difficulty reflects a disconnection between how cohort effect is conceptualized and how it is modeled in the traditional age-period-cohort framework. We propose a new method, calle… ▽ More

    Submitted 2 June, 2019; originally announced June 2019.

  24. arXiv:1904.07672  [pdf

    stat.ME stat.AP

    Constraints in Random Effects Age-Period-Cohort Models

    Authors: Liying Luo, James S. Hodges

    Abstract: Random effects (RE) models have been widely used to study the contextual effects of structures such as neighborhood or school. The RE approach has recently been applied to age-period-cohort (APC) models that are unidentified because the predictors are exactly linearly dependent. However, it has not been fully understood how the RE specification identifies these otherwise unidentified APC models. W… ▽ More

    Submitted 16 April, 2019; originally announced April 2019.

    Comments: Submitted to "Sociological Methodology"

  25. arXiv:1902.09843  [pdf, other

    cs.LG stat.ML

    Adaptive Gradient Methods with Dynamic Bound of Learning Rate

    Authors: Liangchen Luo, Yuanhao Xiong, Yan Liu, Xu Sun

    Abstract: Adaptive optimization methods such as AdaGrad, RMSprop and Adam have been proposed to achieve a rapid training process with an element-wise scaling term on learning rates. Though prevailing, they are observed to generalize poorly compared with SGD or even fail to converge due to unstable and extreme learning rates. Recent work has put forward some algorithms such as AMSGrad to tackle this issue bu… ▽ More

    Submitted 26 February, 2019; originally announced February 2019.

    Comments: Accepted to ICLR 2019. arXiv admin note: text overlap with arXiv:1904.09237 by other authors

  26. arXiv:1811.02629  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge

    Authors: Spyridon Bakas, Mauricio Reyes, Andras Jakab, Stefan Bauer, Markus Rempfler, Alessandro Crimi, Russell Takeshi Shinohara, Christoph Berger, Sung Min Ha, Martin Rozycki, Marcel Prastawa, Esther Alberts, Jana Lipkova, John Freymann, Justin Kirby, Michel Bilello, Hassan Fathallah-Shaykh, Roland Wiest, Jan Kirschke, Benedikt Wiestler, Rivka Colen, Aikaterini Kotrotsou, Pamela Lamontagne, Daniel Marcus, Mikhail Milchenko , et al. (402 additional authors not shown)

    Abstract: Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles dissem… ▽ More

    Submitted 23 April, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: The International Multimodal Brain Tumor Segmentation (BraTS) Challenge

  27. arXiv:1704.04235  [pdf, other

    cs.LG cs.CV stat.ML

    Close Yet Distinctive Domain Adaptation

    Authors: Lingkun Luo, Xiaofang Wang, Shiqiang Hu, Chao Wang, Yuxing Tang, Liming Chen

    Abstract: Domain adaptation is transfer learning which aims to generalize a learning model across training and testing data with different distributions. Most previous research tackle this problem in seeking a shared feature representation between source and target domains while reducing the mismatch of their data distributions. In this paper, we propose a close yet discriminative domain adaptation method,… ▽ More

    Submitted 13 April, 2017; originally announced April 2017.

    Comments: 11pages, 3 figures, ICCV2017

  28. arXiv:1612.00599  [pdf, ps, other

    cs.LG stat.ML

    Communication Lower Bounds for Distributed Convex Optimization: Partition Data on Features

    Authors: Zihao Chen, Luo Luo, Zhihua Zhang

    Abstract: Recently, there has been an increasing interest in designing distributed convex optimization algorithms under the setting where the data matrix is partitioned on features. Algorithms under this setting sometimes have many advantages over those under the setting where data is partitioned on samples, especially when the number of features is huge. Therefore, it is important to understand the inheren… ▽ More

    Submitted 2 December, 2016; originally announced December 2016.

  29. arXiv:1602.00223  [pdf, ps, other

    cs.LG stat.ML

    A Proximal Stochastic Quasi-Newton Algorithm

    Authors: Luo Luo, Zihao Chen, Zhihua Zhang, Wu-Jun Li

    Abstract: In this paper, we discuss the problem of minimizing the sum of two convex functions: a smooth function plus a non-smooth function. Further, the smooth part can be expressed by the average of a large number of smooth component functions, and the non-smooth part is equipped with a simple proximal map**. We propose a proximal stochastic second-order method, which is efficient and scalable. It incor… ▽ More

    Submitted 16 November, 2016; v1 submitted 31 January, 2016; originally announced February 2016.

  30. arXiv:1501.00537  [pdf

    stat.AP math.ST

    A theoretical foundation of the target-decoy search strategy for false discovery rate control in proteomics

    Authors: Kun He, Yan Fu, Wen-Feng Zeng, Lan Luo, Hao Chi, Chao Liu, Lai-Yun Qing, Rui-Xiang Sun, Si-Min He

    Abstract: Motivation: Target-decoy search (TDS) is currently the most popular strategy for estimating and controlling the false discovery rate (FDR) of peptide identifications in mass spectrometry-based shotgun proteomics. While this strategy is very useful in practice and has been intensively studied empirically, its theoretical foundation has not yet been well established. Result: In this work, we systema… ▽ More

    Submitted 3 January, 2015; originally announced January 2015.

    Comments: 7 pages, 2 figures