Skip to main content

Showing 1–27 of 27 results for author: Zeng, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2311.03115  [pdf, other

    cs.CY cs.LG stat.AP

    RELand: Risk Estimation of Landmines via Interpretable Invariant Risk Minimization

    Authors: Mateo Dulce Rubio, Siqi Zeng, Qi Wang, Didier Alvarado, Francisco Moreno, Hoda Heidari, Fei Fang

    Abstract: Landmines remain a threat to war-affected communities for years after conflicts have ended, partly due to the laborious nature of demining tasks. Humanitarian demining operations begin by collecting relevant information from the sites to be cleared, which is then analyzed by human experts to determine the potential risk of remaining landmines. In this paper, we propose RELand system to support the… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  2. arXiv:2309.07001  [pdf, other

    cs.CE cs.AI stat.AP

    Modeling the Evolutionary Trends in Corporate ESG Reporting: A Study based on Knowledge Management Model

    Authors: Ziyuan Xia, Anchen Sun, Xiaodong Cai, Saixing Zeng

    Abstract: Environmental, social, and governance (ESG) reports are globally recognized as a keystone in sustainable enterprise development. However, current literature has not concluded the development of topics and trends in ESG contexts in the twenty-first century. Therefore, We selected 1114 ESG reports from firms in the technology industry to analyze the evolutionary trends of ESG topics by text mining.… ▽ More

    Submitted 25 May, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: 29 pages, 10 figures, 3 tables

  3. arXiv:2306.00673  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Attribute-Efficient PAC Learning of Low-Degree Polynomial Threshold Functions with Nasty Noise

    Authors: Shiwei Zeng, Jie Shen

    Abstract: The concept class of low-degree polynomial threshold functions (PTFs) plays a fundamental role in machine learning. In this paper, we study PAC learning of $K$-sparse degree-$d$ PTFs on $\mathbb{R}^n$, where any such concept depends only on $K$ out of $n$ attributes of the input. Our main contribution is a new algorithm that runs in time $({nd}/ε)^{O(d)}$ and under the Gaussian marginal distributi… ▽ More

    Submitted 19 March, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: ICML 2023. V2 fixed typos

  4. arXiv:2303.04746  [pdf, other

    stat.ME

    Necessary and sufficient conditions for multiple objective optimal regression designs

    Authors: Lucy L. Gao, Jane J. Ye, Shangzhi Zeng, Julie Zhou

    Abstract: We typically construct optimal designs based on a single objective function. To better capture the breadth of an experiment's goals, we could instead construct a multiple objective optimal design based on multiple objective functions. While algorithms have been developed to find multi-objective optimal designs (e.g. efficiency-constrained and maximin optimal designs), it is far less clear how to v… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

  5. arXiv:2210.01808   

    cs.LG stat.ML

    Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees

    Authors: Siliang Zeng, Chenliang Li, Alfredo Garcia, Mingyi Hong

    Abstract: Inverse reinforcement learning (IRL) aims to recover the reward function and the associated optimal policy that best fits observed sequences of states and actions implemented by an expert. Many algorithms for IRL have an inherently nested structure: the inner loop finds the optimal policy given parametrized rewards while the outer loop updates the estimates towards optimizing a measure of fit. For… ▽ More

    Submitted 31 October, 2022; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: Two different versions (arXiv:2210.01808 and arXiv:2210.01282) of the same paper have been submitted to arxiv. To avoid to the overlap between two versions, we withdraw this version. For this paper, readers could refer to arXiv:2210.01282

  6. arXiv:2210.01282  [pdf, other

    cs.LG cs.AI econ.EM stat.ML

    Structural Estimation of Markov Decision Processes in High-Dimensional State Space with Finite-Time Guarantees

    Authors: Siliang Zeng, Mingyi Hong, Alfredo Garcia

    Abstract: We consider the task of estimating a structural model of dynamic decisions by a human agent based upon the observable history of implemented actions and visited states. This problem has an inherent nested structure: in the inner problem, an optimal policy for a given reward function is identified while in the outer problem, a measure of fit is maximized. Several approaches have been proposed to al… ▽ More

    Submitted 1 March, 2024; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: This conference version of this paper refers to "Maximum-Likelihood Inverse Reinforcement Learning with Finite-Time Guarantees" in NeurIPS 2022

  7. arXiv:2202.08070  [pdf, other

    cs.LG stat.ML

    On Measuring Excess Capacity in Neural Networks

    Authors: Florian Graf, Sebastian Zeng, Bastian Rieck, Marc Niethammer, Roland Kwitt

    Abstract: We study the excess capacity of deep networks in the context of supervised classification. That is, given a capacity measure of the underlying hypothesis class - in our case, empirical Rademacher complexity - to what extent can we (a priori) constrain this class while retaining an empirical error on a par with the unconstrained regime? To assess excess capacity in modern architectures (such as res… ▽ More

    Submitted 19 January, 2023; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: Updated to Neurips 2022 camera-ready version

  8. arXiv:2107.09031  [pdf, other

    cs.LG stat.ML

    Topological Attention for Time Series Forecasting

    Authors: Sebastian Zeng, Florian Graf, Christoph Hofer, Roland Kwitt

    Abstract: The problem of (point) forecasting $ \textit{univariate} $ time series is considered. Most approaches, ranging from traditional statistical methods to recent learning-based techniques with neural networks, directly operate on raw time series observations. As an extension, we study whether $\textit{local topological properties}$, as captured via persistent homology, can serve as a reliable signal t… ▽ More

    Submitted 19 July, 2021; originally announced July 2021.

  9. arXiv:2107.04661  [pdf, other

    cs.LG cs.AI stat.ML

    Hölder Bounds for Sensitivity Analysis in Causal Reasoning

    Authors: Serge Assaad, Shuxi Zeng, Henry Pfister, Fan Li, Lawrence Carin

    Abstract: We examine interval estimation of the effect of a treatment T on an outcome Y given the existence of an unobserved confounder U. Using Hölder's inequality, we derive a set of bounds on the confounding bias |E[Y|T=t]-E[Y|do(T=t)]| based on the degree of unmeasured confounding (i.e., the strength of the connection U->T, and the strength of U->Y). These bounds are tight either when U is independent o… ▽ More

    Submitted 9 July, 2021; originally announced July 2021.

    Comments: Workshop on the Neglected Assumptions in Causal Inference at the International Conference on Machine Learning (ICML), 2021

  10. arXiv:2104.08344  [pdf, other

    stat.ME stat.AP

    A Causal Mediation Model for Longitudinal Mediators and Survival Outcomes with an Application to Animal Behavior

    Authors: Shuxi Zeng, Elizabeth C. Lange, Elizabeth A. Archie, Fernando A. Campos, Susan C. Alberts, Fan Li

    Abstract: In animal behavior studies, a common goal is to investigate the causal pathways between an exposure and outcome, and a mediator that lies in between. Causal mediation analysis provides a principled approach for such studies. Although many applications involve longitudinal data, the existing causal mediation models are not directly applicable to settings where the mediators are measured on irregula… ▽ More

    Submitted 12 February, 2022; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: 27 pages, 6 figures, 1 table. arXiv admin note: text overlap with arXiv:2007.01796

  11. arXiv:2103.00605  [pdf, other

    stat.ME

    Propensity Score Weighting Analysis of Survival Outcomes Using Pseudo-observations

    Authors: Shuxi Zeng, Fan Li, Liangyuan Hu, Fan Li

    Abstract: Survival outcomes are common in comparative effectiveness studies and require unique handling because they are usually incompletely observed due to right-censoring. A ``once for all'' approach for causal inference with survival outcomes constructs pseudo-observations and allows standard methods such as propensity score weighting to proceed as if the outcomes are completely observed. For a general… ▽ More

    Submitted 18 December, 2021; v1 submitted 28 February, 2021; originally announced March 2021.

    Comments: 40 pages, 2 figures, 1 table

  12. arXiv:2102.07367  [pdf, other

    math.OC cs.LG stat.ML

    A Near-Optimal Algorithm for Stochastic Bilevel Optimization via Double-Momentum

    Authors: Prashant Khanduri, Siliang Zeng, Mingyi Hong, Hoi-To Wai, Zhaoran Wang, Zhuoran Yang

    Abstract: This paper proposes a new algorithm -- the \underline{S}ingle-timescale Do\underline{u}ble-momentum \underline{St}ochastic \underline{A}pprox\underline{i}matio\underline{n} (SUSTAIN) -- for tackling stochastic unconstrained bilevel optimization problems. We focus on bilevel problems where the lower level subproblem is strongly-convex and the upper level objective function is smooth. Unlike prior w… ▽ More

    Submitted 15 June, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: 36 Pages, 10 Figures

  13. arXiv:2010.12618  [pdf, other

    stat.ML cs.LG

    Counterfactual Representation Learning with Balancing Weights

    Authors: Serge Assaad, Shuxi Zeng, Chenyang Tao, Shounak Datta, Nikhil Mehta, Ricardo Henao, Fan Li, Lawrence Carin

    Abstract: A key to causal inference with observational data is achieving balance in predictive features associated with each treatment type. Recent literature has explored representation learning to achieve this goal. In this work, we discuss the pitfalls of these strategies - such as a steep trade-off between achieving balance and predictive power - and present a remedy via the integration of balancing wei… ▽ More

    Submitted 23 February, 2021; v1 submitted 23 October, 2020; originally announced October 2020.

    Comments: Accepted to International Conference on Artificial Intelligence and Statistics (AISTATS 2021)

  14. arXiv:2010.08710  [pdf, other

    cs.LG stat.ML

    Causal Transfer Random Forest: Combining Logged Data and Randomized Experiments for Robust Prediction

    Authors: Shuxi Zeng, Murat Ali Bayir, Joesph J. Pfeiffer III, Denis Charles, Emre Kiciman

    Abstract: It is often critical for prediction models to be robust to distributional shifts between training and testing data. From a causal perspective, the challenge is to distinguish the stable causal relationships from the unstable spurious correlations across shifts. We describe a causal transfer random forest (CTRF) that combines existing training data with a small amount of data from a randomized expe… ▽ More

    Submitted 14 January, 2021; v1 submitted 16 October, 2020; originally announced October 2020.

    Comments: 9 pages, 7 figures, 2 tables, accepted to WSDM 2021

  15. arXiv:2010.07866  [pdf, other

    stat.ML cs.LG

    Double Robust Representation Learning for Counterfactual Prediction

    Authors: Shuxi Zeng, Serge Assaad, Chenyang Tao, Shounak Datta, Lawrence Carin, Fan Li

    Abstract: Causal inference, or counterfactual prediction, is central to decision making in healthcare, policy and social sciences. To de-bias causal estimators with high-dimensional data in observational studies, recent advances suggest the importance of combining machine learning models for both the propensity score and the outcome function. We propose a novel scalable method to learn double-robust represe… ▽ More

    Submitted 16 October, 2020; v1 submitted 15 October, 2020; originally announced October 2020.

    Comments: 18 pages, 5 figures, 2 Tables

  16. arXiv:2007.01796  [pdf, other

    stat.AP

    Causal Mediation Analysis for Sparse and Irregular Longitudinal Data

    Authors: Shuxi Zeng, Stacy Rosenbaum, Elizabeth Archie, Susan Alberts, Fan Li

    Abstract: Causal mediation analysis seeks to investigate how the treatment effect of an exposure on outcomes is mediated through intermediate variables. Although many applications involve longitudinal data, the existing methods are not directly applicable to settings where the mediator and outcome are measured on sparse and irregular time grids. We extend the existing causal mediation framework from a funct… ▽ More

    Submitted 22 February, 2021; v1 submitted 3 July, 2020; originally announced July 2020.

    Comments: 28 pages, 7 figures, 4 tables

  17. Enabling Counterfactual Survival Analysis with Balanced Representations

    Authors: Paidamoyo Chapfuwa, Serge Assaad, Shuxi Zeng, Michael J. Pencina, Lawrence Carin, Ricardo Henao

    Abstract: Balanced representation learning methods have been applied successfully to counterfactual inference from observational data. However, approaches that account for survival outcomes are relatively limited. Survival data are frequently encountered across diverse medical applications, i.e., drug development, risk profiling, and clinical trials, and such data are also relevant in fields like manufactur… ▽ More

    Submitted 3 March, 2021; v1 submitted 13 June, 2020; originally announced June 2020.

    Comments: Accepted at ACM Conference on Health, Inference, and Learning (ACM CHIL 2021). Code at https://github.com/paidamoyo/counterfactual_survival_analysis

  18. arXiv:2006.04338  [pdf, other

    cs.LG stat.ML

    A Decentralized Policy Gradient Approach to Multi-task Reinforcement Learning

    Authors: Sihan Zeng, Aqeel Anwar, Thinh Doan, Arijit Raychowdhury, Justin Romberg

    Abstract: We develop a mathematical framework for solving multi-task reinforcement learning (MTRL) problems based on a type of policy gradient method. The goal in MTRL is to learn a common policy that operates effectively in different environments; these environments have similar (or overlap**) state spaces, but have different rewards and dynamics. We highlight two fundamental challenges in MTRL that are… ▽ More

    Submitted 27 May, 2021; v1 submitted 7 June, 2020; originally announced June 2020.

  19. arXiv:2006.04045  [pdf, other

    cs.LG cs.CV math.DS math.OC stat.ML

    A Generic First-Order Algorithmic Framework for Bi-Level Programming Beyond Lower-Level Singleton

    Authors: Risheng Liu, Pan Mu, Xiaoming Yuan, Shangzhi Zeng, ** Zhang

    Abstract: In recent years, a variety of gradient-based first-order methods have been developed to solve bi-level optimization problems for learning applications. However, theoretical guarantees of these existing approaches heavily rely on the simplification that for each fixed upper-level variable, the lower-level solution must be a singleton (a.k.a., Lower-Level Singleton, LLS). In this work, we first desi… ▽ More

    Submitted 2 July, 2020; v1 submitted 7 June, 2020; originally announced June 2020.

    Comments: Accepted at ICML 2020

  20. arXiv:2006.02804  [pdf, other

    cs.LG stat.ML

    Exploring the Potential of Low-bit Training of Convolutional Neural Networks

    Authors: Kai Zhong, Xuefei Ning, Guohao Dai, Zhenhua Zhu, Tianchen Zhao, Shulin Zeng, Yu Wang, Huazhong Yang

    Abstract: In this work, we propose a low-bit training framework for convolutional neural networks, which is built around a novel multi-level scaling (MLS) tensor format. Our framework focuses on reducing the energy consumption of convolution operations by quantizing all the convolution operands to low bit-width format. Specifically, we propose the MLS tensor format, in which the element-wise bit-width can b… ▽ More

    Submitted 14 July, 2021; v1 submitted 4 June, 2020; originally announced June 2020.

    Comments: 13 pages, 7 figures

  21. arXiv:2005.09130  [pdf, other

    stat.AP econ.EM

    Is being an only child harmful to psychological health?: Evidence from an instrumental variable analysis of China's One-Child Policy

    Authors: Shuxi Zeng, Fan Li, Peng Ding

    Abstract: This paper evaluates the effects of being an only child in a family on psychological health, leveraging data on the One-Child Policy in China. We use an instrumental variable approach to address the potential unmeasured confounding between the fertility decision and psychological health, where the instrumental variable is an index on the intensity of the implementation of the One-Child Policy. We… ▽ More

    Submitted 11 June, 2020; v1 submitted 18 May, 2020; originally announced May 2020.

    Comments: 33 pages, 6 figures, 1 table

  22. arXiv:2005.06707  [pdf, other

    cs.LG cs.CV stat.ML

    Noise Homogenization via Multi-Channel Wavelet Filtering for High-Fidelity Sample Generation in GANs

    Authors: Shaoning Zeng, Bob Zhang

    Abstract: In the generator of typical Generative Adversarial Networks (GANs), a noise is inputted to generate fake samples via a series of convolutional operations. However, current noise generation models merely relies on the information from the pixel space, which increases the difficulty to approach the target distribution. Fortunately, the long proven wavelet transformation is able to decompose multiple… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

    Comments: 12 pages, 2 figures

  23. arXiv:2004.10075  [pdf, other

    stat.ME

    Propensity Score Weighting for Covariate Adjustment in Randomized Clinical Trials

    Authors: Shuxi Zeng, Fan Li, Rui Wang, Fan Li

    Abstract: Chance imbalance in baseline characteristics is common in randomized clinical trials. Regression adjustment such as the analysis of covariance (ANCOVA) is often used to account for imbalance and increase precision of the treatment effect estimate. An objective alternative is through inverse probability weighting (IPW) of the propensity scores. Although IPW and ANCOVA are asymptotically equivalent,… ▽ More

    Submitted 12 August, 2020; v1 submitted 21 April, 2020; originally announced April 2020.

    Comments: 18 pages, 1 figure, 3 tables

  24. arXiv:2003.12101  [pdf, other

    cs.DC cs.AR cs.LG stat.ML

    Enabling Efficient and Flexible FPGA Virtualization for Deep Learning in the Cloud

    Authors: Shulin Zeng, Guohao Dai, Hanbo Sun, Kai Zhong, Guangjun Ge, Kaiyuan Guo, Yu Wang, Huazhong Yang

    Abstract: FPGAs have shown great potential in providing low-latency and energy-efficient solutions for deep neural network (DNN) inference applications. Currently, the majority of FPGA-based DNN accelerators in the cloud run in a time-division multiplexing way for multiple users sharing a single FPGA, and require re-compilation with $\sim$100 s overhead. Such designs lead to poor isolation and heavy perform… ▽ More

    Submitted 26 March, 2020; originally announced March 2020.

  25. arXiv:1904.11419  [pdf

    stat.ML cs.LG eess.IV

    Time Series Simulation by Conditional Generative Adversarial Net

    Authors: Rao Fu, Jie Chen, Shutian Zeng, Yi** Zhuang, Agus Sudjianto

    Abstract: Generative Adversarial Net (GAN) has been proven to be a powerful machine learning tool in image data analysis and generation. In this paper, we propose to use Conditional Generative Adversarial Net (CGAN) to learn and simulate time series data. The conditions can be both categorical and continuous variables containing different kinds of auxiliary information. Our simulation studies show that CGAN… ▽ More

    Submitted 25 April, 2019; originally announced April 2019.

  26. arXiv:1902.06913  [pdf, other

    cs.LG cs.CV stat.ML

    Fast Compressive Sensing Recovery Using Generative Models with Structured Latent Variables

    Authors: Shaojie Xu, Sihan Zeng, Justin Romberg

    Abstract: Deep learning models have significantly improved the visual quality and accuracy on compressive sensing recovery. In this paper, we propose an algorithm for signal reconstruction from compressed measurements with image priors captured by a generative model. We search and constrain on latent variable space to make the method stable when the number of compressed measurements is extremely limited. We… ▽ More

    Submitted 19 March, 2020; v1 submitted 19 February, 2019; originally announced February 2019.

  27. arXiv:1712.03553  [pdf, other

    stat.ML econ.EM stat.AP

    RNN-based counterfactual prediction, with an application to homestead policy and public schooling

    Authors: Jason Poulos, Shuxi Zeng

    Abstract: This paper proposes a method for estimating the effect of a policy intervention on an outcome over time. We train recurrent neural networks (RNNs) on the history of control unit outcomes to learn a useful representation for predicting future outcomes. The learned representation of control units is then applied to the treated units for predicting counterfactual outcomes. RNNs are specifically struc… ▽ More

    Submitted 17 May, 2021; v1 submitted 10 December, 2017; originally announced December 2017.

    Journal ref: J. R. Stat. Soc., 70(4):1124-1139 (2021)