Skip to main content

Showing 1–50 of 166 results for author: Wu, X

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.00703  [pdf, other

    stat.CO

    A Partition-insensitive Parallel Framework for Distributed Model Fitting

    Authors: Xiaofei Wu, Rongmei Liang, Fabio Roli, Marcello Pelillo, **g Yuan

    Abstract: Distributed model fitting refers to the process of fitting a mathematical or statistical model to the data using distributed computing resources, such that computing tasks are divided among multiple interconnected computers or nodes, often organized in a cluster or network. Most of the existing methods for distributed model fitting are to formulate it in a consensus optimization problem, and then… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  2. arXiv:2405.18781  [pdf, other

    cs.LG stat.ML

    On the Role of Attention Masks and LayerNorm in Transformers

    Authors: Xinyi Wu, Amir Ajorlou, Yifei Wang, Stefanie Jegelka, Ali Jadbabaie

    Abstract: Self-attention is the key mechanism of transformers, which are the essential building blocks of modern foundation models. Recent studies have shown that pure self-attention suffers from an increasing degree of rank collapse as depth increases, limiting model expressivity and further utilization of model depth. The existing literature on rank collapse, however, has mostly overlooked other critical… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  3. arXiv:2405.17734  [pdf, other

    cs.LG stat.AP

    Towards Efficient Disaster Response via Cost-effective Unbiased Class Rate Estimation through Neyman Allocation Stratified Sampling Active Learning

    Authors: Yanbing Bai, Xinyi Wu, Lai Xu, Jihan Pei, Erick Mas, Shunichi Koshimura

    Abstract: With the rapid development of earth observation technology, we have entered an era of massively available satellite remote-sensing data. However, a large amount of satellite remote sensing data lacks a label or the label cost is too high to hinder the potential of AI technology mining satellite data. Especially in such an emergency response scenario that uses satellite data to evaluate the degree… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  4. arXiv:2404.18670  [pdf, other

    cs.LG stat.AP

    Enhancing Uncertain Demand Prediction in Hospitals Using Simple and Advanced Machine Learning

    Authors: Annie Hu, Samuel Stockman, Xun Wu, Richard Wood, Bangdong Zhi, Oliver Y. Chén

    Abstract: Early and timely prediction of patient care demand not only affects effective resource allocation but also influences clinical decision-making as well as patient experience. Accurately predicting patient care demand, however, is a ubiquitous challenge for hospitals across the world due, in part, to the demand's time-varying temporal variability, and, in part, to the difficulty in modelling trends… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  5. arXiv:2404.17735  [pdf, other

    cs.LG cs.AI stat.ME

    Causal Diffusion Autoencoders: Toward Counterfactual Generation via Diffusion Probabilistic Models

    Authors: Aneesh Komanduri, Chen Zhao, Feng Chen, Xintao Wu

    Abstract: Diffusion probabilistic models (DPMs) have become the state-of-the-art in high-quality image generation. However, DPMs have an arbitrary noisy latent space with no interpretable or controllable semantics. Although there has been significant research effort to improve image sample quality, there is little work on representation-controlled generation using diffusion models. Specifically, causal mode… ▽ More

    Submitted 8 May, 2024; v1 submitted 26 April, 2024; originally announced April 2024.

    Comments: Short version accepted to CVPR 2024 Workshop on Generative Models for Computer Vision

  6. arXiv:2401.04603  [pdf, other

    stat.ME stat.AP

    Skewed Pivot-Blend Modeling with Applications to Semicontinuous Outcomes

    Authors: Yiyuan She, Xiaoqiang Wu, Lizhu Tao, Debajyoti Sinha

    Abstract: Skewness is a common occurrence in statistical applications. In recent years, various distribution families have been proposed to model skewed data by introducing unequal scales based on the median or mode. However, we argue that the point at which unbalanced scales occur may be at any quantile and cannot be reparametrized as an ordinary shift parameter in the presence of skewness. In this paper,… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

  7. arXiv:2312.12731  [pdf, other

    cs.LG cs.AI stat.ML

    Robustly Improving Bandit Algorithms with Confounded and Selection Biased Offline Data: A Causal Approach

    Authors: Wen Huang, Xintao Wu

    Abstract: This paper studies bandit problems where an agent has access to offline data that might be utilized to potentially improve the estimation of each arm's reward distribution. A major obstacle in this setting is the existence of compound biases from the observational data. Ignoring these biases and blindly fitting a model with the biased data could even negatively affect the online learning phase. In… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  8. arXiv:2312.08583  [pdf, other

    cs.CL stat.ML

    ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks

    Authors: Xiaoxia Wu, Haojun Xia, Stephen Youn, Zhen Zheng, Shiyang Chen, Arash Bakhtiari, Michael Wyatt, Reza Yazdani Aminabadi, Yuxiong He, Olatunji Ruwase, Leon Song, Zhewei Yao

    Abstract: This study examines 4-bit quantization methods like GPTQ in large language models (LLMs), highlighting GPTQ's overfitting and limited enhancement in Zero-Shot tasks. While prior works merely focusing on zero-shot measurement, we extend task scope to more generative categories such as code generation and abstractive summarization, in which we found that INT4 quantization can significantly underperf… ▽ More

    Submitted 18 December, 2023; v1 submitted 13 December, 2023; originally announced December 2023.

  9. arXiv:2312.04648  [pdf, other

    stat.ML cs.LG

    Enhancing Polynomial Chaos Expansion Based Surrogate Modeling using a Novel Probabilistic Transfer Learning Strategy

    Authors: Wyatt Bridgman, Uma Balakrishnan, Reese Jones, Jiefu Chen, Xuqing Wu, Cosmin Safta, Yueqin Huang, Mohammad Khalil

    Abstract: In the field of surrogate modeling, polynomial chaos expansion (PCE) allows practitioners to construct inexpensive yet accurate surrogates to be used in place of the expensive forward model simulations. For black-box simulations, non-intrusive PCE allows the construction of these surrogates using a set of simulation response evaluations. In this context, the PCE coefficients can be obtained using… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  10. arXiv:2311.14676  [pdf, other

    cs.CY cs.CR cs.HC econ.GN stat.AP

    Decoding Social Sentiment in DAO: A Comparative Analysis of Blockchain Governance Communities

    Authors: Yutong Quan, Xintong Wu, Wanlin Deng, Luyao Zhang

    Abstract: Blockchain technology is leading a revolutionary transformation across diverse industries, with effective governance being critical for the success and sustainability of blockchain projects. Community forums, pivotal in engaging decentralized autonomous organizations (DAOs), significantly impact blockchain governance decisions. Concurrently, Natural Language Processing (NLP), particularly sentimen… ▽ More

    Submitted 25 May, 2024; v1 submitted 31 October, 2023; originally announced November 2023.

  11. arXiv:2311.12319  [pdf, other

    stat.ML math.ST

    A unified consensus-based parallel ADMM algorithm for high-dimensional regression with combined regularizations

    Authors: Xiaofei Wu, Zhimin Zhang, Zhenyu Cui

    Abstract: The parallel alternating direction method of multipliers (ADMM) algorithm is widely recognized for its effectiveness in handling large-scale datasets stored in a distributed manner, making it a popular choice for solving statistical learning models. However, there is currently limited research on parallel algorithms specifically designed for high-dimensional regression with combined (composite) re… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  12. arXiv:2310.11011  [pdf, other

    cs.LG cs.AI stat.ML

    From Identifiable Causal Representations to Controllable Counterfactual Generation: A Survey on Causal Generative Modeling

    Authors: Aneesh Komanduri, Xintao Wu, Yongkai Wu, Feng Chen

    Abstract: Deep generative models have shown tremendous capability in data density estimation and data generation from finite samples. While these models have shown impressive performance by learning correlations among features in the data, some fundamental shortcomings are their lack of explainability, tendency to induce spurious correlations, and poor out-of-distribution extrapolation. To remedy such chall… ▽ More

    Submitted 23 May, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: Published in Transactions on Machine Learning Research (TMLR) (05/2024); 72 pages, 27 figures, 4 tables

    Journal ref: Transactions on Machine Learning Research, 2024

  13. arXiv:2310.09999  [pdf, other

    stat.ML cs.LG eess.SP

    Outlier Detection Using Generative Models with Theoretical Performance Guarantees

    Authors: Jirong Yi, **gchao Gao, Tianming Wang, Xiaodong Wu, Weiyu Xu

    Abstract: This paper considers the problem of recovering signals modeled by generative models from linear measurements contaminated with sparse outliers. We propose an outlier detection approach for reconstructing the ground-truth signals modeled by generative models under sparse outliers. We establish theoretical recovery guarantees for reconstruction of signals using generative models in the presence of o… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:1810.11335

  14. arXiv:2310.00561  [pdf, other

    stat.CO cs.MS econ.EM

    CausalGPS: An R Package for Causal Inference With Continuous Exposures

    Authors: Naeem Khoshnevis, Xiao Wu, Danielle Braun

    Abstract: Quantifying the causal effects of continuous exposures on outcomes of interest is critical for social, economic, health, and medical research. However, most existing software packages focus on binary exposures. We develop the CausalGPS R package that implements a collection of algorithms to provide algorithmic solutions for causal inference with continuous exposures. CausalGPS implements a causal… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

    Comments: 22 pages, 8 figures

  15. arXiv:2309.08043  [pdf, ps, other

    cs.LG stat.ME

    On Prediction Feature Assignment in the Heckman Selection Model

    Authors: Huy Mai, Xintao Wu

    Abstract: Under missing-not-at-random (MNAR) sample selection bias, the performance of a prediction model is often degraded. This paper focuses on one classic instance of MNAR sample selection bias where a subset of samples have non-randomly missing outcomes. The Heckman selection model and its variants have commonly been used to handle this type of sample selection bias. The Heckman model uses two separate… ▽ More

    Submitted 22 April, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: Full version of work accepted to IJCNN 2024

  16. arXiv:2308.09691  [pdf, other

    stat.ML cs.LG

    Reduced Order Modeling of a MOOSE-based Advanced Manufacturing Model with Operator Learning

    Authors: Mahmoud Yaseen, Dewen Yushu, Peter German, Xu Wu

    Abstract: Advanced Manufacturing (AM) has gained significant interest in the nuclear community for its potential application on nuclear materials. One challenge is to obtain desired material properties via controlling the manufacturing process during runtime. Intelligent AM based on deep reinforcement learning (DRL) relies on an automated process-level control mechanism to generate optimal design variables… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: 10 Pages, 7 Figures, 2 Tables. arXiv admin note: text overlap with arXiv:2308.02462

    Journal ref: In Proceedings of the 2023 International Conference on Mathematics and Computational Methods Applied to Nuclear Science and Engineering (M&C 2023)

  17. arXiv:2308.09444  [pdf, other

    cs.LG stat.ML

    An Efficient 1 Iteration Learning Algorithm for Gaussian Mixture Model And Gaussian Mixture Embedding For Neural Network

    Authors: Weiguo Lu, Xuan Wu, Deng Ding, Gangnan Yuan

    Abstract: We propose an Gaussian Mixture Model (GMM) learning algorithm, based on our previous work of GMM expansion idea. The new algorithm brings more robustness and simplicity than classic Expectation Maximization (EM) algorithm. It also improves the accuracy and only take 1 iteration for learning. We theoretically proof that this new algorithm is guarantee to converge regardless the parameters initialis… ▽ More

    Submitted 6 September, 2023; v1 submitted 18 August, 2023; originally announced August 2023.

  18. Fast and Accurate Reduced-Order Modeling of a MOOSE-based Additive Manufacturing Model with Operator Learning

    Authors: Mahmoud Yaseen, Dewen Yushu, Peter German, Xu Wu

    Abstract: One predominant challenge in additive manufacturing (AM) is to achieve specific material properties by manipulating manufacturing process parameters during the runtime. Such manipulation tends to increase the computational load imposed on existing simulation tools employed in AM. The goal of the present work is to construct a fast and accurate reduced-order model (ROM) for an AM model developed wi… ▽ More

    Submitted 4 August, 2023; originally announced August 2023.

    Comments: 28 pages, 18 figures, 4 tables

    Journal ref: Int J Adv Manuf Technol (2023)

  19. arXiv:2308.01628  [pdf, other

    stat.ME

    Estimating causal quantile exposure response functions via matching

    Authors: Luca Merlo, Francesca Dominici, Lea Petrella, Nicola Salvati, Xiao Wu

    Abstract: We develop new matching estimators for estimating causal quantile exposure-response functions and quantile exposure effects with continuous treatments. We provide identification results for the parameters of interest and establish the asymptotic properties of the derived estimators. We introduce a two-step estimation procedure. In the first step, we construct a matched data set via generalized pro… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

  20. arXiv:2308.00812  [pdf, other

    stat.ME

    Causal exposure-response curve estimation with surrogate confounders: a study of air pollution and children's health in Medicaid claims data

    Authors: Jenny J. Lee, Xiao Wu, Francesca Dominici, Rachel C. Nethery

    Abstract: In this paper, we undertake a case study in which interest lies in estimating a causal exposure-response function (ERF) for long-term exposure to fine particulate matter (PM$_{2.5}$) and respiratory hospitalizations in socioeconomically disadvantaged children using nationwide Medicaid claims data. New methods are needed to address the specific challenges the Medicaid data present. First, Medicaid… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: 38 pages,5 figures

  21. Functional PCA and Deep Neural Networks-based Bayesian Inverse Uncertainty Quantification with Transient Experimental Data

    Authors: Ziyu Xie, Mahmoud Yaseen, Xu Wu

    Abstract: Inverse UQ is the process to inversely quantify the model input uncertainties based on experimental data. This work focuses on develo** an inverse UQ process for time-dependent responses, using dimensionality reduction by functional principal component analysis (PCA) and deep neural network (DNN)-based surrogate models. The demonstration is based on the inverse UQ of TRACE physical model paramet… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: 31 pages, 21 figures

  22. arXiv:2306.13255  [pdf, other

    cs.LG stat.ML

    Precise Asymptotic Generalization for Multiclass Classification with Overparameterized Linear Models

    Authors: David X. Wu, Anant Sahai

    Abstract: We study the asymptotic generalization of an overparameterized linear model for multiclass classification under the Gaussian covariates bi-level model introduced in Subramanian et al.~'22, where the number of data points, features, and classes all grow together. We fully resolve the conjecture posed in Subramanian et al.~'22, matching the predicted regimes for generalization. Furthermore, our new… ▽ More

    Submitted 5 December, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

    Comments: NeurIPS 2023, 56 pages

  23. arXiv:2306.01213  [pdf, other

    cs.LG stat.ML

    Learning Causally Disentangled Representations via the Principle of Independent Causal Mechanisms

    Authors: Aneesh Komanduri, Yongkai Wu, Feng Chen, Xintao Wu

    Abstract: Learning disentangled causal representations is a challenging problem that has gained significant attention recently due to its implications for extracting meaningful information for downstream tasks. In this work, we define a new notion of causal disentanglement from the perspective of independent causal mechanisms. We propose ICM-VAE, a framework for learning causally disentangled representation… ▽ More

    Submitted 8 May, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: Accepted to IJCAI 2024

  24. arXiv:2305.16622  [pdf, other

    stat.CO physics.data-an

    Inverse Uncertainty Quantification by Hierarchical Bayesian Modeling and Application in Nuclear System Thermal-Hydraulics Codes

    Authors: Chen Wang, Xu Wu, Tomasz Kozlowski

    Abstract: Inverse Uncertainty Quantification (IUQ) method has been widely used to quantify the uncertainty of Physical Model Parameters (PMPs) in nuclear Thermal Hydraulics (TH) systems. This paper introduces a novel hierarchical Bayesian model which aims to mitigate two existing challenges in IUQ: the high variability of PMPs under varying experimental conditions, and unknown model discrepancies or outlier… ▽ More

    Submitted 25 March, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

  25. arXiv:2305.16102  [pdf, other

    cs.LG cs.SI stat.ML

    Demystifying Oversmoothing in Attention-Based Graph Neural Networks

    Authors: Xinyi Wu, Amir Ajorlou, Zihui Wu, Ali Jadbabaie

    Abstract: Oversmoothing in Graph Neural Networks (GNNs) refers to the phenomenon where increasing network depth leads to homogeneous node representations. While previous work has established that Graph Convolutional Networks (GCNs) exponentially lose expressive power, it remains controversial whether the graph attention mechanism can mitigate oversmoothing. In this work, we provide a definitive answer to th… ▽ More

    Submitted 3 June, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023 spotlight. Fixed an error in the previous version; new results and remarks added

  26. arXiv:2303.14658  [pdf, other

    cs.IT cs.LG stat.ML

    On the tightness of information-theoretic bounds on generalization error of learning algorithms

    Authors: Xuetong Wu, Jonathan H. Manton, Uwe Aickelin, **gge Zhu

    Abstract: A recent line of works, initiated by Russo and Xu, has shown that the generalization error of a learning algorithm can be upper bounded by information measures. In most of the relevant works, the convergence rate of the expected generalization error is in the form of $O(\sqrt{λ/n})$ where $λ$ is some information-theoretic quantities such as the mutual information or conditional mutual information… ▽ More

    Submitted 26 March, 2023; originally announced March 2023.

    Comments: 32 pages, 1 figure. arXiv admin note: substantial text overlap with arXiv:2205.03131

  27. arXiv:2212.10701  [pdf, other

    cs.LG cs.SI stat.ML

    A Non-Asymptotic Analysis of Oversmoothing in Graph Neural Networks

    Authors: Xinyi Wu, Zhengdao Chen, William Wang, Ali Jadbabaie

    Abstract: Oversmoothing is a central challenge of building more powerful Graph Neural Networks (GNNs). While previous works have only demonstrated that oversmoothing is inevitable when the number of graph convolutions tends to infinity, in this paper, we precisely characterize the mechanism behind the phenomenon via a non-asymptotic analysis. Specifically, we distinguish between two different effects when a… ▽ More

    Submitted 28 February, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: Accepted by the 11th International Conference on Learning Representations (ICLR 2023)

  28. arXiv:2212.00985  [pdf, ps, other

    stat.AP

    A comparative analysis of several multivariate zero-inflated and zero-modified models with applications in insurance

    Authors: Pengcheng Zhang, David Pitt, Xueyuan Wu

    Abstract: Claim frequency data in insurance records the number of claims on insurance policies during a finite period of time. Given that insurance companies operate with multiple lines of insurance business where the claim frequencies on different lines of business are often correlated, multivariate count modeling with dependence for claim frequency is therefore essential. Due in part to the operation of b… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

  29. arXiv:2211.08654  [pdf, other

    stat.ML cs.LG physics.comp-ph

    Prediction and Uncertainty Quantification of SAFARI-1 Axial Neutron Flux Profiles with Neural Networks

    Authors: Lesego E. Moloko, Pavel M. Bokov, Xu Wu, Kostadin N. Ivanov

    Abstract: Artificial Neural Networks (ANNs) have been successfully used in various nuclear engineering applications, such as predicting reactor physics parameters within reasonable time and with a high level of accuracy. Despite this success, they cannot provide information about the model prediction uncertainties, making it difficult to assess ANN prediction credibility, especially in extrapolated domains.… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

    Comments: 34 pages, 12 figures

  30. arXiv:2209.05438  [pdf, other

    cs.CY cs.LG stat.AP stat.CO

    Alcohol Intake Differentiates AD and LATE: A Telltale Lifestyle from Two Large-Scale Datasets

    Authors: Xinxing Wu, Chong Peng, Peter T. Nelson, Qiang Cheng

    Abstract: Alzheimer's disease (AD), as a progressive brain disease, affects cognition, memory, and behavior. Similarly, limbic-predominant age-related TDP-43 encephalopathy (LATE) is a recently defined common neurodegenerative disease that mimics the clinical symptoms of AD. At present, the risk factors implicated in LATE and those distinguishing LATE from AD are largely unknown. We leveraged an integrated… ▽ More

    Submitted 25 August, 2022; originally announced September 2022.

    Comments: 10 pages

    Journal ref: AMIA 2022 Annual Symposium (AMIA 2022)

  31. arXiv:2207.04878  [pdf, other

    q-bio.GN cs.LG stat.AP

    Stacked Autoencoder Based Multi-Omics Data Integration for Cancer Survival Prediction

    Authors: Xing Wu, Qiulian Fang

    Abstract: Cancer survival prediction is important for develo** personalized treatments and inducing disease-causing mechanisms. Multi-omics data integration is attracting widespread interest in cancer research for providing information for understanding cancer progression at multiple genetic levels. Many works, however, are limited because of the high dimensionality and heterogeneity of multi-omics data.… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

  32. arXiv:2206.05253  [pdf, other

    cs.CV cs.AI cs.LG stat.AP

    Rethinking Spatial Invariance of Convolutional Networks for Object Counting

    Authors: Zhi-Qi Cheng, Qi Dai, Hong Li, **gKuan Song, Xiao Wu, Alexander G. Hauptmann

    Abstract: Previous work generally believes that improving the spatial invariance of convolutional networks is the key to object counting. However, after verifying several mainstream counting networks, we surprisingly found too strict pixel-level spatial invariance would cause overfit noise in the density map generation. In this paper, we try to use locally connected Gaussian kernels to replace the original… ▽ More

    Submitted 18 August, 2022; v1 submitted 10 June, 2022; originally announced June 2022.

    Comments: Accepted to CVPR 2022, Code: https://github.com/zhiqic/Rethinking-Counting

  33. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  34. arXiv:2204.10981  [pdf, other

    cs.LG cs.DC stat.ML

    Distributed Dynamic Safe Screening Algorithms for Sparse Regularization

    Authors: Runxue Bao, Xidong Wu, Wenhan Xian, Heng Huang

    Abstract: Distributed optimization has been widely used as one of the most efficient approaches for model training with massive samples. However, large-scale learning problems with both massive samples and high-dimensional features widely exist in the era of big data. Safe screening is a popular technique to speed up high-dimensional models by discarding the inactive features with zero coefficients. Neverth… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

  35. arXiv:2203.09609  [pdf

    stat.AP

    An alternative Interpretation of residual feed intake by phenotypic recursive relationships in dairy cattle

    Authors: Xiao-Lin Wu, Kristen L. Parker Gaddis, Javier Burchard, H. Duane Norman, Ezequiel Nicolazzi, Erin E. Connor, John B. Cole, Joao Durr

    Abstract: There has been an increasing interest in residual feed intake (RFI) as a measure of net feed efficiency in dairy cattle. RFI phenotypes are obtained as residuals from linear regression encompassing relevant factors (i.e., energy sinks) to account for body tissue mobilization. However, fitting energy sink phenotypes as regression variables in standard linear regression was criticized because phenot… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: 12 pages, 2 Tables, and 1 Figure

  36. arXiv:2203.09606  [pdf

    stat.AP

    Daily milk yield correction factors: what are they?

    Authors: Xiao-Lin Wu, George Wiggans, H. Duane Norman, Asha M. Miles, Curt Van Tassell, Ransom L. Baldwin VI, Javier Burchard, Joao Durr

    Abstract: Cows are typically milked two or more times on a test day, but not all these milkings are sampled and weighed. Statistical methods have been proposed to estimate daily yields in dairy cows, centering on various yield correction factors in two broad categories. The initial approach estimated a test-day yield with doubled morning (AM) or evening (PM) yield in the AM-PM milking plans, assuming equal… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: 14 pages, 2 tables, and 1 figure

  37. arXiv:2203.00176  [pdf, other

    cs.LG math.OC stat.ML

    When AUC meets DRO: Optimizing Partial AUC for Deep Learning with Non-Convex Convergence Guarantee

    Authors: Dixian Zhu, Gang Li, Bokun Wang, Xiaodong Wu, Tianbao Yang

    Abstract: In this paper, we propose systematic and efficient gradient-based methods for both one-way and two-way partial AUC (pAUC) maximization that are applicable to deep learning. We propose new formulations of pAUC surrogate objectives by using the distributionally robust optimization (DRO) to define the loss for each individual positive data. We consider two formulations of DRO, one of which is based o… ▽ More

    Submitted 17 September, 2023; v1 submitted 28 February, 2022; originally announced March 2022.

    Comments: 25 pages

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, 2022

  38. arXiv:2202.00206  [pdf

    cs.HC eess.SP q-bio.QM stat.AP

    A pilot study of the Earable device to measure facial muscle and eye movement tasks among healthy volunteers

    Authors: Matthew F. Wipperman, Galen Pogoncheff, Katrina F. Mateo, Xuefang Wu, Yiziying Chen, Oren Levy, Andreja Avbersek, Robin R. Deterding, Sara C. Hamon, Tam Vu, Rinol Alaj, Olivier Harari

    Abstract: Many neuromuscular disorders impair function of cranial nerve enervated muscles. Clinical assessment of cranial muscle function has several limitations. Clinician rating of symptoms suffers from inter-rater variation, qualitative or semi-quantitative scoring, and limited ability to capture infrequent or fluctuating symptoms. Patient-reported outcomes are limited by recall bias and poor precision.… ▽ More

    Submitted 31 January, 2022; originally announced February 2022.

  39. arXiv:2112.02180  [pdf, other

    stat.CO stat.ME

    Generalized Transitional Markov Chain Monte Carlo Sampling Technique for Bayesian Inversion

    Authors: Han Lu, Mohammad Khalil, Thomas Catanach, Jiefu Chen, Xuqing Wu, Xin Fu, Cosmin Safta, Yueqin Huang

    Abstract: In the context of Bayesian inversion for scientific and engineering modeling, Markov chain Monte Carlo sampling strategies are the benchmark due to their flexibility and robustness in dealing with arbitrary posterior probability density functions (PDFs). However, these algorithms been shown to be inefficient when sampling from posterior distributions that are high-dimensional or exhibit multi-moda… ▽ More

    Submitted 3 December, 2021; originally announced December 2021.

  40. arXiv:2111.01387  [pdf, other

    cs.LG stat.ML

    Understanding Entropic Regularization in GANs

    Authors: Daria Reshetova, Yikun Bai, Xiugang Wu, Ayfer Ozgur

    Abstract: Generative Adversarial Networks are a popular method for learning distributions from data by modeling the target distribution as a function of a known distribution. The function, often referred to as the generator, is optimized to minimize a chosen distance measure between the generated and target distributions. One commonly used measure for this purpose is the Wasserstein distance. However, Wasse… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

    Comments: 29 pages, 7 figures

  41. arXiv:2110.07435  [pdf, other

    cs.LG eess.IV math.OC stat.ML

    Adaptive Differentially Private Empirical Risk Minimization

    Authors: Xiaoxia Wu, Lingxiao Wang, Irina Cristali, Quanquan Gu, Rebecca Willett

    Abstract: We propose an adaptive (stochastic) gradient perturbation method for differentially private empirical risk minimization. At each iteration, the random noise added to the gradient is optimally adapted to the stepsize; we name this process adaptive differentially private (ADP) learning. Given the same privacy budget, we prove that the ADP method considerably improves the utility guarantee compared t… ▽ More

    Submitted 24 October, 2021; v1 submitted 14 October, 2021; originally announced October 2021.

  42. arXiv:2110.00944  [pdf, other

    cs.LG cs.AI stat.ML

    Kalman Bayesian Neural Networks for Closed-form Online Learning

    Authors: Philipp Wagner, Xinyang Wu, Marco F. Huber

    Abstract: Compared to point estimates calculated by standard neural networks, Bayesian neural networks (BNN) provide probability distributions over the output predictions and model parameters, i.e., the weights. Training the weight distribution of a BNN, however, is more involved due to the intractability of the underlying Bayesian inference problem and thus, requires efficient approximations. In this paper… ▽ More

    Submitted 30 November, 2022; v1 submitted 3 October, 2021; originally announced October 2021.

    Comments: 37th AAAI Conference on Artificial Intelligence (AAAI)

  43. arXiv:2110.00921  [pdf, other

    stat.ML cs.LG econ.EM

    Hierarchical Gaussian Process Models for Regression Discontinuity/Kink under Sharp and Fuzzy Designs

    Authors: Ximing Wu

    Abstract: We propose nonparametric Bayesian estimators for causal inference exploiting Regression Discontinuity/Kink (RD/RK) under sharp and fuzzy designs. Our estimators are based on Gaussian Process (GP) regression and classification. The GP methods are powerful probabilistic machine learning approaches that are advantageous in terms of derivative estimation and uncertainty quantification, facilitating RK… ▽ More

    Submitted 28 February, 2022; v1 submitted 3 October, 2021; originally announced October 2021.

  44. Estimating a Causal Exposure Response Function with a Continuous Error-Prone Exposure: A Study of Fine Particulate Matter and All-Cause Mortality

    Authors: Kevin P. Josey, Priyanka deSouza, Xiao Wu, Danielle Braun, Rachel Nethery

    Abstract: Numerous studies have examined the associations between long-term exposure to fine particulate matter (PM2.5) and adverse health outcomes. Recently, many of these studies have begun to employ high-resolution predicted PM2.5 concentrations, which are subject to measurement error. Previous approaches for exposure measurement error correction have either been applied in non-causal settings or have on… ▽ More

    Submitted 28 November, 2022; v1 submitted 30 September, 2021; originally announced September 2021.

  45. arXiv:2109.09711  [pdf, other

    stat.AP

    Quantifying Grid Resilience Against Extreme Weather Using Large-Scale Customer Power Outage Data

    Authors: Shixiang Zhu, Rui Yao, Yao Xie, Feng Qiu, Yueming, Qiu, Xuan Wu

    Abstract: In recent decades, the weather around the world has become more irregular and extreme, often causing large-scale extended power outages. Resilience -- the capability of withstanding, adapting to, and recovering from a large-scale disruption -- has become a top priority for the power sector. However, the understanding of power grid resilience still stays on the conceptual level mostly or focuses on… ▽ More

    Submitted 4 September, 2022; v1 submitted 20 September, 2021; originally announced September 2021.

  46. arXiv:2109.08282  [pdf, other

    stat.ML cs.LG

    AdaLoss: A computationally-efficient and provably convergent adaptive gradient method

    Authors: Xiaoxia Wu, Yuege Xie, Simon Du, Rachel Ward

    Abstract: We propose a computationally-friendly adaptive learning rate schedule, "AdaLoss", which directly uses the information of the loss function to adjust the stepsize in gradient descent methods. We prove that this schedule enjoys linear convergence in linear regression. Moreover, we provide a linear convergence guarantee over the non-convex regime, in the context of two-layer over-parameterized neural… ▽ More

    Submitted 16 September, 2021; originally announced September 2021.

    Comments: arXiv admin note: text overlap with arXiv:1902.07111

  47. arXiv:2106.02197  [pdf, other

    cs.LG stat.ML

    Top-$k$ Regularization for Supervised Feature Selection

    Authors: Xinxing Wu, Qiang Cheng

    Abstract: Feature selection identifies subsets of informative features and reduces dimensions in the original feature space, hel** provide insights into data generation or a variety of domain problems. Existing methods mainly depend on feature scoring functions or sparse regularizations; nonetheless, they have limited ability to reconcile the representativeness and inter-correlations of features. In this… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Comments: 12 pages

  48. arXiv:2105.11069  [pdf, other

    cs.LG cs.IT stat.ML

    InfoFair: Information-Theoretic Intersectional Fairness

    Authors: Jian Kang, Tiankai Xie, Xintao Wu, Ross Maciejewski, Hanghang Tong

    Abstract: Algorithmic fairness is becoming increasingly important in data mining and machine learning. Among others, a foundational notation is group fairness. The vast majority of the existing works on group fairness, with a few exceptions, primarily focus on debiasing with respect to a single sensitive attribute, despite the fact that the co-existence of multiple sensitive attributes (e.g., gender, race,… ▽ More

    Submitted 31 December, 2022; v1 submitted 23 May, 2021; originally announced May 2021.

    Comments: IEEE Big Data 2022

  49. Bayesian Inverse Uncertainty Quantification of a MOOSE-based Melt Pool Model for Additive Manufacturing Using Experimental Data

    Authors: Ziyu Xie, Wen Jiang, Congjian Wang, Xu Wu

    Abstract: Additive manufacturing (AM) technology is being increasingly adopted in a wide variety of application areas due to its ability to rapidly produce, prototype, and customize designs. AM techniques afford significant opportunities in regard to nuclear materials, including an accelerated fabrication process and reduced cost. High-fidelity modeling and simulation (M\&S) of AM processes is being develop… ▽ More

    Submitted 17 May, 2021; v1 submitted 11 May, 2021; originally announced May 2021.

    Comments: 26 pages, 11 figures

  50. arXiv:2105.03454  [pdf, other

    stat.ME stat.AP

    A Bayesian Gaussian Process for Estimating a Causal Exposure Response Curve in Environmental Epidemiology

    Authors: Boyu Ren, Xiao Wu, Danielle Braun, Natesh Pillai, Francesca Dominici

    Abstract: Motivated by environmental policy questions, we address the challenges of estimation, change point detection, and uncertainty quantification of a causal exposure-response function (CERF). Under a potential outcome framework, the CERF describes the relationship between a continuously varying exposure (or treatment) and its causal effect on an outcome. We propose a new Bayesian approach that relies… ▽ More

    Submitted 25 January, 2023; v1 submitted 7 May, 2021; originally announced May 2021.