Skip to main content

Showing 1–50 of 708 results for author: Wang, Z

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.03082  [pdf, other

    cs.LG stat.ML

    Stable Heterogeneous Treatment Effect Estimation across Out-of-Distribution Populations

    Authors: Yuling Zhang, Anpeng Wu, Kun Kuang, Liang Du, Zixun Sun, Zhi Wang

    Abstract: Heterogeneous treatment effect (HTE) estimation is vital for understanding the change of treatment effect across individuals or subgroups. Most existing HTE estimation methods focus on addressing selection bias induced by imbalanced distributions of confounders between treated and control units, but ignore distribution shifts across populations. Thereby, their applicability has been limited to the… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Accepted by ICDE'2024

  2. arXiv:2407.02539  [pdf

    cs.RO cs.AI cs.LG stat.ML

    Research on Autonomous Robots Navigation based on Reinforcement Learning

    Authors: Zixiang Wang, Hao Yan, Yining Wang, Zhengjia Xu, Zhuoyue Wang, Zhizhong Wu

    Abstract: Reinforcement learning continuously optimizes decision-making based on real-time feedback reward signals through continuous interaction with the environment, demonstrating strong adaptive and self-learning capabilities. In recent years, it has become one of the key methods to achieve autonomous navigation of robots. In this work, an autonomous robot navigation method based on reinforcement learnin… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  3. arXiv:2407.02501  [pdf, other

    cs.LG cs.CE eess.SY stat.AP

    Data-driven Power Flow Linearization: Theory

    Authors: Mengshuo Jia, Gabriela Hug, Ning Zhang, Zhaojian Wang, Yi Wang, Chongqing Kang

    Abstract: This two-part tutorial dives into the field of data-driven power flow linearization (DPFL), a domain gaining increased attention. DPFL stands out for its higher approximation accuracy, wide adaptability, and better ability to implicitly incorporate the latest system attributes. This renders DPFL a potentially superior option for managing the significant fluctuations from renewable energy sources,… ▽ More

    Submitted 10 June, 2024; originally announced July 2024.

    Comments: 20 pages

  4. arXiv:2406.15575  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Sketch-GNN: Scalable Graph Neural Networks with Sublinear Training Complexity

    Authors: Mucong Ding, Tahseen Rabbani, Bang An, Evan Z Wang, Furong Huang

    Abstract: Graph Neural Networks (GNNs) are widely applied to graph learning problems such as node classification. When scaling up the underlying graphs of GNNs to a larger size, we are forced to either train on the complete graph and keep the full graph adjacency and node embeddings in memory (which is often infeasible) or mini-batch sample the graph (which results in exponentially growing computational com… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: NeurIPS 2022

  5. arXiv:2406.12017  [pdf, other

    stat.ML cs.LG stat.CO

    Sparsity-Constraint Optimization via Splicing Iteration

    Authors: Zezhi Wang, ** Zhu, Junxian Zhu, Borui Tang, Hongmei Lin, Xueqin Wang

    Abstract: Sparsity-constraint optimization has wide applicability in signal processing, statistics, and machine learning. Existing fast algorithms must burdensomely tune parameters, such as the step size or the implementation of precise stop criteria, which may be challenging to determine in practice. To address this issue, we develop an algorithm named Sparsity-Constraint Optimization via sPlicing itEratio… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 34 pages

  6. arXiv:2406.09564  [pdf, other

    cs.LG cs.AI cs.CE cs.CV stat.ML

    Towards Domain Adaptive Neural Contextual Bandits

    Authors: Ziyan Wang, Hao Wang

    Abstract: Contextual bandit algorithms are essential for solving real-world decision making problems. In practice, collecting a contextual bandit's feedback from different domains may involve different costs. For example, measuring drug reaction from mice (as a source domain) and humans (as a target domain). Unfortunately, adapting a contextual bandit algorithm from a source domain to a target domain with d… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  7. arXiv:2406.06893  [pdf, other

    stat.ML cs.IT cs.LG

    Transformers Provably Learn Sparse Token Selection While Fully-Connected Nets Cannot

    Authors: Zixuan Wang, Stanley Wei, Daniel Hsu, Jason D. Lee

    Abstract: The transformer architecture has prevailed in various deep learning settings due to its exceptional capabilities to select and compose structural information. Motivated by these capabilities, Sanford et al. proposed the sparse token selection task, in which transformers excel while fully-connected networks (FCNs) fail in the worst case. Building upon that, we strengthen the FCN lower bound to an a… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  8. arXiv:2406.06833  [pdf, other

    eess.SY stat.AP

    Data-driven Power Flow Linearization: Simulation

    Authors: Mengshuo Jia, Gabriela Hug, Ning Zhang, Zhaojian Wang, Yi Wang, Chongqing Kang

    Abstract: Building on the theoretical insights of Part I, this paper, as the second part of the tutorial, dives deeper into data-driven power flow linearization (DPFL), focusing on comprehensive numerical testing. The necessity of these simulations stems from the theoretical analysis's inherent limitations, particularly the challenge of identifying the differences in real-world performance among DPFL method… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 26 pages

  9. arXiv:2406.05260  [pdf, other

    stat.ML cs.LG

    Generative modeling of density regression through tree flows

    Authors: Zhuoqun Wang, Naoki Awaya, Li Ma

    Abstract: A common objective in the analysis of tabular data is estimating the conditional distribution (in contrast to only producing predictions) of a set of "outcome" variables given a set of "covariates", which is sometimes referred to as the "density regression" problem. Beyond estimation on the conditional distribution, the generative ability of drawing synthetic samples from the learned conditional d… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 24 pages, 9 figures

  10. arXiv:2406.05225  [pdf, other

    cs.LG stat.ML

    A Manifold Perspective on the Statistical Generalization of Graph Neural Networks

    Authors: Zhiyang Wang, Juan Cervino, Alejandro Ribeiro

    Abstract: Convolutional neural networks have been successfully extended to operate on graphs, giving rise to Graph Neural Networks (GNNs). GNNs combine information from adjacent nodes by successive applications of graph convolutions. GNNs have been implemented successfully in various learning tasks while the theoretical understanding of their generalization capability is still in progress. In this paper, we… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 34 pages,22 figures

  11. arXiv:2406.05213  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    On Subjective Uncertainty Quantification and Calibration in Natural Language Generation

    Authors: Ziyu Wang, Chris Holmes

    Abstract: Applications of large language models often involve the generation of free-form responses, in which case uncertainty quantification becomes challenging. This is due to the need to identify task-specific uncertainties (e.g., about the semantics) which appears difficult to define in general cases. This work addresses these challenges from a perspective of Bayesian decision theory, starting from the… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  12. arXiv:2406.05193  [pdf, ps, other

    stat.ME stat.CO

    Probabilistic Clustering using Shared Latent Variable Model for Assessing Alzheimers Disease Biomarkers

    Authors: Yizhen Xu, Scott Zeger, Zheyu Wang

    Abstract: The preclinical stage of many neurodegenerative diseases can span decades before symptoms become apparent. Understanding the sequence of preclinical biomarker changes provides a critical opportunity for early diagnosis and effective intervention prior to significant loss of patients' brain functions. The main challenge to early detection lies in the absence of direct observation of the disease sta… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  13. arXiv:2406.04575  [pdf, other

    cs.LG cs.AI stat.AP stat.ML

    Optimization of geological carbon storage operations with multimodal latent dynamic model and deep reinforcement learning

    Authors: Zhongzheng Wang, Yuntian Chen, Guodong Chen, Dongxiao Zhang

    Abstract: Maximizing storage performance in geological carbon storage (GCS) is crucial for commercial deployment, but traditional optimization demands resource-intensive simulations, posing computational challenges. This study introduces the multimodal latent dynamic (MLD) model, a deep learning framework for fast flow prediction and well control optimization in GCS. The MLD model includes a representation… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  14. arXiv:2406.04329  [pdf, other

    cs.LG stat.ML

    Simplified and Generalized Masked Diffusion for Discrete Data

    Authors: Jiaxin Shi, Kehang Han, Zhe Wang, Arnaud Doucet, Michalis K. Titsias

    Abstract: Masked (or absorbing) diffusion is actively explored as an alternative to autoregressive models for generative modeling of discrete data. However, existing work in this area has been hindered by unnecessarily complex model formulations and unclear relationships between different perspectives, leading to suboptimal parameterization, training objectives, and ad hoc adjustments to counteract these is… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  15. arXiv:2406.01561  [pdf, other

    cs.CV cs.AI cs.CL cs.LG stat.ML

    Long and Short Guidance in Score identity Distillation for One-Step Text-to-Image Generation

    Authors: Mingyuan Zhou, Zhendong Wang, Huangjie Zheng, Hai Huang

    Abstract: Diffusion-based text-to-image generation models trained on extensive text-image pairs have shown the capacity to generate photorealistic images consistent with textual descriptions. However, a significant limitation of these models is their slow sample generation, which requires iterative refinement through the same network. In this paper, we enhance Score identity Distillation (SiD) by develo**… ▽ More

    Submitted 22 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  16. arXiv:2406.00793  [pdf, other

    stat.ML cs.LG

    Is In-Context Learning in Large Language Models Bayesian? A Martingale Perspective

    Authors: Fabian Falck, Ziyu Wang, Chris Holmes

    Abstract: In-context learning (ICL) has emerged as a particularly remarkable characteristic of Large Language Models (LLM): given a pretrained LLM and an observed dataset, LLMs can make predictions for new data points from the same distribution without fine-tuning. Numerous works have postulated ICL as approximately Bayesian inference, rendering this a natural hypothesis. In this work, we analyse this hypot… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: Accepted at International Conference on Machine Learning (ICML) 2024

  17. arXiv:2405.20763  [pdf, other

    cs.LG math.OC stat.ML

    Improving Generalization and Convergence by Enhancing Implicit Regularization

    Authors: Mingze Wang, Haotian He, **bo Wang, Zilin Wang, Guanhua Huang, Feiyu Xiong, Zhiyu Li, Weinan E, Lei Wu

    Abstract: In this work, we propose an Implicit Regularization Enhancement (IRE) framework to accelerate the discovery of flat solutions in deep learning, thereby improving generalization and convergence. Specifically, IRE decouples the dynamics of flat and sharp directions, which boosts the sharpness reduction along flat directions while maintaining the training stability in sharp directions. We show that I… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 35 pages

  18. arXiv:2405.18459  [pdf, other

    cs.IT cs.AI cs.LG stat.ME

    Probing the Information Theoretical Roots of Spatial Dependence Measures

    Authors: Zhangyu Wang, Krzysztof Janowicz, Gengchen Mai, Ivan Majic

    Abstract: Intuitively, there is a relation between measures of spatial dependence and information theoretical measures of entropy. For instance, we can provide an intuition of why spatial data is special by stating that, on average, spatial data samples contain less than expected information. Similarly, spatial data, e.g., remotely sensed imagery, that is easy to compress is also likely to show significant… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: COSIT-2024 Conference Proceedings

  19. arXiv:2405.18395  [pdf, other

    cs.LG cs.AI stat.AP

    MC-GTA: Metric-Constrained Model-Based Clustering using Goodness-of-fit Tests with Autocorrelations

    Authors: Zhangyu Wang, Gengchen Mai, Krzysztof Janowicz, Ni Lao

    Abstract: A wide range of (multivariate) temporal (1D) and spatial (2D) data analysis tasks, such as grou** vehicle sensor trajectories, can be formulated as clustering with given metric constraints. Existing metric-constrained clustering algorithms overlook the rich correlation between feature similarity and metric distance, i.e., metric autocorrelation. The model-based variations of these clustering alg… ▽ More

    Submitted 2 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: ICML-2024 Proceedings

  20. arXiv:2405.16436  [pdf, other

    cs.LG cs.AI stat.ML

    Provably Mitigating Overoptimization in RLHF: Your SFT Loss is Implicitly an Adversarial Regularizer

    Authors: Zhihan Liu, Miao Lu, Shenao Zhang, Boyi Liu, Hongyi Guo, Yingxiang Yang, Jose Blanchet, Zhaoran Wang

    Abstract: Aligning generative models with human preference via RLHF typically suffers from overoptimization, where an imperfectly learned reward model can misguide the generative model to output undesired responses. We investigate this problem in a principled manner by identifying the source of the misalignment as a form of distributional shift and uncertainty in learning human preferences. To mitigate over… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 27 pages, 7 figures

  21. arXiv:2405.07761  [pdf, other

    cs.LG cs.AI cs.SC math-ph stat.AP

    LLM4ED: Large Language Models for Automatic Equation Discovery

    Authors: Mengge Du, Yuntian Chen, Zhongzheng Wang, Longfeng Nie, Dongxiao Zhang

    Abstract: Equation discovery is aimed at directly extracting physical laws from data and has emerged as a pivotal research domain. Previous methods based on symbolic mathematics have achieved substantial advancements, but often require the design of implementation of complex algorithms. In this paper, we introduce a new framework that utilizes natural language-based prompts to guide large language models (L… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  22. arXiv:2405.04393  [pdf, other

    stat.ML cs.LG

    Efficient Online Set-valued Classification with Bandit Feedback

    Authors: Zhou Wang, Xingye Qiao

    Abstract: Conformal prediction is a distribution-free method that wraps a given machine learning model and returns a set of plausible labels that contain the true label with a prescribed coverage rate. In practice, the empirical coverage achieved highly relies on fully observed label information from data both in the training phase for model fitting and the calibration phase for quantile estimation. This de… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  23. arXiv:2405.03329  [pdf, other

    cs.LG stat.ML

    Policy Learning for Balancing Short-Term and Long-Term Rewards

    Authors: Peng Wu, Ziyu Shen, Feng Xie, Zhongyao Wang, Chunchen Liu, Yan Zeng

    Abstract: Empirical researchers and decision-makers spanning various domains frequently seek profound insights into the long-term impacts of interventions. While the significance of long-term outcomes is undeniable, an overemphasis on them may inadvertently overshadow short-term gains. Motivated by this, this paper formalizes a new framework for learning the optimal policy that effectively balances both lon… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  24. arXiv:2404.19292  [pdf, other

    cs.IT cs.LG cs.MA stat.ML

    Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning

    Authors: Qiaosheng Zhang, Chenjia Bai, Shuyue Hu, Zhen Wang, Xuelong Li

    Abstract: This work designs and analyzes a novel set of algorithms for multi-agent reinforcement learning (MARL) based on the principle of information-directed sampling (IDS). These algorithms draw inspiration from foundational concepts in information theory, and are proven to be sample efficient in MARL settings such as two-player zero-sum Markov games (MGs) and multi-player general-sum MGs. For episodic t… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  25. arXiv:2404.12312  [pdf, ps, other

    cs.LG math.OC stat.ML

    A Mean-Field Analysis of Neural Stochastic Gradient Descent-Ascent for Functional Minimiax Optimization

    Authors: Yuchen Zhu, Yufeng Zhang, Zhaoran Wang, Zhuoran Yang, Xiaohong Chen

    Abstract: This paper studies minimax optimization problems defined over infinite-dimensional function classes of overparameterized two-layer neural networks. In particular, we consider the minimax optimization problem stemming from estimating linear functional equations defined by conditional expectations, where the objective functions are quadratic in the functional spaces. We address (i) the convergence o… ▽ More

    Submitted 25 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: Submitted

  26. arXiv:2404.08667  [pdf, other

    eess.SY stat.AP

    Traffic State Estimation and Uncertainty Quantification at Signalized Intersections with Low Penetration Rate Vehicle Trajectory Data

    Authors: Xingmin Wang, Zihao Wang, Zachary Jerome, Henry X. Liu

    Abstract: This paper studies the traffic state estimation problem at signalized intersections with low penetration rate vehicle trajectory data. While many existing studies have proposed different methods to estimate unknown traffic states and parameters (e.g., penetration rate, queue length) with this data, most of them only provide a point estimation without knowing the uncertainty of these estimated valu… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  27. arXiv:2404.07323  [pdf, other

    stat.ME math.ST

    Surrogate modeling for probability distribution estimation:uniform or adaptive design?

    Authors: Maijia Su, Ziqi Wang, Oreste Salvatore Bursi, Marco Broccardo

    Abstract: The active learning (AL) technique, one of the state-of-the-art methods for constructing surrogate models, has shown high accuracy and efficiency in forward uncertainty quantification (UQ) analysis. This paper provides a comprehensive study on AL-based global surrogates for computing the full distribution function, i.e., the cumulative distribution function (CDF) and the complementary CDF (CCDF).… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  28. arXiv:2404.06984  [pdf, other

    stat.ME

    Adaptive Strategy of Testing Alphas in High Dimensional Linear Factor Pricing Models

    Authors: Chenxi Zhao, ** Zhao, Long Feng, Zhaojun Wang

    Abstract: In recent years, there has been considerable research on testing alphas in high-dimensional linear factor pricing models. In our study, we introduce a novel max-type test procedure that performs well under sparse alternatives. Furthermore, we demonstrate that this new max-type test procedure is asymptotically independent from the sum-type test procedure proposed by Pesaran and Yamagata (2017). Bui… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  29. arXiv:2404.04057  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation

    Authors: Mingyuan Zhou, Huangjie Zheng, Zhendong Wang, Mingzhang Yin, Hai Huang

    Abstract: We introduce Score identity Distillation (SiD), an innovative data-free method that distills the generative capabilities of pretrained diffusion models into a single-step generator. SiD not only facilitates an exponentially fast reduction in Fréchet inception distance (FID) during distillation but also approaches or even exceeds the FID performance of the original teacher diffusion models. By refo… ▽ More

    Submitted 24 May, 2024; v1 submitted 5 April, 2024; originally announced April 2024.

    Comments: ICML 2024, PyTorch implementation: https://github.com/mingyuanzhou/SiD

  30. arXiv:2403.19629  [pdf, other

    cs.LG stat.ML

    Metric Learning from Limited Pairwise Preference Comparisons

    Authors: Zhi Wang, Geelon So, Ramya Korlakai Vinayak

    Abstract: We study metric learning from preference comparisons under the ideal point model, in which a user prefers an item over another if it is closer to their latent ideal item. These items are embedded into $\mathbb{R}^d$ equipped with an unknown Mahalanobis distance shared across users. While recent work shows that it is possible to simultaneously recover the metric and ideal items given… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  31. arXiv:2403.19381  [pdf, other

    stat.ML cs.LG

    On Uncertainty Quantification for Near-Bayes Optimal Algorithms

    Authors: Ziyu Wang, Chris Holmes

    Abstract: Bayesian modelling allows for the quantification of predictive uncertainty which is crucial in safety-critical applications. Yet for many machine learning (ML) algorithms, it is difficult to construct or implement their Bayesian counterpart. In this work we present a promising approach to address this challenge, based on the hypothesis that commonly used ML algorithms are efficient across a wide v… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  32. arXiv:2403.18540  [pdf, other

    stat.ML cs.LG stat.CO

    skscope: Fast Sparsity-Constrained Optimization in Python

    Authors: Zezhi Wang, ** Zhu, Peng Chen, Huiyang Peng, Xiaoke Zhang, Anran Wang, Yu Zheng, Junxian Zhu, Xueqin Wang

    Abstract: Applying iterative solvers on sparsity-constrained optimization (SCO) requires tedious mathematical deduction and careful programming/debugging that hinders these solvers' broad impact. In the paper, the library skscope is introduced to overcome such an obstacle. With skscope, users can solve the SCO by just programming the objective function. The convenience of skscope is demonstrated through two… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 4 pages

  33. arXiv:2403.16825  [pdf, ps, other

    cs.LG math.OC math.PR stat.ML

    Weak Convergence Analysis of Online Neural Actor-Critic Algorithms

    Authors: Samuel Chun-Hei Lam, Justin Sirignano, Ziheng Wang

    Abstract: We prove that a single-layer neural network trained with the online actor critic algorithm converges in distribution to a random ordinary differential equation (ODE) as the number of hidden units and the number of training steps $\rightarrow \infty$. In the online actor-critic algorithm, the distribution of the data samples dynamically changes as the model is updated, which is a key challenge for… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  34. arXiv:2403.14830  [pdf, other

    stat.ML cs.LG

    Deep Clustering Evaluation: How to Validate Internal Clustering Validation Measures

    Authors: Zeya Wang, Chenglong Ye

    Abstract: Deep clustering, a method for partitioning complex, high-dimensional data using deep neural networks, presents unique evaluation challenges. Traditional clustering validation measures, designed for low-dimensional spaces, are problematic for deep clustering, which involves projecting data into lower-dimensional embeddings before partitioning. Two key issues are identified: 1) the curse of dimensio… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  35. arXiv:2403.13196  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    ADAPT to Robustify Prompt Tuning Vision Transformers

    Authors: Masih Eskandar, Tooba Imtiaz, Zifeng Wang, Jennifer Dy

    Abstract: The performance of deep models, including Vision Transformers, is known to be vulnerable to adversarial attacks. Many existing defenses against these attacks, such as adversarial training, rely on full-model fine-tuning to induce robustness in the models. These defenses require storing a copy of the entire model, that can have billions of parameters, for each task. At the same time, parameter-effi… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  36. arXiv:2403.13081  [pdf, other

    stat.AP math.PR q-bio.PE

    Parameter Estimation from Single Patient, Single Time-Point Sequencing Data of Recurrent Tumors

    Authors: Kevin Leder, Ru** Sun, Zicheng Wang, Xuanming Zhang

    Abstract: In this study, we develop consistent estimators for key parameters that govern the dynamics of tumor cell populations when subjected to pharmacological treatments. While these treatments often lead to an initial reduction in the abundance of drug-sensitive cells, a population of drug-resistant cells frequently emerges over time, resulting in cancer recurrence. Samples from recurrent tumors present… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  37. arXiv:2403.11429  [pdf, other

    stat.AP

    Long-range Ising model for regional-scale seismic risk analysis

    Authors: Sebin Oh, Sang-ri Yi, Ziqi Wang

    Abstract: This study introduces the long-range Ising model from statistical mechanics to the Performance-Based Earthquake Engineering (PBEE) framework for regional seismic damage analysis. The application of the PBEE framework at a regional scale involves estimating the damage states of numerous structures, typically performed using fragility function-based stochastic simulations. However, these simulations… ▽ More

    Submitted 23 May, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

  38. arXiv:2403.00283  [pdf, other

    stat.AP

    Risk Twin: Real-time Risk Visualization and Control for Structural Systems

    Authors: Zeyu Wang, Ziqi Wang

    Abstract: Digital twinning in structural engineering is a rapidly evolving technology that aims to eliminate the gap between physical systems and their digital models through real-time sensing, visualization, and control techniques. Although digital twins can offer dynamic insights into physical systems, their accuracy is inevitably compromised by uncertainties in sensing, modeling, simulation, and controll… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  39. arXiv:2402.10810  [pdf, ps, other

    cs.LG math.OC stat.ML

    Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning

    Authors: Zihao Li, Boyi Liu, Zhuoran Yang, Zhaoran Wang, Mengdi Wang

    Abstract: We study the Constrained Convex Markov Decision Process (MDP), where the goal is to minimize a convex functional of the visitation measure, subject to a convex constraint. Designing algorithms for a constrained convex MDP faces several challenges, including (1) handling the large state space, (2) managing the exploration/exploitation tradeoff, and (3) solving the constrained optimization where the… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  40. arXiv:2402.10127  [pdf, other

    stat.ML cs.LG math.PR math.ST

    Nonlinear spiked covariance matrices and signal propagation in deep neural networks

    Authors: Zhichao Wang, Denny Wu, Zhou Fan

    Abstract: Many recent works have studied the eigenvalue spectrum of the Conjugate Kernel (CK) defined by the nonlinear feature map of a feedforward neural network. However, existing results only establish weak convergence of the empirical eigenvalue distribution, and fall short of providing precise quantitative characterizations of the ''spike'' eigenvalues and eigenvectors that often capture the low-dimens… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 55 pages

  41. arXiv:2402.08539  [pdf

    cs.LG stat.AP

    Intelligent Diagnosis of Alzheimer's Disease Based on Machine Learning

    Authors: Mingyang Li, Hongyu Liu, Yixuan Li, Zejun Wang, Yuan Yuan, Honglin Dai

    Abstract: This study is based on the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset and aims to explore early detection and disease progression in Alzheimer's disease (AD). We employ innovative data preprocessing strategies, including the use of the random forest algorithm to fill missing data and the handling of outliers and invalid data, thereby fully mining and utilizing these limited data re… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  42. arXiv:2402.07227  [pdf, other

    math.DS econ.GN stat.AP

    Time-Delayed Game Strategy Analysis Among Japan, Other Nations, and the International Atomic Energy Agency in the Context of Fukushima Nuclear Wastewater Discharge Decision

    Authors: Mingyang Li, Han Pengsihua, Fujiao Meng, Zejun Wang, Weian Liu

    Abstract: This academic paper examines the strategic interactions between Japan, other nations, and the International Atomic Energy Agency (IAEA) regarding Japan's decision to release treated nuclear wastewater from the Fukushima Daiichi Nuclear Power Plant into the sea. It introduces a payoff matrix and time-delay elements in replicator dynamic equations to mirror real-world decision-making delays. The pap… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  43. arXiv:2402.07210  [pdf, other

    math.DS econ.GN physics.soc-ph stat.AP

    Fukushima Nuclear Wastewater Discharge: An Evolutionary Game Theory Approach to International and Domestic Interaction and Strategic Decision-Making

    Authors: Mingyang Li, Han Pengsihua, Songqing Zhao, Zejun Wang, Limin Yang, Weian Liu

    Abstract: On August 24, 2023, Japan controversially decided to discharge nuclear wastewater from the Fukushima Daiichi Nuclear Power Plant into the ocean, sparking intense domestic and global debates. This study uses evolutionary game theory to analyze the strategic dynamics between Japan, other countries, and the Japan Fisheries Association. By incorporating economic, legal, international aid, and environm… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  44. arXiv:2402.04582  [pdf, other

    stat.AP stat.ML

    Dimensionality reduction can be used as a surrogate model for high-dimensional forward uncertainty quantification

    Authors: Jungho Kim, Sang-ri Yi, Ziqi Wang

    Abstract: We introduce a method to construct a stochastic surrogate model from the results of dimensionality reduction in forward uncertainty quantification. The hypothesis is that the high-dimensional input augmented by the output of a computational model admits a low-dimensional representation. This assumption can be met by numerous uncertainty quantification applications with physics-based computational… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  45. arXiv:2402.03954  [pdf, other

    stat.ME stat.ML

    Mixed Matrix Completion in Complex Survey Sampling under Heterogeneous Missingness

    Authors: Xiaojun Mao, Hengfang Wang, Zhonglei Wang, Shu Yang

    Abstract: Modern surveys with large sample sizes and growing mixed-type questionnaires require robust and scalable analysis methods. In this work, we consider recovering a mixed dataframe matrix, obtained by complex survey sampling, with entries following different canonical exponential distributions and subject to heterogeneous missingness. To tackle this challenging task, we propose a two-stage procedure:… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: Journal of Computational and Graphical Statistics, 2023

  46. arXiv:2402.01887  [pdf, other

    stat.ML cs.CV cs.LG

    On f-Divergence Principled Domain Adaptation: An Improved Framework

    Authors: Ziqiao Wang, Yongyi Mao

    Abstract: Unsupervised domain adaptation (UDA) plays a crucial role in addressing distribution shifts in machine learning. In this work, we improve the theoretical foundations of UDA proposed by Acuna et al. (2021) by refining their f-divergence-based discrepancy and additionally introducing a new measure, f-domain discrepancy (f-DD). By removing the absolute value function and incorporating a scaling param… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  47. arXiv:2402.01381  [pdf, other

    stat.ME

    Spatial-Sign based Maxsum Test for High Dimensional Location Parameters

    Authors: Jixuan Liu, Long Feng, Zhaojun Wang

    Abstract: In this study, we explore a robust testing procedure for the high-dimensional location parameters testing problem. Initially, we introduce a spatial-sign based max-type test statistic, which exhibits excellent performance for sparse alternatives. Subsequently, we demonstrate the asymptotic independence between this max-type test statistic and the spatial-sign based sum-type test statistic (Feng an… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  48. arXiv:2401.17585  [pdf, other

    cs.CL cs.AI cs.LG stat.ME

    Propagation and Pitfalls: Reasoning-based Assessment of Knowledge Editing through Counterfactual Tasks

    Authors: Wenyue Hua, Jiang Guo, Mingwen Dong, Henghui Zhu, Patrick Ng, Zhiguo Wang

    Abstract: Current approaches of knowledge editing struggle to effectively propagate updates to interconnected facts. In this work, we delve into the barriers that hinder the appropriate propagation of updated knowledge within these models for accurate reasoning. To support our analysis, we introduce a novel reasoning-based benchmark -- ReCoE (Reasoning-based Counterfactual Editing dataset) -- which covers s… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: 22 pages, 14 figures, 5 tables

  49. arXiv:2401.14657  [pdf, other

    stat.AP cs.LG physics.ao-ph stat.ML

    Validating Climate Models with Spherical Convolutional Wasserstein Distance

    Authors: Robert C. Garrett, Trevor Harris, Bo Li, Zhuo Wang

    Abstract: The validation of global climate models is crucial to ensure the accuracy and efficacy of model output. We introduce the spherical convolutional Wasserstein distance to more comprehensively measure differences between climate models and reanalysis data. This new similarity measure accounts for spatial variability using convolutional projections and quantifies local differences in the distribution… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

  50. arXiv:2401.14052  [pdf, ps, other

    stat.ME

    Testing Alpha in High Dimensional Linear Factor Pricing Models with Dependent Observations

    Authors: Huifang Ma, Long Feng, Zhaojun Wang, Jigang Bao

    Abstract: In this study, we introduce three distinct testing methods for testing alpha in high dimensional linear factor pricing model that deals with dependent data. The first method is a sum-type test procedure, which exhibits high performance when dealing with dense alternatives. The second method is a max-type test procedure, which is particularly effective for sparse alternatives. For a broader range o… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.