Skip to main content

Showing 1–50 of 160 results for author: Wang, K

Searching in archive stat. Search in all archives.
.
  1. arXiv:2407.02357  [pdf, other

    math.ST math.AG stat.ML

    Contrastive independent component analysis

    Authors: Kexin Wang, Aida Maraj, Anna Seigal

    Abstract: Visualizing data and finding patterns in data are ubiquitous problems in the sciences. Increasingly, applications seek signal and structure in a contrastive setting: a foreground dataset relative to a background dataset. For this purpose, we propose contrastive independent component analysis (cICA). This generalizes independent component analysis to independent latent variables across a foreground… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 28 pages, 8 figures

    MSC Class: 62R01; 15A69; 90C31

  2. arXiv:2406.16605  [pdf, other

    cs.CL cs.AI cs.LG stat.ME

    CLEAR: Can Language Models Really Understand Causal Graphs?

    Authors: Sirui Chen, Mengying Xu, Kun Wang, Xingyu Zeng, Rui Zhao, Shengjie Zhao, Chaochao Lu

    Abstract: Causal reasoning is a cornerstone of how humans interpret the world. To model and reason about causality, causal graphs offer a concise yet effective solution. Given the impressive advancements in language models, a crucial question arises: can they really understand causal graphs? To this end, we pioneer an investigation into language models' understanding of causal graphs. Specifically, we devel… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  3. arXiv:2406.06516  [pdf, other

    stat.ME cs.LG stat.ML

    Distribution-Free Predictive Inference under Unknown Temporal Drift

    Authors: Elise Han, Chengpiao Huang, Kaizheng Wang

    Abstract: Distribution-free prediction sets play a pivotal role in uncertainty quantification for complex statistical models. Their validity hinges on reliable calibration data, which may not be readily available as real-world environments often undergo unknown changes over time. In this paper, we propose a strategy for choosing an adaptive window and use the data therein to construct prediction sets. The w… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 25 pages, 4 figures, 6 tables

  4. arXiv:2404.14446  [pdf, other

    physics.ao-ph stat.ME

    Spatio-temporal Joint Analysis of PM2.5 and Ozone in California with INLA

    Authors: Jianan Pan, Kunyang He, Kai Wang, Qing Mu, Chengxiu Ling

    Abstract: The substantial threat of concurrent air pollutants to public health is increasingly severe under climate change. To identify the common drivers and extent of spatio-temporal similarity of PM2.5 and ozone, this paper proposed a log Gaussian-Gumbel Bayesian hierarchical model allowing for sharing a SPDE-AR(1) spatio-temporal interaction structure. The proposed model outperforms in terms of estimati… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

  5. arXiv:2404.00099  [pdf, other

    cs.AI stat.ML

    Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision Processes

    Authors: Andrew Bennett, Nathan Kallus, Miruna Oprescu, Wen Sun, Kaiwen Wang

    Abstract: We study evaluating a policy under best- and worst-case perturbations to a Markov decision process (MDP), given transition observations from the original MDP, whether under the same or different policy. This is an important problem when there is the possibility of a shift between historical and future environments, due to e.g. unmeasured confounding, distributional shift, or an adversarial environ… ▽ More

    Submitted 29 March, 2024; originally announced April 2024.

    Comments: 40 pages, 1 figure

  6. arXiv:2403.09170  [pdf, other

    math.ST math.NA math.PR stat.ML

    Analysis of singular subspaces under random perturbations

    Authors: Ke Wang

    Abstract: We present a comprehensive analysis of singular vector and singular subspace perturbations in the context of the signal plus random Gaussian noise matrix model. Assuming a low-rank signal matrix, we extend the Davis-Kahan-Wedin theorem in a fully generalized manner, applicable to any unitarily invariant matrix norm, extending previous results of O'Rourke, Vu and the author. We also obtain the fine… ▽ More

    Submitted 19 March, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: Improved the results in the applications and updated the references

  7. arXiv:2402.08672  [pdf, other

    cs.LG cs.AI stat.ME

    Model Assessment and Selection under Temporal Distribution Shift

    Authors: Elise Han, Chengpiao Huang, Kaizheng Wang

    Abstract: We investigate model assessment and selection in a changing environment, by synthesizing datasets from both the current time period and historical epochs. To tackle unknown and potentially arbitrary temporal distribution shift, we develop an adaptive rolling window approach to estimate the generalization error of a given model. This strategy also facilitates the comparison between any two candidat… ▽ More

    Submitted 3 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: 26 pages, 6 figures, 4 tables

    MSC Class: 62G05 (Primary); 62J02 (Secondary)

  8. arXiv:2312.03344  [pdf, other

    cs.LG math.DS stat.AP stat.ML

    Interpretable Mechanistic Representations for Meal-level Glycemic Control in the Wild

    Authors: Ke Alexander Wang, Emily B. Fox

    Abstract: Diabetes encompasses a complex landscape of glycemic control that varies widely among individuals. However, current methods do not faithfully capture this variability at the meal level. On the one hand, expert-crafted features lack the flexibility of data-driven methods; on the other hand, learned representations tend to be uninterpretable which hampers clinical adoption. In this paper, we propose… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: Proceedings of Machine Learning for Health (ML4H) 2023. Code available at: https://github.com/KeAWang/interpretable-cgm-representations

  9. arXiv:2311.18294  [pdf, other

    stat.ME math.ST

    Multivariate Unified Skew-t Distributions And Their Properties

    Authors: Kesen Wang, Maicon J. Karling, Reinaldo B. Arellano-Valle, Marc G. Genton

    Abstract: The unified skew-t (SUT) is a flexible parametric multivariate distribution that accounts for skewness and heavy tails in the data. A few of its properties can be found scattered in the literature or in a parameterization that does not follow the original one for unified skew-normal (SUN) distributions, yet a systematic study is lacking. In this work, explicit properties of the multivariate SUT di… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

  10. arXiv:2311.13046  [pdf, ps, other

    econ.GN cs.AI cs.LG stat.AP

    Do we listen to what we are told? An empirical study on human behaviour during the COVID-19 pandemic: neural networks vs. regression analysis

    Authors: Yuxi Heluo, Kexin Wang, Charles W. Robson

    Abstract: In this work, we contribute the first visual open-source empirical study on human behaviour during the COVID-19 pandemic, in order to investigate how compliant a general population is to mask-wearing-related public-health policy. Object-detection-based convolutional neural networks, regression analysis and multilayer perceptrons are combined to analyse visual data of the Viennese public during 202… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  11. arXiv:2310.18304  [pdf, other

    cs.LG cs.AI math.OC stat.ML

    A Stability Principle for Learning under Non-Stationarity

    Authors: Chengpiao Huang, Kaizheng Wang

    Abstract: We develop a versatile framework for statistical learning in non-stationary environments. In each time period, our approach applies a stability principle to select a look-back window that maximizes the utilization of historical data while kee** the cumulative bias within an acceptable range relative to the stochastic error. Our theory showcases the adaptability of this approach to unknown non-st… ▽ More

    Submitted 22 January, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

    Comments: 48 pages, 1 figure

    MSC Class: 68T05; 90C15

  12. arXiv:2309.12000  [pdf, other

    stat.ME stat.CO

    Which Parameterization of the Matérn Covariance Function?

    Authors: Kesen Wang, Sameh Abdulah, Ying Sun, Marc G. Genton

    Abstract: The Matérn family of covariance functions is currently the most popularly used model in spatial statistics, geostatistics, and machine learning to specify the correlation between two geographical locations based on spatial distance. Compared to existing covariance functions, the Matérn family has more flexibility in data fitting because it allows the control of the field smoothness through a dedic… ▽ More

    Submitted 28 August, 2023; originally announced September 2023.

  13. arXiv:2309.02417  [pdf

    stat.ML cs.LG

    Computing SHAP Efficiently Using Model Structure Information

    Authors: Linwei Hu, Ke Wang

    Abstract: SHAP (SHapley Additive exPlanations) has become a popular method to attribute the prediction of a machine learning model on an input to its features. One main challenge of SHAP is the computation time. An exact computation of Shapley values requires exponential time complexity. Therefore, many approximation methods are proposed in the literature. In this paper, we propose methods that can compute… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: 15 pages

  14. arXiv:2307.05772  [pdf, other

    cs.LG stat.ML

    Random-Set Convolutional Neural Network (RS-CNN) for Epistemic Deep Learning

    Authors: Shireen Kudukkil Manchingal, Muhammad Mubashar, Kaizheng Wang, Keivan Shariatmadar, Fabio Cuzzolin

    Abstract: Machine learning is increasingly deployed in safety-critical domains where robustness against adversarial attacks is crucial and erroneous predictions could lead to potentially catastrophic consequences. This highlights the need for learning systems to be equipped with the means to determine a model's confidence in its prediction and the epistemic uncertainty associated with it, 'to know when a mo… ▽ More

    Submitted 11 July, 2023; originally announced July 2023.

  15. arXiv:2307.03886  [pdf, other

    cs.LG stat.ML

    On Regularization and Inference with Label Constraints

    Authors: Kaifu Wang, Hangfeng He, Tin D. Nguyen, Piyush Kumar, Dan Roth

    Abstract: Prior knowledge and symbolic rules in machine learning are often expressed in the form of label constraints, especially in structured prediction problems. In this work, we compare two common strategies for encoding label constraints in a machine learning pipeline, regularization with constraints and constrained inference, by quantifying their impact on model performance. For regularization, we sho… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  16. arXiv:2307.00205  [pdf, other

    stat.ME

    A Transparent and Nonlinear Method for Variable Selection

    Authors: Keyao Wang, Huiwen Wang, Jichang Zhao, Lihong Wang

    Abstract: Variable selection is a procedure to attain the truly important predictors from inputs. Complex nonlinear dependencies and strong coupling pose great challenges for variable selection in high-dimensional data. In addition, real-world applications have increased demands for interpretability of the selection process. A pragmatic approach should not only attain the most predictive covariates, but als… ▽ More

    Submitted 30 June, 2023; originally announced July 2023.

  17. arXiv:2306.13796  [pdf, ps, other

    cs.LG stat.ML

    On Learning Latent Models with Multi-Instance Weak Supervision

    Authors: Kaifu Wang, Efi Tsamoura, Dan Roth

    Abstract: We consider a weakly supervised learning scenario where the supervision signal is generated by a transition function $σ$ of labels associated with multiple input instances. We formulate this problem as \emph{multi-instance Partial Label Learning (multi-instance PLL)}, which is an extension to the standard PLL problem. Our problem is met in different fields, including latent structural learning and… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

  18. arXiv:2306.05202  [pdf, ps, other

    math.ST stat.ME

    Bayesian Inference for Multivariate Monotone Densities

    Authors: Kang Wang, Subhashis Ghosal

    Abstract: We consider a nonparametric Bayesian approach to estimation and testing for a multivariate monotone density. Instead of following the conventional Bayesian route of putting a prior distribution complying with the monotonicity restriction, we put a prior on the step heights through binning and a Dirichlet distribution. An arbitrary piece-wise constant probability density is converted to a monotone… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  19. arXiv:2306.05173  [pdf, other

    math.ST stat.ME

    Bayesian Inference for $k$-Monotone Densities with Applications to Multiple Testing

    Authors: Kang Wang, Subhashis Ghosal

    Abstract: Shape restriction, like monotonicity or convexity, imposed on a function of interest, such as a regression or density function, allows for its estimation without smoothness assumptions. The concept of $k$-monotonicity encompasses a family of shape restrictions, including decreasing and convex decreasing as special cases corresponding to $k=1$ and $k=2$. We consider Bayesian approaches to estimate… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

  20. arXiv:2305.15703  [pdf, ps, other

    cs.LG cs.AI math.OC math.ST stat.ML

    The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning

    Authors: Kaiwen Wang, Kevin Zhou, Runzhe Wu, Nathan Kallus, Wen Sun

    Abstract: While distributional reinforcement learning (DistRL) has been empirically effective, the question of when and why it is better than vanilla, non-distributional RL has remained unanswered. This paper explains the benefits of DistRL through the lens of small-loss bounds, which are instance-dependent bounds that scale with optimal achievable cost. Particularly, our bounds converge much faster than th… ▽ More

    Submitted 22 September, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: Accepted at NeurIPS 2023

  21. arXiv:2305.01638  [pdf, other

    cs.LG cs.CV stat.ML

    Sequence Modeling with Multiresolution Convolutional Memory

    Authors: Jiaxin Shi, Ke Alexander Wang, Emily B. Fox

    Abstract: Efficiently capturing the long-range patterns in sequential data sources salient to a given task -- such as classification and generative modeling -- poses a fundamental challenge. Popular approaches in the space tradeoff between the memory burden of brute-force enumeration and comparison, as in transformers, the computational burden of complicated sequential dependencies, as in recurrent neural n… ▽ More

    Submitted 1 November, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: ICML 2023, Source code: https://github.com/thjashin/multires-conv

  22. arXiv:2302.10160  [pdf, other

    stat.ME cs.LG math.ST stat.ML

    Pseudo-Labeling for Kernel Ridge Regression under Covariate Shift

    Authors: Kaizheng Wang

    Abstract: We develop and analyze a principled approach to kernel ridge regression under covariate shift. The goal is to learn a regression function with small mean squared error over a target distribution, based on unlabeled data from there and labeled data that may have a different feature distribution. We propose to split the labeled data into two subsets and conduct kernel ridge regression on them separa… ▽ More

    Submitted 14 March, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: 41 pages, 1 figure

    MSC Class: 62J07; 62G05

  23. Disentangled Representation for Causal Mediation Analysis

    Authors: Ziqi Xu, Debo Cheng, Jiuyong Li, Jixue Liu, Lin Liu, Ke Wang

    Abstract: Estimating direct and indirect causal effects from observational data is crucial to understanding the causal mechanisms and predicting the behaviour under different interventions. Causal mediation analysis is a method that is often used to reveal direct and indirect effects. Deep learning shows promise in mediation analysis, but the current methods only assume latent confounders that affect treatm… ▽ More

    Submitted 15 December, 2023; v1 submitted 19 February, 2023; originally announced February 2023.

    Comments: This paper has been accepted by AAAI 2023. Please check: https://doi.org/10.1609/aaai.v37i9.26266

  24. arXiv:2302.06059  [pdf, other

    stat.AP

    Spatio-temporal Joint Modelling on Moderate and Extreme Air Pollution in Spain

    Authors: Kai Wang, Chengxiu Ling, Ying Chen, Zhengjun Zhang

    Abstract: Very unhealthy air quality is consistently connected with numerous diseases. Appropriate extreme analysis and accurate predictions are in rising demand for exploring potential linked causes and for providing suggestions for the environmental agency in public policy strategy. This paper aims to model the spatial and temporal pattern of both moderate and extremely poor PM10 concentrations (of daily… ▽ More

    Submitted 24 August, 2023; v1 submitted 12 February, 2023; originally announced February 2023.

  25. arXiv:2302.05918  [pdf, other

    stat.ML cs.LG stat.AP

    Efficient Fraud Detection Using Deep Boosting Decision Trees

    Authors: Biao Xu, Yao Wang, Xiuwu Liao, Kaidong Wang

    Abstract: Fraud detection is to identify, monitor, and prevent potentially fraudulent activities from complex data. The recent development and success in AI, especially machine learning, provides a new data-driven way to deal with fraud. From a methodological point of view, machine learning based fraud detection can be divided into two categories, i.e., conventional methods (decision tree, boosting...) and… ▽ More

    Submitted 18 May, 2023; v1 submitted 12 February, 2023; originally announced February 2023.

    Comments: 34 pages, 8 figures

  26. arXiv:2302.03201  [pdf, ps, other

    cs.LG math.OC math.ST stat.ML

    Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR

    Authors: Kaiwen Wang, Nathan Kallus, Wen Sun

    Abstract: In this paper, we study risk-sensitive Reinforcement Learning (RL), focusing on the objective of Conditional Value at Risk (CVaR) with risk tolerance $τ$. Starting with multi-arm bandits (MABs), we show the minimax CVaR regret rate is $Ω(\sqrt{τ^{-1}AK})$, where $A$ is the number of actions and $K$ is the number of episodes, and that it is achieved by an Upper Confidence Bound algorithm with a nov… ▽ More

    Submitted 24 May, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: Accepted at ICML 2023

  27. arXiv:2301.01766  [pdf, other

    math.ST cs.LG math.OC stat.ML

    Learning Gaussian Mixtures Using the Wasserstein-Fisher-Rao Gradient Flow

    Authors: Yuling Yan, Kaizheng Wang, Philippe Rigollet

    Abstract: Gaussian mixture models form a flexible and expressive parametric family of distributions that has found applications in a wide variety of applications. Unfortunately, fitting these models to data is a notoriously hard problem from a computational perspective. Currently, only moment-based methods enjoy theoretical guarantees while likelihood-based methods are dominated by heuristics such as Expect… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

  28. arXiv:2212.00862  [pdf, ps, other

    cs.AI math.OC math.PR stat.AP

    An introduction to optimization under uncertainty -- A short survey

    Authors: Keivan Shariatmadar, Kaizheng Wang, Calvin R. Hubbard, Hans Hallez, David Moens

    Abstract: Optimization equips engineers and scientists in a variety of fields with the ability to transcribe their problems into a generic formulation and receive optimal solutions with relative ease. Industries ranging from aerospace to robotics continue to benefit from advancements in optimization theory and the associated algorithmic developments. Nowadays, optimization is used in real time on autonomous… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Comments: 13 pages

  29. arXiv:2210.12832  [pdf, other

    stat.ME

    Functional Bayesian Networks for Discovering Causality from Multivariate Functional Data

    Authors: Fangting Zhou, Kejun He, Kunbo Wang, Yanxun Xu, Yang Ni

    Abstract: Multivariate functional data arise in a wide range of applications. One fundamental task is to understand the causal relationships among these functional objects of interest, which has not yet been fully explored. In this article, we develop a novel Bayesian network model for multivariate functional data where the conditional independence and causal structure are both encoded by a directed acyclic… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

  30. arXiv:2210.12334  [pdf, other

    stat.ML cs.LG math.OC

    Adaptive Data Fusion for Multi-task Non-smooth Optimization

    Authors: Henry Lam, Kaizheng Wang, Yuhang Wu, Yichen Zhang

    Abstract: We study the problem of multi-task non-smooth optimization that arises ubiquitously in statistical learning, decision-making and risk management. We develop a data fusion approach that adaptively leverages commonalities among a large number of objectives to improve sample efficiency while tackling their unknown heterogeneities. We provide sharp statistical guarantees for our approach. Numerical ex… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: 25 pages

  31. arXiv:2210.11363  [pdf, other

    stat.ME

    Bayesian Tensor-on-Tensor Regression with Efficient Computation

    Authors: Kunbo Wang, Yanxun Xu

    Abstract: We propose a Bayesian tensor-on-tensor regression approach to predict a multidimensional array (tensor) of arbitrary dimensions from another tensor of arbitrary dimensions, building upon the Tucker decomposition of the regression coefficient tensor. Traditional tensor regression methods making use of the Tucker decomposition either assume the dimension of the core tensor to be known or estimate it… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

  32. arXiv:2209.09845  [pdf, other

    cs.LG cs.MA stat.ML

    Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL

    Authors: Fengzhuo Zhang, Boyi Liu, Kaixin Wang, Vincent Y. F. Tan, Zhuoran Yang, Zhaoran Wang

    Abstract: The cooperative Multi-A gent R einforcement Learning (MARL) with permutation invariant agents framework has achieved tremendous empirical successes in real-world applications. Unfortunately, the theoretical understanding of this MARL problem is lacking due to the curse of many agents and the limited exploration of the relational reasoning in existing works. In this paper, we verify that the transf… ▽ More

    Submitted 16 October, 2022; v1 submitted 20 September, 2022; originally announced September 2022.

  33. arXiv:2207.05837  [pdf, other

    cs.LG math.OC math.ST stat.ML

    Learning Bellman Complete Representations for Offline Policy Evaluation

    Authors: Jonathan D. Chang, Kaiwen Wang, Nathan Kallus, Wen Sun

    Abstract: We study representation learning for Offline Reinforcement Learning (RL), focusing on the important task of Offline Policy Evaluation (OPE). Recent work shows that, in contrast to supervised learning, realizability of the Q-function is not enough for learning it. Two sufficient conditions for sample-efficient OPE are Bellman completeness and coverage. Prior work often assumes that representations… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

    Comments: Accepted for Long Talk at ICML 2022

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:2938-2971, 2022

  34. arXiv:2206.11056  [pdf

    cs.LG econ.GN stat.AP

    Generational Differences in Automobility: Comparing America's Millennials and Gen Xers Using Gradient Boosting Decision Trees

    Authors: Kailai Wang, Xize Wang

    Abstract: Whether the Millennials are less auto-centric than the previous generations has been widely discussed in the literature. Most existing studies use regression models and assume that all factors are linear-additive in contributing to the young adults' driving behaviors. This study relaxes this assumption by applying a non-parametric statistical learning method, namely the gradient boosting decision… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

    Journal ref: Cities, 114, 103204 (2021)

  35. Simulated redistricting plans for the analysis and evaluation of redistricting in the United States

    Authors: Cory McCartan, Christopher T. Kenny, Tyler Simko, George Garcia III, Kevin Wang, Melissa Wu, Shiro Kuriwaki, Kosuke Imai

    Abstract: This article introduces the 50stateSimulations, a collection of simulated congressional districting plans and underlying code developed by the Algorithm-Assisted Redistricting Methodology (ALARM) Project. The 50stateSimulations allow for the evaluation of enacted and other congressional redistricting plans in the United States. While the use of redistricting simulation algorithms has become standa… ▽ More

    Submitted 20 October, 2022; v1 submitted 21 June, 2022; originally announced June 2022.

    Comments: 11 pages, 3 figures

    Journal ref: Sci Data (2022) 9, 689

  36. arXiv:2205.10490  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Aligning Logits Generatively for Principled Black-Box Knowledge Distillation

    Authors: **g Ma, Xiang Xiang, Ke Wang, Yuchuan Wu, Yongbin Li

    Abstract: Black-Box Knowledge Distillation (B2KD) is a formulated problem for cloud-to-edge model compression with invisible data and models hosted on the server. B2KD faces challenges such as limited Internet exchange and edge-cloud disparity of data distributions. In this paper, we formalize a two-step workflow consisting of deprivatization and distillation, and theoretically provide a new optimization di… ▽ More

    Submitted 30 March, 2024; v1 submitted 20 May, 2022; originally announced May 2022.

    Comments: To appear at CVPR 2024; significantly rewritten with extra experiments since the preliminary report

  37. arXiv:2205.02410  [pdf, other

    stat.ML cs.LG

    Sequential Importance Sampling for Hybrid Model Bayesian Inference to Support Bioprocess Mechanism Learning and Robust Control

    Authors: Wei Xie, Keqi Wang, Hua Zheng, Ben Feng

    Abstract: Driven by the critical needs of biomanufacturing 4.0, we introduce a probabilistic knowledge graph hybrid model characterizing the risk- and science-based understanding of bioprocess mechanisms. It can faithfully capture the important properties, including nonlinear reactions, partially observed state, and nonstationary dynamics. Given very limited real process observations, we derive a posterior… ▽ More

    Submitted 29 September, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

    Comments: 11 pages, 2 figures

  38. arXiv:2203.08980  [pdf, other

    stat.ME eess.SY

    Stochastic Simulation Uncertainty Analysis to Accelerate Flexible Biomanufacturing Process Development

    Authors: Wei Xie, Russell R. Barton, Barry L. Nelson, Keqi Wang

    Abstract: Motivated by critical challenges and needs from biopharmaceuticals manufacturing, we propose a general metamodel-assisted stochastic simulation uncertainty analysis framework to accelerate the development of a simulation model with modular design for flexible production processes. There are often very limited process observations. Thus, there exist both simulation and model uncertainties in the sy… ▽ More

    Submitted 3 September, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: 32 pages, 3 figures. arXiv admin note: substantial text overlap with arXiv:2011.04207

  39. arXiv:2202.09667  [pdf, other

    cs.LG math.OC math.ST stat.ML

    Doubly Robust Distributionally Robust Off-Policy Evaluation and Learning

    Authors: Nathan Kallus, Xiaojie Mao, Kaiwen Wang, Zhengyuan Zhou

    Abstract: Off-policy evaluation and learning (OPE/L) use offline observational data to make better decisions, which is crucial in applications where online experimentation is limited. However, depending entirely on logged data, OPE/L is sensitive to environment distribution shifts -- discrepancies between the data-generating environment and that where policies are deployed. \citet{si2020distributional} prop… ▽ More

    Submitted 18 July, 2022; v1 submitted 19 February, 2022; originally announced February 2022.

    Comments: Short Talk at ICML 2022

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:10598-10632, 2022

  40. arXiv:2202.05250  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Adaptive and Robust Multi-Task Learning

    Authors: Yaqi Duan, Kaizheng Wang

    Abstract: We study the multi-task learning problem that aims to simultaneously analyze multiple datasets collected from different sources and learn one model for each of them. We propose a family of adaptive methods that automatically utilize possible similarities among those tasks while carefully handling their differences. We derive sharp statistical guarantees for the methods and prove their robustness a… ▽ More

    Submitted 16 September, 2023; v1 submitted 10 February, 2022; originally announced February 2022.

    Comments: 72 pages, 2 figures

    MSC Class: 62F10; 62R07

  41. arXiv:2201.10745  [pdf, other

    stat.CO stat.ME

    Control Variate Polynomial Chaos: Optimal Fusion of Sampling and Surrogates for Multifidelity Uncertainty Quantification

    Authors: Hang Yang, Yuji Fujii, K. W. Wang, Alex A. Gorodetsky

    Abstract: We present a hybrid sampling-surrogate approach for reducing the computational expense of uncertainty quantification in nonlinear dynamical systems. Our motivation is to enable rapid uncertainty quantification in complex mechanical systems such as automotive propulsion systems. Our approach is to build upon ideas from multifidelity uncertainty quantification to leverage the benefits of both sampli… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    MSC Class: 62-08; 65Pxx; 65D15; 65C05; 41A10

  42. arXiv:2112.14754  [pdf, other

    cs.LG cs.CV stat.ML

    Disentanglement and Generalization Under Correlation Shifts

    Authors: Christina M. Funke, Paul Vicol, Kuan-Chieh Wang, Matthias Kümmerer, Richard Zemel, Matthias Bethge

    Abstract: Correlations between factors of variation are prevalent in real-world data. Exploiting such correlations may increase predictive performance on noisy data; however, often correlations are not robust (e.g., they may change between domains, datasets, or applications) and models that exploit them do not generalize when correlations shift. Disentanglement methods aim to learn representations which cap… ▽ More

    Submitted 23 December, 2022; v1 submitted 29 December, 2021; originally announced December 2021.

    Comments: CoLLAs 2022

  43. arXiv:2112.12986  [pdf, other

    cs.LG stat.ML

    Is Importance Weighting Incompatible with Interpolating Classifiers?

    Authors: Ke Alexander Wang, Niladri S. Chatterji, Saminul Haque, Tatsunori Hashimoto

    Abstract: Importance weighting is a classic technique to handle distribution shifts. However, prior work has presented strong empirical and theoretical evidence demonstrating that importance weights can have little to no effect on overparameterized neural networks. Is importance weighting truly incompatible with the training of overparameterized neural networks? Our paper answers this in the negative. We sh… ▽ More

    Submitted 4 March, 2022; v1 submitted 24 December, 2021; originally announced December 2021.

    Comments: International Conference on Learning Representations (ICLR), 2022

  44. arXiv:2112.04640  [pdf, other

    cs.LG cs.CR stat.ML

    Differentially Private Ensemble Classifiers for Data Streams

    Authors: Lovedeep Gondara, Ke Wang, Ricardo Silva Carvalho

    Abstract: Learning from continuous data streams via classification/regression is prevalent in many domains. Adapting to evolving data characteristics (concept drift) while protecting data owners' private information is an open challenge. We present a differentially private ensemble solution to this problem with two distinguishing features: it allows an \textit{unbounded} number of ensemble updates to deal w… ▽ More

    Submitted 8 December, 2021; originally announced December 2021.

    Comments: Accepted at WSDM 2022

  45. arXiv:2110.15468  [pdf, other

    stat.ME math.ST

    Interval Estimation of Relative Risks for Combined Unilateral and Bilateral Correlated Data

    Authors: Kejia Wang, Chang-Xing Ma

    Abstract: Measurements are generally collected as unilateral or bilateral data in clinical trials or observational studies. For example, in ophthalmology studies, the primary outcome is often obtained from one eye or both eyes of an individual. In medical studies, the relative risk is usually the parameter of interest and is commonly used. In this article, we develop three confidence intervals for the relat… ▽ More

    Submitted 28 October, 2021; originally announced October 2021.

  46. arXiv:2110.01602  [pdf, other

    stat.ML cs.IT cs.LG math.OC math.ST

    Clustering a Mixture of Gaussians with Unknown Covariance

    Authors: Damek Davis, Mateo Díaz, Kaizheng Wang

    Abstract: We investigate a clustering problem with data from a mixture of Gaussians that share a common but unknown, and potentially ill-conditioned, covariance matrix. We start by considering Gaussian mixtures with two equally-sized components and derive a Max-Cut integer program based on maximum likelihood estimation. We prove its solutions achieve the optimal misclassification rate when the number of sam… ▽ More

    Submitted 29 November, 2021; v1 submitted 4 October, 2021; originally announced October 2021.

    Comments: 89 pages

    MSC Class: 62H30; 62H12; 62H05

  47. arXiv:2108.10129  [pdf, other

    cs.LG stat.ML

    Effective Streaming Low-tubal-rank Tensor Approximation via Frequent Directions

    Authors: Qianxin Yi, Chenhao Wang, Kaidong Wang, Yao Wang

    Abstract: Low-tubal-rank tensor approximation has been proposed to analyze large-scale and multi-dimensional data. However, finding such an accurate approximation is challenging in the streaming setting, due to the limited computational resources. To alleviate this issue, this paper extends a popular matrix sketching technique, namely Frequent Directions, for constructing an efficient and accurate low-tubal… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

  48. arXiv:2107.05545  [pdf, other

    cs.LG cs.AI stat.ML

    Towards Better Laplacian Representation in Reinforcement Learning with Generalized Graph Drawing

    Authors: Kaixin Wang, Kuangqi Zhou, Qixin Zhang, Jie Shao, Bryan Hooi, Jiashi Feng

    Abstract: The Laplacian representation recently gains increasing attention for reinforcement learning as it provides succinct and informative representation for states, by taking the eigenvectors of the Laplacian matrix of the state-transition graph as state embeddings. Such representation captures the geometry of the underlying state space and is beneficial to RL tasks such as option discovery and reward s… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

    Comments: ICML 2021

  49. arXiv:2107.03003  [pdf, other

    cs.LG cs.AI stat.ML

    Harnessing Heterogeneity: Learning from Decomposed Feedback in Bayesian Modeling

    Authors: Kai Wang, Bryan Wilder, Sze-chuan Suen, Bistra Dilkina, Milind Tambe

    Abstract: There is significant interest in learning and optimizing a complex system composed of multiple sub-components, where these components may be agents or autonomous sensors. Among the rich literature on this topic, agent-based and domain-specific simulations can capture complex dynamics and subgroup interaction, but optimizing over such simulations can be computationally and algorithmically challengi… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

  50. arXiv:2106.12498  [pdf, other

    cs.LG cs.IT stat.ML

    Universal Consistency of Deep Convolutional Neural Networks

    Authors: Shao-Bo Lin, Kaidong Wang, Yao Wang, Ding-Xuan Zhou

    Abstract: Compared with avid research activities of deep convolutional neural networks (DCNNs) in practice, the study of theoretical behaviors of DCNNs lags heavily behind. In particular, the universal consistency of DCNNs remains open. In this paper, we prove that implementing empirical risk minimization on DCNNs with expansive convolution (with zero-padding) is strongly universally consistent. Motivated b… ▽ More

    Submitted 23 June, 2021; originally announced June 2021.

    Comments: 9pages, 4 figures