Skip to main content

Showing 1–19 of 19 results for author: Ha, W

Searching in archive stat. Search in all archives.
.
  1. arXiv:2311.17867  [pdf, other

    stat.ME

    A Class of Directed Acyclic Graphs with Mixed Data Types in Mediation Analysis

    Authors: Wei Hao, Canyi Chen, Peter X. -K. Song

    Abstract: We propose a unified class of generalized structural equation models (GSEMs) with data of mixed types in mediation analysis, including continuous, categorical, and count variables. Such models extend substantially the classical linear structural equation model to accommodate many data types arising from the application of mediation analysis. Invoking the hierarchical modeling approach, we specify… ▽ More

    Submitted 4 December, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: 33 pages, 3 figures, 3 tables

  2. arXiv:2311.16628  [pdf, ps, other

    stat.ML cs.LG

    Symmetry-regularized neural ordinary differential equations

    Authors: Wenbo Hao

    Abstract: Neural Ordinary Differential Equations (Neural ODEs) is a class of deep neural network models that interpret the hidden state dynamics of neural networks as an ordinary differential equation, thereby capable of capturing system dynamics in a continuous time framework. In this work, I integrate symmetry regularization into Neural ODEs. In particular, I use continuous Lie symmetry of ODEs and PDEs a… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  3. arXiv:2309.10301  [pdf, other

    stat.ML cs.LG

    Prominent Roles of Conditionally Invariant Components in Domain Adaptation: Theory and Algorithms

    Authors: Keru Wu, Yuansi Chen, Wooseok Ha, Bin Yu

    Abstract: Domain adaptation (DA) is a statistical learning problem that arises when the distribution of the source data used to train a model differs from that of the target data used to evaluate the model. While many DA algorithms have demonstrated considerable empirical success, blindly applying these algorithms can often lead to worse performance on new datasets. To address this, it is crucial to clarify… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  4. arXiv:2308.03215  [pdf, other

    stat.ML cs.LG

    The Effect of SGD Batch Size on Autoencoder Learning: Sparsity, Sharpness, and Feature Learning

    Authors: Nikhil Ghosh, Spencer Frei, Wooseok Ha, Bin Yu

    Abstract: In this work, we investigate the dynamics of stochastic gradient descent (SGD) when training a single-neuron autoencoder with linear or ReLU activation on orthogonal data. We show that for this non-convex problem, randomly initialized SGD with a constant step size successfully finds a global minimum for any batch size choice. However, the particular global minimum found depends upon the batch size… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

  5. arXiv:2306.17347  [pdf, other

    stat.ME

    Mediation with External Summary Statistic Information (MESSI)

    Authors: Jonathan Boss, Wei Hao, Amber Cathey, Barrett M. Welch, Kelly K. Ferguson, John D. Meeker, Jian Kang, Bhramar Mukherjee

    Abstract: Environmental health studies are increasingly measuring endogenous omics data ($\boldsymbol{M}$) to study intermediary biological pathways by which an exogenous exposure ($\boldsymbol{A}$) affects a health outcome ($\boldsymbol{Y}$), given confounders ($\boldsymbol{C}$). Mediation analysis is frequently carried out to understand such mechanisms. If intermediary pathways are of interest, then there… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: 32 pages, 6 figures

  6. arXiv:2203.13293  [pdf, other

    stat.ME stat.AP

    Methods for Large-scale Single Mediator Hypothesis Testing: Possible Choices and Comparisons

    Authors: Jiacong Du, Xiang Zhou, Wei Hao, Yongmei Liu, Jennifer A. Smith, Bhramar Mukherjee

    Abstract: Mediation hypothesis testing for a large number of mediators is challenging due to the composite structure of the null hypothesis, H0:alpha*beta=0 (alpha: effect of the exposure on the mediator after adjusting for confounders; beta: effect of the mediator on the outcome after adjusting for exposure and confounders). In this paper, we reviewed three classes of methods for multiple mediation hypothe… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

    Comments: 24 pages, 6 figures, 4 tables

  7. arXiv:2108.06847  [pdf, other

    stat.ML cs.LG

    Interpreting and improving deep-learning models with reality checks

    Authors: Chandan Singh, Wooseok Ha, Bin Yu

    Abstract: Recent deep-learning models have achieved impressive predictive performance by learning complex functions of many variables, often at the cost of interpretability. This chapter covers recent work aiming to interpret models by attributing importance to features and feature groups for a single prediction. Importantly, the proposed attributions assign importance to interactions between features, in a… ▽ More

    Submitted 18 August, 2021; v1 submitted 15 August, 2021; originally announced August 2021.

  8. arXiv:2107.09145  [pdf, other

    stat.ML cs.LG

    Adaptive wavelet distillation from neural networks through interpretations

    Authors: Wooseok Ha, Chandan Singh, Francois Lanusse, Srigokul Upadhyayula, Bin Yu

    Abstract: Recent deep-learning models have achieved impressive prediction performance, but often sacrifice interpretability and computational efficiency. Interpretability is crucial in many disciplines, such as science and medicine, where models must be carefully vetted or where interpretation is the goal itself. Moreover, interpretable models are concise and often yield computational efficiency. Here, we p… ▽ More

    Submitted 26 August, 2021; v1 submitted 19 July, 2021; originally announced July 2021.

  9. arXiv:2104.13417  [pdf, other

    cs.CV cs.LG stat.ML

    Towards Fair Federated Learning with Zero-Shot Data Augmentation

    Authors: Weituo Hao, Mostafa El-Khamy, Jungwon Lee, Jianyi Zhang, Kevin J Liang, Changyou Chen, Lawrence Carin

    Abstract: Federated learning has emerged as an important distributed learning paradigm, where a server aggregates a global model from many client-trained models while having no access to the client data. Although it is recognized that statistical heterogeneity of the client local data yields slower global model convergence, it is less commonly recognized that it also yields a biased federated global model w… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

    Comments: Accepted by IEEE CVPR Workshop on Fair, Data Efficient And Trusted Computer Vision

  10. arXiv:2011.00593  [pdf, other

    cs.CL stat.ML

    MixKD: Towards Efficient Distillation of Large-scale Language Models

    Authors: Kevin J Liang, Weituo Hao, Dinghan Shen, Yufan Zhou, Weizhu Chen, Changyou Chen, Lawrence Carin

    Abstract: Large-scale language models have recently demonstrated impressive empirical performance. Nevertheless, the improved results are attained at the price of bigger models, more power consumption, and slower inference, which hinder their applicability to low-resource (both memory and computation) platforms. Knowledge distillation (KD) has been demonstrated as an effective framework for compressing such… ▽ More

    Submitted 17 March, 2021; v1 submitted 1 November, 2020; originally announced November 2020.

    Comments: ICLR 2021 Camera Ready

  11. arXiv:2008.05687  [pdf, other

    cs.LG stat.ML

    WAFFLe: Weight Anonymized Factorization for Federated Learning

    Authors: Weituo Hao, Nikhil Mehta, Kevin J Liang, Pengyu Cheng, Mostafa El-Khamy, Lawrence Carin

    Abstract: In domains where data are sensitive or private, there is great value in methods that can learn in a distributed manner without the data ever leaving the local devices. In light of this need, federated learning has emerged as a popular training paradigm. However, many federated learning approaches trade transmitting data for communicating updated weight parameters for each local device. Therefore,… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

  12. arXiv:2006.12013  [pdf, other

    cs.LG stat.ML

    CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information

    Authors: Pengyu Cheng, Weituo Hao, Shuyang Dai, Jiachang Liu, Zhe Gan, Lawrence Carin

    Abstract: Mutual information (MI) minimization has gained considerable interests in various machine learning tasks. However, estimating and minimizing MI in high-dimensional spaces remains a challenging problem, especially when only samples, rather than distribution forms, are accessible. Previous works mainly focus on MI lower bound approximation, which is not applicable to MI minimization problems. In thi… ▽ More

    Submitted 23 July, 2020; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: Accepted by the 37th International Conference on Machine Learing (ICML2020)

  13. arXiv:2006.09543  [pdf, other

    cs.LG eess.SY stat.ML

    Data Driven Control with Learned Dynamics: Model-Based versus Model-Free Approach

    Authors: Wenjian Hao, Yiqiang Han

    Abstract: This paper compares two different types of data-driven control methods, representing model-based and model-free approaches. One is a recently proposed method - Deep Koopman Representation for Control (DKRC), which utilizes a deep neural network to map an unknown nonlinear dynamical system to a high-dimensional linear system, which allows for employing state-of-the-art control strategy. The other o… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

    Comments: 17 pages, 16 figures

  14. arXiv:2003.01926  [pdf, other

    stat.ML astro-ph.IM cs.LG

    Transformation Importance with Applications to Cosmology

    Authors: Chandan Singh, Wooseok Ha, Francois Lanusse, Vanessa Boehm, Jia Liu, Bin Yu

    Abstract: Machine learning lies at the heart of new possibilities for scientific discovery, knowledge generation, and artificial intelligence. Its potential benefits to these fields requires going beyond predictive accuracy and focusing on interpretability. In particular, many scientific problems require interpretations in a domain-specific interpretable feature space (e.g. the frequency domain) whereas att… ▽ More

    Submitted 14 June, 2021; v1 submitted 4 March, 2020; originally announced March 2020.

    Comments: Published in ICLR 2020 Workshop on Fundamental Science in the era of AI

  15. arXiv:1906.04863  [pdf, other

    cs.LG math.OC math.ST stat.ML

    Statistical guarantees for local graph clustering

    Authors: Wooseok Ha, Kimon Fountoulakis, Michael W. Mahoney

    Abstract: Local graph clustering methods aim to find small clusters in very large graphs. These methods take as input a graph and a seed node, and they return as output a good cluster in a running time that depends on the size of the output cluster but that is independent of the size of the input graph. In this paper, we adopt a statistical perspective on local graph clustering, and we analyze the performan… ▽ More

    Submitted 10 January, 2020; v1 submitted 11 June, 2019; originally announced June 2019.

    Comments: 52 pages, 4 figures, 8 tables

  16. arXiv:1903.03712  [pdf, other

    cs.LG eess.SY stat.ML

    Adaptive Power System Emergency Control using Deep Reinforcement Learning

    Authors: Qiuhua Huang, Renke Huang, Weituo Hao, Jie Tan, Rui Fan, Zhenyu Huang

    Abstract: Power system emergency control is generally regarded as the last safety net for grid security and resiliency. Existing emergency control schemes are usually designed off-line based on either the conceived "worst" case scenario or a few typical operation scenarios. These schemes are facing significant adaptiveness and robustness issues as increasing uncertainties and variations occur in modern elec… ▽ More

    Submitted 22 April, 2019; v1 submitted 8 March, 2019; originally announced March 2019.

    Comments: 12 pages

  17. arXiv:1712.01995  [pdf

    stat.AP

    Short-Term Prediction of Signal Cycle in Actuated-Controlled Corridor Using Sparse Time Series Models

    Authors: Bahman Moghimi, Abolfazl Safikhani, Camille Kamga, Wei Hao, JiaQi Ma

    Abstract: Traffic signals as part of intelligent transportation systems can play a significant role toward making cities smart. Conventionally, most traffic lights are designed with fixed-time control, which induces a lot of slack time (unused green time). Actuated traffic lights control traffic flow in real time and are more responsive to the variation of traffic demands. For an isolated signal, a family o… ▽ More

    Submitted 18 March, 2018; v1 submitted 5 December, 2017; originally announced December 2017.

  18. arXiv:1709.04451  [pdf, other

    math.OC stat.ML

    Alternating minimization and alternating descent over nonconvex sets

    Authors: Wooseok Ha, Rina Foygel Barber

    Abstract: We analyze the performance of alternating minimization for loss functions optimized over two variables, where each variable may be restricted to lie in some potentially nonconvex constraint set. This type of setting arises naturally in high-dimensional statistics and signal processing, where the variables often reflect different structures or components within the signals being considered. Our ana… ▽ More

    Submitted 25 February, 2019; v1 submitted 13 September, 2017; originally announced September 2017.

  19. arXiv:1312.2041  [pdf, other

    q-bio.PE q-bio.GN q-bio.QM stat.AP stat.ME

    Probabilistic models of genetic variation in structured populations applied to global human studies

    Authors: Wei Hao, Minsun Song, John D. Storey

    Abstract: Modern population genetics studies typically involve genome-wide genoty** of individuals from a diverse network of ancestries. An important, unsolved problem is how to formulate and estimate probabilistic models of observed genotypes that allow for complex population structure. We formulate two general probabilistic models, and we propose computationally efficient algorithms to estimate them. Fi… ▽ More

    Submitted 3 March, 2015; v1 submitted 6 December, 2013; originally announced December 2013.

    Comments: Wei Hao and Minsun Song contributed equally to this work