Skip to main content

Showing 1–24 of 24 results for author: Xie, Z

Searching in archive stat. Search in all archives.
.
  1. arXiv:2309.04957  [pdf, other

    stat.ME

    Winner's Curse Free Robust Mendelian Randomization with Summary Data

    Authors: Zhongming Xie, Wanheng Zhang, **gshen Wang, Chong Wu

    Abstract: In the past decade, the increased availability of genome-wide association studies summary data has popularized Mendelian Randomization (MR) for conducting causal inference. MR analyses, incorporating genetic variants as instrumental variables, are known for their robustness against reverse causation bias and unmeasured confounders. Nevertheless, classical MR analyses utilizing summary data may sti… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

  2. Functional PCA and Deep Neural Networks-based Bayesian Inverse Uncertainty Quantification with Transient Experimental Data

    Authors: Ziyu Xie, Mahmoud Yaseen, Xu Wu

    Abstract: Inverse UQ is the process to inversely quantify the model input uncertainties based on experimental data. This work focuses on develo** an inverse UQ process for time-dependent responses, using dimensionality reduction by functional principal component analysis (PCA) and deep neural network (DNN)-based surrogate models. The demonstration is based on the inverse UQ of TRACE physical model paramet… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: 31 pages, 21 figures

  3. arXiv:2303.18067  [pdf, other

    physics.ao-ph stat.AP

    Rediscover Climate Change during Global Warming Slowdown via Wasserstein Stability Analysis

    Authors: Zhiang Xie, Dongwei Chen, Puxi Li

    Abstract: Climate change is one of the key topics in climate science. However, previous research has predominantly concentrated on changes in mean values, and few research examines changes in Probability Distribution Function (PDF). In this study, a novel method called Wasserstein Stability Analysis (WSA) is developed to identify PDF changes, especially the extreme event shift and non-linear physical value… ▽ More

    Submitted 28 May, 2023; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: 14 pages, 4 figures, 1 Algorithm, and 3-page supplementary materials

  4. arXiv:2212.02083  [pdf, other

    cs.LG stat.ML

    On the Overlooked Structure of Stochastic Gradients

    Authors: Zeke Xie, Qian-Yuan Tang, Mingming Sun, ** Li

    Abstract: Stochastic gradients closely relate to both optimization and generalization of deep neural networks (DNNs). Some works attempted to explain the success of stochastic optimization for deep learning by the arguably heavy-tail properties of gradient noise, while other works presented theoretical and empirical evidence against the heavy-tail hypothesis on gradient noise. Unfortunately, formal statisti… ▽ More

    Submitted 20 October, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

    Comments: NeurIPS 2023. 20 pages, 16 figures, 17 Tables; Key Words: Deep Learning, Stochastic Gradient, Optimization. arXiv admin note: text overlap with arXiv:2201.13011

  5. arXiv:2208.07959  [pdf, other

    stat.ME stat.AP

    Variable Selection in Latent Regression IRT Models via Knockoffs: An Application to International Large-scale Assessment in Education

    Authors: Zilong Xie, Yunxiao Chen, Matthias von Davier, Haolei Weng

    Abstract: International large-scale assessments (ILSAs) play an important role in educational research and policy making. They collect valuable data on education quality and performance development across many education systems, giving countries the opportunity to share techniques, organizational structures, and policies that have proven efficient and successful. To gain insights from ILSA data, we identify… ▽ More

    Submitted 14 November, 2023; v1 submitted 16 August, 2022; originally announced August 2022.

  6. arXiv:2207.02943  [pdf, other

    econ.EM stat.ME

    Degrees of Freedom and Information Criteria for the Synthetic Control Method

    Authors: Guillaume Allaire Pouliot, Zhen Xie

    Abstract: We provide an analytical characterization of the model flexibility of the synthetic control method (SCM) in the familiar form of degrees of freedom. We obtain estimable information criteria. These may be used to circumvent cross-validation when selecting either the weighting matrix in the SCM with covariates, or the tuning parameter in model averaging or penalized variants of SCM. We assess the im… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

  7. Bayesian Inverse Uncertainty Quantification of a MOOSE-based Melt Pool Model for Additive Manufacturing Using Experimental Data

    Authors: Ziyu Xie, Wen Jiang, Congjian Wang, Xu Wu

    Abstract: Additive manufacturing (AM) technology is being increasingly adopted in a wide variety of application areas due to its ability to rapidly produce, prototype, and customize designs. AM techniques afford significant opportunities in regard to nuclear materials, including an accelerated fabrication process and reduced cost. High-fidelity modeling and simulation (M\&S) of AM processes is being develop… ▽ More

    Submitted 17 May, 2021; v1 submitted 11 May, 2021; originally announced May 2021.

    Comments: 26 pages, 11 figures

  8. Towards Improving the Predictive Capability of Computer Simulations by Integrating Inverse Uncertainty Quantification and Quantitative Validation with Bayesian Hypothesis Testing

    Authors: Ziyu Xie, Farah Alsafadi, Xu Wu

    Abstract: The Best Estimate plus Uncertainty (BEPU) approach for nuclear systems modeling and simulation requires that the prediction uncertainty must be quantified in order to prove that the investigated design stays within acceptance criteria. A rigorous Uncertainty Quantification (UQ) process should simultaneously consider multiple sources of quantifiable uncertainties: (1) parameter uncertainty due to r… ▽ More

    Submitted 2 May, 2021; originally announced May 2021.

    Comments: 29 pages, 11 figures

  9. A Comprehensive Survey of Inverse Uncertainty Quantification of Physical Model Parameters in Nuclear System Thermal-Hydraulics Codes

    Authors: Xu Wu, Ziyu Xie, Farah Alsafadi, Tomasz Kozlowski

    Abstract: Uncertainty Quantification (UQ) is an essential step in computational model validation because assessment of the model accuracy requires a concrete, quantifiable measure of uncertainty in the model predictions. The concept of UQ in the nuclear community generally means forward UQ (FUQ), in which the information flow is from the inputs to the outputs. Inverse UQ (IUQ), in which the information flow… ▽ More

    Submitted 26 April, 2021; originally announced April 2021.

    Comments: 76 pages, 10 figures

  10. arXiv:2010.13520  [pdf, other

    cs.LG cs.CR stat.ML

    Differentially Private (Gradient) Expectation Maximization Algorithm with Statistical Guarantees

    Authors: Di Wang, Jiahao Ding, Lijie Hu, Zejun Xie, Miao Pan, **hui Xu

    Abstract: (Gradient) Expectation Maximization (EM) is a widely used algorithm for estimating the maximum likelihood of mixture models or incomplete data problems. A major challenge facing this popular technique is how to effectively preserve the privacy of sensitive data. Previous research on this problem has already lead to the discovery of some Differentially Private (DP) algorithms for (Gradient) EM. How… ▽ More

    Submitted 16 January, 2022; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: Submiited. arXiv admin note: text overlap with arXiv:2010.09576

  11. arXiv:2009.11469  [pdf, other

    cs.LG cs.AI stat.ML

    Revisiting Graph Convolutional Network on Semi-Supervised Node Classification from an Optimization Perspective

    Authors: Hongwei Zhang, Ti** Yan, Zenjun Xie, Yuanqing Xia, Yuan Zhang

    Abstract: Graph convolutional networks (GCNs) have achieved promising performance on various graph-based tasks. However they suffer from over-smoothing when stacking more layers. In this paper, we present a quantitative study on this observation and develop novel insights towards the deeper GCN. First, we interpret the current graph convolutional operations from an optimization perspective and argue that ov… ▽ More

    Submitted 24 September, 2020; v1 submitted 23 September, 2020; originally announced September 2020.

  12. arXiv:2008.11832  [pdf, other

    cs.LG cs.DC physics.comp-ph stat.ML

    Adaptive Neural Network-Based Approximation to Accelerate Eulerian Fluid Simulation

    Authors: Wenqian Dong, Jie Liu, Zhen Xie, Dong Li

    Abstract: The Eulerian fluid simulation is an important HPC application. The neural network has been applied to accelerate it. The current methods that accelerate the fluid simulation with neural networks lack flexibility and generalization. In this paper, we tackle the above limitation and aim to enhance the applicability of neural networks in the Eulerian fluid simulation. We introduce Smartfluidnet, a fr… ▽ More

    Submitted 26 August, 2020; originally announced August 2020.

  13. arXiv:2006.15815  [pdf, other

    cs.LG stat.ML

    Adaptive Inertia: Disentangling the Effects of Adaptive Learning Rate and Momentum

    Authors: Zeke Xie, Xinrui Wang, Huishuai Zhang, Issei Sato, Masashi Sugiyama

    Abstract: Adaptive Moment Estimation (Adam), which combines Adaptive Learning Rate and Momentum, would be the most popular stochastic optimizer for accelerating the training of deep neural networks. However, it is empirically known that Adam often generalizes worse than Stochastic Gradient Descent (SGD). The purpose of this paper is to unveil the mystery of this behavior in the diffusion theoretical framewo… ▽ More

    Submitted 14 June, 2022; v1 submitted 29 June, 2020; originally announced June 2020.

    Comments: ICML2022, Long Oral Presentation, 30 pages, 14 figures, Key Words: Deep Learning Theory, Optimization, Adam, Adaptive Inertia, Flat Minima

  14. arXiv:2005.08704  [pdf, other

    cs.CV cs.LG stat.ML

    A Biologically Inspired Feature Enhancement Framework for Zero-Shot Learning

    Authors: Zhongwu Xie, Weipeng Cao, Xizhao Wang, Zhong Ming, **g**g Zhang, Jiyong Zhang

    Abstract: Most of the Zero-Shot Learning (ZSL) algorithms currently use pre-trained models as their feature extractors, which are usually trained on the ImageNet data set by using deep neural networks. The richness of the feature information embedded in the pre-trained models can help the ZSL model extract more useful features from its limited training samples. However, sometimes the difference between the… ▽ More

    Submitted 13 May, 2020; originally announced May 2020.

  15. arXiv:2003.01762  [pdf, other

    cs.LG stat.ML

    FLAME: A Self-Adaptive Auto-labeling System for Heterogeneous Mobile Processors

    Authors: Jie Liu, Jiawen Liu, Zhen Xie, Dong Li

    Abstract: How to accurately and efficiently label data on a mobile device is critical for the success of training machine learning models on mobile devices. Auto-labeling data on mobile devices is challenging, because data is usually incrementally generated and there is possibility of having unknown labels. Furthermore, the rich hardware heterogeneity on mobile devices creates challenges on efficiently exec… ▽ More

    Submitted 3 March, 2020; originally announced March 2020.

  16. arXiv:2002.03495  [pdf, other

    cs.LG stat.ML

    A Diffusion Theory For Deep Learning Dynamics: Stochastic Gradient Descent Exponentially Favors Flat Minima

    Authors: Zeke Xie, Issei Sato, Masashi Sugiyama

    Abstract: Stochastic Gradient Descent (SGD) and its variants are mainstream methods for training deep networks in practice. SGD is known to find a flat minimum that often generalizes well. However, it is mathematically unclear how deep learning can select a flat minimum among so many minima. To answer the question quantitatively, we develop a density diffusion theory (DDT) to reveal how minima selection qua… ▽ More

    Submitted 15 January, 2021; v1 submitted 9 February, 2020; originally announced February 2020.

    Comments: ICLR 2021; 28 pages; 19 figures

  17. arXiv:1912.05903   

    q-bio.QM cs.LG q-bio.BM stat.ML

    Prediction and optimization of NaV1.7 inhibitors based on machine learning methods

    Authors: Weikaixin Kong, Xinyu Tu, Zhengwei Xie, Zhuo Huang

    Abstract: We used machine learning methods to predict NaV1.7 inhibitors and found the model RF-CDK that performed best on the imbalanced dataset. Using the RF-CDK model for screening drugs, we got effective compounds K1. We use the cell patch clamp method to verify K1. However, because the model evaluation method in this article is not comprehensive enough, there is still a lot of research work to be perfor… ▽ More

    Submitted 15 February, 2020; v1 submitted 29 November, 2019; originally announced December 2019.

    Comments: The evaluation of the model in the results section of this article is not comprehensive enough.We will carry out further work. The article needs to be polished. There are certain disadvantages to the molecular optimization method. The discussion part is not deep enough, so withdraw is needed

  18. arXiv:1912.03015  [pdf, other

    cs.LG cs.RO stat.ML

    Learning to Correspond Dynamical Systems

    Authors: Nam Hee Kim, Zhaoming Xie, Michiel van de Panne

    Abstract: Many dynamical systems exhibit similar structure, as often captured by hand-designed simplified models that can be used for analysis and control. We develop a method for learning to correspond pairs of dynamical systems via a learned latent dynamical system. Given trajectory data from two dynamical systems, we learn a shared latent state space and a shared latent dynamics model, along with an enco… ▽ More

    Submitted 4 June, 2020; v1 submitted 6 December, 2019; originally announced December 2019.

  19. arXiv:1812.00335  [pdf, other

    cs.LG stat.ML

    GAN-EM: GAN based EM learning framework

    Authors: Wentian Zhao, Shaojie Wang, Zhihuai Xie, **g Shi, Chenliang Xu

    Abstract: Expectation maximization (EM) algorithm is to find maximum likelihood solution for models having latent variables. A typical example is Gaussian Mixture Model (GMM) which requires Gaussian assumption, however, natural images are highly non-Gaussian so that GMM cannot be applied to perform clustering task on pixel space. To overcome such limitation, we propose a GAN based EM learning framework that… ▽ More

    Submitted 2 December, 2018; originally announced December 2018.

  20. arXiv:1809.00083  [pdf, other

    q-bio.BM cs.LG stat.ME

    Predicting protein inter-residue contacts using composite likelihood maximization and deep learning

    Authors: Haicang Zhang, Qi Zhang, Fusong Ju, Jianwei Zhu, Shiwei Sun, Yujuan Gao, Ziwei Xie, Minghua Deng, Shiwei Sun, Wei-Mou Zheng, Dongbo Bu

    Abstract: Accurate prediction of inter-residue contacts of a protein is important to calcu- lating its tertiary structure. Analysis of co-evolutionary events among residues has been proved effective to inferring inter-residue contacts. The Markov ran- dom field (MRF) technique, although being widely used for contact prediction, suffers from the following dilemma: the actual likelihood function of MRF is acc… ▽ More

    Submitted 31 August, 2018; originally announced September 2018.

  21. arXiv:1807.11790  [pdf, other

    cs.GT cs.LG stat.ML

    Practical Constrained Optimization of Auction Mechanisms in E-Commerce Sponsored Search Advertising

    Authors: Gang Bai, Zhihui Xie, Liang Wang

    Abstract: Sponsored search in E-commerce platforms such as Amazon, Taobao and Tmall provides sellers an effective way to reach potential buyers with most relevant purpose. In this paper, we study the auction mechanism optimization problem in sponsored search on Alibaba's mobile E-commerce platform. Besides generating revenue, we are supposed to maintain an efficient marketplace with plenty of quality users,… ▽ More

    Submitted 31 July, 2018; originally announced July 2018.

    Comments: 6 pages, 1 figure

  22. arXiv:1803.08010  [pdf, other

    cs.SI physics.soc-ph stat.AP stat.ML

    Social Media Would Not Lie: Prediction of the 2016 Taiwan Election via Online Heterogeneous Data

    Authors: Zheng Xie, Guannan Liu, Junjie Wu, Yong Tan

    Abstract: The prevalence of online media has attracted researchers from various domains to explore human behavior and make interesting predictions. In this research, we leverage heterogeneous social media data collected from various online platforms to predict Taiwan's 2016 presidential election. In contrast to most existing research, we take a "signal" view of heterogeneous information and adopt the Kalman… ▽ More

    Submitted 3 April, 2018; v1 submitted 21 March, 2018; originally announced March 2018.

    Journal ref: EPJ Data Science,2018,7:32

  23. arXiv:1711.09534  [pdf, other

    cs.CL cs.LG stat.ML

    Neural Text Generation: A Practical Guide

    Authors: Ziang Xie

    Abstract: Deep learning methods have recently achieved great empirical success on machine translation, dialogue response generation, summarization, and other text generation tasks. At a high level, the technique has been to train end-to-end neural network models consisting of an encoder model to produce a hidden representation of the source text, followed by a decoder model to generate the target. While suc… ▽ More

    Submitted 26 November, 2017; originally announced November 2017.

  24. arXiv:1406.7806  [pdf, other

    cs.CL cs.LG cs.NE stat.ML

    Building DNN Acoustic Models for Large Vocabulary Speech Recognition

    Authors: Andrew L. Maas, Peng Qi, Ziang Xie, Awni Y. Hannun, Christopher T. Lengerich, Daniel Jurafsky, Andrew Y. Ng

    Abstract: Deep neural networks (DNNs) are now a central component of nearly all state-of-the-art speech recognition systems. Building neural network acoustic models requires several design decisions including network architecture, size, and training loss function. This paper offers an empirical investigation on which aspects of DNN acoustic model design are most important for speech recognition system perfo… ▽ More

    Submitted 20 January, 2015; v1 submitted 30 June, 2014; originally announced June 2014.