Skip to main content

Showing 1–17 of 17 results for author: He, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2404.13177  [pdf, other

    stat.ME stat.AP

    A Bayesian Hybrid Design with Borrowing from Historical Study

    Authors: Zhaohua Lu, John Toso, Girma Ayele, Philip He

    Abstract: In early phase drug development of combination therapy, the primary objective is to preliminarily assess whether there is additive activity when a novel agent combined with an established monotherapy. Due to potential feasibility issues with a large randomized study, uncontrolled single-arm trials have been the mainstream approach in cancer clinical trials. However, such trials often present signi… ▽ More

    Submitted 29 April, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

  2. arXiv:2401.17426  [pdf, other

    cs.LG cs.AI stat.ML

    Superiority of Multi-Head Attention in In-Context Linear Regression

    Authors: Yingqian Cui, Jie Ren, Pengfei He, Jiliang Tang, Yue Xing

    Abstract: We present a theoretical analysis of the performance of transformer with softmax attention in in-context learning with linear regression tasks. While the existing literature predominantly focuses on the convergence of transformers with single-/multi-head attention, our research centers on comparing their performance. We conduct an exact theoretical analysis to demonstrate that multi-head attention… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  3. arXiv:2312.15352  [pdf

    stat.AP

    A Bayesian Basket Trial Design Using Local Power Prior

    Authors: Haiming Zhou, Rex Shen, Sutan Wu, Philip He

    Abstract: In recent years, basket trials, which enable the evaluation of an experimental therapy across multiple tumor types within a single protocol, have gained prominence in early-phase oncology development. Unlike traditional trials, where each tumor type is evaluated separately with limited sample size, basket trials offer the advantage of borrowing information across various tumor types. However, a ke… ▽ More

    Submitted 19 April, 2024; v1 submitted 23 December, 2023; originally announced December 2023.

  4. arXiv:2310.06389  [pdf, other

    cs.CV stat.ML

    Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling

    Authors: Huangjie Zheng, Zhendong Wang, Jianbo Yuan, Guanghan Ning, Pengcheng He, Quanzeng You, Hongxia Yang, Mingyuan Zhou

    Abstract: Diffusion models excel at generating photo-realistic images but come with significant computational costs in both training and sampling. While various techniques address these computational challenges, a less-explored issue is designing an efficient and adaptable network backbone for iterative refinement. Current options like U-Net and Vision Transformer often rely on resource-intensive deep netwo… ▽ More

    Submitted 27 June, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

  5. arXiv:2305.00350  [pdf, other

    cs.LG cs.AI cs.CL cs.CV stat.ML

    POUF: Prompt-oriented unsupervised fine-tuning for large pre-trained models

    Authors: Korawat Tanwisuth, Shujian Zhang, Huangjie Zheng, Pengcheng He, Mingyuan Zhou

    Abstract: Through prompting, large-scale pre-trained models have become more expressive and powerful, gaining significant attention in recent years. Though these big models have zero-shot capabilities, in general, labeled data are still required to adapt them to downstream tasks. To overcome this critical limitation, we propose an unsupervised fine-tuning framework to directly fine-tune the model or prompt… ▽ More

    Submitted 29 April, 2023; originally announced May 2023.

    Comments: ICML 2023; PyTorch code is available at https://github.com/korawat-tanwisuth/POUF

  6. arXiv:2206.03254  [pdf, ps, other

    cs.LG stat.ML

    Demystifying the Global Convergence Puzzle of Learning Over-parameterized ReLU Nets in Very High Dimensions

    Authors: Peng He

    Abstract: This theoretical paper is devoted to develo** a rigorous theory for demystifying the global convergence phenomenon in a challenging scenario: learning over-parameterized Rectified Linear Unit (ReLU) nets for very high dimensional dataset under very mild assumptions. A major ingredient of our analysis is a fine-grained analysis of random activation matrices. The essential virtue of dissecting act… ▽ More

    Submitted 4 June, 2022; originally announced June 2022.

  7. arXiv:2206.02262  [pdf, other

    cs.LG stat.ML

    Diffusion-GAN: Training GANs with Diffusion

    Authors: Zhendong Wang, Huangjie Zheng, Pengcheng He, Weizhu Chen, Mingyuan Zhou

    Abstract: Generative adversarial networks (GANs) are challenging to train stably, and a promising remedy of injecting instance noise into the discriminator input has not been very effective in practice. In this paper, we propose Diffusion-GAN, a novel GAN framework that leverages a forward diffusion chain to generate Gaussian-mixture distributed instance noise. Diffusion-GAN consists of three components, in… ▽ More

    Submitted 25 August, 2023; v1 submitted 5 June, 2022; originally announced June 2022.

    Comments: Project homepage: https://github.com/Zhendong-Wang/Diffusion-GAN; ICLR 2023 camera ready version

  8. arXiv:2202.09671  [pdf, other

    stat.ML cs.LG

    Truncated Diffusion Probabilistic Models and Diffusion-based Adversarial Auto-Encoders

    Authors: Huangjie Zheng, Pengcheng He, Weizhu Chen, Mingyuan Zhou

    Abstract: Employing a forward diffusion chain to gradually map the data to a noise distribution, diffusion-based generative models learn how to generate the data by inferring a reverse diffusion chain. However, this approach is slow and costly because it needs many forward and reverse steps. We propose a faster and cheaper approach that adds noise not until the data become pure random noise, but until they… ▽ More

    Submitted 7 September, 2023; v1 submitted 19 February, 2022; originally announced February 2022.

    Comments: ICLR 2023 camera-ready version

  9. arXiv:2107.13610  [pdf, other

    cs.LG math.DG math.SP stat.ML

    Large sample spectral analysis of graph-based multi-manifold clustering

    Authors: Nicolas Garcia Trillos, Pengfei He, Chenghui Li

    Abstract: In this work we study statistical properties of graph-based algorithms for multi-manifold clustering (MMC). In MMC the goal is to retrieve the multi-manifold structure underlying a given Euclidean data set when this one is assumed to be obtained by sampling a distribution on a union of manifolds $\mathcal{M} = \mathcal{M}_1 \cup\dots \cup \mathcal{M}_N$ that may intersect with each other and that… ▽ More

    Submitted 10 November, 2022; v1 submitted 28 July, 2021; originally announced July 2021.

    Comments: We fixed some typos and added some detailed discussion to our theoretical results. We also supplemented more experimental discussions

  10. arXiv:2104.12301  [pdf, ps, other

    stat.ME astro-ph.IM physics.comp-ph

    Data-Based Optimal Bandwidth for Kernel Density Estimation of Statistical Samples

    Authors: Zhen-Wei Li, ** He

    Abstract: It is a common practice to evaluate probability density function or matter spatial density function from statistical samples. Kernel density estimation is a frequently used method, but to select an optimal bandwidth of kernel estimation, which is completely based on data samples, is a long-term issue that has not been well settled so far. There exist analytic formulae of optimal kernel bandwidth,… ▽ More

    Submitted 25 April, 2021; originally announced April 2021.

    Comments: 7 pages, 8 figures

    Journal ref: Commun. Theor. Phys. 70 (2018) 728-734

  11. arXiv:2009.00827  [pdf, ps, other

    stat.ME

    Estimating the reciprocal of a binomial proportion

    Authors: Jia** Wei, ** He, Tiejun Tong

    Abstract: As a classic parameter from the binomial distribution, the binomial proportion has been well studied in the literature owing to its wide range of applications. In contrast, the reciprocal of the binomial proportion, also known as the inverse proportion, is often overlooked, even though it also plays an important role in various fields including clinical studies and random sampling. The maximum lik… ▽ More

    Submitted 30 August, 2021; v1 submitted 2 September, 2020; originally announced September 2020.

  12. arXiv:2006.04164  [pdf, other

    cs.IR cs.LG stat.ML

    Single-Layer Graph Convolutional Networks For Recommendation

    Authors: Yue Xu, Hao Chen, Zengde Deng, Junxiong Zhu, Yanghua Li, Peng He, Wenyao Gao, Wenjun Xu

    Abstract: Graph Convolutional Networks (GCNs) and their variants have received significant attention and achieved start-of-the-art performances on various recommendation tasks. However, many existing GCN models tend to perform recursive aggregations among all related nodes, which arises severe computational burden. Moreover, they favor multi-layer architectures in conjunction with complicated modeling techn… ▽ More

    Submitted 7 June, 2020; originally announced June 2020.

  13. arXiv:1908.03265  [pdf, other

    cs.LG cs.CL stat.ML

    On the Variance of the Adaptive Learning Rate and Beyond

    Authors: Liyuan Liu, Haoming Jiang, Pengcheng He, Weizhu Chen, Xiaodong Liu, Jianfeng Gao, Jiawei Han

    Abstract: The learning rate warmup heuristic achieves remarkable success in stabilizing training, accelerating convergence and improving generalization for adaptive stochastic optimization algorithms like RMSprop and Adam. Here, we study its mechanism in details. Pursuing the theory behind warmup, we identify a problem of the adaptive learning rate (i.e., it has problematically large variance in the early s… ▽ More

    Submitted 25 October, 2021; v1 submitted 8 August, 2019; originally announced August 2019.

    Comments: ICLR 2020. Fix several typos in the previous version

  14. arXiv:1907.04707  [pdf, other

    cs.LG stat.ML

    Label-Aware Graph Convolutional Networks

    Authors: Hao Chen, Yue Xu, Feiran Huang, Zengde Deng, Wenbing Huang, Senzhang Wang, Peng He, Zhoujun Li

    Abstract: Recent advances in Graph Convolutional Networks (GCNs) have led to state-of-the-art performance on various graph-related tasks. However, most existing GCN models do not explicitly identify whether all the aggregated neighbors are valuable to the learning tasks, which may harm the learning performance. In this paper, we consider the problem of node classification and propose the Label-Aware Graph C… ▽ More

    Submitted 5 September, 2020; v1 submitted 10 July, 2019; originally announced July 2019.

    Comments: Accepted by CIKM 2020

  15. arXiv:1712.07107  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Adversarial Examples: Attacks and Defenses for Deep Learning

    Authors: Xiaoyong Yuan, Pan He, Qile Zhu, Xiaolin Li

    Abstract: With rapid progress and significant successes in a wide spectrum of applications, deep learning is being applied in many safety-critical environments. However, deep neural networks have been recently found vulnerable to well-designed input samples, called adversarial examples. Adversarial examples are imperceptible to human but can easily fool deep neural networks in the testing/deploying stage. T… ▽ More

    Submitted 6 July, 2018; v1 submitted 19 December, 2017; originally announced December 2017.

    Comments: Github: https://github.com/chbrian/awesome-adversarial-examples-dl

  16. arXiv:1712.01145  [pdf, other

    cs.CR cs.LG stat.ML

    Learning Fast and Slow: PROPEDEUTICA for Real-time Malware Detection

    Authors: Ruimin Sun, Xiaoyong Yuan, Pan He, Qile Zhu, Aokun Chen, Andre Gregio, Daniela Oliveira, Xiaolin Li

    Abstract: Existing malware detectors on safety-critical devices have difficulties in runtime detection due to the performance overhead. In this paper, we introduce PROPEDEUTICA, a framework for efficient and effective real-time malware detection, leveraging the best of conventional machine learning (ML) and deep learning (DL) techniques. In PROPEDEUTICA, all software start execution are considered as benign… ▽ More

    Submitted 17 October, 2021; v1 submitted 4 December, 2017; originally announced December 2017.

    Comments: 12 pages, 4 figures. This paper has been accepted to IEEE Transactions on Neural Networks and Learning Systems (TNNLS)

  17. arXiv:1410.5356  [pdf, ps, other

    stat.ME physics.comp-ph physics.data-an

    Statistical computation of Boltzmann entropy and estimation of the optimal probability density function from statistical sample

    Authors: Ning Sui, Min Li, ** He

    Abstract: In this work, we investigate the statistical computation of the Boltzmann entropy of statistical samples. For this purpose, we use both histogram and kernel function to estimate the probability density function of statistical samples. We find that, due to coarse-graining, the entropy is a monotonic increasing function of the bin width for histogram or bandwidth for kernel estimation, which seems t… ▽ More

    Submitted 17 October, 2014; originally announced October 2014.

    Comments: 8 pages, 6 figures, MNRAS, in the press

    Journal ref: MNRAS (2014), 445, 4211 - 4217