Skip to main content

Showing 1–18 of 18 results for author: Deng, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2406.14535  [pdf, other

    stat.ME math.ST

    On estimation and order selection for multivariate extremes via clustering

    Authors: Shiyuan Deng, He Tang, Shuyang Bai

    Abstract: We investigate the estimation of multivariate extreme models with a discrete spectral measure using spherical clustering techniques. The primary contribution involves devising a method for selecting the order, that is, the number of clusters. The method consistently identifies the true order, i.e., the number of spectral atoms, and enjoys intuitive implementation in practice. Specifically, we intr… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 31 pages, 12 figures

    MSC Class: 62G32 (Primary); 60G70 (Secondary)

  2. arXiv:2406.05287  [pdf, ps, other

    cs.LG cs.GT stat.ML

    Group-wise oracle-efficient algorithms for online multi-group learning

    Authors: Samuel Deng, Daniel Hsu, **gwen Liu

    Abstract: We study the problem of online multi-group learning, a learning model in which an online learner must simultaneously achieve small prediction regret on a large collection of (possibly overlap**) subsequences corresponding to a family of groups. Groups are subsets of the context space, and in fairness applications, they may correspond to subpopulations defined by expressive functions of demograph… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  3. arXiv:2406.04657  [pdf, other

    cs.LG cs.AI math.ST stat.ML

    Crafting Heavy-Tails in Weight Matrix Spectrum without Gradient Noise

    Authors: Vignesh Kothapalli, Tianyu Pang, Shenyang Deng, Zongmin Liu, Yaoqing Yang

    Abstract: Modern training strategies of deep neural networks (NNs) tend to induce a heavy-tailed (HT) spectra of layer weights. Extensive efforts to study this phenomenon have found that NNs with HT weight spectra tend to generalize well. A prevailing notion for the occurrence of such HT spectra attributes gradient noise during training as a key contributing factor. Our work shows that gradient noise is unn… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: 31 pages, 37 figures

  4. arXiv:2401.00540  [pdf, other

    stat.ME stat.AP

    Study Duration Prediction for Clinical Trials with Time-to-Event Endpoints Using Mixture Distributions Accounting for Heterogeneous Population

    Authors: Hong Zhang, Jie Pu, Shibing Deng, Satrajit Roychoudhury, Haitao Chu, Douglas Robinson

    Abstract: In the era of precision medicine, more and more clinical trials are now driven or guided by biomarkers, which are patient characteristics objectively measured and evaluated as indicators of normal biological processes, pathogenic processes, or pharmacologic responses to therapeutic interventions. With the overarching objective to optimize and personalize disease management, biomarker-guided clinic… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

  5. arXiv:2201.07348  [pdf, other

    cs.LG stat.ML

    Learning Tensor Representations for Meta-Learning

    Authors: Samuel Deng, Yilin Guo, Daniel Hsu, Debmalya Mandal

    Abstract: We introduce a tensor-based model of shared representation for meta-learning from a diverse set of tasks. Prior works on learning linear representations for meta-learning assume that there is a common shared representation across different tasks, and do not consider the additional task-specific observable side information. In this work, we model the meta-parameter through an order-$3$ tensor, whic… ▽ More

    Submitted 18 January, 2022; originally announced January 2022.

    Comments: Forthcoming at AISTATS-2022

  6. arXiv:2112.12909  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Optimal Variable Clustering for High-Dimensional Matrix Valued Data

    Authors: Inbeom Lee, Siyi Deng, Yang Ning

    Abstract: Matrix valued data has become increasingly prevalent in many applications. Most of the existing clustering methods for this type of data are tailored to the mean model and do not account for the dependence structure of the features, which can be very informative, especially in high-dimensional settings or when mean information is not available. To extract the information from the dependence struct… ▽ More

    Submitted 6 December, 2023; v1 submitted 23 December, 2021; originally announced December 2021.

  7. arXiv:2105.09347  [pdf, ps, other

    stat.OT

    An Introduction to DoSStoolkit

    Authors: Rohan Alexander, Samantha-Jo Caetano, Haoluan Chen, Michael Chong, Annie Collins, Shirley Deng, Isaac Ehrlich, Paul Hodgetts, Yena Joo, Marija Pejcinovska, Mariam Walaa, Matthew Wankiewicz

    Abstract: We describe a series of interactive, student-developed, self-paced, modules for learning R. We detail the components of this resource, and the pedagogical underpinning. We discuss the development of this resource, and avenues for future work. Our resource is available as an R package: DoSStoolkit.

    Submitted 19 May, 2021; originally announced May 2021.

    Comments: For associated R package, see https://github.com/RohanAlexander/DoSStoolkit

  8. arXiv:2103.00476  [pdf, other

    cs.NE stat.ML

    Optimal Conversion of Conventional Artificial Neural Networks to Spiking Neural Networks

    Authors: Shikuang Deng, Shi Gu

    Abstract: Spiking neural networks (SNNs) are biology-inspired artificial neural networks (ANNs) that comprise of spiking neurons to process asynchronous discrete signals. While more efficient in power consumption and inference speed on the neuromorphic hardware, SNNs are usually difficult to train directly from scratch with spikes due to the discreteness. As an alternative, many efforts have been devoted to… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

  9. arXiv:2011.14185  [pdf, other

    stat.ME math.ST stat.ML

    Optimal and Safe Estimation for High-Dimensional Semi-Supervised Learning

    Authors: Siyi Deng, Yang Ning, Jiwei Zhao, He** Zhang

    Abstract: We consider the estimation problem in high-dimensional semi-supervised learning. Our goal is to investigate when and how the unlabeled data can be exploited to improve the estimation of the regression parameters of linear model in light of the fact that such linear models may be misspecified in data analysis. We first establish the minimax lower bound for parameter estimation in the semi-supervise… ▽ More

    Submitted 18 March, 2023; v1 submitted 28 November, 2020; originally announced November 2020.

  10. arXiv:2009.07022  [pdf, other

    cs.LG cs.CL cs.DB cs.IR stat.ML

    The Devil is the Classifier: Investigating Long Tail Relation Classification with Decoupling Analysis

    Authors: Haiyang Yu, Ningyu Zhang, Shumin Deng, Zonggang Yuan, Yantao Jia, Huajun Chen

    Abstract: Long-tailed relation classification is a challenging problem as the head classes may dominate the training phase, thereby leading to the deterioration of the tail performance. Existing solutions usually address this issue via class-balancing strategies, e.g., data re-sampling and loss re-weighting, but all these methods adhere to the schema of entangling learning of the representation and classifi… ▽ More

    Submitted 15 September, 2020; originally announced September 2020.

  11. arXiv:2007.06029  [pdf, other

    cs.LG stat.ML

    Ensuring Fairness Beyond the Training Data

    Authors: Debmalya Mandal, Samuel Deng, Suman Jana, Jeannette M. Wing, Daniel Hsu

    Abstract: We initiate the study of fair classifiers that are robust to perturbations in the training distribution. Despite recent progress, the literature on fairness has largely ignored the design of fair and robust classifiers. In this work, we develop classifiers that are fair not only with respect to the training distribution, but also for a class of distributions that are weighted perturbations of the… ▽ More

    Submitted 4 November, 2020; v1 submitted 12 July, 2020; originally announced July 2020.

    Comments: 18 pages, 3 figures, To appear at NeurIPS-2020

  12. arXiv:2007.05627  [pdf, other

    stat.ML cs.LG

    A Performance Guarantee for Spectral Clustering

    Authors: March Boedihardjo, Shaofeng Deng, Thomas Strohmer

    Abstract: The two-step spectral clustering method, which consists of the Laplacian eigenmap and a rounding step, is a widely used method for graph partitioning. It can be seen as a natural relaxation to the NP-hard minimum ratio cut problem. In this paper we study the central question: when is spectral clustering able to find the global solution to the minimum ratio cut problem? First we provide a condition… ▽ More

    Submitted 10 July, 2020; originally announced July 2020.

  13. arXiv:2004.09780  [pdf, ps, other

    stat.ML cs.LG cs.SI

    Strong Consistency, Graph Laplacians, and the Stochastic Block Model

    Authors: Shaofeng Deng, Shuyang Ling, Thomas Strohmer

    Abstract: Spectral clustering has become one of the most popular algorithms in data clustering and community detection. We study the performance of classical two-step spectral clustering via the graph Laplacian to learn the stochastic block model. Our aim is to answer the following question: when is spectral clustering via the graph Laplacian able to achieve strong consistency, i.e., the exact recovery of t… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

  14. arXiv:2003.12020  [pdf, ps, other

    cs.LG cs.CR stat.ML

    A Separation Result Between Data-oblivious and Data-aware Poisoning Attacks

    Authors: Samuel Deng, Sanjam Garg, Somesh Jha, Saeed Mahloujifar, Mohammad Mahmoody, Abhradeep Thakurta

    Abstract: Poisoning attacks have emerged as a significant security threat to machine learning algorithms. It has been demonstrated that adversaries who make small changes to the training set, such as adding specially crafted data points, can hurt the performance of the output model. Some of the stronger poisoning attacks require the full knowledge of the training data. This leaves open the possibility of ac… ▽ More

    Submitted 13 December, 2021; v1 submitted 26 March, 2020; originally announced March 2020.

  15. arXiv:2002.09062  [pdf

    q-bio.MN cs.LG physics.chem-ph stat.ML

    Autonomous Discovery of Unknown Reaction Pathways from Data by Chemical Reaction Neural Network

    Authors: Weiqi Ji, Sili Deng

    Abstract: Chemical reactions occur in energy, environmental, biological, and many other natural systems, and the inference of the reaction networks is essential to understand and design the chemical processes in engineering and life sciences. Yet, revealing the reaction pathways for complex systems and processes is still challenging due to the lack of knowledge of the involved species and reactions. Here, w… ▽ More

    Submitted 8 January, 2021; v1 submitted 20 February, 2020; originally announced February 2020.

    Journal ref: The Journal of Physical Chemistry A, 2021

  16. arXiv:1912.10202  [pdf, other

    cs.LG cs.SI stat.ML

    Graph Message Passing with Cross-location Attentions for Long-term ILI Prediction

    Authors: Songgaojun Deng, Shusen Wang, Huzefa Rangwala, Li**g Wang, Yue Ning

    Abstract: Forecasting influenza-like illness (ILI) is of prime importance to epidemiologists and health-care providers. Early prediction of epidemic outbreaks plays a pivotal role in disease intervention and control. Most existing work has either limited long-term prediction performance or lacks a comprehensive ability to capture spatio-temporal dependencies in data. Accurate and early disease forecasting m… ▽ More

    Submitted 28 December, 2019; v1 submitted 21 December, 2019; originally announced December 2019.

    Comments: 17 pages, 22 figures, 5 tables

  17. arXiv:1908.08507  [pdf, other

    cs.LG cs.CL stat.ML

    Transfer Learning for Relation Extraction via Relation-Gated Adversarial Learning

    Authors: Ningyu Zhang, Shumin Deng, Zhanlin Sun, Jiaoyan Chen, Wei Zhang, Huajun Chen

    Abstract: Relation extraction aims to extract relational facts from sentences. Previous models mainly rely on manually labeled datasets, seed instances or human-crafted patterns, and distant supervision. However, the human annotation is expensive, while human-crafted patterns suffer from semantic drift and distant supervision samples are usually noisy. Domain adaptation methods enable leveraging labeled dat… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.

  18. arXiv:1811.09886  [pdf, other

    cs.LG stat.ML

    Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware Implications

    Authors: Jongsoo Park, Maxim Naumov, Protonu Basu, Summer Deng, Aravind Kalaiah, Daya Khudia, James Law, Parth Malani, Andrey Malevich, Satish Nadathur, Juan Pino, Martin Schatz, Alexander Sidorov, Viswanath Sivakumar, Andrew Tulloch, Xiaodong Wang, Yiming Wu, Hector Yuen, Utku Diril, Dmytro Dzhulgakov, Kim Hazelwood, Bill Jia, Yangqing Jia, Lin Qiao, Vijay Rao , et al. (3 additional authors not shown)

    Abstract: The application of deep learning techniques resulted in remarkable improvement of machine learning models. In this paper provides detailed characterizations of deep learning models used in many Facebook social network services. We present computational characteristics of our models, describe high performance optimizations targeting existing systems, point out their limitations and make suggestions… ▽ More

    Submitted 29 November, 2018; v1 submitted 24 November, 2018; originally announced November 2018.