Skip to main content

Showing 1–32 of 32 results for author: Shen, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.20088  [pdf, other

    stat.AP stat.ME

    Personalized Predictions from Population Level Experiments: A Study on Alzheimer's Disease

    Authors: Dennis Shen, Anish Agarwal, Vishal Misra, Bjoern Schelter, Devavrat Shah, Helen Shiells, Claude Wischik

    Abstract: The purpose of this article is to infer patient level outcomes from population level randomized control trials (RCTs). In this pursuit, we utilize the recently proposed synthetic nearest neighbors (SNN) estimator. At its core, SNN leverages information across patients to impute missing data associated with each patient of interest. We focus on two types of missing data: (i) unrecorded outcomes fro… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  2. arXiv:2309.15769  [pdf, other

    math.ST cs.LG stat.ME

    Algebraic and Statistical Properties of the Ordinary Least Squares Interpolator

    Authors: Dennis Shen, Dogyoon Song, Peng Ding, Jasjeet S. Sekhon

    Abstract: Deep learning research has uncovered the phenomenon of benign overfitting for overparameterized statistical models, which has drawn significant theoretical interest in recent years. Given its simplicity and practicality, the ordinary least squares (OLS) interpolator has become essential to gain foundational insights into this phenomenon. While properties of OLS are well established in classical, u… ▽ More

    Submitted 30 May, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

  3. arXiv:2209.12388  [pdf, other

    stat.ME stat.AP

    Joint and Individual Component Regression

    Authors: Peiyao Wang, Haodong Wang, Quefeng Li, Dinggang Shen, Yufeng Liu

    Abstract: Multi-group data are commonly seen in practice. Such data structure consists of data from multiple groups and can be challenging to analyze due to data heterogeneity. We propose a novel Joint and Individual Component Regression (JICO) model to analyze multi-group data. In particular, our proposed model decomposes the response into shared and group-specific components, which are driven by low-rank… ▽ More

    Submitted 25 September, 2022; originally announced September 2022.

  4. arXiv:2207.14481  [pdf, other

    econ.EM stat.ME

    Same Root Different Leaves: Time Series and Cross-Sectional Methods in Panel Data

    Authors: Dennis Shen, Peng Ding, Jasjeet Sekhon, Bin Yu

    Abstract: A central goal in social science is to evaluate the causal effect of a policy. One dominant approach is through panel data analysis in which the behaviors of multiple units are observed over time. The information across time and space motivates two general approaches: (i) horizontal regression (i.e., unconfoundedness), which exploits time series patterns, and (ii) vertical regression (e.g., synthe… ▽ More

    Submitted 8 October, 2022; v1 submitted 29 July, 2022; originally announced July 2022.

  5. arXiv:2109.15154  [pdf, other

    econ.EM cs.LG math.ST stat.ML

    Causal Matrix Completion

    Authors: Anish Agarwal, Munther Dahleh, Devavrat Shah, Dennis Shen

    Abstract: Matrix completion is the study of recovering an underlying matrix from a sparse subset of noisy observations. Traditionally, it is assumed that the entries of the matrix are "missing completely at random" (MCAR), i.e., each entry is revealed at random, independent of everything else, with uniform probability. This is likely unrealistic due to the presence of "latent confounders", i.e., unobserved… ▽ More

    Submitted 30 September, 2021; originally announced September 2021.

  6. arXiv:2011.03127  [pdf, other

    stat.ME

    Causal Imputation via Synthetic Interventions

    Authors: Chandler Squires, Dennis Shen, Anish Agarwal, Devavrat Shah, Caroline Uhler

    Abstract: Consider the problem of determining the effect of a compound on a specific cell type. To answer this question, researchers traditionally need to run an experiment applying the drug of interest to that cell type. This approach is not scalable: given a large number of different actions (compounds) and a large number of different contexts (cell types), it is infeasible to run an experiment for every… ▽ More

    Submitted 11 June, 2023; v1 submitted 5 November, 2020; originally announced November 2020.

  7. arXiv:2011.00593  [pdf, other

    cs.CL stat.ML

    MixKD: Towards Efficient Distillation of Large-scale Language Models

    Authors: Kevin J Liang, Weituo Hao, Dinghan Shen, Yufan Zhou, Weizhu Chen, Changyou Chen, Lawrence Carin

    Abstract: Large-scale language models have recently demonstrated impressive empirical performance. Nevertheless, the improved results are attained at the price of bigger models, more power consumption, and slower inference, which hinder their applicability to low-resource (both memory and computation) platforms. Knowledge distillation (KD) has been demonstrated as an effective framework for compressing such… ▽ More

    Submitted 17 March, 2021; v1 submitted 1 November, 2020; originally announced November 2020.

    Comments: ICLR 2021 Camera Ready

  8. arXiv:2010.14449  [pdf, other

    math.ST cs.LG stat.ML

    On Model Identification and Out-of-Sample Prediction of Principal Component Regression: Applications to Synthetic Controls

    Authors: Anish Agarwal, Devavrat Shah, Dennis Shen

    Abstract: We analyze principal component regression (PCR) in a high-dimensional error-in-variables setting with fixed design. Under suitable conditions, we show that PCR consistently identifies the unique model with minimum $\ell_2$-norm. These results enable us to establish non-asymptotic out-of-sample prediction guarantees that improve upon the best known rates. In the course of our analysis, we introduce… ▽ More

    Submitted 25 August, 2023; v1 submitted 27 October, 2020; originally announced October 2020.

  9. arXiv:2007.16103  [pdf, other

    cs.LG cs.CV stat.ML

    Learning-based Computer-aided Prescription Model for Parkinson's Disease: A Data-driven Perspective

    Authors: Yinghuan Shi, Wanqi Yang, Kim-Han Thung, Hao Wang, Yang Gao, Yang Pan, Li Zhang, Dinggang Shen

    Abstract: In this paper, we study a novel problem: "automatic prescription recommendation for PD patients." To realize this goal, we first build a dataset by collecting 1) symptoms of PD patients, and 2) their prescription drug provided by neurologists. Then, we build a novel computer-aided prescription model by learning the relation between observed symptoms and prescription drug. Finally, for the new comi… ▽ More

    Submitted 31 July, 2020; originally announced July 2020.

    Comments: IEEE JBHI 2020

  10. arXiv:2006.08858  [pdf, other

    cs.LG cs.CL stat.ML

    Generative Semantic Hashing Enhanced via Boltzmann Machines

    Authors: Lin Zheng, Qinliang Su, Dinghan Shen, Changyou Chen

    Abstract: Generative semantic hashing is a promising technique for large-scale information retrieval thanks to its fast retrieval speed and small memory footprint. For the tractability of training, existing generative-hashing methods mostly assume a factorized form for the posterior distribution, enforcing independence among the bits of hash codes. From the perspectives of both model representation and code… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

  11. arXiv:2006.07691  [pdf, other

    econ.EM cs.LG stat.ML

    Synthetic Interventions

    Authors: Anish Agarwal, Devavrat Shah, Dennis Shen

    Abstract: Consider a setting with $N$ heterogeneous units (e.g., individuals, sub-populations) and $D$ interventions (e.g., socio-economic policies). Our goal is to learn the expected potential outcome associated with every intervention on every unit, totaling $N \times D$ causal parameters. Towards this, we present a causal framework, synthetic interventions (SI), to infer these $N \times D$ causal paramet… ▽ More

    Submitted 31 October, 2023; v1 submitted 13 June, 2020; originally announced June 2020.

  12. arXiv:2006.00693  [pdf, other

    cs.LG stat.ML

    Improving Disentangled Text Representation Learning with Information-Theoretic Guidance

    Authors: Pengyu Cheng, Martin Renqiang Min, Dinghan Shen, Christopher Malon, Yizhe Zhang, Yitong Li, Lawrence Carin

    Abstract: Learning disentangled representations of natural language is essential for many NLP tasks, e.g., conditional text generation, style transfer, personalized dialogue systems, etc. Similar problems have been studied extensively for other forms of data, such as images and videos. However, the discrete nature of natural language makes the disentangling of textual representations more challenging (e.g.,… ▽ More

    Submitted 12 January, 2022; v1 submitted 31 May, 2020; originally announced June 2020.

    Comments: Accepted by the 58th Annual Meeting of the Association for Computational Linguistics (ACL2020)

  13. arXiv:2005.00072  [pdf, other

    econ.EM cs.LG stat.AP

    Two Burning Questions on COVID-19: Did shutting down the economy help? Can we (partially) reopen the economy without risking the second wave?

    Authors: Anish Agarwal, Abdullah Alomar, Arnab Sarker, Devavrat Shah, Dennis Shen, Cindy Yang

    Abstract: As we reach the apex of the COVID-19 pandemic, the most pressing question facing us is: can we even partially reopen the economy without risking a second wave? We first need to understand if shutting down the economy helped. And if it did, is it possible to achieve similar gains in the war against the pandemic while partially opening up the economy? To do so, it is critical to understand the effec… ▽ More

    Submitted 10 May, 2020; v1 submitted 30 April, 2020; originally announced May 2020.

  14. arXiv:1911.06156  [pdf, other

    cs.CL cs.LG stat.ML

    Syntax-Infused Transformer and BERT models for Machine Translation and Natural Language Understanding

    Authors: Dhanasekar Sundararaman, Vivek Subramanian, Guoyin Wang, Shi**g Si, Dinghan Shen, Dong Wang, Lawrence Carin

    Abstract: Attention-based models have shown significant improvement over traditional algorithms in several NLP tasks. The Transformer, for instance, is an illustrative example that generates abstract representations of tokens inputted to an encoder based on their relationships to all tokens in a sequence. Recent studies have shown that although such models are capable of learning syntactic features purely b… ▽ More

    Submitted 9 November, 2019; originally announced November 2019.

  15. arXiv:1910.02176  [pdf, other

    cs.LG stat.ML

    Straight-Through Estimator as Projected Wasserstein Gradient Flow

    Authors: Pengyu Cheng, Chang Liu, Chunyuan Li, Dinghan Shen, Ricardo Henao, Lawrence Carin

    Abstract: The Straight-Through (ST) estimator is a widely used technique for back-propagating gradients through discrete random variables. However, this effective method lacks theoretical justification. In this paper, we show that ST can be interpreted as the simulation of the projected Wasserstein gradient flow (pWGF). Based on this understanding, a theoretical foundation is established to justify the conv… ▽ More

    Submitted 4 October, 2019; originally announced October 2019.

    Comments: Accepted as NeurIPS 2018 Bayesian Deep Learning Workshop

  16. arXiv:1907.04924  [pdf, other

    cs.IR cs.LG stat.ML

    Infer Implicit Contexts in Real-time Online-to-Offline Recommendation

    Authors: Xichen Ding, Jie Tang, Tracy Liu, Cheng Xu, Ya** Zhang, Feng Shi, Qixia Jiang, Dan Shen

    Abstract: Understanding users' context is essential for successful recommendations, especially for Online-to-Offline (O2O) recommendation, such as Yelp, Groupon, and Koubei. Different from traditional recommendation where individual preference is mostly static, O2O recommendation should be dynamic to capture variation of users' purposes across time and location. However, precisely inferring users' real-time… ▽ More

    Submitted 8 July, 2019; originally announced July 2019.

    Comments: 9 pages,KDD,KDD2019

  17. arXiv:1906.02181  [pdf, other

    stat.ML cs.CL cs.LG

    Syntax-Infused Variational Autoencoder for Text Generation

    Authors: Xinyuan Zhang, Yi Yang, Siyang Yuan, Dinghan Shen, Lawrence Carin

    Abstract: We present a syntax-infused variational autoencoder (SIVAE), that integrates sentences with their syntactic trees to improve the grammar of generated sentences. Distinct from existing VAE-based text generative models, SIVAE contains two separate latent spaces, for sentences and syntactic trees. The evidence lower bound objective is redesigned correspondingly, by optimizing a joint distribution tha… ▽ More

    Submitted 5 June, 2019; originally announced June 2019.

    Comments: Accepted by ACL 2019

  18. arXiv:1905.06400  [pdf, other

    stat.ME econ.EM

    mRSC: Multi-dimensional Robust Synthetic Control

    Authors: Muhummad Amjad, Vishal Misra, Devavrat Shah, Dennis Shen

    Abstract: When evaluating the impact of a policy on a metric of interest, it may not be possible to conduct a randomized control trial. In settings where only observational data is available, Synthetic Control (SC) methods provide a popular data-driven approach to estimate a "synthetic" control by combining measurements of "similar" units (donors). Recently, Robust SC (RSC) was proposed as a generalization… ▽ More

    Submitted 23 September, 2019; v1 submitted 15 May, 2019; originally announced May 2019.

  19. arXiv:1902.10920  [pdf, other

    cs.LG stat.ML

    On Robustness of Principal Component Regression

    Authors: Anish Agarwal, Devavrat Shah, Dennis Shen, Dogyoon Song

    Abstract: Principal component regression (PCR) is a simple, but powerful and ubiquitously utilized method. Its effectiveness is well established when the covariates exhibit low-rank structure. However, its ability to handle settings with noisy, missing, and mixed-valued, i.e., discrete and continuous, covariates is not understood and remains an important open challenge. As the main contribution of this work… ▽ More

    Submitted 19 May, 2021; v1 submitted 28 February, 2019; originally announced February 2019.

  20. Population-Guided Large Margin Classifier for High-Dimension Low -Sample-Size Problems

    Authors: Qingbo Yin, Ehsan Adeli, Liran Shen, Dinggang Shen

    Abstract: Various applications in different fields, such as gene expression analysis or computer vision, suffer from data sets with high-dimensional low-sample-size (HDLSS), which has posed significant challenges for standard statistical and modern machine learning methods. In this paper, we propose a novel linear binary classifier, denoted by population-guided large margin classifier (PGLMC), which is appl… ▽ More

    Submitted 25 January, 2021; v1 submitted 5 January, 2019; originally announced January 2019.

    Journal ref: Pattern Recognition, vol. 97, pp. 107030, 2020/01/01/, 2020

  21. arXiv:1812.04103  [pdf, other

    cs.CV cs.LG stat.AP stat.ML

    Non-local U-Net for Biomedical Image Segmentation

    Authors: Zhengyang Wang, Na Zou, Dinggang Shen, Shuiwang Ji

    Abstract: Deep learning has shown its great promise in various biomedical image segmentation tasks. Existing models are typically based on U-Net and rely on an encoder-decoder architecture with stacked local operators to aggregate long-range information gradually. However, only using the local operators limits the efficiency and effectiveness. In this work, we propose the non-local U-Nets, which are equippe… ▽ More

    Submitted 18 February, 2020; v1 submitted 10 December, 2018; originally announced December 2018.

    Comments: In Proceedings of the 34th AAAI Conference on Artificial Intelligence (AAAI), 2019

  22. arXiv:1811.02629  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Identifying the Best Machine Learning Algorithms for Brain Tumor Segmentation, Progression Assessment, and Overall Survival Prediction in the BRATS Challenge

    Authors: Spyridon Bakas, Mauricio Reyes, Andras Jakab, Stefan Bauer, Markus Rempfler, Alessandro Crimi, Russell Takeshi Shinohara, Christoph Berger, Sung Min Ha, Martin Rozycki, Marcel Prastawa, Esther Alberts, Jana Lipkova, John Freymann, Justin Kirby, Michel Bilello, Hassan Fathallah-Shaykh, Roland Wiest, Jan Kirschke, Benedikt Wiestler, Rivka Colen, Aikaterini Kotrotsou, Pamela Lamontagne, Daniel Marcus, Mikhail Milchenko , et al. (402 additional authors not shown)

    Abstract: Gliomas are the most common primary brain malignancies, with different degrees of aggressiveness, variable prognosis and various heterogeneous histologic sub-regions, i.e., peritumoral edematous/invaded tissue, necrotic core, active and non-enhancing core. This intrinsic heterogeneity is also portrayed in their radio-phenotype, as their sub-regions are depicted by varying intensity profiles dissem… ▽ More

    Submitted 23 April, 2019; v1 submitted 5 November, 2018; originally announced November 2018.

    Comments: The International Multimodal Brain Tumor Segmentation (BraTS) Challenge

  23. arXiv:1807.09157  [pdf, other

    stat.AP

    Robust Group Comparison Using Non-Parametric Block-Based Statistics

    Authors: Geng Chen, Pei Zhang, Ke Li, Chong-Yaw Wee, Wenliang Pan, Yafeng Wu, Panteleimon Giannakopoulos, Sven Haller, Dinggang Shen, Pew-Thian Yap

    Abstract: Voxel-based analysis methods localize brain structural differences by performing voxel-wise statistical comparisons on two groups of images aligned to a common space. This procedure requires highly accurate registration as well as a sufficiently large dataset. However, in practice, the registration algorithms are not perfect due to noise, artifacts, and complex structural variations. The sample si… ▽ More

    Submitted 24 July, 2018; originally announced July 2018.

    Comments: 17 pages, 9 figures

  24. arXiv:1805.09906  [pdf, other

    cs.CL cs.SI stat.ML

    Diffusion Maps for Textual Network Embedding

    Authors: Xinyuan Zhang, Yitong Li, Dinghan Shen, Lawrence Carin

    Abstract: Textual network embedding leverages rich text information associated with the network to learn low-dimensional vectorial representations of vertices. Rather than using typical natural language processing (NLP) approaches, recent research exploits the relationship of texts on the same edge to graphically embed text. However, these models neglect to measure the complete level of connectivity between… ▽ More

    Submitted 14 January, 2019; v1 submitted 24 May, 2018; originally announced May 2018.

    Comments: This paper is a spotlight paper of NeurIPS 2018

  25. arXiv:1802.09064  [pdf, other

    cs.LG stat.ML

    Model Agnostic Time Series Analysis via Matrix Estimation

    Authors: Anish Agarwal, Muhammad Jehangir Amjad, Devavrat Shah, Dennis Shen

    Abstract: We propose an algorithm to impute and forecast a time series by transforming the observed time series into a matrix, utilizing matrix estimation to recover missing values and de-noise observed entries, and performing linear regression to make predictions. At the core of our analysis is a representation result, which states that for a large model class, the transformed time series matrix is (approx… ▽ More

    Submitted 26 April, 2019; v1 submitted 25 February, 2018; originally announced February 2018.

  26. arXiv:1711.06940  [pdf, other

    econ.EM stat.AP stat.ML

    Robust Synthetic Control

    Authors: Muhammad Jehangir Amjad, Devavrat Shah, Dennis Shen

    Abstract: We present a robust generalization of the synthetic control method for comparative case studies. Like the classical method, we present an algorithm to estimate the unobservable counterfactual of a treatment unit. A distinguishing feature of our algorithm is that of de-noising the data matrix via singular value thresholding, which renders our approach robust in multiple facets: it automatically ide… ▽ More

    Submitted 18 November, 2017; originally announced November 2017.

  27. arXiv:1709.08294  [pdf, other

    cs.CL cs.LG stat.ML

    Learning Context-Sensitive Convolutional Filters for Text Processing

    Authors: Dinghan Shen, Martin Renqiang Min, Yitong Li, Lawrence Carin

    Abstract: Convolutional neural networks (CNNs) have recently emerged as a popular building block for natural language processing (NLP). Despite their success, most existing CNN models employed in NLP share the same learned (and static) set of filters for all input sentences. In this paper, we consider an approach of using a small meta network to learn context-sensitive convolutional filters for text process… ▽ More

    Submitted 30 August, 2018; v1 submitted 24 September, 2017; originally announced September 2017.

    Comments: Accepted by EMNLP 2018 as a full paper

  28. arXiv:1709.07109  [pdf, other

    cs.CL cs.LG stat.ML

    Deconvolutional Latent-Variable Model for Text Sequence Matching

    Authors: Dinghan Shen, Yizhe Zhang, Ricardo Henao, Qinliang Su, Lawrence Carin

    Abstract: A latent-variable model is introduced for text matching, inferring sentence representations by jointly optimizing generative and discriminative objectives. To alleviate typical optimization challenges in latent-variable models for text, we employ deconvolutional networks as the sequence decoder (generator), providing learned latent codes with more semantic information and better generalization. Ou… ▽ More

    Submitted 21 November, 2017; v1 submitted 20 September, 2017; originally announced September 2017.

    Comments: Accepted by AAAI-2018

  29. arXiv:1708.04729  [pdf, other

    cs.CL cs.LG stat.ML

    Deconvolutional Paragraph Representation Learning

    Authors: Yizhe Zhang, Dinghan Shen, Guoyin Wang, Zhe Gan, Ricardo Henao, Lawrence Carin

    Abstract: Learning latent representations from long text sequences is an important first step in many natural language processing applications. Recurrent Neural Networks (RNNs) have become a cornerstone for this challenging task. However, the quality of sentences during RNN-based decoding (reconstruction) decreases with the length of the text. We propose a sequence-to-sequence, purely convolutional and deco… ▽ More

    Submitted 22 September, 2017; v1 submitted 15 August, 2017; originally announced August 2017.

    Comments: Accepted by NIPS 2017

  30. arXiv:1706.03850  [pdf, other

    stat.ML cs.CL cs.LG

    Adversarial Feature Matching for Text Generation

    Authors: Yizhe Zhang, Zhe Gan, Kai Fan, Zhi Chen, Ricardo Henao, Dinghan Shen, Lawrence Carin

    Abstract: The Generative Adversarial Network (GAN) has achieved great success in generating realistic (real-valued) synthetic data. However, convergence issues and difficulties dealing with discrete data hinder the applicability of GAN to text. We propose a framework for generating realistic text via adversarial training. We employ a long short-term memory network as generator, and a convolutional network a… ▽ More

    Submitted 18 November, 2017; v1 submitted 12 June, 2017; originally announced June 2017.

    Comments: Accepted by ICML 2017

  31. arXiv:1412.6592  [pdf, other

    stat.ME

    Tensor Generalized Estimating Equations for Longitudinal Imaging Analysis

    Authors: Xiang Zhang, Lexin Li, Hua Zhou, Dinggang Shen, the Alzheimer's Disease Neuroimaging Initiative

    Abstract: In an increasing number of neuroimaging studies, brain images, which are in the form of multidimensional arrays (tensors), have been collected on multiple subjects at multiple time points. Of scientific interest is to analyze such massive and complex longitudinal images to diagnose neurodegenerative disorders and to identify disease relevant brain regions. In this article, we treat those problems… ▽ More

    Submitted 19 December, 2014; originally announced December 2014.

    Comments: 40 pages, 4 figures, 2 tables

  32. arXiv:1211.2679  [pdf, other

    stat.AP

    High Dimensional Principal Component Scores and Data Visualization

    Authors: Dan Shen, Haipeng Shen, Hongtu Zhu, J. S. Marron

    Abstract: Principal component analysis is a useful dimension reduction and data visualization method. However, in high dimension, low sample size asymptotic contexts, where the sample size is fixed and the dimension goes to infinity,a paradox has arisen. In particular, despite the useful real data insights commonly obtained from principal component score visualization, these scores are not consistent even w… ▽ More

    Submitted 19 November, 2012; v1 submitted 12 November, 2012; originally announced November 2012.