Skip to main content

Showing 1–26 of 26 results for author: Patel, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2403.05336  [pdf, other

    stat.ME stat.AP

    Estimating time-varying exposure effects through continuous-time modelling in Mendelian randomization

    Authors: Haodong Tian, Ashish Patel, Stephen Burgess

    Abstract: Mendelian randomization is an instrumental variable method that utilizes genetic information to investigate the causal effect of a modifiable exposure on an outcome. In most cases, the exposure changes over time. Understanding the time-varying causal effect of the exposure can yield detailed insights into mechanistic effects and the potential impact of public health interventions. Recently, a grow… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  2. arXiv:2402.12171  [pdf, other

    stat.ME q-bio.GN

    A frequentist test of proportional colocalization after selecting relevant genetic variants

    Authors: Ashish Patel, John C. Whittaker, Stephen Burgess

    Abstract: Colocalization analyses assess whether two traits are affected by the same or distinct causal genetic variants in a single gene region. A class of Bayesian colocalization tests are now routinely used in practice; for example, for genetic analyses in drug development pipelines. In this work, we consider an alternative frequentist approach to colocalization testing that examines the proportionality… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  3. arXiv:2310.18278  [pdf, other

    q-bio.BM physics.bio-ph physics.chem-ph stat.ML

    Navigating protein landscapes with a machine-learned transferable coarse-grained model

    Authors: Nicholas E. Charron, Felix Musil, Andrea Guljas, Yaoyi Chen, Klara Bonneau, Aldo S. Pasos-Trejo, Jacopo Venturin, Daria Gusew, Iryna Zaporozhets, Andreas Krämer, Clark Templeton, Atharva Kelkar, Aleksander E. P. Durumeric, Simon Olsson, Adrià Pérez, Maciej Majewski, Brooke E. Husic, Ankit Patel, Gianni De Fabritiis, Frank Noé, Cecilia Clementi

    Abstract: The most popular and universally predictive protein simulation models employ all-atom molecular dynamics (MD), but they come at extreme computational cost. The development of a universal, computationally efficient coarse-grained (CG) model with similar prediction performance has been a long-standing challenge. By combining recent deep learning methods with a large and diverse training set of all-a… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

  4. arXiv:2302.03750  [pdf, other

    cs.CV cs.LG stat.ME

    Linking convolutional kernel size to generalization bias in face analysis CNNs

    Authors: Hao Liang, Josue Ortega Caro, Vikram Maheshri, Ankit B. Patel, Guha Balakrishnan

    Abstract: Training dataset biases are by far the most scrutinized factors when explaining algorithmic biases of neural networks. In contrast, hyperparameters related to the neural network architecture have largely been ignored even though different network parameterizations are known to induce different implicit biases over learned features. For example, convolutional kernel size is known to affect the freq… ▽ More

    Submitted 3 December, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: WACV 2024

  5. arXiv:2109.12361  [pdf, ps, other

    stat.ME

    Disentangling the effects of traits with shared clustered genetic predictors using multivariable Mendelian randomization

    Authors: Fatima Batool, Ashish Patel, Dipender Gill, Stephen Burgess

    Abstract: When genetic variants in a gene cluster are associated with a disease outcome, the causal pathway from the variants to the outcome can be difficult to disentangle. For example, the chemokine receptor gene cluster contains genetic variants associated with various cytokines. Associations between variants in this cluster and stroke risk may be driven by any of these cytokines. Multivariable Mendelian… ▽ More

    Submitted 2 October, 2021; v1 submitted 25 September, 2021; originally announced September 2021.

  6. arXiv:2107.01513  [pdf, other

    stat.ME

    Selecting invalid instruments to improve Mendelian randomization with two-sample summary data

    Authors: Ashish Patel, Francis J. DiTraglia, Verena Zuber, Stephen Burgess

    Abstract: Mendelian randomization (MR) is a widely-used method to estimate the causal relationship between a risk factor and disease. A fundamental part of any MR analysis is to choose appropriate genetic variants as instrumental variables. Genome-wide association studies often reveal that hundreds of genetic variants may be robustly associated with a risk factor, but in some situations investigators may ha… ▽ More

    Submitted 25 April, 2023; v1 submitted 3 July, 2021; originally announced July 2021.

    Comments: 44 pages, 13 figures

  7. arXiv:2008.01772  [pdf

    cs.LG stat.ML

    Shallow Univariate ReLu Networks as Splines: Initialization, Loss Surface, Hessian, & Gradient Flow Dynamics

    Authors: Justin Sahs, Ryan Pyle, Aneel Damaraju, Josue Ortega Caro, Onur Tavaslioglu, Andy Lu, Ankit Patel

    Abstract: Understanding the learning dynamics and inductive bias of neural networks (NNs) is hindered by the opacity of the relationship between NN parameters and the function represented. We propose reparametrizing ReLU NNs as continuous piecewise linear splines. Using this spline lens, we study learning dynamics in shallow univariate ReLU NNs, finding unexpected insights and explanations for several perpl… ▽ More

    Submitted 4 August, 2020; originally announced August 2020.

    Comments: 14 pages, 4 figures in main text

    ACM Class: I.2.0

  8. arXiv:2006.11440  [pdf, other

    stat.ML cs.LG

    Local Convolutions Cause an Implicit Bias towards High Frequency Adversarial Examples

    Authors: Josue Ortega Caro, Yilong Ju, Ryan Pyle, Sourav Dey, Wieland Brendel, Fabio Anselmi, Ankit Patel

    Abstract: Adversarial Attacks are still a significant challenge for neural networks. Recent work has shown that adversarial perturbations typically contain high-frequency features, but the root cause of this phenomenon remains unknown. Inspired by theoretical work on linear full-width convolutional models, we hypothesize that the local (i.e. bounded-width) convolutional operations commonly used in current n… ▽ More

    Submitted 8 March, 2023; v1 submitted 19 June, 2020; originally announced June 2020.

    Comments: 23 pages, 11 figures, 12 Tables

  9. arXiv:2006.09286  [pdf, other

    cs.LG cs.CL stat.ML

    On the Computational Power of Transformers and its Implications in Sequence Modeling

    Authors: Satwik Bhattamishra, Arkil Patel, Navin Goyal

    Abstract: Transformers are being used extensively across several sequence modeling tasks. Significant research effort has been devoted to experimentally probe the inner workings of Transformers. However, our conceptual and theoretical understanding of their power and inherent limitations is still nascent. In particular, the roles of various components in Transformers such as positional encodings, attention… ▽ More

    Submitted 10 October, 2020; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: CoNLL 2020

  10. arXiv:2006.07460  [pdf, other

    cs.LG stat.ML

    An Improved Semi-Supervised VAE for Learning Disentangled Representations

    Authors: Weili Nie, Zichao Wang, Ankit B. Patel, Richard G. Baraniuk

    Abstract: Learning interpretable and disentangled representations is a crucial yet challenging task in representation learning. In this work, we focus on semi-supervised disentanglement learning and extend work by Locatello et al. (2019) by introducing another source of supervision that we denote as label replacement. Specifically, during training, we replace the inferred representation associated with a da… ▽ More

    Submitted 22 June, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

  11. arXiv:2005.08033  [pdf, other

    cs.LG stat.ML

    Towards classification parity across cohorts

    Authors: Aarsh Patel, Rahul Gupta, Mukund Harakere, Satyapriya Krishna, Aman Alok, Peng Liu

    Abstract: Recently, there has been a lot of interest in ensuring algorithmic fairness in machine learning where the central question is how to prevent sensitive information (e.g. knowledge about the ethnic group of an individual) from adding "unfair" bias to a learning algorithm (Feldman et al. (2015), Zemel et al. (2013)). This has led to several debiasing algorithms on word embeddings (Qian et al. (2019)… ▽ More

    Submitted 16 May, 2020; originally announced May 2020.

    Comments: Published in ML-IRL ICLR 2020 workshop

  12. arXiv:2005.01765  [pdf, other

    stat.ME

    Conditional inference in cis-Mendelian randomization using weak genetic factors

    Authors: Ashish Patel, Dipender Gill, Paul J. Newcombe, Stephen Burgess

    Abstract: Mendelian randomization is a widely-used method to estimate the unconfounded effect of an exposure on an outcome by using genetic variants as instrumental variables. Mendelian randomization analyses which use variants from a single genetic region (cis-MR) have gained popularity for being an economical way to provide supporting evidence for drug target validation. This paper proposes methods for ci… ▽ More

    Submitted 31 December, 2021; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: 34 pages, 6 figures

  13. arXiv:2004.12028  [pdf, other

    stat.ME cs.LG q-bio.QM stat.ML

    Two-Stage Penalized Regression Screening to Detect Biomarker-Treatment Interactions in Randomized Clinical Trials

    Authors: Jixiong Wang, Ashish Patel, James M. S. Wason, Paul J. Newcombe

    Abstract: High-dimensional biomarkers such as genomics are increasingly being measured in randomized clinical trials. Consequently, there is a growing interest in develo** methods that improve the power to detect biomarker-treatment interactions. We adapt recently proposed two-stage interaction detecting procedures in the setting of randomized clinical trials. We also propose a new stage 1 multivariate sc… ▽ More

    Submitted 28 April, 2021; v1 submitted 24 April, 2020; originally announced April 2020.

    Comments: Accepted version, to be published in Biometrics

  14. arXiv:1911.04389  [pdf, other

    cs.LG stat.ML

    A Biologically Plausible Benchmark for Contextual Bandit Algorithms in Precision Oncology Using in vitro Data

    Authors: Niklas T. Rindtorff, MingYu Lu, Nisarg A. Patel, Huahua Zheng, Alexander D'Amour

    Abstract: Precision oncology, the genetic sequencing of tumors to identify druggable targets, has emerged as the standard of care in the treatment of many cancers. Nonetheless, due to the pace of therapy development and variability in patient information, designing effective protocols for individual treatment assignment in a sample-efficient way remains a major challenge. One promising approach to this prob… ▽ More

    Submitted 11 November, 2019; originally announced November 2019.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2019 - Extended Abstract

  15. arXiv:1908.01760  [pdf, other

    cs.LG stat.ML

    The Myths of Our Time: Fake News

    Authors: Vít Růžička, Eunsu Kang, David Gordon, Ankita Patel, Jacqui Fashimpaur, Manzil Zaheer

    Abstract: While the purpose of most fake news is misinformation and political propaganda, our team sees it as a new type of myth that is created by people in the age of internet identities and artificial intelligence. Seeking insights on the fear and desire hidden underneath these modified or generated stories, we use machine learning methods to generate fake articles and present them in the form of an onli… ▽ More

    Submitted 5 August, 2019; originally announced August 2019.

    Comments: 5 pages, 5 figures, in proceedings of International Symposium on Electronic Art 2019 (ISEA)

    Journal ref: Proceedings of International Symposium on Electronic Art 2019 (ISEA), pages 494-498

  16. arXiv:1904.07032  [pdf

    q-bio.QM cs.LG stat.ML

    Deep neural networks can predict mortality from 12-lead electrocardiogram voltage data

    Authors: Sushravya Raghunath, Alvaro E. Ulloa Cerna, Linyuan **g, David P. vanMaanen, Joshua Stough, Dustin N. Hartzel, Joseph B. Leader, H. Lester Kirchner, Christopher W. Good, Aalpen A. Patel, Brian P. Delisle, Amro Alsaid, Dominik Beer, Christopher M. Haggerty, Brandon K. Fornwalt

    Abstract: The electrocardiogram (ECG) is a widely-used medical test, typically consisting of 12 voltage versus time traces collected from surface recordings over the heart. Here we hypothesize that a deep neural network can predict an important future clinical event (one-year all-cause mortality) from ECG voltage-time traces. We show good performance for predicting one-year mortality with an average AUC of… ▽ More

    Submitted 11 May, 2020; v1 submitted 15 April, 2019; originally announced April 2019.

    Comments: An updated version of this paper is now published with Nature Medicine (2020)

  17. arXiv:1901.08125  [pdf, other

    cs.LG stat.ML

    Interpretable Neural Networks for Predicting Mortality Risk using Multi-modal Electronic Health Records

    Authors: Alvaro E. Ulloa Cerna, Marios Pattichis, David P. vanMaanen, Linyuan **g, Aalpen A. Patel, Joshua V. Stough, Christopher M. Haggerty, Brandon K. Fornwalt

    Abstract: We present an interpretable neural network for predicting an important clinical outcome (1-year mortality) from multi-modal Electronic Health Record (EHR) data. Our approach builds on prior multi-modal machine learning models by now enabling visualization of how individual factors contribute to the overall outcome risk, assuming other factors remain constant, which was previously impossible. We… ▽ More

    Submitted 23 January, 2019; originally announced January 2019.

    Comments: Submitted to IEEE JBHI

    Journal ref: IEEE Journal of Biomedical and Health Informatics, 2019

  18. arXiv:1812.10234  [pdf, other

    cs.LG cs.CL stat.ML

    A New Concept of Deep Reinforcement Learning based Augmented General Sequence Tagging System

    Authors: Yu Wang, Abhishek Patel, Hongxia **

    Abstract: In this paper, a new deep reinforcement learning based augmented general sequence tagging system is proposed. The new system contains two parts: a deep neural network (DNN) based sequence tagging model and a deep reinforcement learning (DRL) based augmented tagger. The augmented tagger helps improve system performance by modeling the data with minority tags. The new system is evaluated on SLU and… ▽ More

    Submitted 26 December, 2018; originally announced December 2018.

    Comments: Published at 2018 COLING

  19. arXiv:1811.02657  [pdf, other

    cs.CV cs.AI cs.LG cs.NE stat.ML

    A Bayesian Perspective of Convolutional Neural Networks through a Deconvolutional Generative Model

    Authors: Tan Nguyen, Nhat Ho, Ankit Patel, Anima Anandkumar, Michael I. Jordan, Richard G. Baraniuk

    Abstract: Inspired by the success of Convolutional Neural Networks (CNNs) for supervised prediction in images, we design the Deconvolutional Generative Model (DGM), a new probabilistic generative model whose inference calculations correspond to those in a given CNN architecture. The DGM uses a CNN to design the prior distribution in the probabilistic model. Furthermore, the DGM generates images from coarse… ▽ More

    Submitted 9 December, 2019; v1 submitted 31 October, 2018; originally announced November 2018.

    Comments: Keywords: neural nets, generative models, semi-supervised learning, cross-entropy, statistical guarantees 80 pages, 7 figures, 8 tables

  20. arXiv:1806.09235  [pdf, other

    stat.ML cs.LG

    Towards a Better Understanding and Regularization of GAN Training Dynamics

    Authors: Weili Nie, Ankit Patel

    Abstract: Generative adversarial networks (GANs) are notoriously difficult to train and the reasons underlying their (non-)convergence behaviors are still not completely understood. By first considering a simple yet representative GAN example, we mathematically analyze its local convergence behavior in a non-asymptotic way. Furthermore, the analysis is extended to general GANs under certain assumptions. We… ▽ More

    Submitted 1 July, 2019; v1 submitted 24 June, 2018; originally announced June 2018.

    Comments: UAI 2019

  21. arXiv:1712.09117  [pdf, other

    eess.AS cs.SD stat.ML

    Overcomplete Frame Thresholding for Acoustic Scene Analysis

    Authors: Romain Cosentino, Randall Balestriero, Richard Baraniuk, Ankit Patel

    Abstract: In this work, we derive a generic overcomplete frame thresholding scheme based on risk minimization. Overcomplete frames being favored for analysis tasks such as classification, regression or anomaly detection, we provide a way to leverage those optimal representations in real-world applications through the use of thresholding. We validate the method on a large scale bird activity detection task v… ▽ More

    Submitted 25 December, 2017; originally announced December 2017.

  22. arXiv:1709.02280  [pdf, other

    stat.ML cs.PF cs.SE

    Transfer Learning for Performance Modeling of Configurable Systems: An Exploratory Analysis

    Authors: Pooyan Jamshidi, Norbert Siegmund, Miguel Velez, Christian Kästner, Akshay Patel, Yuvraj Agarwal

    Abstract: Modern software systems provide many configuration options which significantly influence their non-functional properties. To understand and predict the effect of configuration options, several sampling and learning strategies have been proposed, albeit often with significant cost to cover the highly dimensional configuration space. Recently, transfer learning has been applied to reduce the effort… ▽ More

    Submitted 7 September, 2017; originally announced September 2017.

    Comments: To appear in 32nd IEEE/ACM International Conference on Automated Software Engineering (ASE 2017), 12 pages

  23. arXiv:1612.01942  [pdf, other

    stat.ML cs.LG cs.NE

    Semi-Supervised Learning with the Deep Rendering Mixture Model

    Authors: Tan Nguyen, Wanjia Liu, Ethan Perez, Richard G. Baraniuk, Ankit B. Patel

    Abstract: Semi-supervised learning algorithms reduce the high cost of acquiring labeled training data by using both labeled and unlabeled data during learning. Deep Convolutional Networks (DCNs) have achieved great success in supervised tasks and as such have been widely employed in the semi-supervised learning. In this paper we leverage the recently developed Deep Rendering Mixture Model (DRMM), a probabil… ▽ More

    Submitted 6 December, 2016; originally announced December 2016.

  24. arXiv:1612.01936  [pdf, other

    stat.ML cs.LG cs.NE

    A Probabilistic Framework for Deep Learning

    Authors: Ankit B. Patel, Tan Nguyen, Richard G. Baraniuk

    Abstract: We develop a probabilistic framework for deep learning based on the Deep Rendering Mixture Model (DRMM), a new generative probabilistic model that explicitly capture variations in data due to latent task nuisance variables. We demonstrate that max-sum inference in the DRMM yields an algorithm that exactly reproduces the operations in deep convolutional neural networks (DCNs), providing a first pri… ▽ More

    Submitted 6 December, 2016; originally announced December 2016.

    Comments: arXiv admin note: substantial text overlap with arXiv:1504.00641

  25. A Deep Learning Approach to Structured Signal Recovery

    Authors: Ali Mousavi, Ankit B. Patel, Richard G. Baraniuk

    Abstract: In this paper, we develop a new framework for sensing and recovering structured signals. In contrast to compressive sensing (CS) systems that employ linear measurements, sparse representations, and computationally complex convex/greedy algorithms, we introduce a deep learning framework that supports both linear and mildly nonlinear measurements, that learns a structured representation from trainin… ▽ More

    Submitted 17 August, 2015; originally announced August 2015.

    Journal ref: In Proceeding of 2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton)

  26. arXiv:1504.00641  [pdf, other

    stat.ML cs.CV cs.LG cs.NE

    A Probabilistic Theory of Deep Learning

    Authors: Ankit B. Patel, Tan Nguyen, Richard G. Baraniuk

    Abstract: A grand challenge in machine learning is the development of computational algorithms that match or outperform humans in perceptual inference tasks that are complicated by nuisance variation. For instance, visual object recognition involves the unknown object position, orientation, and scale in object recognition while speech recognition involves the unknown voice pronunciation, pitch, and speed. R… ▽ More

    Submitted 2 April, 2015; originally announced April 2015.

    Comments: 56 pages, 6 figures, 2 tables

    Report number: Rice University Electrical and Computer Engineering Dept. Technical Report No 2015-1