Skip to main content

Showing 1–27 of 27 results for author: Singh, C

Searching in archive stat. Search in all archives.
.
  1. arXiv:2306.17547  [pdf, other

    physics.soc-ph stat.OT

    Spaces of innovation and venture formation: the case of biotech in the United Kingdom

    Authors: Francesco Marzolla, Przemysław Nowak, Rohit Sahasrabuddhe, Chakresh Singh, Matteo Straccamore, Erik Zhivkoplias, Elsa Arcaute

    Abstract: Patents serve as valuable indicators of innovation and provide insights into the spaces of innovation and venture formation within geographic regions. In this study, we utilise patent data to examine the dynamics of innovation and venture formation in the biotech sector across the United Kingdom (UK). By analysing patents, we identify key regions that drive biotech innovation in the UK. Our findin… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

  2. arXiv:2210.01848  [pdf, other

    cs.LG cs.AI cs.CL q-bio.NC stat.ML

    Explaining Patterns in Data with Language Models via Interpretable Autoprompting

    Authors: Chandan Singh, John X. Morris, Jyoti Aneja, Alexander M. Rush, Jianfeng Gao

    Abstract: Large language models (LLMs) have displayed an impressive ability to harness natural language to perform complex tasks. In this work, we explore whether we can leverage this learned ability to find and explain patterns in data. Specifically, given a pre-trained LLM and data examples, we introduce interpretable autoprompting (iPrompt), an algorithm that generates a natural-language string explainin… ▽ More

    Submitted 26 January, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: The two first authors contributed equally

  3. arXiv:2209.11799  [pdf, other

    cs.AI cs.CL cs.LG stat.ME

    Augmenting Interpretable Models with LLMs during Training

    Authors: Chandan Singh, Armin Askari, Rich Caruana, Jianfeng Gao

    Abstract: Recent large language models (LLMs) have demonstrated remarkable prediction performance for a growing array of tasks. However, their proliferation into high-stakes domains (e.g. medicine) and compute-limited settings has created a burgeoning need for interpretability and efficiency. We address this need by proposing Augmented Interpretable Models (Aug-imodels), a framework for leveraging the knowl… ▽ More

    Submitted 24 April, 2023; v1 submitted 23 September, 2022; originally announced September 2022.

    Journal ref: Nature Communications, 2023

  4. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  5. arXiv:2205.15135  [pdf, other

    cs.LG cs.AI stat.AP stat.ME stat.ML

    Group Probability-Weighted Tree Sums for Interpretable Modeling of Heterogeneous Data

    Authors: Keyan Nasseri, Chandan Singh, James Duncan, Aaron Kornblith, Bin Yu

    Abstract: Machine learning in high-stakes domains, such as healthcare, faces two critical challenges: (1) generalizing to diverse data distributions given limited training data while (2) maintaining interpretability. To address these challenges, we propose an instance-weighted tree-sum method that effectively pools data across diverse groups to output a concise, rule-based model. Given distinct groups of in… ▽ More

    Submitted 30 May, 2022; originally announced May 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2201.11931

  6. arXiv:2202.00858  [pdf, other

    cs.LG cs.AI stat.AP stat.ME stat.ML

    Hierarchical Shrinkage: improving the accuracy and interpretability of tree-based methods

    Authors: Abhineet Agarwal, Yan Shuo Tan, Omer Ronen, Chandan Singh, Bin Yu

    Abstract: Tree-based models such as decision trees and random forests (RF) are a cornerstone of modern machine-learning practice. To mitigate overfitting, trees are typically regularized by a variety of techniques that modify their structure (e.g. pruning). We introduce Hierarchical Shrinkage (HS), a post-hoc algorithm that does not modify the tree structure, and instead regularizes the tree by shrinking th… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

  7. arXiv:2201.11931  [pdf, other

    cs.LG cs.AI stat.AP stat.ME stat.ML

    Fast Interpretable Greedy-Tree Sums

    Authors: Yan Shuo Tan, Chandan Singh, Keyan Nasseri, Abhineet Agarwal, James Duncan, Omer Ronen, Matthew Epland, Aaron Kornblith, Bin Yu

    Abstract: Modern machine learning has achieved impressive prediction performance, but often sacrifices interpretability, a critical consideration in high-stakes domains such as medicine. In such settings, practitioners often use highly interpretable decision tree models, but these suffer from inductive bias against additive structure. To overcome this bias, we propose Fast Interpretable Greedy-Tree Sums (FI… ▽ More

    Submitted 8 July, 2023; v1 submitted 27 January, 2022; originally announced January 2022.

  8. arXiv:2108.08445  [pdf, ps, other

    stat.AP

    Seven Principles for Rapid-Response Data Science: Lessons Learned from Covid-19 Forecasting

    Authors: Bin Yu, Chandan Singh

    Abstract: In this article, we take a step back to distill seven principles out of our experience in the spring of 2020, when our 12-person rapid-response team used skills of data science and beyond to help distribute Covid PPE. This process included tap** into domain knowledge of epidemiology and medical logistics chains, curating a relevant data repository, develo** models for short-term county-level d… ▽ More

    Submitted 29 March, 2022; v1 submitted 18 August, 2021; originally announced August 2021.

    Comments: 4 pages, accepted in special issue of "Statistical Science" on COVID-19 Response

  9. arXiv:2108.06847  [pdf, other

    stat.ML cs.LG

    Interpreting and improving deep-learning models with reality checks

    Authors: Chandan Singh, Wooseok Ha, Bin Yu

    Abstract: Recent deep-learning models have achieved impressive predictive performance by learning complex functions of many variables, often at the cost of interpretability. This chapter covers recent work aiming to interpret models by attributing importance to features and feature groups for a single prediction. Importantly, the proposed attributions assign importance to interactions between features, in a… ▽ More

    Submitted 18 August, 2021; v1 submitted 15 August, 2021; originally announced August 2021.

  10. arXiv:2107.09145  [pdf, other

    stat.ML cs.LG

    Adaptive wavelet distillation from neural networks through interpretations

    Authors: Wooseok Ha, Chandan Singh, Francois Lanusse, Srigokul Upadhyayula, Bin Yu

    Abstract: Recent deep-learning models have achieved impressive prediction performance, but often sacrifice interpretability and computational efficiency. Interpretability is crucial in many disciplines, such as science and medicine, where models must be carefully vetted or where interpretation is the goal itself. Moreover, interpretable models are concise and often yield computational efficiency. Here, we p… ▽ More

    Submitted 26 August, 2021; v1 submitted 19 July, 2021; originally announced July 2021.

  11. arXiv:2103.13455  [pdf, other

    cs.CV cs.AI cs.LG stat.AP

    Matched sample selection with GANs for mitigating attribute confounding

    Authors: Chandan Singh, Guha Balakrishnan, Pietro Perona

    Abstract: Measuring biases of vision systems with respect to protected attributes like gender and age is critical as these systems gain widespread use in society. However, significant correlations between attributes in benchmark datasets make it difficult to separate algorithmic bias from dataset bias. To mitigate such attribute confounding during bias analysis, we propose a matching approach that selects a… ▽ More

    Submitted 24 March, 2021; originally announced March 2021.

  12. arXiv:2007.01420  [pdf, other

    cs.LG physics.comp-ph quant-ph stat.ML

    CoPhy-PGNN: Learning Physics-guided Neural Networks with Competing Loss Functions for Solving Eigenvalue Problems

    Authors: Mohannad Elhamod, Jie Bu, Christopher Singh, Matthew Redell, Abantika Ghosh, Viktor Podolskiy, Wei-Cheng Lee, Anuj Karpatne

    Abstract: Physics-guided Neural Networks (PGNNs) represent an emerging class of neural networks that are trained using physics-guided (PG) loss functions (capturing violations in network outputs with known physics), along with the supervision contained in data. Existing work in PGNNs has demonstrated the efficacy of adding single PG loss functions in the neural network objectives, using constant trade-off p… ▽ More

    Submitted 16 December, 2021; v1 submitted 2 July, 2020; originally announced July 2020.

  13. arXiv:2006.10189  [pdf, other

    cs.LG cs.IT math.ST stat.ML

    Revisiting minimum description length complexity in overparameterized models

    Authors: Raaz Dwivedi, Chandan Singh, Bin Yu, Martin J. Wainwright

    Abstract: Complexity is a fundamental concept underlying statistical learning theory that aims to inform generalization performance. Parameter count, while successful in low-dimensional settings, is not well-justified for overparameterized settings when the number of parameters is more than the number of training samples. We revisit complexity measures based on Rissanen's principle of minimum description le… ▽ More

    Submitted 12 October, 2023; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: First two authors contributed equally

  14. Curating a COVID-19 data repository and forecasting county-level death counts in the United States

    Authors: Nick Altieri, Rebecca L. Barter, James Duncan, Raaz Dwivedi, Karl Kumbier, Xiao Li, Robert Netzorg, Briton Park, Chandan Singh, Yan Shuo Tan, Tiffany Tang, Yu Wang, Chao Zhang, Bin Yu

    Abstract: As the COVID-19 outbreak evolves, accurate forecasting continues to play an extremely important role in informing policy decisions. In this paper, we present our continuous curation of a large data repository containing COVID-19 information from a range of sources. We use this data to develop predictions and corresponding prediction intervals for the short-term trajectory of COVID-19 cumulative de… ▽ More

    Submitted 9 August, 2020; v1 submitted 16 May, 2020; originally announced May 2020.

    Comments: Authors ordered alphabetically. All authors contributed significantly to this work. All collected data, modeling code, forecasts, and visualizations are updated daily and available at \url{https://github.com/Yu-Group/covid19-severity-prediction}

    Journal ref: Published in Harvard Data Science Review, 2020

  15. arXiv:2003.01926  [pdf, other

    stat.ML astro-ph.IM cs.LG

    Transformation Importance with Applications to Cosmology

    Authors: Chandan Singh, Wooseok Ha, Francois Lanusse, Vanessa Boehm, Jia Liu, Bin Yu

    Abstract: Machine learning lies at the heart of new possibilities for scientific discovery, knowledge generation, and artificial intelligence. Its potential benefits to these fields requires going beyond predictive accuracy and focusing on interpretability. In particular, many scientific problems require interpretations in a domain-specific interpretable feature space (e.g. the frequency domain) whereas att… ▽ More

    Submitted 14 June, 2021; v1 submitted 4 March, 2020; originally announced March 2020.

    Comments: Published in ICLR 2020 Workshop on Fundamental Science in the era of AI

  16. arXiv:1909.13584  [pdf, other

    cs.LG cs.CV stat.ML

    Interpretations are useful: penalizing explanations to align neural networks with prior knowledge

    Authors: Laura Rieger, Chandan Singh, W. James Murdoch, Bin Yu

    Abstract: For an explanation of a deep learning model to be effective, it must provide both insight into a model and suggest a corresponding action in order to achieve some objective. Too often, the litany of proposed explainable deep learning methods stop at the first step, providing practitioners with insight into a model, but no way to act on it. In this paper, we propose contextual decomposition explana… ▽ More

    Submitted 8 October, 2020; v1 submitted 30 September, 2019; originally announced September 2019.

    Comments: 18 pages; published in ICML2020; Erratum: numbers in table 1 were too high (now corrected) with the trend remaining the same

  17. arXiv:1908.08652  [pdf

    cs.LG cs.CV stat.ML

    MTCNET: Multi-task Learning Paradigm for Crowd Count Estimation

    Authors: Abhay Kumar, Nishant Jain, Suraj Tripathi, Chirag Singh, Kamal Krishna

    Abstract: We propose a Multi-Task Learning (MTL) paradigm based deep neural network architecture, called MTCNet (Multi-Task Crowd Network) for crowd density and count estimation. Crowd count estimation is challenging due to the non-uniform scale variations and the arbitrary perspective of an individual image. The proposed model has two related tasks, with Crowd Density Estimation as the main task and Crowd-… ▽ More

    Submitted 22 August, 2019; originally announced August 2019.

    Comments: 5 pages, 3 figures, Accepted in IEEE AVSS 2019

  18. arXiv:1906.09986  [pdf

    cs.CV cs.LG stat.ML

    Visual Context-aware Convolution Filters for Transformation-invariant Neural Network

    Authors: Suraj Tripathi, Abhay Kumar, Chirag Singh

    Abstract: We propose a novel visual context-aware filter generation module which incorporates contextual information present in images into Convolutional Neural Networks (CNNs). In contrast to traditional CNNs, we do not employ the same set of learned convolution filters for all input image instances. Our proposed input-conditioned convolution filters when combined with techniques inspired by Multi-instance… ▽ More

    Submitted 15 June, 2019; originally announced June 2019.

    Comments: Under-Review

  19. arXiv:1906.08873  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Learning Discriminative features using Center Loss and Reconstruction as Regularizer for Speech Emotion Recognition

    Authors: Suraj Tripathi, Abhiram Ramesh, Abhay Kumar, Chirag Singh, Promod Yenigalla

    Abstract: This paper proposes a Convolutional Neural Network (CNN) inspired by Multitask Learning (MTL) and based on speech features trained under the joint supervision of softmax loss and center loss, a powerful metric learning strategy, for the recognition of emotion in speech. Speech features such as Spectrograms and Mel-frequency Cepstral Coefficient s (MFCCs) help retain emotion-related low-level chara… ▽ More

    Submitted 31 August, 2019; v1 submitted 18 June, 2019; originally announced June 2019.

    Comments: 10 pages, Accepted in IJCAI Affective Computing Workshop 2019

  20. arXiv:1906.05682  [pdf

    eess.AS cs.LG cs.SD stat.ML

    Focal Loss based Residual Convolutional Neural Network for Speech Emotion Recognition

    Authors: Suraj Tripathi, Abhay Kumar, Abhiram Ramesh, Chirag Singh, Promod Yenigalla

    Abstract: This paper proposes a Residual Convolutional Neural Network (ResNet) based on speech features and trained under Focal Loss to recognize emotion in speech. Speech features such as Spectrogram and Mel-frequency Cepstral Coefficients (MFCCs) have shown the ability to characterize emotion better than just plain text. Further Focal Loss, first used in One-Stage Object Detectors, has shown the ability t… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

    Comments: Accepted in CICLing 2019

  21. arXiv:1906.05681  [pdf

    eess.AS cs.CL cs.LG cs.SD stat.ML

    Deep Learning based Emotion Recognition System Using Speech Features and Transcriptions

    Authors: Suraj Tripathi, Abhay Kumar, Abhiram Ramesh, Chirag Singh, Promod Yenigalla

    Abstract: This paper proposes a speech emotion recognition method based on speech features and speech transcriptions (text). Speech features such as Spectrogram and Mel-frequency Cepstral Coefficients (MFCC) help retain emotion-related low-level characteristics in speech whereas text helps capture semantic meaning, both of which help in different aspects of emotion detection. We experimented with several De… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

    Comments: Accepted in CICLing 2019

  22. arXiv:1906.04914  [pdf

    cs.IR cs.CL cs.LG stat.ML

    From Fully Supervised to Zero Shot Settings for Twitter Hashtag Recommendation

    Authors: Abhay Kumar, Nishant Jain, Suraj Tripathi, Chirag Singh

    Abstract: We propose a comprehensive end-to-end pipeline for Twitter hashtags recommendation system including data collection, supervised training setting and zero shot training setting. In the supervised training setting, we have proposed and compared the performance of various deep learning architectures, namely Convolutional Neural Network (CNN), Recurrent Neural Network (RNN) and Transformer Network. Ho… ▽ More

    Submitted 11 June, 2019; originally announced June 2019.

    Comments: Accepted in CICLing 2019

  23. arXiv:1905.07631  [pdf, other

    stat.ML cs.LG stat.ME

    Disentangled Attribution Curves for Interpreting Random Forests and Boosted Trees

    Authors: Summer Devlin, Chandan Singh, W. James Murdoch, Bin Yu

    Abstract: Tree ensembles, such as random forests and AdaBoost, are ubiquitous machine learning models known for achieving strong predictive performance across a wide variety of domains. However, this strong performance comes at the cost of interpretability (i.e. users are unable to understand the relationships a trained random forest has learned and why it is making its predictions). In particular, it is ch… ▽ More

    Submitted 18 May, 2019; originally announced May 2019.

    Comments: Under review

  24. arXiv:1904.00197  [pdf

    cs.CV cs.LG stat.ML

    Exploiting SIFT Descriptor for Rotation Invariant Convolutional Neural Network

    Authors: Abhay Kumar, Nishant Jain, Chirag Singh, Suraj Tripathi

    Abstract: This paper presents a novel approach to exploit the distinctive invariant features in convolutional neural network. The proposed CNN model uses Scale Invariant Feature Transform (SIFT) descriptor instead of the max-pooling layer. Max-pooling layer discards the pose, i.e., translational and rotational relationship between the low-level features, and hence unable to capture the spatial hierarchies b… ▽ More

    Submitted 30 March, 2019; originally announced April 2019.

    Comments: Accepted in IEEE INDICON 2018

  25. arXiv:1901.04592  [pdf, other

    stat.ML cs.AI cs.LG stat.AP

    Interpretable machine learning: definitions, methods, and applications

    Authors: W. James Murdoch, Chandan Singh, Karl Kumbier, Reza Abbasi-Asl, Bin Yu

    Abstract: Machine-learning models have demonstrated great success in learning complex patterns that enable them to make predictions about unobserved data. In addition to using models for prediction, the ability to interpret what a model has learned is receiving an increasing amount of attention. However, this increased focus has led to considerable confusion about the notion of interpretability. In particul… ▽ More

    Submitted 14 January, 2019; originally announced January 2019.

    Comments: 11 pages

    Journal ref: Published in PNAS 2019

  26. arXiv:1806.05337  [pdf, other

    cs.LG cs.AI cs.CL cs.CV stat.ML

    Hierarchical interpretations for neural network predictions

    Authors: Chandan Singh, W. James Murdoch, Bin Yu

    Abstract: Deep neural networks (DNNs) have achieved impressive predictive performance due to their ability to learn complex, non-linear relationships between variables. However, the inability to effectively visualize these relationships has led to DNNs being characterized as black boxes and consequently limited their applications. To ameliorate this problem, we introduce the use of hierarchical interpretati… ▽ More

    Submitted 16 January, 2019; v1 submitted 13 June, 2018; originally announced June 2018.

    Comments: Published in ICLR 2019

    Journal ref: ICLR 2019

  27. arXiv:1806.01600  [pdf, other

    cs.LG stat.ML

    Accelerated Randomized Coordinate Descent Algorithms for Stochastic Optimization and Online Learning

    Authors: Akshita Bhandari, Chandramani Singh

    Abstract: We propose accelerated randomized coordinate descent algorithms for stochastic optimization and online learning. Our algorithms have significantly less per-iteration complexity than the known accelerated gradient algorithms. The proposed algorithms for online learning have better regret performance than the known randomized online coordinate descent algorithms. Furthermore, the proposed algorithms… ▽ More

    Submitted 16 July, 2018; v1 submitted 5 June, 2018; originally announced June 2018.

    Comments: 20 pages, 4 figures, 2 tables