Skip to main content

Showing 1–15 of 15 results for author: Gupta, G

Searching in archive stat. Search in all archives.
.
  1. arXiv:2405.05852  [pdf, other

    cs.CV cs.AI cs.CL cs.LG cs.RO stat.ML

    Pre-trained Text-to-Image Diffusion Models Are Versatile Representation Learners for Control

    Authors: Gunshi Gupta, Karmesh Yadav, Yarin Gal, Dhruv Batra, Zsolt Kira, Cong Lu, Tim G. J. Rudner

    Abstract: Embodied AI agents require a fine-grained understanding of the physical world mediated through visual and language inputs. Such capabilities are difficult to learn solely from task-specific data. This has led to the emergence of pre-trained vision-language models as a tool for transferring representations learned from internet-scale data to downstream tasks and new domains. However, commonly used… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  2. arXiv:2305.18404  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Conformal Prediction with Large Language Models for Multi-Choice Question Answering

    Authors: Bhawesh Kumar, Charlie Lu, Gauri Gupta, Anil Palepu, David Bellamy, Ramesh Raskar, Andrew Beam

    Abstract: As large language models continue to be widely developed, robust uncertainty quantification techniques will become crucial for their safe deployment in high-stakes scenarios. In this work, we explore how conformal prediction can be used to provide uncertainty quantification in language models for the specific task of multiple-choice question-answering. We find that the uncertainty estimates from c… ▽ More

    Submitted 7 July, 2023; v1 submitted 28 May, 2023; originally announced May 2023.

    Comments: Updated sections on prompt engineering. Expanded sections 4.1 and 4.2 and appendix. Included additional references. Work published at the ICML 2023 (Neural Conversational AI TEACH) workshop

  3. arXiv:2305.15786  [pdf, other

    cs.LG math.ST stat.ML

    Theoretical Guarantees of Learning Ensembling Strategies with Applications to Time Series Forecasting

    Authors: Hilaf Hasson, Danielle C. Maddix, Yuyang Wang, Gaurav Gupta, Youngsuk Park

    Abstract: Ensembling is among the most popular tools in machine learning (ML) due to its effectiveness in minimizing variance and thus improving generalization. Most ensembling methods for black-box base learners fall under the umbrella of "stacked generalization," namely training an ML algorithm that takes the inferences from the base learners as input. While stacking has been widely applied in practice, i… ▽ More

    Submitted 28 August, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: ICML 2023

  4. arXiv:2008.09858  [pdf, other

    stat.ME cs.AI cs.LG

    Hi-CI: Deep Causal Inference in High Dimensions

    Authors: Ankit Sharma, Garima Gupta, Ranjitha Prasad, Arnab Chatterjee, Lovekesh Vig, Gautam Shroff

    Abstract: We address the problem of counterfactual regression using causal inference (CI) in observational studies consisting of high dimensional covariates and high cardinality treatments. Confounding bias, which leads to inaccurate treatment effect estimation, is attributed to covariates that affect both treatments and outcome. The presence of high-dimensional co-variates exacerbates the impact of bias as… ▽ More

    Submitted 9 April, 2021; v1 submitted 22 August, 2020; originally announced August 2020.

    Comments: 23 pages, 5 figures, Accepted in Causal Discovery Workshop - KDD 2020

  5. arXiv:2007.13904  [pdf, other

    cs.LG stat.ML

    La-MAML: Look-ahead Meta Learning for Continual Learning

    Authors: Gunshi Gupta, Karmesh Yadav, Liam Paull

    Abstract: The continual learning problem involves training models with limited capacity to perform well on a set of an unknown number of sequentially arriving tasks. While meta-learning shows great potential for reducing interference between old and new tasks, the current training procedures tend to be either slow or offline, and sensitive to many hyper-parameters. In this work, we propose Look-ahead MAML (… ▽ More

    Submitted 11 November, 2020; v1 submitted 27 July, 2020; originally announced July 2020.

    Comments: Accepted for Oral Presentation at NeurIPS 2020

  6. arXiv:2006.14554  [pdf, other

    stat.ML cs.LG

    STORM: Foundations of End-to-End Empirical Risk Minimization on the Edge

    Authors: Benjamin Coleman, Gaurav Gupta, John Chen, Anshumali Shrivastava

    Abstract: Empirical risk minimization is perhaps the most influential idea in statistical learning, with applications to nearly all scientific and technical domains in the form of regression and classification models. To analyze massive streaming datasets in distributed computing environments, practitioners increasingly prefer to deploy regression models on edge rather than in the cloud. By kee** data on… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

  7. arXiv:2004.13446  [pdf, ps, other

    stat.ME cs.LG cs.MA

    MultiMBNN: Matched and Balanced Causal Inference with Neural Networks

    Authors: Ankit Sharma, Garima Gupta, Ranjitha Prasad, Arnab Chatterjee, Lovekesh Vig, Gautam Shroff

    Abstract: Causal inference (CI) in observational studies has received a lot of attention in healthcare, education, ad attribution, policy evaluation, etc. Confounding is a typical hazard, where the context affects both, the treatment assignment and response. In a multiple treatment scenario, we propose the neural network based MultiMBNN, where we overcome confounding by employing generalized propensity scor… ▽ More

    Submitted 14 August, 2021; v1 submitted 28 April, 2020; originally announced April 2020.

    Comments: 7 pages, 3 figures, Accepted in ESANN 2020

  8. arXiv:1912.03960  [pdf, ps, other

    cs.LG cs.MA stat.ML

    MetaCI: Meta-Learning for Causal Inference in a Heterogeneous Population

    Authors: Ankit Sharma, Garima Gupta, Ranjitha Prasad, Arnab Chatterjee, Lovekesh Vig, Gautam Shroff

    Abstract: Performing inference on data obtained through observational studies is becoming extremely relevant due to the widespread availability of data in fields such as healthcare, education, retail, etc. Furthermore, this data is accrued from multiple homogeneous subgroups of a heterogeneous population, and hence, generalizing the inference mechanism over such data is essential. We propose the MetaCI fram… ▽ More

    Submitted 17 February, 2021; v1 submitted 9 December, 2019; originally announced December 2019.

    Comments: 10 pages, 4 figures, Accepted in CausalML Workshop - NeurIPS 2019

  9. arXiv:1910.10367  [pdf, other

    stat.ML cs.LG

    Unifying Variational Inference and PAC-Bayes for Supervised Learning that Scales

    Authors: Sanjay Thakur, Herke Van Hoof, Gunshi Gupta, David Meger

    Abstract: Neural Network based controllers hold enormous potential to learn complex, high-dimensional functions. However, they are prone to overfitting and unwarranted extrapolations. PAC Bayes is a generalized framework which is more resistant to overfitting and that yields performance bounds that hold with arbitrarily high probability even on the unjustified extrapolations. However, optimizing to learn su… ▽ More

    Submitted 17 December, 2019; v1 submitted 23 October, 2019; originally announced October 2019.

    Comments: 13 pages, 8 figures, 8 tables

  10. arXiv:1909.12473  [pdf, other

    cs.LG stat.ML

    Noisy Batch Active Learning with Deterministic Annealing

    Authors: Gaurav Gupta, Anit Kumar Sahu, Wan-Yi Lin

    Abstract: We study the problem of training machine learning models incrementally with batches of samples annotated with noisy oracles. We select each batch of samples that are important and also diverse via clustering and importance sampling. More importantly, we incorporate model uncertainty into the sampling probability to compensate for poor estimation of the importance scores when the training data is t… ▽ More

    Submitted 28 October, 2020; v1 submitted 26 September, 2019; originally announced September 2019.

  11. arXiv:1905.11226  [pdf, other

    cs.LG cs.LO stat.ML

    Induction of Non-Monotonic Rules From Statistical Learning Models Using High-Utility Itemset Mining

    Authors: Farhad Shakerin, Gopal Gupta

    Abstract: We present a fast and scalable algorithm to induce non-monotonic logic programs from statistical learning models. We reduce the problem of search for best clauses to instances of the High-Utility Itemset Mining (HUIM) problem. In the HUIM problem, feature values and their importance are treated as transactions and utilities respectively. We make use of TreeExplainer, a fast and scalable implementa… ▽ More

    Submitted 28 May, 2019; v1 submitted 24 May, 2019; originally announced May 2019.

    Comments: arXiv admin note: text overlap with arXiv:1808.00629

  12. arXiv:1811.00703  [pdf, ps, other

    cs.LG stat.ML

    Learning Latent Fractional dynamics with Unknown Unknowns

    Authors: Gaurav Gupta, Sergio Pequito, Paul Bogdan

    Abstract: Despite significant effort in understanding complex systems (CS), we lack a theory for modeling, inference, analysis and efficient control of time-varying complex networks (TVCNs) in uncertain environments. From brain activity dynamics to microbiome, and even chromatin interactions within the genome architecture, many such TVCNs exhibits a pronounced spatio-temporal fractality. Moreover, for many… ▽ More

    Submitted 21 March, 2019; v1 submitted 1 November, 2018; originally announced November 2018.

    Comments: 8 pages, 5 figures, American Control Conference 2019

  13. arXiv:1811.00688  [pdf, ps, other

    cs.LG q-bio.NC stat.ML

    Data-driven Perception of Neuron Point Process with Unknown Unknowns

    Authors: Ruochen Yang, Gaurav Gupta, Paul Bogdan

    Abstract: Identification of patterns from discrete data time-series for statistical inference, threat detection, social opinion dynamics, brain activity prediction has received recent momentum. In addition to the huge data size, the associated challenges are, for example, (i) missing data to construct a closed time-varying complex network, and (ii) contribution of unknown sources which are not probed. Towar… ▽ More

    Submitted 21 February, 2019; v1 submitted 1 November, 2018; originally announced November 2018.

  14. arXiv:1808.00629  [pdf, other

    cs.LG stat.ML

    Induction of Non-Monotonic Logic Programs to Explain Boosted Tree Models Using LIME

    Authors: Farhad Shakerin, Gopal Gupta

    Abstract: We present a heuristic based algorithm to induce \textit{nonmonotonic} logic programs that will explain the behavior of XGBoost trained classifiers. We use the technique based on the LIME approach to locally select the most important features contributing to the classification decision. Then, in order to explain the model's global behavior, we propose the LIME-FOLD algorithm ---a heuristic-based i… ▽ More

    Submitted 9 November, 2018; v1 submitted 1 August, 2018; originally announced August 2018.

  15. arXiv:1708.06246  [pdf, other

    cs.AI stat.ML

    Comparative Benchmarking of Causal Discovery Techniques

    Authors: Karamjit Singh, Garima Gupta, Vartika Tewari, Gautam Shroff

    Abstract: In this paper we present a comprehensive view of prominent causal discovery algorithms, categorized into two main categories (1) assuming acyclic and no latent variables, and (2) allowing both cycles and latent variables, along with experimental results comparing them from three perspectives: (a) structural accuracy, (b) standard predictive accuracy, and (c) accuracy of counterfactual inference. F… ▽ More

    Submitted 12 September, 2017; v1 submitted 18 August, 2017; originally announced August 2017.

    Comments: arXiv admin note: text overlap with arXiv:1506.07669, arXiv:1611.03977 by other authors