Skip to main content

Showing 1–27 of 27 results for author: Shah, V

Searching in archive stat. Search in all archives.
.
  1. arXiv:2402.01207  [pdf, other

    cs.LG cs.AI stat.ME

    Efficient Causal Graph Discovery Using Large Language Models

    Authors: Thomas Jiralerspong, Xiaoyin Chen, Yash More, Vedant Shah, Yoshua Bengio

    Abstract: We propose a novel framework that leverages LLMs for full causal graph discovery. While previous LLM-based methods have used a pairwise query approach, this requires a quadratic number of queries which quickly becomes impractical for larger causal graphs. In contrast, the proposed framework uses a breadth-first search (BFS) approach which allows it to use only a linear number of queries. We also s… ▽ More

    Submitted 13 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  2. arXiv:2105.06371  [pdf, other

    cs.LG stat.ML

    Provably Convergent Algorithms for Solving Inverse Problems Using Generative Models

    Authors: Viraj Shah, Rakib Hyder, M. Salman Asif, Chinmay Hegde

    Abstract: The traditional approach of hand-crafting priors (such as sparsity) for solving inverse problems is slowly being replaced by the use of richer learned priors (such as those modeled by deep generative networks). In this work, we study the algorithmic aspects of such a learning-based approach from a theoretical perspective. For certain generative network architectures, we establish a simple non-conv… ▽ More

    Submitted 13 May, 2021; originally announced May 2021.

    Comments: arXiv admin note: text overlap with arXiv:1810.03587, arXiv:1802.08406

  3. arXiv:2102.08554  [pdf, other

    stat.ML cs.LG

    Recoverability Landscape of Tree Structured Markov Random Fields under Symmetric Noise

    Authors: Ashish Katiyar, Soumya Basu, Vatsal Shah, Constantine Caramanis

    Abstract: We study the problem of learning tree-structured Markov random fields (MRF) on discrete random variables with common support when the observations are corrupted by a $k$-ary symmetric noise channel with unknown probability of error. For Ising models (support size = 2), past work has shown that graph structure can only be recovered up to the leaf clusters (a leaf node, its parent, and its siblings… ▽ More

    Submitted 14 June, 2021; v1 submitted 16 February, 2021; originally announced February 2021.

  4. arXiv:2011.14066  [pdf, other

    stat.ML cs.LG

    On Generalization of Adaptive Methods for Over-parameterized Linear Regression

    Authors: Vatsal Shah, Soumya Basu, Anastasios Kyrillidis, Sujay Sanghavi

    Abstract: Over-parameterization and adaptive methods have played a crucial role in the success of deep learning in the last decade. The widespread use of over-parameterization has forced us to rethink generalization by bringing forth new phenomena, such as implicit regularization of optimization algorithms and double descent with training progression. A series of recent works have started to shed light on t… ▽ More

    Submitted 27 November, 2020; originally announced November 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1811.07055

  5. arXiv:2006.05601  [pdf, other

    stat.ML cs.LG

    Robust Estimation of Tree Structured Ising Models

    Authors: Ashish Katiyar, Vatsal Shah, Constantine Caramanis

    Abstract: We consider the task of learning Ising models when the signs of different random variables are flipped independently with possibly unequal, unknown probabilities. In this paper, we focus on the problem of robust estimation of tree-structured Ising models. Without any additional assumption of side information, this is an open problem. We first prove that this problem is unidentifiable, however, thi… ▽ More

    Submitted 9 June, 2020; originally announced June 2020.

  6. arXiv:2001.03316  [pdf, other

    stat.ML cs.LG

    Choosing the Sample with Lowest Loss makes SGD Robust

    Authors: Vatsal Shah, Xiaoxia Wu, Sujay Sanghavi

    Abstract: The presence of outliers can potentially significantly skew the parameters of machine learning models trained via stochastic gradient descent (SGD). In this paper we propose a simple variant of the simple SGD method: in each step, first choose a set of k samples, then from these choose the one with the smallest current loss, and do an SGD-like update with this chosen sample. Vanilla SGD correspond… ▽ More

    Submitted 10 January, 2020; originally announced January 2020.

  7. arXiv:1911.05166  [pdf, other

    cs.LG stat.ML

    Negative sampling in semi-supervised learning

    Authors: John Chen, Vatsal Shah, Anastasios Kyrillidis

    Abstract: We introduce Negative Sampling in Semi-Supervised Learning (NS3L), a simple, fast, easy to tune algorithm for semi-supervised learning (SSL). NS3L is motivated by the success of negative sampling/contrastive estimation. We demonstrate that adding the NS3L loss to state-of-the-art SSL algorithms, such as the Virtual Adversarial Training (VAT), significantly improves upon vanilla VAT and its variant… ▽ More

    Submitted 28 June, 2020; v1 submitted 12 November, 2019; originally announced November 2019.

  8. arXiv:1910.08179  [pdf

    stat.AP

    High Performance Implementation of the Hierarchical Likelihood for Generalized Linear Mixed Models. An Application to estimate the potassium reference range in massive Electronic Health Records datasets

    Authors: Cristian H Bologa, Vernon Shane Pankratz, Mark L Unruh, Maria Eleni Roumelioti, Vallabh Shah, Saeed Kamran Shaffi, Soraya Arzhan, John Cook, Christos Argyropoulos

    Abstract: Converting electronic health record (EHR) entries to useful clinical inferences requires one to address the poor scalability of existing implementations of Generalized Linear Mixed Models (GLMM) for repeated measures. The major computational bottleneck concerns the numerical evaluation of integrals, which even for the simplest EHR analyses may involve millions of dimensions (one for each patient).… ▽ More

    Submitted 17 April, 2021; v1 submitted 17 October, 2019; originally announced October 2019.

    MSC Class: 62P10; 65D25; 62-04; 62F10; 62R07

  9. arXiv:1906.03183  [pdf, other

    q-bio.QM cs.CL cs.IR stat.AP

    Exploring Diseases and Syndromes in Neurology Case Reports from 1955 to 2017 with Text Mining

    Authors: Amir Karami, Mehdi Ghasemi, Souvik Sen, Marcos Moraes, Vishal Shah

    Abstract: Background: A large number of neurology case reports have been published, but it is a challenging task for human medical experts to explore all of these publications. Text mining offers a computational approach to investigate neurology literature and capture meaningful patterns. The overarching goal of this study is to provide a new perspective on case reports of neurological disease and syndrome… ▽ More

    Submitted 23 May, 2019; originally announced June 2019.

  10. arXiv:1906.01626  [pdf, other

    cs.LG eess.IV stat.ML

    Encoding Invariances in Deep Generative Models

    Authors: Viraj Shah, Ameya Joshi, Sambuddha Ghosal, Balaji Pokuri, Soumik Sarkar, Baskar Ganapathysubramanian, Chinmay Hegde

    Abstract: Reliable training of generative adversarial networks (GANs) typically require massive datasets in order to model complicated distributions. However, in several applications, training samples obey invariances that are \textit{a priori} known; for example, in complex physics simulations, the training data obey universal laws encoded as well-defined mathematical equations. In this paper, we propose a… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

  11. arXiv:1903.02706  [pdf, ps, other

    cs.SI cs.CY stat.AP stat.ML

    Twitter Speaks: A Case of National Disaster Situational Awareness

    Authors: Amir Karami, Vishal Shah, Reza Vaezi, Amit Bansal

    Abstract: In recent years, we have been faced with a series of natural disasters causing a tremendous amount of financial, environmental, and human losses. The unpredictable nature of natural disasters' behavior makes it hard to have a comprehensive situational awareness (SA) to support disaster management. Using opinion surveys is a traditional approach to analyze public concerns during natural disasters;… ▽ More

    Submitted 6 March, 2019; originally announced March 2019.

    Comments: 17 pages, 3 figures, 5 tables

  12. arXiv:1901.02045  [pdf, other

    cs.LG econ.EM stat.ML

    Semi-parametric dynamic contextual pricing

    Authors: Virag Shah, Jose Blanchet, Ramesh Johari

    Abstract: Motivated by the application of real-time pricing in e-commerce platforms, we consider the problem of revenue-maximization in a setting where the seller can leverage contextual information describing the customer's history and the product's type to predict her valuation of the product. However, her true valuation is unobservable to the seller, only binary outcome in the form of success-failure of… ▽ More

    Submitted 10 August, 2019; v1 submitted 7 January, 2019; originally announced January 2019.

    Comments: 28 pages, 1 table, 1 figure

  13. arXiv:1812.00557  [pdf, other

    stat.ML cs.LG

    Signal Reconstruction from Modulo Observations

    Authors: Viraj Shah, Chinmay Hegde

    Abstract: We consider the problem of reconstructing a signal from under-determined modulo observations (or measurements). This observation model is inspired by a (relatively) less well-known imaging mechanism called modulo imaging, which can be used to extend the dynamic range of imaging systems; variations of this model have also been studied under the category of phase unwrap**. Signal reconstruction in… ▽ More

    Submitted 16 July, 2019; v1 submitted 3 December, 2018; originally announced December 2018.

  14. arXiv:1811.09669  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.comp-ph stat.ML

    Physics-aware Deep Generative Models for Creating Synthetic Microstructures

    Authors: Rahul Singh, Viraj Shah, Balaji Pokuri, Soumik Sarkar, Baskar Ganapathysubramanian, Chinmay Hegde

    Abstract: A key problem in computational material science deals with understanding the effect of material distribution (i.e., microstructure) on material performance. The challenge is to synthesize microstructures, given a finite number of microstructure images, and/or some physical invariances that the microstructure exhibits. Conventional approaches are based on stochastic optimization and are computation… ▽ More

    Submitted 21 November, 2018; originally announced November 2018.

  15. arXiv:1811.07055   

    stat.ML cs.LG

    Minimum weight norm models do not always generalize well for over-parameterized problems

    Authors: Vatsal Shah, Anastasios Kyrillidis, Sujay Sanghavi

    Abstract: This work is substituted by the paper in arXiv:2011.14066. Stochastic gradient descent is the de facto algorithm for training deep neural networks (DNNs). Despite its popularity, it still requires fine tuning in order to achieve its best performance. This has led to the development of adaptive methods, that claim automatic hyper-parameter optimization. Recently, researchers have studied both algo… ▽ More

    Submitted 1 December, 2020; v1 submitted 16 November, 2018; originally announced November 2018.

    Comments: This work is substituted by the paper in arXiv:2011.14066

  16. arXiv:1805.11754  [pdf, other

    stat.ML cs.LG

    Optimal Testing in the Experiment-rich Regime

    Authors: Sven Schmit, Virag Shah, Ramesh Johari

    Abstract: Motivated by the widespread adoption of large-scale A/B testing in industry, we propose a new experimentation framework for the setting where potential experiments are abundant (i.e., many hypotheses are available to test), and observations are costly; we refer to this as the experiment-rich regime. Such scenarios require the experimenter to internalize the opportunity cost of assigning a sample t… ▽ More

    Submitted 29 May, 2018; originally announced May 2018.

  17. arXiv:1802.08406  [pdf, other

    stat.ML cs.LG

    Solving Linear Inverse Problems Using GAN Priors: An Algorithm with Provable Guarantees

    Authors: Viraj Shah, Chinmay Hegde

    Abstract: In recent works, both sparsity-based methods as well as learning-based methods have proven to be successful in solving several challenging linear inverse problems. However, sparsity priors for natural signals and images suffer from poor discriminative capability, while learning-based methods seldom provide concrete theoretical guarantees. In this work, we advocate the idea of replacing hand-crafte… ▽ More

    Submitted 23 February, 2018; originally announced February 2018.

  18. arXiv:1802.05693  [pdf, other

    cs.LG stat.ML

    Bandit Learning with Positive Externalities

    Authors: Virag Shah, Jose Blanchet, Ramesh Johari

    Abstract: In many platforms, user arrivals exhibit a self-reinforcing behavior: future user arrivals are likely to have preferences similar to users who were satisfied in the past. In other words, arrivals exhibit positive externalities. We study multiarmed bandit (MAB) problems with positive externalities. We show that the self-reinforcing preferences may lead standard benchmark algorithms such as UCB to e… ▽ More

    Submitted 6 March, 2019; v1 submitted 15 February, 2018; originally announced February 2018.

    Comments: 31 pages, 1 table, 2 figures

  19. arXiv:1801.08085  [pdf

    q-bio.QM eess.SP q-bio.NC stat.ML

    The Temple University Hospital Seizure Detection Corpus

    Authors: Vinit Shah, Eva von Weltin, Silvia Lopez, James Riley McHugh, Lily Veloso, Meysam Golmohammadi, Iyad Obeid, Joseph Picone

    Abstract: We introduce the TUH EEG Seizure Corpus (TUSZ), which is the largest open source corpus of its type, and represents an accurate characterization of clinical conditions. In this paper, we describe the techniques used to develop TUSZ, evaluate their effectiveness, and present some descriptive statistics on the resulting corpus.

    Submitted 2 January, 2018; originally announced January 2018.

    Comments: Under review in Frontiers in Neuroscience

  20. arXiv:1801.02472  [pdf

    eess.SP cs.CV q-bio.NC stat.ML

    Optimizing Channel Selection for Seizure Detection

    Authors: Vinit Shah, Meysam Golmohammadi, Saeedeh Ziyabari, Eva Von Weltin, Iyad Obeid, Joseph Picone

    Abstract: Interpretation of electroencephalogram (EEG) signals can be complicated by obfuscating artifacts. Artifact detection plays an important role in the observation and analysis of EEG signals. Spatial information contained in the placement of the electrodes can be exploited to accurately detect artifacts. However, when fewer electrodes are used, less spatial information is available, making it harder… ▽ More

    Submitted 2 January, 2018; originally announced January 2018.

    Comments: Published in Dec 2017 publication IEEE Signal Processing in Medicine and Biology Symposium. Philadelphia, Pennsylvania, USA

  21. arXiv:1801.02471  [pdf

    eess.SP cs.AI stat.ML

    Gated Recurrent Networks for Seizure Detection

    Authors: Meysam Golmohammadi, Saeedeh Ziyabari, Vinit Shah, Eva Von Weltin, Christopher Campbell, Iyad Obeid, Joseph Picone

    Abstract: Recurrent Neural Networks (RNNs) with sophisticated units that implement a gating mechanism have emerged as powerful technique for modeling sequential signals such as speech or electroencephalography (EEG). The latter is the focus on this paper. A significant big data resource, known as the TUH EEG Corpus (TUEEG), has recently become available for EEG research, creating a unique opportunity to eva… ▽ More

    Submitted 2 January, 2018; originally announced January 2018.

    Comments: Published in Dec 2017 publication In IEEE Signal Processing in Medicine and Biology Symposium. Philadelphia, Pennsylvania, USA. arXiv admin note: text overlap with arXiv:1712.09776

  22. arXiv:1712.10107  [pdf

    cs.LG eess.SP stat.ML

    Objective evaluation metrics for automatic classification of EEG events

    Authors: Saeedeh Ziyabari, Vinit Shah, Meysam Golmohammadi, Iyad Obeid, Joseph Picone

    Abstract: The evaluation of machine learning algorithms in biomedical fields for applications involving sequential data lacks standardization. Common quantitative scalar evaluation metrics such as sensitivity and specificity can often be misleading depending on the requirements of the application. Evaluation metrics must ultimately reflect the needs of users yet be sufficiently sensitive to guide algorithm… ▽ More

    Submitted 2 December, 2019; v1 submitted 28 December, 2017; originally announced December 2017.

    Comments: 22 pages, 11 figures, 9 tables

  23. arXiv:1712.09776  [pdf

    cs.LG eess.SP q-bio.NC stat.ML

    Deep Architectures for Automated Seizure Detection in Scalp EEGs

    Authors: Meysam Golmohammadi, Saeedeh Ziyabari, Vinit Shah, Silvia Lopez de Diego, Iyad Obeid, Joseph Picone

    Abstract: Automated seizure detection using clinical electroencephalograms is a challenging machine learning problem because the multichannel signal often has an extremely low signal to noise ratio. Events of interest such as seizures are easily confused with signal artifacts (e.g, eye movements) or benign variants (e.g., slowing). Commercially available systems suffer from unacceptably high false alarm rat… ▽ More

    Submitted 28 December, 2017; originally announced December 2017.

    Comments: nder review in International Conference on Machine Learning, Stockholm, Sweden

  24. arXiv:1710.00109  [pdf, other

    stat.ML

    Reconstruction from Periodic Nonlinearities, With Applications to HDR Imaging

    Authors: Viraj Shah, Mohammadreza Soltani, Chinmay Hegde

    Abstract: We consider the problem of reconstructing signals and images from periodic nonlinearities. For such problems, we design a measurement scheme that supports efficient reconstruction; moreover, our method can be adapted to extend to compressive sensing-based signal and image acquisition systems. Our techniques can be potentially useful for reducing the measurement complexity of high dynamic range (HD… ▽ More

    Submitted 29 September, 2017; originally announced October 2017.

  25. arXiv:1705.02047  [pdf, other

    stat.ML cs.LG

    Matrix Completion via Factorizing Polynomials

    Authors: Vatsal Shah, Nikhil Rao, Weicong Ding

    Abstract: Predicting unobserved entries of a partially observed matrix has found wide applicability in several areas, such as recommender systems, computational biology, and computer vision. Many scalable methods with rigorous theoretical guarantees have been developed for algorithms where the matrix is factored into low-rank components, and embeddings are learned for the row and column entities. While ther… ▽ More

    Submitted 13 February, 2018; v1 submitted 4 May, 2017; originally announced May 2017.

  26. arXiv:1703.00674  [pdf, other

    cs.AI cs.LG stat.ML

    Adaptive Matching for Expert Systems with Uncertain Task Types

    Authors: Virag Shah, Lennart Gulikers, Laurent Massoulie, Milan Vojnovic

    Abstract: A matching in a two-sided market often incurs an externality: a matched resource may become unavailable to the other side of the market, at least for a while. This is especially an issue in online platforms involving human experts as the expert resources are often scarce. The efficient utilization of experts in these platforms is made challenging by the fact that the information available about th… ▽ More

    Submitted 26 October, 2018; v1 submitted 2 March, 2017; originally announced March 2017.

    Comments: A part of it presented at Allerton Conference 2017, 18 pages

  27. arXiv:1603.06861  [pdf, other

    stat.ML cs.IT cs.LG math.OC

    Trading-off variance and complexity in stochastic gradient descent

    Authors: Vatsal Shah, Megasthenis Asteris, Anastasios Kyrillidis, Sujay Sanghavi

    Abstract: Stochastic gradient descent is the method of choice for large-scale machine learning problems, by virtue of its light complexity per iteration. However, it lags behind its non-stochastic counterparts with respect to the convergence rate, due to high variance introduced by the stochastic updates. The popular Stochastic Variance-Reduced Gradient (SVRG) method mitigates this shortcoming, introducing… ▽ More

    Submitted 22 March, 2016; originally announced March 2016.

    Comments: 14 pages, 13 figures, first edition on 9th of October 2015