Skip to main content

Showing 1–29 of 29 results for author: Dasarathy, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.14422  [pdf, other

    cs.LG cs.AI cs.CY

    Unraveling overoptimism and publication bias in ML-driven science

    Authors: Pouria Saidi, Gautam Dasarathy, Visar Berisha

    Abstract: Machine Learning (ML) is increasingly used across many disciplines with impressive reported results across many domain areas. However, recent studies suggest that the published performance of ML models are often overoptimistic. Validity concerns are underscored by findings of an inverse relationship between sample size and reported accuracy in published ML models, contrasting with the theory of le… ▽ More

    Submitted 10 June, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: 31 pages, 7 figures, 6 tables

  2. arXiv:2405.06253  [pdf, ps, other

    cs.GT math.OC

    On Characterizations of Potential and Ordinal Potential Games

    Authors: Sina Arefizadeh, Angelia Nedich, Gautam Dasarathy

    Abstract: This paper investigates some necessary and sufficient conditions for a game to be a potential game. At first, we extend the classical results of Slade and Monderer and Shapley from games with one-dimensional action spaces to games with multi-dimensional action spaces, which require differentiable cost functions. Then, we provide a necessary and sufficient conditions for a game to have a potential… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  3. arXiv:2301.12616  [pdf, other

    cs.LG stat.ME

    Active Sequential Two-Sample Testing

    Authors: Weizhi Li, Prad Kadambi, Pouria Saidi, Karthikeyan Natesan Ramamurthy, Gautam Dasarathy, Visar Berisha

    Abstract: A two-sample hypothesis test is a statistical procedure used to determine whether the distributions generating two samples are identical. We consider the two-sample testing problem in a new scenario where the sample measurements (or sample features) are inexpensive to access, but their group memberships (or labels) are costly. To address the problem, we devise the first \emph{active sequential two… ▽ More

    Submitted 27 June, 2024; v1 submitted 29 January, 2023; originally announced January 2023.

  4. arXiv:2211.05690  [pdf, other

    stat.ML cs.LG math.ST

    Robust Model Selection of Gaussian Graphical Models

    Authors: Abrar Zahin, Rajasekhar Anguluri, Lalitha Sankar, Oliver Kosut, Gautam Dasarathy

    Abstract: In Gaussian graphical model selection, noise-corrupted samples present significant challenges. It is known that even minimal amounts of noise can obscure the underlying structure, leading to fundamental identifiability issues. A recent line of work addressing this "robust model selection" problem narrows its focus to tree-structured graphical models. Even within this specific class of models, exac… ▽ More

    Submitted 7 May, 2024; v1 submitted 10 November, 2022; originally announced November 2022.

  5. arXiv:2206.10569  [pdf, other

    eess.SY cs.LG eess.SP stat.ML

    Controllability of Coarsely Measured Networked Linear Dynamical Systems (Extended Version)

    Authors: Nafiseh Ghoroghchian, Rajasekhar Anguluri, Gautam Dasarathy, Stark C. Draper

    Abstract: We consider the controllability of large-scale linear networked dynamical systems when complete knowledge of network structure is unavailable and knowledge is limited to coarse summaries. We provide conditions under which average controllability of the fine-scale system can be well approximated by average controllability of the (synthesized, reduced-order) coarse-scale system. To this end, we requ… ▽ More

    Submitted 21 June, 2022; originally announced June 2022.

  6. arXiv:2206.07083  [pdf, other

    stat.ML cs.LG eess.SP math.OC math.ST

    Learning the Structure of Large Networked Systems Obeying Conservation Laws

    Authors: Anirudh Rayas, Rajasekhar Anguluri, Gautam Dasarathy

    Abstract: Many networked systems such as electric networks, the brain, and social networks of opinion dynamics are known to obey conservation laws. Examples of this phenomenon include the Kirchoff laws in electric networks and opinion consensus in social networks. Conservation laws in networked systems may be modeled as balance equations of the form $X = B^{*} Y$, where the sparsity pattern of $B^{*}$ captu… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

  7. A Machine Learning Framework for Event Identification via Modal Analysis of PMU Data

    Authors: Nima T. Bazargani, Gautam Dasarathy, Lalitha Sankar, Oliver Kosut

    Abstract: Power systems are prone to a variety of events (e.g. line trips and generation loss) and real-time identification of such events is crucial in terms of situational awareness, reliability, and security. Using measurements from multiple synchrophasors, i.e., phasor measurement units (PMUs), we propose to identify events by extracting features based on modal dynamics. We combine such traditional phys… ▽ More

    Submitted 3 October, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: 12 pages, Accepted for the publication in the IEEE Transactions on Power Systems

    Journal ref: IEEE Transactions on Power Systems, 2022

  8. arXiv:2111.08861  [pdf, other

    cs.LG stat.ML

    A label-efficient two-sample test

    Authors: Weizhi Li, Gautam Dasarathy, Karthikeyan Natesan Ramamurthy, Visar Berisha

    Abstract: Two-sample tests evaluate whether two samples are realizations of the same distribution (the null hypothesis) or two different distributions (the alternative hypothesis). We consider a new setting for this problem where sample features are easily measured whereas sample labels are unknown and costly to obtain. Accordingly, we devise a three-stage framework in service of performing an effective two… ▽ More

    Submitted 19 July, 2022; v1 submitted 16 November, 2021; originally announced November 2021.

    Comments: Accepted to the 38th conference on Uncertainty in Artificial Intelligence (UAI2022)

  9. arXiv:2108.01152  [pdf, ps, other

    cs.LG stat.ML

    Maximizing and Satisficing in Multi-armed Bandits with Graph Information

    Authors: Parth K. Thaker, Mohit Malu, Nikhil Rao, Gautam Dasarathy

    Abstract: Pure exploration in multi-armed bandits has emerged as an important framework for modeling decision-making and search under uncertainty. In modern applications, however, one is often faced with a tremendously large number of options. Even obtaining one observation per option may be too costly rendering traditional pure exploration algorithms ineffective. Fortunately, one often has access to simila… ▽ More

    Submitted 20 November, 2022; v1 submitted 2 August, 2021; originally announced August 2021.

  10. arXiv:2104.07208  [pdf

    cs.LG eess.SP

    State and Topology Estimation for Unobservable Distribution Systems using Deep Neural Networks

    Authors: Behrouz Azimian, Reetam Sen Biswas, Shiva Moshtagh, Anamitra Pal, Lang Tong, Gautam Dasarathy

    Abstract: Time-synchronized state estimation for reconfigurable distribution networks is challenging because of limited real-time observability. This paper addresses this challenge by formulating a deep learning (DL)-based approach for topology identification (TI) and unbalanced three-phase distribution system state estimation (DSSE). Two deep neural networks (DNNs) are trained for time-synchronized DNN-bas… ▽ More

    Submitted 26 March, 2022; v1 submitted 14 April, 2021; originally announced April 2021.

    Comments: 13 pages. arXiv admin note: substantial text overlap with arXiv:2011.04272

    Journal ref: IEEE Transactions on Instrumentation and Measurement, 2022

  11. arXiv:2102.13135  [pdf, other

    math.ST cs.IT cs.LG eess.SP stat.ML

    Graph Community Detection from Coarse Measurements: Recovery Conditions for the Coarsened Weighted Stochastic Block Model

    Authors: Nafiseh Ghoroghchian, Gautam Dasarathy, Stark C. Draper

    Abstract: We study the problem of community recovery from coarse measurements of a graph. In contrast to the problem of community recovery of a fully observed graph, one often encounters situations when measurements of a graph are made at low-resolution, each measurement integrating across multiple graph nodes. Such low-resolution measurements effectively induce a coarse graph with its own communities. Our… ▽ More

    Submitted 25 February, 2021; originally announced February 2021.

  12. arXiv:2011.09645  [pdf, other

    cs.LG

    Finding the Homology of Decision Boundaries with Active Learning

    Authors: Weizhi Li, Gautam Dasarathy, Karthikeyan Natesan Ramamurthy, Visar Berisha

    Abstract: Accurately and efficiently characterizing the decision boundary of classifiers is important for problems related to model selection and meta-learning. Inspired by topological data analysis, the characterization of decision boundaries using their homology has recently emerged as a general and powerful tool. In this paper, we propose an active learning algorithm to recover the homology of decision b… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

    Journal ref: Advances in Neural Information Processing Systems 33 (2020)

  13. arXiv:2007.05996  [pdf, other

    cs.CV eess.IV physics.ao-ph

    Differentiable Programming for Hyperspectral Unmixing using a Physics-based Dispersion Model

    Authors: John Janiczek, Parth Thaker, Gautam Dasarathy, Christopher S. Edwards, Philip Christensen, Suren Jayasuriya

    Abstract: Hyperspectral unmixing is an important remote sensing task with applications including material identification and analysis. Characteristic spectral features make many pure materials identifiable from their visible-to-infrared spectra, but quantifying their presence within a mixture is a challenging task due to nonlinearities and factors of variation. In this paper, spectral variation is considere… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

    Comments: 36 pages, 11 figures. Accepted to European Conference on Computer Vision (ECCV) 2020

  14. arXiv:2006.12406  [pdf, ps, other

    cs.LG cs.IT stat.ML

    On the alpha-loss Landscape in the Logistic Model

    Authors: Tyler Sypherd, Mario Diaz, Lalitha Sankar, Gautam Dasarathy

    Abstract: We analyze the optimization landscape of a recently introduced tunable class of loss functions called $α$-loss, $α\in (0,\infty]$, in the logistic model. This family encapsulates the exponential loss ($α= 1/2$), the log-loss ($α= 1$), and the 0-1 loss ($α= \infty$) and contains compelling properties that enable the practitioner to discern among a host of operating conditions relevant to emerging l… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

    Comments: 5 pages, appeared in ISIT 2020. arXiv admin note: text overlap with arXiv:1906.02314

  15. arXiv:2002.01066  [pdf, ps, other

    eess.SP cs.IT cs.LG math.OC stat.ML

    On the Sample Complexity and Optimization Landscape for Quadratic Feasibility Problems

    Authors: Parth Thaker, Gautam Dasarathy, Angelia Nedić

    Abstract: We consider the problem of recovering a complex vector $\mathbf{x}\in \mathbb{C}^n$ from $m$ quadratic measurements $\{\langle A_i\mathbf{x}, \mathbf{x}\rangle\}_{i=1}^m$. This problem, known as quadratic feasibility, encompasses the well known phase retrieval problem and has applications in a wide range of important areas including power system state estimation and x-ray crystallography. In gener… ▽ More

    Submitted 14 December, 2020; v1 submitted 3 February, 2020; originally announced February 2020.

    Comments: 21 pages

  16. arXiv:2001.01900  [pdf, other

    cs.LG stat.ML

    Regularization via Structural Label Smoothing

    Authors: Weizhi Li, Gautam Dasarathy, Visar Berisha

    Abstract: Regularization is an effective way to promote the generalization performance of machine learning models. In this paper, we focus on label smoothing, a form of output distribution regularization that prevents overfitting of a neural network by softening the ground-truth labels in the training data in an attempt to penalize overconfident outputs. Existing approaches typically use cross-validation to… ▽ More

    Submitted 4 July, 2020; v1 submitted 7 January, 2020; originally announced January 2020.

  17. arXiv:1906.02314  [pdf, other

    cs.LG stat.ML

    A Tunable Loss Function for Robust Classification: Calibration, Landscape, and Generalization

    Authors: Tyler Sypherd, Mario Diaz, John Kevin Cava, Gautam Dasarathy, Peter Kairouz, Lalitha Sankar

    Abstract: We introduce a tunable loss function called $α$-loss, parameterized by $α\in (0,\infty]$, which interpolates between the exponential loss ($α= 1/2$), the log-loss ($α= 1$), and the 0-1 loss ($α= \infty$), for the machine learning setting of classification. Theoretically, we illustrate a fundamental connection between $α$-loss and Arimoto conditional entropy, verify the classification-calibration o… ▽ More

    Submitted 21 December, 2022; v1 submitted 5 June, 2019; originally announced June 2019.

    Comments: Published at the Transactions on Information Theory

  18. arXiv:1905.09190  [pdf, other

    cs.LG stat.ML

    Thresholding Graph Bandits with GrAPL

    Authors: Daniel LeJeune, Gautam Dasarathy, Richard G. Baraniuk

    Abstract: In this paper, we introduce a new online decision making paradigm that we call Thresholding Graph Bandits. The main goal is to efficiently identify a subset of arms in a multi-armed bandit problem whose means are above a specified threshold. While traditionally in such problems, the arms are assumed to be independent, in our paradigm we further suppose that we have access to the similarity between… ▽ More

    Submitted 24 March, 2020; v1 submitted 22 May, 2019; originally announced May 2019.

    Comments: 14 pages, 3 figures. To appear in AISTATS 2020

  19. arXiv:1905.08831  [pdf, other

    cs.SI cs.LG eess.SP stat.ML

    IdeoTrace: A Framework for Ideology Tracing with a Case Study on the 2016 U.S. Presidential Election

    Authors: Indu Manickam, Andrew S. Lan, Gautam Dasarathy, Richard G. Baraniuk

    Abstract: The 2016 United States presidential election has been characterized as a period of extreme divisiveness that was exacerbated on social media by the influence of fake news, trolls, and social bots. However, the extent to which the public became more polarized in response to these influences over the course of the election is not well understood. In this paper we propose IdeoTrace, a framework for (… ▽ More

    Submitted 30 May, 2019; v1 submitted 21 May, 2019; originally announced May 2019.

    Comments: 9 pages, 4 figures, submitted to ASONAM 2019

  20. arXiv:1806.04310  [pdf, other

    cs.DS cs.LG stat.ML

    MISSION: Ultra Large-Scale Feature Selection using Count-Sketches

    Authors: Amirali Aghazadeh, Ryan Spring, Daniel LeJeune, Gautam Dasarathy, Anshumali Shrivastava, Richard G. Baraniuk

    Abstract: Feature selection is an important challenge in machine learning. It plays a crucial role in the explainability of machine-driven decisions that are rapidly permeating throughout modern society. Unfortunately, the explosion in the size and dimensionality of real-world datasets poses a severe challenge to standard feature selection algorithms. Today, it is not uncommon for datasets to have billions… ▽ More

    Submitted 11 June, 2018; originally announced June 2018.

  21. arXiv:1707.04300  [pdf, other

    cs.LG math.PR math.ST q-bio.PE

    Coalescent-based species tree estimation: a stochastic Farris transform

    Authors: Gautam Dasarathy, Elchanan Mossel, Robert Nowak, Sebastien Roch

    Abstract: The reconstruction of a species phylogeny from genomic data faces two significant hurdles: 1) the trees describing the evolution of each individual gene--i.e., the gene trees--may differ from the species phylogeny and 2) the molecular sequences corresponding to each gene often provide limited information about the gene trees themselves. In this paper we consider an approach to species tree reconst… ▽ More

    Submitted 13 July, 2017; originally announced July 2017.

    Comments: Submitted. 49 pages

  22. arXiv:1707.03386  [pdf, ps, other

    stat.ML cs.LG

    DeepCodec: Adaptive Sensing and Recovery via Deep Convolutional Neural Networks

    Authors: Ali Mousavi, Gautam Dasarathy, Richard G. Baraniuk

    Abstract: In this paper we develop a novel computational sensing framework for sensing and recovering structured signals. When trained on a set of representative signals, our framework learns to take undersampled measurements and recover signals from them using a deep convolutional neural network. In other words, it learns a transformation from the original signals to a near-optimal number of undersampled m… ▽ More

    Submitted 11 July, 2017; originally announced July 2017.

  23. arXiv:1610.09726  [pdf, other

    cs.LG

    The Multi-fidelity Multi-armed Bandit

    Authors: Kirthevasan Kandasamy, Gautam Dasarathy, Jeff Schneider, Barnabás Póczos

    Abstract: We study a variant of the classical stochastic $K$-armed bandit where observing the outcome of each arm is expensive, but cheap approximations to this outcome are available. For example, in online advertising the performance of an ad can be approximated by displaying it for shorter time periods or to narrower audiences. We formalise this task as a multi-fidelity bandit, where, at each time step, t… ▽ More

    Submitted 30 October, 2016; originally announced October 2016.

    Comments: To appear at NIPS 2016

  24. arXiv:1603.06288  [pdf, other

    stat.ML cs.AI cs.LG

    Multi-fidelity Gaussian Process Bandit Optimisation

    Authors: Kirthevasan Kandasamy, Gautam Dasarathy, Junier B. Oliva, Jeff Schneider, Barnabas Poczos

    Abstract: In many scientific and engineering applications, we are tasked with the maximisation of an expensive to evaluate black box function $f$. Traditional settings for this problem assume just the availability of this single function. However, in many cases, cheap approximations to $f$ may be obtainable. For example, the expensive real world behaviour of a robot can be approximated by a cheap computer s… ▽ More

    Submitted 15 March, 2019; v1 submitted 20 March, 2016; originally announced March 2016.

    Comments: Preliminary version appeared at NIPS 2016

  25. arXiv:1602.00354  [pdf, other

    stat.ML cs.IT cs.LG math.ST

    Active Learning Algorithms for Graphical Model Selection

    Authors: Gautam Dasarathy, Aarti Singh, Maria-Florina Balcan, Jong Hyuk Park

    Abstract: The problem of learning the structure of a high dimensional graphical model from data has received considerable attention in recent years. In many applications such as sensor networks and proteomics it is often expensive to obtain samples from all the variables involved simultaneously. For instance, this might involve the synchronization of a large number of sensors or the tagging of a large numbe… ▽ More

    Submitted 7 April, 2016; v1 submitted 31 January, 2016; originally announced February 2016.

    Comments: 26 pages, 3 figures. Preliminary version to appear in AI & Statistics 2016

  26. arXiv:1506.08760  [pdf, other

    cs.LG stat.ML

    S2: An Efficient Graph Based Active Learning Algorithm with Application to Nonparametric Classification

    Authors: Gautam Dasarathy, Robert Nowak, Xiao** Zhu

    Abstract: This paper investigates the problem of active learning for binary label prediction on a graph. We introduce a simple and label-efficient algorithm called S2 for this task. At each step, S2 selects the vertex to be labeled based on the structure of the graph and all previously gathered labels. Specifically, S2 queries for the label of the vertex that bisects the *shortest shortest* path between any… ▽ More

    Submitted 29 June, 2015; originally announced June 2015.

    Comments: A version of this paper appears in the Conference on Learning Theory (COLT) 2015

  27. arXiv:1404.7055  [pdf, other

    q-bio.PE cs.CE cs.DS math.PR math.ST stat.ML

    Data Requirement for Phylogenetic Inference from Multiple Loci: A New Distance Method

    Authors: Gautam Dasarathy, Robert Nowak, Sebastien Roch

    Abstract: We consider the problem of estimating the evolutionary history of a set of species (phylogeny or species tree) from several genes. It is known that the evolutionary history of individual genes (gene trees) might be topologically distinct from each other and from the underlying species tree, possibly confounding phylogenetic analysis. A further complication in practice is that one has to estimate g… ▽ More

    Submitted 30 June, 2014; v1 submitted 28 April, 2014; originally announced April 2014.

    Comments: 19 pages, 2 figures. Preliminary version to appear in IEEE ISIT 2014. Added acknowledgements and made the proof of the "equality" part of Theorem 3 explicit in Appendix C

  28. arXiv:1303.6544  [pdf, other

    cs.IT math.OC

    Sketching Sparse Matrices

    Authors: Gautam Dasarathy, Parikshit Shah, Badri Narayan Bhaskar, Robert Nowak

    Abstract: This paper considers the problem of recovering an unknown sparse p\times p matrix X from an m\times m matrix Y=AXB^T, where A and B are known m \times p matrices with m << p. The main result shows that there exist constructions of the "sketching" matrices A and B so that even if X has O(p) non-zeros, it can be recovered exactly and efficiently using a convex program as long as these non-zeros ar… ▽ More

    Submitted 26 March, 2013; originally announced March 2013.

  29. arXiv:1102.3887  [pdf, ps, other

    cs.IT cs.LG stat.ML

    Active Clustering: Robust and Efficient Hierarchical Clustering using Adaptively Selected Similarities

    Authors: Brian Eriksson, Gautam Dasarathy, Aarti Singh, Robert Nowak

    Abstract: Hierarchical clustering based on pairwise similarities is a common tool used in a broad range of scientific applications. However, in many problems it may be expensive to obtain or compute similarities between the items to be clustered. This paper investigates the hierarchical clustering of N items based on a small subset of pairwise similarities, significantly less than the complete set of N(N-1)… ▽ More

    Submitted 18 February, 2011; originally announced February 2011.