Skip to main content

Showing 1–23 of 23 results for author: Nanda, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.12957  [pdf, other

    cs.CL cs.LG

    Towards Reliable Latent Knowledge Estimation in LLMs: In-Context Learning vs. Prompting Based Factual Knowledge Extraction

    Authors: Qinyuan Wu, Mohammad Aflah Khan, Soumi Das, Vedant Nanda, Bishwamittra Ghosh, Camila Kolling, Till Speicher, Laurent Bindschaedler, Krishna P. Gummadi, Evimaria Terzi

    Abstract: We propose an approach for estimating the latent knowledge embedded inside large language models (LLMs). We leverage the in-context learning (ICL) abilities of LLMs to estimate the extent to which an LLM knows the facts stored in a knowledge base. Our knowledge estimator avoids reliability concerns with previous prompting-based methods, is both conceptually simpler and easier to apply, and we demo… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  2. arXiv:2404.06993  [pdf, other

    stat.ML cs.LG math.CO math.RT math.ST q-bio.QM

    Quiver Laplacians and Feature Selection

    Authors: Otto Sumray, Heather A. Harrington, Vidit Nanda

    Abstract: The challenge of selecting the most relevant features of a given dataset arises ubiquitously in data analysis and dimensionality reduction. However, features found to be of high importance for the entire dataset may not be relevant to subsets of interest, and vice versa. Given a feature selector and a fixed decomposition of the data into subsets, we describe a method for identifying selected featu… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 40 pages, 7 figures

    MSC Class: 16G20; 05C50; 62P05; 62H25

  3. arXiv:2311.04171  [pdf, other

    cs.LG cs.AI math.AT math.DG math.ST

    HADES: Fast Singularity Detection with Local Measure Comparison

    Authors: Uzu Lim, Harald Oberhauser, Vidit Nanda

    Abstract: We introduce Hades, an unsupervised algorithm to detect singularities in data. This algorithm employs a kernel goodness-of-fit test, and as a consequence it is much faster and far more scaleable than the existing topology-based alternatives. Using tools from differential geometry and optimal transport theory, we prove that Hades correctly detects singularities with high probability when the data s… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    MSC Class: 55N31; 32S50

  4. arXiv:2307.06006  [pdf, other

    cs.CV cs.LG

    What Happens During Finetuning of Vision Transformers: An Invariance Based Investigation

    Authors: Gabriele Merlin, Vedant Nanda, Ruchit Rawal, Mariya Toneva

    Abstract: The pretrain-finetune paradigm usually improves downstream performance over training a model from scratch on the same task, becoming commonplace across many areas of machine learning. While pretraining is empirically observed to be beneficial for a range of tasks, there is not a clear understanding yet of the reasons for this effect. In this work, we examine the relationship between pretrained vis… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: Accepted to CoLLAs 2023

  5. arXiv:2306.00183  [pdf, other

    cs.LG cs.AI

    Diffused Redundancy in Pre-trained Representations

    Authors: Vedant Nanda, Till Speicher, John P. Dickerson, Soheil Feizi, Krishna P. Gummadi, Adrian Weller

    Abstract: Representations learned by pre-training a neural network on a large dataset are increasingly used successfully to perform a variety of downstream tasks. In this work, we take a closer look at how features are encoded in such pre-trained representations. We find that learned representations in a given layer exhibit a degree of diffuse redundancy, ie, any randomly chosen subset of neurons in the lay… ▽ More

    Submitted 14 November, 2023; v1 submitted 31 May, 2023; originally announced June 2023.

    Comments: NeurIPS 2023

  6. arXiv:2305.19294  [pdf, other

    cs.LG

    Pointwise Representational Similarity

    Authors: Camila Kolling, Till Speicher, Vedant Nanda, Mariya Toneva, Krishna P. Gummadi

    Abstract: With the increasing reliance on deep neural networks, it is important to develop ways to better understand their learned representations. Representation similarity measures have emerged as a popular tool for examining learned representations However, existing measures only provide aggregate estimates of similarity at a global level, i.e. over a set of representations for N input examples. As such,… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  7. arXiv:2206.11939  [pdf, other

    cs.LG cs.AI

    Measuring Representational Robustness of Neural Networks Through Shared Invariances

    Authors: Vedant Nanda, Till Speicher, Camila Kolling, John P. Dickerson, Krishna P. Gummadi, Adrian Weller

    Abstract: A major challenge in studying robustness in deep learning is defining the set of ``meaningless'' perturbations to which a given Neural Network (NN) should be invariant. Most work on robustness implicitly uses a human as the reference model to define such perturbations. Our work offers a new view on robustness by using another reference NN to define the set of perturbations a given NN should be inv… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

    Comments: Accepted for oral presentation at ICML 2022

  8. arXiv:2206.06320  [pdf, other

    cs.CL cs.AI cs.LG cs.SI q-fin.ST

    Cryptocurrency Bubble Detection: A New Stock Market Dataset, Financial Task & Hyperbolic Models

    Authors: Ramit Sawhney, Shivam Agarwal, Vivek Mittal, Paolo Rosso, Vikram Nanda, Sudheer Chava

    Abstract: The rapid spread of information over social media influences quantitative trading and investments. The growing popularity of speculative trading of highly volatile assets such as cryptocurrencies and meme stocks presents a fresh challenge in the financial realm. Investigating such "bubbles" - periods of sudden anomalous behavior of markets are critical in better understanding investor behavior and… ▽ More

    Submitted 11 May, 2022; originally announced June 2022.

    Comments: Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

  9. arXiv:2201.06021  [pdf, other

    cs.GT cs.AI cs.DS

    Rawlsian Fairness in Online Bipartite Matching: Two-sided, Group, and Individual

    Authors: Seyed A. Esmaeili, Sharmila Duppala, Davidson Cheng, Vedant Nanda, Aravind Srinivasan, John P. Dickerson

    Abstract: Online bipartite-matching platforms are ubiquitous and find applications in important areas such as crowdsourcing and ridesharing. In the most general form, the platform consists of three entities: two sides to be matched and a platform operator that decides the matching. The design of algorithms for such platforms has traditionally focused on the operator's (expected) profit. Since fairness has b… ▽ More

    Submitted 4 June, 2023; v1 submitted 16 January, 2022; originally announced January 2022.

    Comments: Accepted to AAAI 2023

  10. arXiv:2111.14726  [pdf, other

    cs.CV cs.AI cs.LG

    Do Invariances in Deep Neural Networks Align with Human Perception?

    Authors: Vedant Nanda, Ayan Majumdar, Camila Kolling, John P. Dickerson, Krishna P. Gummadi, Bradley C. Love, Adrian Weller

    Abstract: An evaluation criterion for safe and trustworthy deep learning is how well the invariances captured by representations of deep neural networks (DNNs) are shared with humans. We identify challenges in measuring these invariances. Prior works used gradient-based methods to generate identically represented inputs (IRIs), ie, inputs which have identical representations (on a given layer) of a neural n… ▽ More

    Submitted 2 December, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: AAAI 2023

  11. arXiv:2110.15182  [pdf, other

    cs.LG math.AT

    Dist2Cycle: A Simplicial Neural Network for Homology Localization

    Authors: Alexandros Dimitrios Keros, Vidit Nanda, Kartic Subr

    Abstract: Simplicial complexes can be viewed as high dimensional generalizations of graphs that explicitly encode multi-way ordered relations between vertices at different resolutions, all at once. This concept is central towards detection of higher dimensional topological features of data, features to which graphs, encoding only pairwise relationships, remain oblivious. While attempts have been made to ext… ▽ More

    Submitted 3 July, 2022; v1 submitted 28 October, 2021; originally announced October 2021.

    Comments: 9 pages, 5 figures

    MSC Class: 55N31 ACM Class: I.2

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence. 36, 7 (Jun. 2022), 7133-7142

  12. arXiv:2110.06357  [pdf, other

    math.ST cs.LG

    Tangent Space and Dimension Estimation with the Wasserstein Distance

    Authors: Uzu Lim, Harald Oberhauser, Vidit Nanda

    Abstract: Consider a set of points sampled independently near a smooth compact submanifold of Euclidean space. We provide mathematically rigorous bounds on the number of sample points required to estimate both the dimension and the tangent spaces of that manifold with high confidence. The algorithm for this estimation is Local PCA, a local version of principal component analysis. Our results accommodate for… ▽ More

    Submitted 25 September, 2023; v1 submitted 12 October, 2021; originally announced October 2021.

    Comments: Main theorems rewritten. Introduction is written more compactly

  13. arXiv:2106.14555  [pdf, other

    math.AG cs.SC math.AC math.AT

    Conormal Spaces and Whitney Stratifications

    Authors: Martin Helmer, Vidit Nanda

    Abstract: We describe a new algorithm for computing Whitney stratifications of complex projective varieties. The main ingredients are (a) an algebraic criterion, due to Lê and Teissier, which reformulates Whitney regularity in terms of conormal spaces and maps, and (b) a new interpretation of this conormal criterion via primary decomposition, which can be practically implemented on a computer. We show that… ▽ More

    Submitted 26 December, 2022; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: There is an error in the published version of the article (Found Comput Math, 2022) which has been fixed in this update. Section 3 is entirely new, but the downstream results Sections 4-6 remain largely the same. We have also updated the Runtimes and Complexity estimates in Section 7. The def. of the integral closure of an ideal has also been corrected

    MSC Class: 14B05; 14Q20; 32S60; 32S15

  14. arXiv:2106.00639  [pdf, other

    eess.AS cs.SD eess.SP

    Multi-modal Point-of-Care Diagnostics for COVID-19 Based On Acoustics and Symptoms

    Authors: Srikanth Raj Chetupalli, Prashant Krishnan, Neeraj Sharma, Ananya Muguli, Rohit Kumar, Viral Nanda, Lancelot Mark Pinto, Prasanta Kumar Ghosh, Sriram Ganapathy

    Abstract: The research direction of identifying acoustic bio-markers of respiratory diseases has received renewed interest following the onset of COVID-19 pandemic. In this paper, we design an approach to COVID-19 diagnostic using crowd-sourced multi-modal data. The data resource, consisting of acoustic signals like cough, breathing, and speech signals, along with the data of symptoms, are recorded using a… ▽ More

    Submitted 5 June, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: The Manuscript is submitted to IEEE-EMBS Journal of Biomedical and Health Informatics on June 1, 2021

  15. arXiv:2103.09148  [pdf, other

    eess.AS cs.SD

    DiCOVA Challenge: Dataset, task, and baseline system for COVID-19 diagnosis using acoustics

    Authors: Ananya Muguli, Lancelot Pinto, Nirmala R., Neeraj Sharma, Prashant Krishnan, Prasanta Kumar Ghosh, Rohit Kumar, Shrirama Bhat, Srikanth Raj Chetupalli, Sriram Ganapathy, Shreyas Ramoji, Viral Nanda

    Abstract: The DiCOVA challenge aims at accelerating research in diagnosing COVID-19 using acoustics (DiCOVA), a topic at the intersection of speech and audio processing, respiratory health diagnosis, and machine learning. This challenge is an open call for researchers to analyze a dataset of sound recordings collected from COVID-19 infected and non-COVID-19 individuals for a two-class classification. These… ▽ More

    Submitted 17 June, 2021; v1 submitted 16 March, 2021; originally announced March 2021.

    Comments: To appear in Proceedings of Interspeech, 2021

  16. arXiv:2102.06764  [pdf, other

    cs.LG cs.AI cs.CY

    Technical Challenges for Training Fair Neural Networks

    Authors: Valeriia Cherepanova, Vedant Nanda, Micah Goldblum, John P. Dickerson, Tom Goldstein

    Abstract: As machine learning algorithms have been widely deployed across applications, many concerns have been raised over the fairness of their predictions, especially in high stakes settings (such as facial recognition and medical imaging). To respond to these concerns, the community has proposed and formalized various notions of fairness as well as methods for rectifying unfair behavior. While fairness… ▽ More

    Submitted 12 February, 2021; originally announced February 2021.

  17. GIS-Based Estimation of Seasonal Solar Energy Potential for Parking Lots and Roads

    Authors: Vishnu Mahesh Vivek Nanda, Laura Tateosian, Perver Baran

    Abstract: The amount of sun cast on roads and parking lots determines the charging opportunities for solar vehicles and impacts the efficiency of conventional vehicles. Estimates of solar energy potential on urban surfaces to assess parking and driving conditions need to account for the shadows cast by surrounding trees and buildings. However, though existing GIS tools can calculate solar potential on surfa… ▽ More

    Submitted 24 December, 2020; originally announced December 2020.

  18. arXiv:2007.00251  [pdf, other

    cs.AI cs.CY cs.LG

    Unifying Model Explainability and Robustness via Machine-Checkable Concepts

    Authors: Vedant Nanda, Till Speicher, John P. Dickerson, Krishna P. Gummadi, Muhammad Bilal Zafar

    Abstract: As deep neural networks (DNNs) get adopted in an ever-increasing number of applications, explainability has emerged as a crucial desideratum for these models. In many real-world tasks, one of the principal reasons for requiring explainability is to in turn assess prediction robustness, where predictions (i.e., class labels) that do not conform to their respective explanations (e.g., presence or ab… ▽ More

    Submitted 2 July, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

    Comments: 22 pages, 12 figures, 11 tables

  19. arXiv:2006.12621  [pdf, other

    cs.LG cs.CY

    Fairness Through Robustness: Investigating Robustness Disparity in Deep Learning

    Authors: Vedant Nanda, Samuel Dooley, Sahil Singla, Soheil Feizi, John P. Dickerson

    Abstract: Deep neural networks (DNNs) are increasingly used in real-world applications (e.g. facial recognition). This has resulted in concerns about the fairness of decisions made by these models. Various notions and measures of fairness have been proposed to ensure that a decision-making system does not disproportionately harm (or benefit) particular subgroups of the population. In this paper, we argue th… ▽ More

    Submitted 21 January, 2021; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: Accepted at ACM Conference on Fairness, Accountability, and Transparency (FAccT) 2021

  20. arXiv:1912.08388  [pdf, other

    cs.AI cs.CY

    Balancing the Tradeoff between Profit and Fairness in Rideshare Platforms During High-Demand Hours

    Authors: Vedant Nanda, Pan Xu, Karthik Abinav Sankararaman, John P. Dickerson, Aravind Srinivasan

    Abstract: Rideshare platforms, when assigning requests to drivers, tend to maximize profit for the system and/or minimize waiting time for riders. Such platforms can exacerbate biases that drivers may have over certain types of requests. We consider the case of peak hours when the demand for rides is more than the supply of drivers. Drivers are well aware of their advantage during the peak hours and can cho… ▽ More

    Submitted 6 September, 2020; v1 submitted 18 December, 2019; originally announced December 2019.

    Comments: 8 pages, 4 figures, Accepted at AAAI 2020 & AIES (Oral) 2020

  21. arXiv:1903.01209  [pdf, other

    cs.CY cs.AI

    On the Long-term Impact of Algorithmic Decision Policies: Effort Unfairness and Feature Segregation through Social Learning

    Authors: Hoda Heidari, Vedant Nanda, Krishna P. Gummadi

    Abstract: Most existing notions of algorithmic fairness are one-shot: they ensure some form of allocative equality at the time of decision making, but do not account for the adverse impact of the algorithmic decisions today on the long-term welfare and prosperity of certain segments of the population. We take a broader perspective on algorithmic fairness. We propose an effort-based measure of fairness and p… ▽ More

    Submitted 27 June, 2019; v1 submitted 4 March, 2019; originally announced March 2019.

  22. arXiv:1806.00381  [pdf, other

    stat.ML cs.LG math.PR math.ST

    Persistence paths and signature features in topological data analysis

    Authors: Ilya Chevyrev, Vidit Nanda, Harald Oberhauser

    Abstract: We introduce a new feature map for barcodes that arise in persistent homology computation. The main idea is to first realize each barcode as a path in a convenient vector space, and to then compute its path signature which takes values in the tensor algebra of that vector space. The composition of these two operations - barcode to path, path to tensor series - results in a feature map that has sev… ▽ More

    Submitted 12 December, 2018; v1 submitted 1 June, 2018; originally announced June 2018.

    Comments: Additional experiment and further details. To appear in IEEE Transactions on Pattern Analysis and Machine Intelligence

    Journal ref: IEEE TPAMI (2020) Volume: 42, Issue: 1, pp. 192 - 202

  23. arXiv:1205.6990  [pdf, ps, other

    math.AG cs.CC

    The devil is in Asymmetries (Rough Version)

    Authors: Edinah K. Gnang, Vidit Nanda

    Abstract: We formally investigate some computational obstacles to tractability of computing the variety determined by K complex polynomials in N boolean variables. We show that using algebraic methods for solving combinatorial problems, the obstacles to tractability lies in the order of magnitude of asymmetries admitted by the given system of equations.

    Submitted 27 May, 2012; originally announced May 2012.