Skip to main content

Showing 1–8 of 8 results for author: Speicher, T

.
  1. arXiv:2407.04325  [pdf, other

    cs.LG

    Understanding the Role of Invariance in Transfer Learning

    Authors: Till Speicher, Vedant Nanda, Krishna P. Gummadi

    Abstract: Transfer learning is a powerful technique for knowledge-sharing between different tasks. Recent work has found that the representations of models with certain invariances, such as to adversarial input perturbations, achieve higher performance on downstream tasks. These findings suggest that invariance may be an important property in the context of transfer learning. However, the relationship of in… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: Published at TMLR 2024

  2. arXiv:2404.12957  [pdf, other

    cs.CL cs.LG

    Towards Reliable Latent Knowledge Estimation in LLMs: In-Context Learning vs. Prompting Based Factual Knowledge Extraction

    Authors: Qinyuan Wu, Mohammad Aflah Khan, Soumi Das, Vedant Nanda, Bishwamittra Ghosh, Camila Kolling, Till Speicher, Laurent Bindschaedler, Krishna P. Gummadi, Evimaria Terzi

    Abstract: We propose an approach for estimating the latent knowledge embedded inside large language models (LLMs). We leverage the in-context learning (ICL) abilities of LLMs to estimate the extent to which an LLM knows the facts stored in a knowledge base. Our knowledge estimator avoids reliability concerns with previous prompting-based methods, is both conceptually simpler and easier to apply, and we demo… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  3. arXiv:2306.00183  [pdf, other

    cs.LG cs.AI

    Diffused Redundancy in Pre-trained Representations

    Authors: Vedant Nanda, Till Speicher, John P. Dickerson, Soheil Feizi, Krishna P. Gummadi, Adrian Weller

    Abstract: Representations learned by pre-training a neural network on a large dataset are increasingly used successfully to perform a variety of downstream tasks. In this work, we take a closer look at how features are encoded in such pre-trained representations. We find that learned representations in a given layer exhibit a degree of diffuse redundancy, ie, any randomly chosen subset of neurons in the lay… ▽ More

    Submitted 14 November, 2023; v1 submitted 31 May, 2023; originally announced June 2023.

    Comments: NeurIPS 2023

  4. arXiv:2305.19294  [pdf, other

    cs.LG

    Pointwise Representational Similarity

    Authors: Camila Kolling, Till Speicher, Vedant Nanda, Mariya Toneva, Krishna P. Gummadi

    Abstract: With the increasing reliance on deep neural networks, it is important to develop ways to better understand their learned representations. Representation similarity measures have emerged as a popular tool for examining learned representations However, existing measures only provide aggregate estimates of similarity at a global level, i.e. over a set of representations for N input examples. As such,… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  5. arXiv:2206.11939  [pdf, other

    cs.LG cs.AI

    Measuring Representational Robustness of Neural Networks Through Shared Invariances

    Authors: Vedant Nanda, Till Speicher, Camila Kolling, John P. Dickerson, Krishna P. Gummadi, Adrian Weller

    Abstract: A major challenge in studying robustness in deep learning is defining the set of ``meaningless'' perturbations to which a given Neural Network (NN) should be invariant. Most work on robustness implicitly uses a human as the reference model to define such perturbations. Our work offers a new view on robustness by using another reference NN to define the set of perturbations a given NN should be inv… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

    Comments: Accepted for oral presentation at ICML 2022

  6. arXiv:2007.00251  [pdf, other

    cs.AI cs.CY cs.LG

    Unifying Model Explainability and Robustness via Machine-Checkable Concepts

    Authors: Vedant Nanda, Till Speicher, John P. Dickerson, Krishna P. Gummadi, Muhammad Bilal Zafar

    Abstract: As deep neural networks (DNNs) get adopted in an ever-increasing number of applications, explainability has emerged as a crucial desideratum for these models. In many real-world tasks, one of the principal reasons for requiring explainability is to in turn assess prediction robustness, where predictions (i.e., class labels) that do not conform to their respective explanations (e.g., presence or ab… ▽ More

    Submitted 2 July, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

    Comments: 22 pages, 12 figures, 11 tables

  7. arXiv:1807.00787  [pdf, other

    cs.LG cs.CY stat.ML

    A Unified Approach to Quantifying Algorithmic Unfairness: Measuring Individual & Group Unfairness via Inequality Indices

    Authors: Till Speicher, Hoda Heidari, Nina Grgic-Hlaca, Krishna P. Gummadi, Adish Singla, Adrian Weller, Muhammad Bilal Zafar

    Abstract: Discrimination via algorithmic decision making has received considerable attention. Prior work largely focuses on defining conditions for fairness, but does not define satisfactory measures of algorithmic unfairness. In this paper, we focus on the following question: Given two unfair algorithms, how should we determine which of the two is more unfair? Our core idea is to use existing inequality in… ▽ More

    Submitted 2 July, 2018; originally announced July 2018.

    Comments: 12 pages 7 figures To be published in: KDD '18: The 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining Proceedings

  8. arXiv:1404.3377  [pdf, ps, other

    cs.CL

    A Generalized Language Model as the Combination of Skipped n-grams and Modified Kneser-Ney Smoothing

    Authors: Rene Pickhardt, Thomas Gottron, Martin Körner, Paul Georg Wagner, Till Speicher, Steffen Staab

    Abstract: We introduce a novel approach for building language models based on a systematic, recursive exploration of skip n-gram models which are interpolated using modified Kneser-Ney smoothing. Our approach generalizes language models as it contains the classical interpolation with lower order models as a special case. In this paper we motivate, formalize and present our approach. In an extensive empirica… ▽ More

    Submitted 13 April, 2014; originally announced April 2014.

    Comments: 13 pages, 2 figures, ACL 2014