Skip to main content

Showing 1–4 of 4 results for author: Palit, V

.
  1. arXiv:2406.16320  [pdf, other

    cs.CL

    What Do VLMs NOTICE? A Mechanistic Interpretability Pipeline for Noise-free Text-Image Corruption and Evaluation

    Authors: Michal Golovanevsky, William Rudman, Vedant Palit, Ritambhara Singh, Carsten Eickhoff

    Abstract: Vision-Language Models (VLMs) have gained community-spanning prominence due to their ability to integrate visual and textual inputs to perform complex tasks. Despite their success, the internal decision-making processes of these models remain opaque, posing challenges in high-stakes applications. To address this, we introduce NOTICE, the first Noise-free Text-Image Corruption and Evaluation pipeli… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2406.12058  [pdf, other

    cs.AI cs.CL

    WellDunn: On the Robustness and Explainability of Language Models and Large Language Models in Identifying Wellness Dimensions

    Authors: Seyedali Mohammadi, Edward Raff, **endra Malekar, Vedant Palit, Francis Ferraro, Manas Gaur

    Abstract: Language Models (LMs) are being proposed for mental health applications where the heightened risk of adverse outcomes means predictive performance may not be a sufficient litmus test of a model's utility in clinical practice. A model that can be trusted for practice should have a correspondence between explanation and clinical determination, yet no prior research has examined the attention fidelit… ▽ More

    Submitted 28 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: 26 pages, including reference and appendix sections, 8 figures, and 16 tables

  3. arXiv:2308.14179  [pdf, other

    cs.CL cs.AI cs.CV

    Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP

    Authors: Vedant Palit, Rohan Pandey, Aryaman Arora, Paul Pu Liang

    Abstract: Mechanistic interpretability seeks to understand the neural mechanisms that enable specific behaviors in Large Language Models (LLMs) by leveraging causality-based methods. While these approaches have identified neural circuits that copy spans of text, capture factual knowledge, and more, they remain unusable for multimodal models since adapting these tools to the vision-language domain requires c… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

    Comments: Final version for 5th Workshop on Closing the Loop Between Vision and Language (CLVL) @ ICCV 2023. 4 pages, 5 figures

  4. arXiv:2305.04989  [pdf, other

    cs.CL cs.AI

    Knowledge Graph Guided Semantic Evaluation of Language Models For User Trust

    Authors: Kaushik Roy, Tarun Garg, Vedant Palit, Yuxin Zi, Vignesh Narayanan, Amit Sheth

    Abstract: A fundamental question in natural language processing is - what kind of language structure and semantics is the language model capturing? Graph formats such as knowledge graphs are easy to evaluate as they explicitly express language semantics and structure. This study evaluates the semantics encoded in the self-attention transformers by leveraging explicit knowledge graph structures. We propose n… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.