Skip to main content

Showing 1–2 of 2 results for author: Vergara-Browne, T

.
  1. arXiv:2406.12618  [pdf, other

    cs.CL

    From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP

    Authors: Marius Mosbach, Vagrant Gautam, Tomás Vergara-Browne, Dietrich Klakow, Mor Geva

    Abstract: Interpretability and analysis (IA) research is a growing subfield within NLP with the goal of develo** a deeper understanding of the behavior or inner workings of NLP systems and methods. Despite growing interest in the subfield, a commonly voiced criticism is that it lacks actionable insights and therefore has little impact on NLP. In this paper, we seek to quantify the impact of IA research on… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  2. arXiv:2404.03147  [pdf, other

    cs.LG cs.AI

    Eigenpruning: an Interpretability-Inspired PEFT Method

    Authors: Tomás Vergara-Browne, Álvaro Soto, Akiko Aizawa

    Abstract: We introduce eigenpruning, a method that removes singular values from weight matrices in an LLM to improve its performance in a particular task. This method is inspired by interpretability methods designed to automatically find subnetworks of a model which solve a specific task. In our tests, the pruned model outperforms the original model by a large margin, while only requiring minimal computatio… ▽ More

    Submitted 20 June, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

    Comments: Extended abstract accepted to LatinX at NAACL 2024