Skip to main content

Showing 1–5 of 5 results for author: Aghazadeh, E

.
  1. arXiv:2406.12824  [pdf, other

    cs.CL cs.AI

    From RAGs to rich parameters: Probing how language models utilize external knowledge over parametric information for factual queries

    Authors: Hitesh Wadhwa, Rahul Seetharaman, Somyaa Aggarwal, Reshmi Ghosh, Samyadeep Basu, Soundararajan Srinivasan, Wenlong Zhao, Shreyas Chaudhari, Ehsan Aghazadeh

    Abstract: Retrieval Augmented Generation (RAG) enriches the ability of language models to reason using external context to augment responses for a given user prompt. This approach has risen in popularity due to practical applications in various applications of language models in search, question/answering, and chat-bots. However, the exact nature of how this approach works isn't clearly understood. In this… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  2. arXiv:2306.02873  [pdf, other

    cs.CL

    DecompX: Explaining Transformers Decisions by Propagating Token Decomposition

    Authors: Ali Modarressi, Mohsen Fayyaz, Ehsan Aghazadeh, Yadollah Yaghoobzadeh, Mohammad Taher Pilehvar

    Abstract: An emerging solution for explaining Transformer-based models is to use vector-based analysis on how the representations are formed. However, providing a faithful vector-based explanation for a multi-layer model could be challenging in three aspects: (1) Incorporating all components into the analysis, (2) Aggregating the layer dynamics to determine the information flow and mixture throughout the en… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: Accepted to ACL 2023 (main conference)

  3. arXiv:2211.05610  [pdf, other

    cs.CL

    BERT on a Data Diet: Finding Important Examples by Gradient-Based Pruning

    Authors: Mohsen Fayyaz, Ehsan Aghazadeh, Ali Modarressi, Mohammad Taher Pilehvar, Yadollah Yaghoobzadeh, Samira Ebrahimi Kahou

    Abstract: Current pre-trained language models rely on large datasets for achieving state-of-the-art performance. However, past research has shown that not all examples in a dataset are equally important during training. In fact, it is sometimes possible to prune a considerable fraction of the training set while maintaining the test performance. Established on standard vision benchmarks, two gradient-based s… ▽ More

    Submitted 28 November, 2022; v1 submitted 10 November, 2022; originally announced November 2022.

    Comments: ENLSP @ NeurIPS2022

  4. arXiv:2203.14139  [pdf, other

    cs.CL

    Metaphors in Pre-Trained Language Models: Probing and Generalization Across Datasets and Languages

    Authors: Ehsan Aghazadeh, Mohsen Fayyaz, Yadollah Yaghoobzadeh

    Abstract: Human languages are full of metaphorical expressions. Metaphors help people understand the world by connecting new concepts and domains to more familiar ones. Large pre-trained language models (PLMs) are therefore assumed to encode metaphorical knowledge useful for NLP systems. In this paper, we investigate this hypothesis for PLMs, by probing metaphoricity information in their encodings, and by m… ▽ More

    Submitted 26 March, 2022; originally announced March 2022.

    Comments: Accepted to ACL 2022 (main conference)

  5. arXiv:2109.05958  [pdf, other

    cs.CL cs.AI

    Not All Models Localize Linguistic Knowledge in the Same Place: A Layer-wise Probing on BERToids' Representations

    Authors: Mohsen Fayyaz, Ehsan Aghazadeh, Ali Modarressi, Hosein Mohebbi, Mohammad Taher Pilehvar

    Abstract: Most of the recent works on probing representations have focused on BERT, with the presumption that the findings might be similar to the other models. In this work, we extend the probing studies to two other models in the family, namely ELECTRA and XLNet, showing that variations in the pre-training objectives or architectural choices can result in different behaviors in encoding linguistic informa… ▽ More

    Submitted 15 September, 2021; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: Accepted to BlackboxNLP Workshop at EMNLP 2021