Skip to main content

Showing 1–2 of 2 results for author: Shiri, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.06493  [pdf, other

    cs.CL

    L3 Ensembles: Lifelong Learning Approach for Ensemble of Foundational Language Models

    Authors: Aidin Shiri, Kaushik Roy, Amit Sheth, Manas Gaur

    Abstract: Fine-tuning pre-trained foundational language models (FLM) for specific tasks is often impractical, especially for resource-constrained devices. This necessitates the development of a Lifelong Learning (L3) framework that continuously adapts to a stream of Natural Language Processing (NLP) tasks efficiently. We propose an approach that focuses on extracting meaningful representations from unseen d… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

  2. arXiv:2308.12272  [pdf, other

    cs.CL cs.AI

    Simple is Better and Large is Not Enough: Towards Ensembling of Foundational Language Models

    Authors: Nancy Tyagi, Aidin Shiri, Surjodeep Sarkar, Abhishek Kumar Umrawal, Manas Gaur

    Abstract: Foundational Language Models (FLMs) have advanced natural language processing (NLP) research. Current researchers are develo** larger FLMs (e.g., XLNet, T5) to enable contextualized language representation, classification, and generation. While develo** larger FLMs has been of significant advantage, it is also a liability concerning hallucination and predictive uncertainty. Fundamentally, larg… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: Accepted at the 10th Mid-Atlantic Student Colloquium on Speech, Language and Learning (MASC-SLL 2023)