Skip to main content

Showing 1–5 of 5 results for author: Guskin, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2210.17114  [pdf, other

    cs.CL

    QuaLA-MiniLM: a Quantized Length Adaptive MiniLM

    Authors: Shira Guskin, Moshe Wasserblat, Chang Wang, Haihao Shen

    Abstract: Limited computational budgets often prevent transformers from being used in production and from having their high accuracy utilized. A knowledge distillation approach addresses the computational efficiency by self-distilling BERT into a smaller transformer representation having fewer layers and smaller internal embedding. However, the performance of these models drops as we reduce the number of la… ▽ More

    Submitted 10 May, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: In this version we updated the reference to the source code in the abstract. arXiv admin note: text overlap with arXiv:2111.09645

  2. arXiv:2111.09645  [pdf, other

    cs.CL cs.LG

    Dynamic-TinyBERT: Boost TinyBERT's Inference Efficiency by Dynamic Sequence Length

    Authors: Shira Guskin, Moshe Wasserblat, Ke Ding, Gyuwan Kim

    Abstract: Limited computational budgets often prevent transformers from being used in production and from having their high accuracy utilized. TinyBERT addresses the computational efficiency by self-distilling BERT into a smaller transformer representation having fewer layers and smaller internal embedding. However, TinyBERT's performance drops when we reduce the number of layers by 50%, and drops even more… ▽ More

    Submitted 18 November, 2021; originally announced November 2021.

    Comments: ENLSP NeurIPS Workshop 2021, 7 pages

  3. arXiv:1910.06294  [pdf, other

    cs.CL cs.LG

    Training Compact Models for Low Resource Entity Tagging using Pre-trained Language Models

    Authors: Peter Izsak, Shira Guskin, Moshe Wasserblat

    Abstract: Training models on low-resource named entity recognition tasks has been shown to be a challenge, especially in industrial applications where deploying updated models is a continuous effort and crucial for business operations. In such cases there is often an abundance of unlabeled data, while labeled data is scarce or unavailable. Pre-trained language models trained to extract contextual features f… ▽ More

    Submitted 17 October, 2019; v1 submitted 14 October, 2019; originally announced October 2019.

    Comments: Accepted to the 5th Workshop on Energy Efficient Machine Learning and Cognitive Computing - NeurIPS 2019

  4. arXiv:1808.08953  [pdf, other

    cs.AI cs.CL

    Term Set Expansion based NLP Architect by Intel AI Lab

    Authors: Jonathan Mamou, Oren Pereg, Moshe Wasserblat, Alon Eirew, Yael Green, Shira Guskin, Peter Izsak, Daniel Korat

    Abstract: We present SetExpander, a corpus-based system for expanding a seed set of terms into amore complete set of terms that belong to the same semantic class. SetExpander implements an iterative end-to-end workflow. It enables users to easily select a seed set of terms, expand it, view the expanded set, validate it, re-expand the validated set and store it, thus simplifying the extraction of domain-spec… ▽ More

    Submitted 15 October, 2018; v1 submitted 27 August, 2018; originally announced August 2018.

    Comments: EMNLP 2018 System Demonstrations. arXiv admin note: substantial text overlap with arXiv:1807.10104

  5. arXiv:1807.10104  [pdf, other

    cs.AI cs.CL

    Term Set Expansion based on Multi-Context Term Embeddings: an End-to-end Workflow

    Authors: Jonathan Mamou, Oren Pereg, Moshe Wasserblat, Ido Dagan, Yoav Goldberg, Alon Eirew, Yael Green, Shira Guskin, Peter Izsak, Daniel Korat

    Abstract: We present SetExpander, a corpus-based system for expanding a seed set of terms into a more complete set of terms that belong to the same semantic class. SetExpander implements an iterative end-to end workflow for term set expansion. It enables users to easily select a seed set of terms, expand it, view the expanded set, validate it, re-expand the validated set and store it, thus simplifying the e… ▽ More

    Submitted 26 July, 2018; originally announced July 2018.

    Comments: COLING 2018 System Demonstration paper

    MSC Class: 68T50 ACM Class: I.2.7