Skip to main content

Showing 1–6 of 6 results for author: Sadde, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2204.12130  [pdf, other

    cs.CL

    LM-Debugger: An Interactive Tool for Inspection and Intervention in Transformer-Based Language Models

    Authors: Mor Geva, Avi Caciularu, Guy Dar, Paul Roit, Shoval Sadde, Micah Shlain, Bar Tamir, Yoav Goldberg

    Abstract: The opaque nature and unexplained behavior of transformer-based language models (LMs) have spurred a wide interest in interpreting their predictions. However, current interpretation methods mostly focus on probing models from outside, executing behavioral tests, and analyzing salience input features, while the internal prediction construction process is largely not understood. In this work, we int… ▽ More

    Submitted 12 October, 2022; v1 submitted 26 April, 2022; originally announced April 2022.

    Comments: EMNLP 2022 System Demonstrations

  2. arXiv:2110.07681  [pdf, other

    cs.CL

    Large Scale Substitution-based Word Sense Induction

    Authors: Matan Eyal, Shoval Sadde, Hillel Taub-Tabib, Yoav Goldberg

    Abstract: We present a word-sense induction method based on pre-trained masked language models (MLMs), which can cheaply scale to large vocabularies and large corpora. The result is a corpus which is sense-tagged according to a corpus-derived sense inventory and where each sense is associated with indicative words. Evaluation on English Wikipedia that was sense-tagged using our method shows that both the in… ▽ More

    Submitted 21 March, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

    Comments: ACL 2022

  3. arXiv:2106.08037  [pdf, other

    cs.CL

    The Possible, the Plausible, and the Desirable: Event-Based Modality Detection for Language Processing

    Authors: Valentina Pyatkin, Shoval Sadde, Aynat Rubinstein, Paul Portner, Reut Tsarfaty

    Abstract: Modality is the linguistic ability to describe events with added information such as how desirable, plausible, or feasible they are. Modality is important for many NLP downstream tasks such as the detection of hedging, uncertainty, speculation, and more. Previous studies that address modality detection in NLP often restrict modal expressions to a closed syntactic class, and the modal sense labels… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: ACL 2021

  4. arXiv:2006.04148  [pdf, other

    cs.CL cs.IR

    Interactive Extractive Search over Biomedical Corpora

    Authors: Hillel Taub-Tabib, Micah Shlain, Shoval Sadde, Dan Lahav, Matan Eyal, Yaara Cohen, Yoav Goldberg

    Abstract: We present a system that allows life-science researchers to search a linguistically annotated corpus of scientific texts using patterns over dependency graphs, as well as using patterns over token sequences and a powerful variant of boolean keyword queries. In contrast to previous attempts to dependency-based search, we introduce a light-weight query language that does not require the user to know… ▽ More

    Submitted 7 June, 2020; originally announced June 2020.

  5. arXiv:2006.03010  [pdf, other

    cs.CL

    Syntactic Search by Example

    Authors: Micah Shlain, Hillel Taub-Tabib, Shoval Sadde, Yoav Goldberg

    Abstract: We present a system that allows a user to search a large linguistically annotated corpus using syntactic patterns over dependency graphs. In contrast to previous attempts to this effect, we introduce a light-weight query language that does not require the user to know the details of the underlying syntactic representations, and instead to query the corpus by providing an example sentence coupled w… ▽ More

    Submitted 4 June, 2020; originally announced June 2020.

  6. arXiv:1908.05453  [pdf, other

    cs.CL

    What's Wrong with Hebrew NLP? And How to Make it Right

    Authors: Reut Tsarfaty, Amit Seker, Shoval Sadde, Stav Klein

    Abstract: For languages with simple morphology, such as English, automatic annotation pipelines such as spaCy or Stanford's CoreNLP successfully serve projects in academia and the industry. For many morphologically-rich languages (MRLs), similar pipelines show sub-optimal performance that limits their applicability for text analysis in research and the industry.The sub-optimal performance is mainly due to e… ▽ More

    Submitted 15 August, 2019; originally announced August 2019.