Skip to main content

Showing 1–7 of 7 results for author: Sulem, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2111.01243  [pdf, other

    cs.CL cs.AI cs.LG

    Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey

    Authors: Bonan Min, Hayley Ross, Elior Sulem, Amir Pouran Ben Veyseh, Thien Huu Nguyen, Oscar Sainz, Eneko Agirre, Ilana Heinz, Dan Roth

    Abstract: Large, pre-trained transformer-based language models such as BERT have drastically changed the Natural Language Processing (NLP) field. We present a survey of recent work that uses these large language models to solve NLP tasks via pre-training then fine-tuning, prompting, or text generation approaches. We also present approaches that use pre-trained language models to generate data for training a… ▽ More

    Submitted 1 November, 2021; originally announced November 2021.

  2. arXiv:1905.05543  [pdf, other

    cs.CL

    The Language of Legal and Illegal Activity on the Darknet

    Authors: Leshem Choshen, Dan Eldad, Daniel Hershcovich, Elior Sulem, Omri Abend

    Abstract: The non-indexed parts of the Internet (the Darknet) have become a haven for both legal and illegal anonymous activity. Given the magnitude of these networks, scalably monitoring their activity necessarily relies on automated tools, and notably on NLP tools. However, little is known about what characteristics texts communicated through the Darknet have, and how well off-the-shelf NLP tools do on th… ▽ More

    Submitted 4 June, 2019; v1 submitted 14 May, 2019; originally announced May 2019.

    Comments: ACL 2019 camera ready; code in https://github.com/huji-nlp/cyber

  3. arXiv:1903.02953  [pdf, other

    cs.CL

    SemEval-2019 Task 1: Cross-lingual Semantic Parsing with UCCA

    Authors: Daniel Hershcovich, Zohar Aizenbud, Leshem Choshen, Elior Sulem, Ari Rappoport, Omri Abend

    Abstract: We present the SemEval 2019 shared task on UCCA parsing in English, German and French, and discuss the participating systems and results. UCCA is a cross-linguistically applicable framework for semantic representation, which builds on extensive typological work and supports rapid annotation. UCCA poses a challenge for existing parsing techniques, as it exhibits reentrancy (resulting in DAG structu… ▽ More

    Submitted 11 June, 2020; v1 submitted 6 March, 2019; originally announced March 2019.

    Comments: SemEval 2019 Shared task. arXiv admin note: substantial text overlap with arXiv:1805.12386

  4. arXiv:1810.05995  [pdf, other

    cs.CL

    BLEU is Not Suitable for the Evaluation of Text Simplification

    Authors: Elior Sulem, Omri Abend, Ari Rappoport

    Abstract: BLEU is widely considered to be an informative metric for text-to-text generation, including Text Simplification (TS). TS includes both lexical and structural aspects. In this paper we show that BLEU is not suitable for the evaluation of sentence splitting, the major structural simplification operation. We manually compiled a sentence splitting gold standard corpus containing multiple structural p… ▽ More

    Submitted 14 October, 2018; originally announced October 2018.

    Comments: Accepted to EMNLP 2018 (Short papers)

  5. arXiv:1810.05104  [pdf, ps, other

    cs.CL

    Simple and Effective Text Simplification Using Semantic and Neural Methods

    Authors: Elior Sulem, Omri Abend, Ari Rappoport

    Abstract: Sentence splitting is a major simplification operator. Here we present a simple and efficient splitting algorithm based on an automatic semantic parser. After splitting, the text is amenable for further fine-tuned simplification operations. In particular, we show that neural Machine Translation can be effectively used in this situation. Previous application of Machine Translation for simplificatio… ▽ More

    Submitted 11 October, 2018; originally announced October 2018.

    Journal ref: Proc. of ACL 2018

  6. arXiv:1810.05022  [pdf, ps, other

    cs.CL

    Semantic Structural Evaluation for Text Simplification

    Authors: Elior Sulem, Omri Abend, Ari Rappoport

    Abstract: Current measures for evaluating text simplification systems focus on evaluating lexical text aspects, neglecting its structural aspects. In this paper we propose the first measure to address structural aspects of text simplification, called SAMSA. It leverages recent advances in semantic parsing to assess simplification quality by decomposing the input based on its semantic structure and comparing… ▽ More

    Submitted 11 October, 2018; originally announced October 2018.

    Journal ref: Proc. of NAACL 2018

  7. arXiv:1805.12386   

    cs.CL

    SemEval 2019 Shared Task: Cross-lingual Semantic Parsing with UCCA - Call for Participation

    Authors: Daniel Hershcovich, Leshem Choshen, Elior Sulem, Zohar Aizenbud, Ari Rappoport, Omri Abend

    Abstract: We announce a shared task on UCCA parsing in English, German and French, and call for participants to submit their systems. UCCA is a cross-linguistically applicable framework for semantic representation, which builds on extensive typological work and supports rapid annotation. UCCA poses a challenge for existing parsing techniques, as it exhibits reentrancy (resulting in DAG structures), disconti… ▽ More

    Submitted 3 February, 2021; v1 submitted 31 May, 2018; originally announced May 2018.

    Comments: Not an actual paper. The shared task summary is at arXiv:1903.02953