Skip to main content

Showing 1–50 of 55 results for author: Abend, O

.
  1. arXiv:2405.14863  [pdf, other

    cs.CL cs.AI cs.LG

    A Nurse is Blue and Elephant is Rugby: Cross Domain Alignment in Large Language Models Reveal Human-like Patterns

    Authors: Asaf Yehudai, Taelin Karidi, Gabriel Stanovsky, Ariel Goldstein, Omri Abend

    Abstract: Cross-domain alignment refers to the task of map** a concept from one domain to another. For example, ``If a \textit{doctor} were a \textit{color}, what color would it be?''. This seemingly peculiar task is designed to investigate how people represent concrete and abstract concepts through their map**s between categories and their reasoning processes over those map**s. In this paper, we adap… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: CogSci

  2. arXiv:2405.02650  [pdf, other

    cs.CL cs.AI

    Identifying Narrative Patterns and Outliers in Holocaust Testimonies Using Topic Modeling

    Authors: Maxim Ifergan, Renana Keydar, Omri Abend, Amit Pinchevski

    Abstract: The vast collection of Holocaust survivor testimonies presents invaluable historical insights but poses challenges for manual analysis. This paper leverages advanced Natural Language Processing (NLP) techniques to explore the USC Shoah Foundation Holocaust testimony corpus. By treating testimonies as structured question-and-answer sections, we apply topic modeling to identify key themes. We experi… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: 9 pages, 7 figures, LREC-COLING 2024

  3. arXiv:2403.19887  [pdf, other

    cs.CL cs.LG

    Jamba: A Hybrid Transformer-Mamba Language Model

    Authors: Opher Lieber, Barak Lenz, Hofit Bata, Gal Cohen, Jhonathan Osin, Itay Dalmedigos, Erez Safahi, Shaked Meirom, Yonatan Belinkov, Shai Shalev-Shwartz, Omri Abend, Raz Alon, Tomer Asida, Amir Bergman, Roman Glozman, Michael Gokhman, Avashalom Manevich, Nir Ratner, Noam Rozen, Erez Shwartz, Mor Zusman, Yoav Shoham

    Abstract: We present Jamba, a new base large language model based on a novel hybrid Transformer-Mamba mixture-of-experts (MoE) architecture. Specifically, Jamba interleaves blocks of Transformer and Mamba layers, enjoying the benefits of both model families. MoE is added in some of these layers to increase model capacity while kee** active parameter usage manageable. This flexible architecture allows reso… ▽ More

    Submitted 3 July, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: Webpage: https://www.ai21.com/jamba

  4. arXiv:2311.12131  [pdf, other

    cs.CL

    Human Learning by Model Feedback: The Dynamics of Iterative Prompting with Midjourney

    Authors: Shachar Don-Yehiya, Leshem Choshen, Omri Abend

    Abstract: Generating images with a Text-to-Image model often requires multiple trials, where human users iteratively update their prompt based on feedback, namely the output image. Taking inspiration from cognitive work on reference games and dialogue alignment, this paper analyzes the dynamics of the user prompts along such iterations. We compile a dataset of iterative interactions of human users with Midj… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: EMNLP23

  5. arXiv:2310.13583  [pdf, other

    cs.CL cs.LG

    Improving Cross-Lingual Transfer through Subtree-Aware Word Reordering

    Authors: Ofir Arviv, Dmitry Nikolaev, Taelin Karidi, Omri Abend

    Abstract: Despite the impressive growth of the abilities of multilingual language models, such as XLM-R and mT5, it has been shown that they still face difficulties when tackling typologically-distant languages, particularly in the low-resource setting. One obstacle for effective cross-lingual transfer is variability in word-order patterns. It can be potentially mitigated via source- or target-side word reo… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP Findings 2023

  6. arXiv:2307.06908  [pdf, other

    cs.CL cs.AI

    Generating Benchmarks for Factuality Evaluation of Language Models

    Authors: Dor Muhlgay, Ori Ram, Inbal Magar, Yoav Levine, Nir Ratner, Yonatan Belinkov, Omri Abend, Kevin Leyton-Brown, Amnon Shashua, Yoav Shoham

    Abstract: Before deploying a language model (LM) within a given domain, it is important to measure its tendency to generate factually incorrect information in that domain. Existing methods for factuality evaluation of LLM generation focus on facts sampled from the LM itself, and thus do not control the set of evaluated facts and might under-represent domain specific or rare facts. We propose FACTOR: Factual… ▽ More

    Submitted 4 February, 2024; v1 submitted 13 July, 2023; originally announced July 2023.

  7. arXiv:2305.14991  [pdf, other

    cs.CL cs.AI

    MuLER: Detailed and Scalable Reference-based Evaluation

    Authors: Taelin Karidi, Leshem Choshen, Gal Patel, Omri Abend

    Abstract: We propose a novel methodology (namely, MuLER) that transforms any reference-based evaluation metric for text generation, such as machine translation (MT) into a fine-grained analysis tool. Given a system and a metric, MuLER quantifies how much the chosen metric penalizes specific error types (e.g., errors in translating names of locations). MuLER thus enables a detailed error analysis which can l… ▽ More

    Submitted 29 November, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

  8. arXiv:2302.08464  [pdf, other

    cs.CL cs.AI cs.LG

    Evaluating and Improving the Coreference Capabilities of Machine Translation Models

    Authors: Asaf Yehudai, Arie Cattan, Omri Abend, Gabriel Stanovsky

    Abstract: Machine translation (MT) requires a wide range of linguistic capabilities, which current end-to-end models are expected to learn implicitly by observing aligned sentences in bilingual corpora. In this work, we ask: \emph{How well do MT models learn coreference resolution from implicit signal?} To answer this question, we develop an evaluation methodology that derives coreference clusters from MT o… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: EACL paper

  9. arXiv:2302.04811  [pdf, other

    cs.CL

    A Large-Scale Multilingual Study of Visual Constraints on Linguistic Selection of Descriptions

    Authors: Uri Berger, Lea Frermann, Gabriel Stanovsky, Omri Abend

    Abstract: We present a large, multilingual study into how vision constrains linguistic choice, covering four languages and five linguistic properties, such as verb transitivity or use of numerals. We propose a novel method that leverages existing corpora of images with captions written by native speakers, and apply it to nine corpora, comprising 600k images and 3M captions. We study the relation between vis… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: Accepted to EACL 2023 Findings

  10. arXiv:2212.10947  [pdf, other

    cs.CL

    Parallel Context Windows for Large Language Models

    Authors: Nir Ratner, Yoav Levine, Yonatan Belinkov, Ori Ram, Inbal Magar, Omri Abend, Ehud Karpas, Amnon Shashua, Kevin Leyton-Brown, Yoav Shoham

    Abstract: When applied to processing long text, Large Language Models (LLMs) are limited by their context window. Existing efforts to address this limitation involve training specialized architectures, and cannot be easily applied to off-the-shelf LLMs. We present Parallel Context Windows (PCW), a method that alleviates the context window restriction for any off-the-shelf LLM without further training. The k… ▽ More

    Submitted 1 August, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

    Comments: The 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023)

  11. arXiv:2211.08825  [pdf, other

    cs.CL cs.AI

    Cognitive Simplification Operations Improve Text Simplification

    Authors: Eytan Chamovitz, Omri Abend

    Abstract: Text Simplification (TS) is the task of converting a text into a form that is easier to read while maintaining the meaning of the original text. A sub-task of TS is Cognitive Simplification (CS), converting text to a form that is readily understood by people with cognitive disabilities without rendering it childish or simplistic. This sub-task has yet to be explored with neural methods in NLP, and… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: 25 pages, 7 figures, 8 tables, uses emnlp2022.sty, to be published in CoNLL 2022

  12. arXiv:2211.05655  [pdf, other

    cs.CL cs.AI cs.LG

    DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering

    Authors: Ella Neeman, Roee Aharoni, Or Honovich, Leshem Choshen, Idan Szpektor, Omri Abend

    Abstract: Question answering models commonly have access to two sources of "knowledge" during inference time: (1) parametric knowledge - the factual knowledge encoded in the model weights, and (2) contextual knowledge - external knowledge (e.g., a Wikipedia passage) given to the model to generate a grounded answer. Having these two sources of knowledge entangled together is a core issue for generative QA mo… ▽ More

    Submitted 10 November, 2022; originally announced November 2022.

    Comments: 12 pages, 2 figures

  13. arXiv:2210.13783  [pdf, other

    cs.CL cs.LG

    Topical Segmentation of Spoken Narratives: A Test Case on Holocaust Survivor Testimonies

    Authors: Eitan Wagner, Renana Keydar, Amit Pinchevski, Omri Abend

    Abstract: The task of topical segmentation is well studied, but previous work has mostly addressed it in the context of structured, well-defined segments, such as segmentation into paragraphs, chapters, or segmenting text that originated from multiple sources. We tackle the task of segmenting running (spoken) narratives, which poses hitherto unaddressed challenges. As a test case, we address Holocaust survi… ▽ More

    Submitted 3 December, 2022; v1 submitted 25 October, 2022; originally announced October 2022.

  14. arXiv:2210.03053  [pdf, other

    cs.CL cs.AI cs.LG

    Reinforcement Learning with Large Action Spaces for Neural Machine Translation

    Authors: Asaf Yehudai, Leshem Choshen, Lior Fox, Omri Abend

    Abstract: Applying Reinforcement learning (RL) following maximum likelihood estimation (MLE) pre-training is a versatile method for enhancing neural machine translation (NMT) performance. However, recent work has argued that the gains produced by RL for NMT are mostly due to promoting tokens that have already received a fairly high probability in pre-training. We hypothesize that the large action space is a… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: Accepted for Coling

  15. arXiv:2205.09178  [pdf, other

    cs.CL cs.LG

    PreQuEL: Quality Estimation of Machine Translation Outputs in Advance

    Authors: Shachar Don-Yehiya, Leshem Choshen, Omri Abend

    Abstract: We present the task of PreQuEL, Pre-(Quality-Estimation) Learning. A PreQuEL system predicts how well a given sentence will be translated, without recourse to the actual translation, thus eschewing unnecessary resource allocation when translation quality is bound to be low. PreQuEL can be defined relative to a given MT system (e.g., some industry service) or generally relative to the state-of-the-… ▽ More

    Submitted 4 December, 2022; v1 submitted 18 May, 2022; originally announced May 2022.

    Comments: Accepted to the main conference of EMNLP 2022

  16. arXiv:2205.05974  [pdf, other

    cs.CL

    A Computational Acquisition Model for Multimodal Word Categorization

    Authors: Uri Berger, Gabriel Stanovsky, Omri Abend, Lea Frermann

    Abstract: Recent advances in self-supervised modeling of text and images open new opportunities for computational models of child language acquisition, which is believed to rely heavily on cross-modal signals. However, prior studies have been limited by their reliance on vision models trained on large image datasets annotated with a pre-defined set of depicted object categories. This is (a) not faithful to… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Comments: Accepted to NAACL 2022

  17. arXiv:2205.05730  [pdf, other

    cs.CL cs.AI cs.CY

    Some Grammatical Errors are Frequent, Others are Important

    Authors: Leshem Choshen, Ofir Shifman, Omri Abend

    Abstract: In Grammatical Error Correction, systems are evaluated by the number of errors they correct. However, no one has assessed whether all error types are equally important. We provide and apply a method to quantify the importance of different grammatical error types to humans. We show that some rare errors are considered disturbing while other common ones are not. This affects possible directions to i… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

  18. arXiv:2205.00445  [pdf, other

    cs.CL cs.AI

    MRKL Systems: A modular, neuro-symbolic architecture that combines large language models, external knowledge sources and discrete reasoning

    Authors: Ehud Karpas, Omri Abend, Yonatan Belinkov, Barak Lenz, Opher Lieber, Nir Ratner, Yoav Shoham, Hofit Bata, Yoav Levine, Kevin Leyton-Brown, Dor Muhlgay, Noam Rozen, Erez Schwartz, Gal Shachaf, Shai Shalev-Shwartz, Amnon Shashua, Moshe Tenenholtz

    Abstract: Huge language models (LMs) have ushered in a new era for AI, serving as a gateway to natural-language-based knowledge tasks. Although an essential element of modern AI, LMs are also inherently limited in a number of ways. We discuss these limitations and how they can be avoided by adopting a systems approach. Conceptualizing the challenge as one that involves knowledge and reasoning in addition to… ▽ More

    Submitted 1 May, 2022; originally announced May 2022.

  19. Semantics-aware Attention Improves Neural Machine Translation

    Authors: Aviv Slobodkin, Leshem Choshen, Omri Abend

    Abstract: The integration of syntactic structures into Transformer machine translation has shown positive results, but to our knowledge, no work has attempted to do so with semantic structures. In this work we propose two novel parameter-free methods for injecting semantic information into Transformers, both rely on semantics-aware masking of (some of) the attention heads. One such method operates on the en… ▽ More

    Submitted 24 May, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: Accepted to *SEM 2022

  20. arXiv:2110.04644  [pdf, other

    cs.CL cs.LG

    On the Relation between Syntactic Divergence and Zero-Shot Performance

    Authors: Ofir Arviv, Dmitry Nikolaev, Taelin Karidi, Omri Abend

    Abstract: We explore the link between the extent to which syntactic relations are preserved in translation and the ease of correctly constructing a parse tree in a zero-shot setting. While previous work suggests such a relation, it tends to focus on the macro level and not on the level of individual edges-a gap we aim to address. As a test case, we take the transfer of Universal Dependencies (UD) parsing fr… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

    Comments: Accepted to EMNLP 2021

  21. arXiv:2110.03067  [pdf, other

    cs.CL

    On Neurons Invariant to Sentence Structural Changes in Neural Machine Translation

    Authors: Gal Patel, Leshem Choshen, Omri Abend

    Abstract: We present a methodology that explores how sentence structure is reflected in neural representations of machine translation systems. We demonstrate our model-agnostic approach with the Transformer English-German translation model. We analyze neuron-level correlation of activations between paraphrases while discussing the methodology challenges and the need for confound analysis to isolate the effe… ▽ More

    Submitted 2 November, 2022; v1 submitted 6 October, 2021; originally announced October 2021.

  22. arXiv:2109.11491  [pdf, other

    cs.CL

    Putting Words in BERT's Mouth: Navigating Contextualized Vector Spaces with Pseudowords

    Authors: Taelin Karidi, Yichu Zhou, Nathan Schneider, Omri Abend, Vivek Srikumar

    Abstract: We present a method for exploring regions around individual points in a contextualized vector space (particularly, BERT space), as a way to investigate how these regions correspond to word senses. By inducing a contextualized "pseudoword" as a stand-in for a static embedding in the input layer, and then performing masked prediction of a word in the sentence, we are able to investigate the geometry… ▽ More

    Submitted 4 October, 2021; v1 submitted 23 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021 camera-ready version

  23. arXiv:2109.10952  [pdf, other

    cs.CL

    Cross-linguistically Consistent Semantic and Syntactic Annotation of Child-directed Speech

    Authors: Ida Szubert, Omri Abend, Nathan Schneider, Samuel Gibbon, Louis Mahon, Sharon Goldwater, Mark Steedman

    Abstract: This paper proposes a methodology for constructing such corpora of child directed speech (CDS) paired with sentential logical forms, and uses this method to create two such corpora, in English and Hebrew. The approach enforces a cross-linguistically consistent representation, building on recent advances in dependency representation and semantic parsing. Specifically, the approach involves two step… ▽ More

    Submitted 14 March, 2024; v1 submitted 22 September, 2021; originally announced September 2021.

  24. arXiv:2109.06096  [pdf, other

    cs.CL cs.AI cs.LG

    The Grammar-Learning Trajectories of Neural Language Models

    Authors: Leshem Choshen, Guy Hacohen, Daphna Weinshall, Omri Abend

    Abstract: The learning trajectories of linguistic phenomena in humans provide insight into linguistic representation, beyond what can be gleaned from inspecting the behavior of an adult speaker. To apply a similar approach to analyze neural language models (NLM), it is first necessary to establish that different models are similar enough in the generalizations they make. In this paper, we show that NLMs wit… ▽ More

    Submitted 6 April, 2022; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: ACL camera-ready

  25. arXiv:2106.00745  [pdf

    cs.CL cs.AI cs.LG

    Part of Speech and Universal Dependency effects on English Arabic Machine Translation

    Authors: Ofek Rafaeli, Omri Abend, Leshem Choshen, Dmitry Nikolaev

    Abstract: In this research paper, I will elaborate on a method to evaluate machine translation models based on their performance on underlying syntactical phenomena between English and Arabic languages. This method is especially important as such "neural" and "machine learning" are hard to fine-tune and change. Thus, finding a way to evaluate them easily and diversely would greatly help the task of betterin… ▽ More

    Submitted 3 June, 2021; v1 submitted 1 June, 2021; originally announced June 2021.

    Comments: 19 pages

  26. arXiv:2104.08202  [pdf, other

    cs.CL

    $Q^{2}$: Evaluating Factual Consistency in Knowledge-Grounded Dialogues via Question Generation and Question Answering

    Authors: Or Honovich, Leshem Choshen, Roee Aharoni, Ella Neeman, Idan Szpektor, Omri Abend

    Abstract: Neural knowledge-grounded generative models for dialogue often produce content that is factually inconsistent with the knowledge they rely on, making them unreliable and limiting their applicability. Inspired by recent work on evaluating factual consistency in abstractive summarization, we propose an automatic evaluation metric for factual consistency in knowledge-grounded dialogue using automatic… ▽ More

    Submitted 9 September, 2021; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: Accepted to EMNLP 2021

  27. Mediators in Determining what Processing BERT Performs First

    Authors: Aviv Slobodkin, Leshem Choshen, Omri Abend

    Abstract: Probing neural models for the ability to perform downstream tasks using their activation patterns is often used to localize what parts of the network specialize in performing what tasks. However, little work addressed potential mediating factors in such comparisons. As a test-case mediating factor, we consider the prediction's context length, namely the length of the span whose processing is minim… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

    Comments: Accepted to NAACL 2021

  28. arXiv:2104.02310  [pdf, ps, other

    cs.CL

    SERRANT: a syntactic classifier for English Grammatical Error Types

    Authors: Leshem Choshen, Matanel Oren, Dmitry Nikolaev, Omri Abend

    Abstract: SERRANT is a system and code for automatic classification of English grammatical errors that combines SErCl and ERRANT. SERRANT uses ERRANT's annotations when they are informative and those provided by SErCl otherwise.

    Submitted 7 April, 2021; v1 submitted 6 April, 2021; originally announced April 2021.

    Comments: Code library in: https://github.com/matanel-oren/serrant

  29. arXiv:2101.12640  [pdf, other

    cs.CL cs.LG

    Enhancing the Transformer Decoder with Transition-based Syntax

    Authors: Leshem Choshen, Omri Abend

    Abstract: Notwithstanding recent advances, syntactic generalization remains a challenge for text decoders. While some studies showed gains from incorporating source-side symbolic syntactic and semantic structure into text generation Transformers, very little work addressed the decoding of such structure. We propose a general approach for tree decoding using a transition-based approach. Examining the challen… ▽ More

    Submitted 31 October, 2022; v1 submitted 29 January, 2021; originally announced January 2021.

    Comments: Accepted to CoNLL

  30. arXiv:2012.15810  [pdf, other

    cs.CL

    UCCA's Foundational Layer: Annotation Guidelines v2.1

    Authors: Omri Abend, Nathan Schneider, Dotan Dvir, Jakob Prange, Ari Rappoport

    Abstract: This is the annotation manual for Universal Conceptual Cognitive Annotation (UCCA; Abend and Rappoport, 2013), specifically the Foundational Layer. UCCA is a graph-based semantic annotation scheme based on typological linguistic principles. It has been applied to several languages; for ease of exposition these guidelines give examples mainly in English. New annotators may wish to start with the tu… ▽ More

    Submitted 31 December, 2020; originally announced December 2020.

  31. arXiv:2011.00834  [pdf, other

    cs.CL

    Comparison by Conversion: Reverse-Engineering UCCA from Syntax and Lexical Semantics

    Authors: Daniel Hershcovich, Nathan Schneider, Dotan Dvir, Jakob Prange, Miryam de Lhoneux, Omri Abend

    Abstract: Building robust natural language understanding systems will require a clear characterization of whether and how various linguistic meaning representations complement each other. To perform a systematic comparative analysis, we evaluate the map** between meaning representations from different frameworks using two complementary methods: (i) a rule-based converter, and (ii) a supervised delexicaliz… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Comments: COLING 2020 camera ready

  32. arXiv:2010.11032  [pdf, other

    cs.CL

    Classifying Syntactic Errors in Learner Language

    Authors: Leshem Choshen, Dmitry Nikolaev, Yevgeni Berzak, Omri Abend

    Abstract: We present a method for classifying syntactic errors in learner language, namely errors whose correction alters the morphosyntactic structure of a sentence. The methodology builds on the established Universal Dependencies syntactic representation scheme, and provides complementary information to other error-classification systems. Unlike existing error classification methods, our method is app… ▽ More

    Submitted 27 October, 2020; v1 submitted 21 October, 2020; originally announced October 2020.

    Comments: CoNLL 2020

  33. arXiv:2010.01825  [pdf, other

    cs.LG cs.CL stat.ML

    PMI-Masking: Principled masking of correlated spans

    Authors: Yoav Levine, Barak Lenz, Opher Lieber, Omri Abend, Kevin Leyton-Brown, Moshe Tennenholtz, Yoav Shoham

    Abstract: Masking tokens uniformly at random constitutes a common flaw in the pretraining of Masked Language Models (MLMs) such as BERT. We show that such uniform masking allows an MLM to minimize its training objective by latching onto shallow local signals, leading to pretraining inefficiency and suboptimal downstream performance. To address this flaw, we propose PMI-Masking, a principled masking strategy… ▽ More

    Submitted 5 October, 2020; originally announced October 2020.

  34. arXiv:2005.03436  [pdf, other

    cs.CL

    Fine-Grained Analysis of Cross-Linguistic Syntactic Divergences

    Authors: Dmitry Nikolaev, Ofir Arviv, Taelin Karidi, Neta Kenneth, Veronika Mitnik, Lilja Maria Saeboe, Omri Abend

    Abstract: The patterns in which the syntax of different languages converges and diverges are often used to inform work on cross-lingual transfer. Nevertheless, little empirical work has been done on quantifying the prevalence of different syntactic divergences across language pairs. We propose a framework for extracting divergence patterns for any language pair from a parallel corpus, building on Universal… ▽ More

    Submitted 13 July, 2020; v1 submitted 7 May, 2020; originally announced May 2020.

  35. arXiv:2005.00311  [pdf, other

    cs.CL cs.LG

    Language (Re)modelling: Towards Embodied Language Understanding

    Authors: Ronen Tamari, Chen Shani, Tom Hope, Miriam R. L. Petruck, Omri Abend, Dafna Shahaf

    Abstract: While natural language understanding (NLU) is advancing rapidly, today's technology differs from human-like language understanding in fundamental ways, notably in its inferior efficiency, interpretability, and generalization. This work proposes an approach to representation and learning based on the tenets of embodied cognitive linguistics (ECL). According to ECL, natural language is inherently ex… ▽ More

    Submitted 9 July, 2020; v1 submitted 1 May, 2020; originally announced May 2020.

    Comments: Accepted to ACL2020 Theme Track. Extended bibliography version

  36. arXiv:1909.08796  [pdf, other

    cs.CL

    Made for Each Other: Broad-coverage Semantic Structures Meet Preposition Supersenses

    Authors: Jakob Prange, Nathan Schneider, Omri Abend

    Abstract: Universal Conceptual Cognitive Annotation (UCCA; Abend and Rappoport, 2013) is a typologically-informed, broad-coverage semantic annotation scheme that describes coarse-grained predicate-argument structure but currently lacks semantic roles. We argue that lexicon-free annotation of the semantic roles marked by prepositions, as formulated by Schneider et al. (2018b), is complementary and suitable f… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.

    Comments: to appear at CoNLL 2019

  37. arXiv:1909.06814  [pdf, other

    cs.CL cs.LG

    Automatically Extracting Challenge Sets for Non local Phenomena in Neural Machine Translation

    Authors: Leshem Choshen, Omri Abend

    Abstract: We show that the state of the art Transformer Machine Translation (MT) model is not biased towards monotonic reordering (unlike previous recurrent neural network models), but that nevertheless, long-distance dependencies remain a challenge for the model. Since most dependencies are short-distance, common evaluation metrics will be little influenced by how well systems perform on them. We, therefor… ▽ More

    Submitted 25 September, 2019; v1 submitted 15 September, 2019; originally announced September 2019.

    Comments: Accepted for CoNLL

  38. arXiv:1907.01752  [pdf, other

    cs.CL cs.AI cs.LG

    On the Weaknesses of Reinforcement Learning for Neural Machine Translation

    Authors: Leshem Choshen, Lior Fox, Zohar Aizenbud, Omri Abend

    Abstract: Reinforcement learning (RL) is frequently used to increase performance in text generation tasks, including machine translation (MT), notably through the use of Minimum Risk Training (MRT) and Generative Adversarial Networks (GAN). However, little is known about what and how these methods learn in the context of MT. We prove that one of the most common RL methods for MT does not optimize the expect… ▽ More

    Submitted 15 January, 2020; v1 submitted 3 July, 2019; originally announced July 2019.

    Comments: Accepted to ICLR 2020 (matching content, different style)

  39. arXiv:1906.00663  [pdf, other

    cs.CL

    Semantically Constrained Multilayer Annotation: The Case of Coreference

    Authors: Jakob Prange, Nathan Schneider, Omri Abend

    Abstract: We propose a coreference annotation scheme as a layer on top of the Universal Conceptual Cognitive Annotation foundational layer, treating units in predicate-argument structure as a basis for entity and event mentions. We argue that this allows coreference annotators to sidestep some of the challenges faced in other schemes, which do not enforce consistency with predicate-argument structure and va… ▽ More

    Submitted 11 June, 2019; v1 submitted 3 June, 2019; originally announced June 2019.

    Comments: Accepted to The First International Workshop on Designing Meaning Representations (DMR), 2019 (in conjunction with ACL 2019)

  40. arXiv:1905.05543  [pdf, other

    cs.CL

    The Language of Legal and Illegal Activity on the Darknet

    Authors: Leshem Choshen, Dan Eldad, Daniel Hershcovich, Elior Sulem, Omri Abend

    Abstract: The non-indexed parts of the Internet (the Darknet) have become a haven for both legal and illegal anonymous activity. Given the magnitude of these networks, scalably monitoring their activity necessarily relies on automated tools, and notably on NLP tools. However, little is known about what characteristics texts communicated through the Darknet have, and how well off-the-shelf NLP tools do on th… ▽ More

    Submitted 4 June, 2019; v1 submitted 14 May, 2019; originally announced May 2019.

    Comments: ACL 2019 camera ready; code in https://github.com/huji-nlp/cyber

  41. arXiv:1903.06494  [pdf, other

    cs.CL

    Content Differences in Syntactic and Semantic Representations

    Authors: Daniel Hershcovich, Omri Abend, Ari Rappoport

    Abstract: Syntactic analysis plays an important role in semantic parsing, but the nature of this role remains a topic of ongoing debate. The debate has been constrained by the scarcity of empirical comparative studies between syntactic and semantic schemes, which hinders the development of parsing methods informed by the details of target schemes and constructions. We target this gap, and take Universal Dep… ▽ More

    Submitted 1 May, 2019; v1 submitted 15 March, 2019; originally announced March 2019.

    Comments: NAACL-HLT 2019 camera ready

  42. arXiv:1903.02953  [pdf, other

    cs.CL

    SemEval-2019 Task 1: Cross-lingual Semantic Parsing with UCCA

    Authors: Daniel Hershcovich, Zohar Aizenbud, Leshem Choshen, Elior Sulem, Ari Rappoport, Omri Abend

    Abstract: We present the SemEval 2019 shared task on UCCA parsing in English, German and French, and discuss the participating systems and results. UCCA is a cross-linguistically applicable framework for semantic representation, which builds on extensive typological work and supports rapid annotation. UCCA poses a challenge for existing parsing techniques, as it exhibits reentrancy (resulting in DAG structu… ▽ More

    Submitted 11 June, 2020; v1 submitted 6 March, 2019; originally announced March 2019.

    Comments: SemEval 2019 Shared task. arXiv admin note: substantial text overlap with arXiv:1805.12386

  43. arXiv:1810.05995  [pdf, other

    cs.CL

    BLEU is Not Suitable for the Evaluation of Text Simplification

    Authors: Elior Sulem, Omri Abend, Ari Rappoport

    Abstract: BLEU is widely considered to be an informative metric for text-to-text generation, including Text Simplification (TS). TS includes both lexical and structural aspects. In this paper we show that BLEU is not suitable for the evaluation of sentence splitting, the major structural simplification operation. We manually compiled a sentence splitting gold standard corpus containing multiple structural p… ▽ More

    Submitted 14 October, 2018; originally announced October 2018.

    Comments: Accepted to EMNLP 2018 (Short papers)

  44. arXiv:1810.05104  [pdf, ps, other

    cs.CL

    Simple and Effective Text Simplification Using Semantic and Neural Methods

    Authors: Elior Sulem, Omri Abend, Ari Rappoport

    Abstract: Sentence splitting is a major simplification operator. Here we present a simple and efficient splitting algorithm based on an automatic semantic parser. After splitting, the text is amenable for further fine-tuned simplification operations. In particular, we show that neural Machine Translation can be effectively used in this situation. Previous application of Machine Translation for simplificatio… ▽ More

    Submitted 11 October, 2018; originally announced October 2018.

    Journal ref: Proc. of ACL 2018

  45. arXiv:1810.05022  [pdf, ps, other

    cs.CL

    Semantic Structural Evaluation for Text Simplification

    Authors: Elior Sulem, Omri Abend, Ari Rappoport

    Abstract: Current measures for evaluating text simplification systems focus on evaluating lexical text aspects, neglecting its structural aspects. In this paper we propose the first measure to address structural aspects of text simplification, called SAMSA. It leverages recent advances in semantic parsing to assess simplification quality by decomposing the input based on its semantic structure and comparing… ▽ More

    Submitted 11 October, 2018; originally announced October 2018.

    Journal ref: Proc. of NAACL 2018

  46. arXiv:1808.09354  [pdf, other

    cs.CL

    Universal Dependency Parsing with a General Transition-Based DAG Parser

    Authors: Daniel Hershcovich, Omri Abend, Ari Rappoport

    Abstract: This paper presents our experiments with applying TUPA to the CoNLL 2018 UD shared task. TUPA is a general neural transition-based DAG parser, which we use to present the first experiments on recovering enhanced dependencies as part of the general parsing task. TUPA was designed for parsing UCCA, a cross-linguistic semantic annotation scheme, exhibiting reentrancy, discontinuity and non-terminal n… ▽ More

    Submitted 28 August, 2018; originally announced August 2018.

    Comments: CoNLL 2018 UD Shared Task

  47. arXiv:1805.12386   

    cs.CL

    SemEval 2019 Shared Task: Cross-lingual Semantic Parsing with UCCA - Call for Participation

    Authors: Daniel Hershcovich, Leshem Choshen, Elior Sulem, Zohar Aizenbud, Ari Rappoport, Omri Abend

    Abstract: We announce a shared task on UCCA parsing in English, German and French, and call for participants to submit their systems. UCCA is a cross-linguistically applicable framework for semantic representation, which builds on extensive typological work and supports rapid annotation. UCCA poses a challenge for existing parsing techniques, as it exhibits reentrancy (resulting in DAG structures), disconti… ▽ More

    Submitted 3 February, 2021; v1 submitted 31 May, 2018; originally announced May 2018.

    Comments: Not an actual paper. The shared task summary is at arXiv:1903.02953

  48. arXiv:1805.04905  [pdf, other

    cs.CL

    Comprehensive Supersense Disambiguation of English Prepositions and Possessives

    Authors: Nathan Schneider, Jena D. Hwang, Vivek Srikumar, Jakob Prange, Austin Blodgett, Sarah R. Moeller, Aviram Stern, Adi Bitan, Omri Abend

    Abstract: Semantic relations are often signaled with prepositional or possessive marking--but extreme polysemy bedevils their analysis and automatic interpretation. We introduce a new annotation scheme, corpus, and task for the disambiguation of prepositions and possessives in English. Unlike previous approaches, our annotations are comprehensive with respect to types and tokens of these markers; use broadl… ▽ More

    Submitted 13 May, 2018; originally announced May 2018.

    Comments: ACL 2018

  49. arXiv:1805.00287  [pdf, ps, other

    cs.CL

    Multitask Parsing Across Semantic Representations

    Authors: Daniel Hershcovich, Omri Abend, Ari Rappoport

    Abstract: The ability to consolidate information of different types is at the core of intelligence, and has tremendous practical value in allowing learning for one task to benefit from generalizations learned for others. In this paper we tackle the challenging task of improving semantic parsing performance, taking UCCA parsing as a test case, and AMR, SDP and Universal Dependencies (UD) parsing as auxiliary… ▽ More

    Submitted 1 May, 2018; originally announced May 2018.

    Comments: Accepted to ACL 2018

  50. arXiv:1804.11254  [pdf, other

    cs.CL

    Inherent Biases in Reference based Evaluation for Grammatical Error Correction and Text Simplification

    Authors: Leshem Choshen, Omri Abend

    Abstract: The prevalent use of too few references for evaluating text-to-text generation is known to bias estimates of their quality ({\it low coverage bias} or LCB). This paper shows that overcoming LCB in Grammatical Error Correction (GEC) evaluation cannot be attained by re-scaling or by increasing the number of references in any feasible range, contrary to previous suggestions. This is due to the long-t… ▽ More

    Submitted 18 September, 2019; v1 submitted 30 April, 2018; originally announced April 2018.

    Comments: Accepted to ACL 2018 (figures currently omitted due to technical arxiv issues