Skip to main content

Showing 1–8 of 8 results for author: Arviv, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.14367  [pdf, other

    cs.CL cs.AI cs.LG

    Genie: Achieving Human Parity in Content-Grounded Datasets Generation

    Authors: Asaf Yehudai, Boaz Carmeli, Yosi Mass, Ofir Arviv, Nathaniel Mills, Assaf Toledo, Eyal Shnarch, Leshem Choshen

    Abstract: The lack of high-quality data for content-grounded generation tasks has been identified as a major obstacle to advancing these tasks. To address this gap, we propose Genie, a novel method for automatically generating high-quality content-grounded data. It consists of three stages: (a) Content Preparation, (b) Generation: creating task-specific examples from the content (e.g., question-answer pairs… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted to ICLR24

  2. arXiv:2401.14019  [pdf, other

    cs.CL cs.AI

    Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AI

    Authors: Elron Bandel, Yotam Perlitz, Elad Venezian, Roni Friedman-Melamed, Ofir Arviv, Matan Orbach, Shachar Don-Yehyia, Dafna Sheinwald, Ariel Gera, Leshem Choshen, Michal Shmueli-Scheuer, Yoav Katz

    Abstract: In the dynamic landscape of generative NLP, traditional text processing pipelines limit research flexibility and reproducibility, as they are tailored to specific dataset, task, and model combinations. The escalating complexity, involving system prompts, model-specific formats, instructions, and more, calls for a shift to a structured, modular, and customizable solution. Addressing this need, we p… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: Submitted to NAACL demo track

  3. arXiv:2310.13583  [pdf, other

    cs.CL cs.LG

    Improving Cross-Lingual Transfer through Subtree-Aware Word Reordering

    Authors: Ofir Arviv, Dmitry Nikolaev, Taelin Karidi, Omri Abend

    Abstract: Despite the impressive growth of the abilities of multilingual language models, such as XLM-R and mT5, it has been shown that they still face difficulties when tackling typologically-distant languages, particularly in the low-resource setting. One obstacle for effective cross-lingual transfer is variability in word-order patterns. It can be potentially mitigated via source- or target-side word reo… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP Findings 2023

  4. arXiv:2308.11696  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    Efficient Benchmarking of Language Models

    Authors: Yotam Perlitz, Elron Bandel, Ariel Gera, Ofir Arviv, Liat Ein-Dor, Eyal Shnarch, Noam Slonim, Michal Shmueli-Scheuer, Leshem Choshen

    Abstract: The increasing versatility of language models (LMs) has given rise to a new class of benchmarks that comprehensively assess a broad range of capabilities. Such benchmarks are associated with massive computational costs, extending to thousands of GPU hours per model. However, the efficiency aspect of these evaluation efforts had raised little discussion in the literature. In this work, we present t… ▽ More

    Submitted 1 April, 2024; v1 submitted 22 August, 2023; originally announced August 2023.

    Comments: Accepted to NAACL main track

  5. arXiv:2305.01628  [pdf, other

    cs.CL cs.LG

    The Benefits of Bad Advice: Autocontrastive Decoding across Model Layers

    Authors: Ariel Gera, Roni Friedman, Ofir Arviv, Chulaka Gunasekara, Benjamin Sznajder, Noam Slonim, Eyal Shnarch

    Abstract: Applying language models to natural language processing tasks typically relies on the representations in the final model layer, as intermediate hidden layer representations are presumed to be less informative. In this work, we argue that due to the gradual improvement across model layers, additional information can be gleaned from the contrast between higher and lower layers during inference. Spec… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

    Comments: 9 pages, 8 figures; To be published in ACL 2023

  6. arXiv:2110.04644  [pdf, other

    cs.CL cs.LG

    On the Relation between Syntactic Divergence and Zero-Shot Performance

    Authors: Ofir Arviv, Dmitry Nikolaev, Taelin Karidi, Omri Abend

    Abstract: We explore the link between the extent to which syntactic relations are preserved in translation and the ease of correctly constructing a parse tree in a zero-shot setting. While previous work suggests such a relation, it tends to focus on the macro level and not on the level of individual edges-a gap we aim to address. As a test case, we take the transfer of Universal Dependencies (UD) parsing fr… ▽ More

    Submitted 9 October, 2021; originally announced October 2021.

    Comments: Accepted to EMNLP 2021

  7. arXiv:2010.05710  [pdf, other

    cs.CL cs.LG

    HUJI-KU at MRP~2020: Two Transition-based Neural Parsers

    Authors: Ofir Arviv, Ruixiang Cui, Daniel Hershcovich

    Abstract: This paper describes the HUJI-KU system submission to the shared task on Cross-Framework Meaning Representation Parsing (MRP) at the 2020 Conference for Computational Language Learning (CoNLL), employing TUPA and the HIT-SCIR parser, which were, respectively, the baseline system and winning system in the 2019 MRP shared task. Both are transition-based parsers using BERT contextualized embeddings.… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

  8. arXiv:2005.03436  [pdf, other

    cs.CL

    Fine-Grained Analysis of Cross-Linguistic Syntactic Divergences

    Authors: Dmitry Nikolaev, Ofir Arviv, Taelin Karidi, Neta Kenneth, Veronika Mitnik, Lilja Maria Saeboe, Omri Abend

    Abstract: The patterns in which the syntax of different languages converges and diverges are often used to inform work on cross-lingual transfer. Nevertheless, little empirical work has been done on quantifying the prevalence of different syntactic divergences across language pairs. We propose a framework for extracting divergence patterns for any language pair from a parallel corpus, building on Universal… ▽ More

    Submitted 13 July, 2020; v1 submitted 7 May, 2020; originally announced May 2020.