Skip to main content

Showing 1–14 of 14 results for author: Ram, O

.
  1. arXiv:2310.01558  [pdf, other

    cs.CL cs.AI

    Making Retrieval-Augmented Language Models Robust to Irrelevant Context

    Authors: Ori Yoran, Tomer Wolfson, Ori Ram, Jonathan Berant

    Abstract: Retrieval-augmented language models (RALMs) hold promise to produce language understanding systems that are are factual, efficient, and up-to-date. An important desideratum of RALMs, is that retrieved information helps model performance when it is relevant, and does not harm performance when it is not. This is particularly important in multi-hop reasoning scenarios, where misuse of irrelevant evid… ▽ More

    Submitted 5 May, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

  2. arXiv:2307.06908  [pdf, other

    cs.CL cs.AI

    Generating Benchmarks for Factuality Evaluation of Language Models

    Authors: Dor Muhlgay, Ori Ram, Inbal Magar, Yoav Levine, Nir Ratner, Yonatan Belinkov, Omri Abend, Kevin Leyton-Brown, Amnon Shashua, Yoav Shoham

    Abstract: Before deploying a language model (LM) within a given domain, it is important to measure its tendency to generate factually incorrect information in that domain. Existing methods for factuality evaluation of LLM generation focus on facts sampled from the LM itself, and thus do not control the set of evaluated facts and might under-represent domain specific or rare facts. We propose FACTOR: Factual… ▽ More

    Submitted 4 February, 2024; v1 submitted 13 July, 2023; originally announced July 2023.

  3. arXiv:2302.00083  [pdf, other

    cs.CL cs.IR

    In-Context Retrieval-Augmented Language Models

    Authors: Ori Ram, Yoav Levine, Itay Dalmedigos, Dor Muhlgay, Amnon Shashua, Kevin Leyton-Brown, Yoav Shoham

    Abstract: Retrieval-Augmented Language Modeling (RALM) methods, which condition a language model (LM) on relevant documents from a grounding corpus during generation, were shown to significantly improve language modeling performance. In addition, they can mitigate the problem of factually inaccurate text generation and provide natural source attribution mechanism. Existing RALM approaches focus on modifying… ▽ More

    Submitted 1 August, 2023; v1 submitted 31 January, 2023; originally announced February 2023.

    Comments: Accepted for publication in Transactions of the Association for Computational Linguistics (TACL). pre-MIT Press publication version

  4. arXiv:2212.10947  [pdf, other

    cs.CL

    Parallel Context Windows for Large Language Models

    Authors: Nir Ratner, Yoav Levine, Yonatan Belinkov, Ori Ram, Inbal Magar, Omri Abend, Ehud Karpas, Amnon Shashua, Kevin Leyton-Brown, Yoav Shoham

    Abstract: When applied to processing long text, Large Language Models (LLMs) are limited by their context window. Existing efforts to address this limitation involve training specialized architectures, and cannot be easily applied to off-the-shelf LLMs. We present Parallel Context Windows (PCW), a method that alleviates the context window restriction for any off-the-shelf LLM without further training. The k… ▽ More

    Submitted 1 August, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

    Comments: The 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023)

  5. arXiv:2212.10380  [pdf, other

    cs.CL cs.IR

    What Are You Token About? Dense Retrieval as Distributions Over the Vocabulary

    Authors: Ori Ram, Liat Bezalel, Adi Zicher, Yonatan Belinkov, Jonathan Berant, Amir Globerson

    Abstract: Dual encoders are now the dominant architecture for dense retrieval. Yet, we have little understanding of how they represent text, and why this leads to good performance. In this work, we shed light on this question via distributions over the vocabulary. We propose to interpret the vector representations produced by dual encoders by projecting them into the model's vocabulary space. We show that t… ▽ More

    Submitted 24 May, 2023; v1 submitted 20 December, 2022; originally announced December 2022.

    Comments: ACL 2023

  6. arXiv:2204.10019  [pdf, other

    cs.CL cs.AI

    Standing on the Shoulders of Giant Frozen Language Models

    Authors: Yoav Levine, Itay Dalmedigos, Ori Ram, Yoel Zeldes, Daniel Jannai, Dor Muhlgay, Yoni Osin, Opher Lieber, Barak Lenz, Shai Shalev-Shwartz, Amnon Shashua, Kevin Leyton-Brown, Yoav Shoham

    Abstract: Huge pretrained language models (LMs) have demonstrated surprisingly good zero-shot capabilities on a wide variety of tasks. This gives rise to the appealing vision of a single, versatile model with a wide range of functionalities across disparate applications. However, current leading techniques for leveraging a "frozen" LM -- i.e., leaving its weights untouched -- still often underperform fine-t… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

  7. arXiv:2203.16634  [pdf, other

    cs.CL cs.AI cs.LG

    Transformer Language Models without Positional Encodings Still Learn Positional Information

    Authors: Adi Haviv, Ori Ram, Ofir Press, Peter Izsak, Omer Levy

    Abstract: Causal transformer language models (LMs), such as GPT-3, typically require some form of positional encoding, such as positional embeddings. However, we show that LMs without any explicit positional encoding are still competitive with standard models, and that this phenomenon is robust across different datasets, model sizes, and sequence lengths. Probing experiments reveal that such models acquire… ▽ More

    Submitted 5 December, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: Findings of EMNLP 2022

  8. arXiv:2112.07708  [pdf, other

    cs.CL cs.IR

    Learning to Retrieve Passages without Supervision

    Authors: Ori Ram, Gal Shachaf, Omer Levy, Jonathan Berant, Amir Globerson

    Abstract: Dense retrievers for open-domain question answering (ODQA) have been shown to achieve impressive performance by training on large datasets of question-passage pairs. In this work we ask whether this dependence on labeled data can be reduced via unsupervised pretraining that is geared towards ODQA. We show this is in fact possible, via a novel pretraining scheme designed for retrieval. Our "recurri… ▽ More

    Submitted 17 May, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: NAACL 2022

  9. arXiv:2108.05857  [pdf, other

    cs.CL

    How Optimal is Greedy Decoding for Extractive Question Answering?

    Authors: Or Castel, Ori Ram, Avia Efrat, Omer Levy

    Abstract: Fine-tuned language models use greedy decoding to answer reading comprehension questions with relative success. However, this approach does not ensure that the answer is a span in the given passage, nor does it guarantee that it is the most probable one. Does greedy decoding actually perform worse than an algorithm that does adhere to these properties? To study the performance and optimality of gr… ▽ More

    Submitted 8 November, 2022; v1 submitted 12 August, 2021; originally announced August 2021.

    Comments: AKBC 2022 12 pages, 3 figures

  10. arXiv:2101.00438  [pdf, other

    cs.CL

    Few-Shot Question Answering by Pretraining Span Selection

    Authors: Ori Ram, Yuval Kirstain, Jonathan Berant, Amir Globerson, Omer Levy

    Abstract: In several question answering benchmarks, pretrained models have reached human parity through fine-tuning on an order of 100,000 annotated questions and answers. We explore the more realistic few-shot setting, where only a few hundred training examples are available, and observe that standard models perform poorly, highlighting the discrepancy between current pretraining objectives and question an… ▽ More

    Submitted 2 June, 2021; v1 submitted 2 January, 2021; originally announced January 2021.

    Comments: Accepted to ACL 2021

  11. arXiv:2101.00434  [pdf, other

    cs.CL

    Coreference Resolution without Span Representations

    Authors: Yuval Kirstain, Ori Ram, Omer Levy

    Abstract: The introduction of pretrained language models has reduced many complex task-specific NLP models to simple lightweight layers. An exception to this trend is coreference resolution, where a sophisticated task-specific model is appended to a pretrained transformer encoder. While highly effective, the model has a very large memory footprint -- primarily due to dynamically-constructed span and span-pa… ▽ More

    Submitted 31 May, 2021; v1 submitted 2 January, 2021; originally announced January 2021.

    Comments: Accepted to ACL 2021

  12. arXiv:1908.05646  [pdf, other

    cs.CL cs.LG

    SenseBERT: Driving Some Sense into BERT

    Authors: Yoav Levine, Barak Lenz, Or Dagan, Ori Ram, Dan Padnos, Or Sharir, Shai Shalev-Shwartz, Amnon Shashua, Yoav Shoham

    Abstract: The ability to learn from large unlabeled corpora has allowed neural language models to advance the frontier in natural language understanding. However, existing self-supervision techniques operate at the word form level, which serves as a surrogate for the underlying semantic content. This paper proposes a method to employ weak-supervision directly at the word sense level. Our model, named SenseB… ▽ More

    Submitted 18 May, 2020; v1 submitted 15 August, 2019; originally announced August 2019.

    Comments: Accepted to ACL 2020

  13. arXiv:1902.09492  [pdf, other

    cs.CL cs.LG

    Cross-Lingual Alignment of Contextual Word Embeddings, with Applications to Zero-shot Dependency Parsing

    Authors: Tal Schuster, Ori Ram, Regina Barzilay, Amir Globerson

    Abstract: We introduce a novel method for multilingual transfer that utilizes deep contextual embeddings, pretrained in an unsupervised fashion. While contextual embeddings have been shown to yield richer representations of meaning compared to their static counterparts, aligning them poses a challenge due to their dynamic nature. To this end, we construct context-independent variants of the original monolin… ▽ More

    Submitted 3 April, 2019; v1 submitted 25 February, 2019; originally announced February 2019.

    Comments: NAACL 2019

  14. arXiv:0811.3515  [pdf

    q-bio.GN q-bio.PE

    Intronic Alus Influence Alternative Splicing

    Authors: Galit Lev-Maor, Oren Ram, Eddo Kim, Noa Sela, Amir Goren, Erez Y Levanon, Gil Ast

    Abstract: Examination of the human transcriptome reveals higher levels of RNA editing than in any other organism tested to date. This is indicative of extensive double-stranded RNA (dsRNA) formation within the human transcriptome. Most of the editing sites are located in the primate-specific retrotransposed element called Alu. A large fraction of Alus are found in intronic sequences, implying extensive Al… ▽ More

    Submitted 21 November, 2008; originally announced November 2008.

    Journal ref: PLoS Genet 2008 4(9): e1000204