Skip to main content

Showing 1–4 of 4 results for author: Dalmedigos, I

.
  1. arXiv:2403.19887  [pdf, other

    cs.CL cs.LG

    Jamba: A Hybrid Transformer-Mamba Language Model

    Authors: Opher Lieber, Barak Lenz, Hofit Bata, Gal Cohen, Jhonathan Osin, Itay Dalmedigos, Erez Safahi, Shaked Meirom, Yonatan Belinkov, Shai Shalev-Shwartz, Omri Abend, Raz Alon, Tomer Asida, Amir Bergman, Roman Glozman, Michael Gokhman, Avashalom Manevich, Nir Ratner, Noam Rozen, Erez Shwartz, Mor Zusman, Yoav Shoham

    Abstract: We present Jamba, a new base large language model based on a novel hybrid Transformer-Mamba mixture-of-experts (MoE) architecture. Specifically, Jamba interleaves blocks of Transformer and Mamba layers, enjoying the benefits of both model families. MoE is added in some of these layers to increase model capacity while kee** active parameter usage manageable. This flexible architecture allows reso… ▽ More

    Submitted 3 July, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: Webpage: https://www.ai21.com/jamba

  2. arXiv:2302.00083  [pdf, other

    cs.CL cs.IR

    In-Context Retrieval-Augmented Language Models

    Authors: Ori Ram, Yoav Levine, Itay Dalmedigos, Dor Muhlgay, Amnon Shashua, Kevin Leyton-Brown, Yoav Shoham

    Abstract: Retrieval-Augmented Language Modeling (RALM) methods, which condition a language model (LM) on relevant documents from a grounding corpus during generation, were shown to significantly improve language modeling performance. In addition, they can mitigate the problem of factually inaccurate text generation and provide natural source attribution mechanism. Existing RALM approaches focus on modifying… ▽ More

    Submitted 1 August, 2023; v1 submitted 31 January, 2023; originally announced February 2023.

    Comments: Accepted for publication in Transactions of the Association for Computational Linguistics (TACL). pre-MIT Press publication version

  3. arXiv:2204.10019  [pdf, other

    cs.CL cs.AI

    Standing on the Shoulders of Giant Frozen Language Models

    Authors: Yoav Levine, Itay Dalmedigos, Ori Ram, Yoel Zeldes, Daniel Jannai, Dor Muhlgay, Yoni Osin, Opher Lieber, Barak Lenz, Shai Shalev-Shwartz, Amnon Shashua, Kevin Leyton-Brown, Yoav Shoham

    Abstract: Huge pretrained language models (LMs) have demonstrated surprisingly good zero-shot capabilities on a wide variety of tasks. This gives rise to the appealing vision of a single, versatile model with a wide range of functionalities across disparate applications. However, current leading techniques for leveraging a "frozen" LM -- i.e., leaving its weights untouched -- still often underperform fine-t… ▽ More

    Submitted 21 April, 2022; originally announced April 2022.

  4. arXiv:2002.04358  [pdf, other

    cond-mat.stat-mech q-bio.PE

    Dynamical persistence in resource-consumer models

    Authors: Itay Dalmedigos, Guy Bunin

    Abstract: We show how highly-diverse ecological communities may display persistent abundance fluctuations, when interacting through resource competition and subjected to migration from a species pool. This turns out to be closely related to the ratio of realized species diversity to the number of resources. This ratio is set by competition, through the balance between species being pushed out and invading.… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.