Skip to main content

Showing 1–3 of 3 results for author: Manevich, A

.
  1. arXiv:2403.19887  [pdf, other

    cs.CL cs.LG

    Jamba: A Hybrid Transformer-Mamba Language Model

    Authors: Opher Lieber, Barak Lenz, Hofit Bata, Gal Cohen, Jhonathan Osin, Itay Dalmedigos, Erez Safahi, Shaked Meirom, Yonatan Belinkov, Shai Shalev-Shwartz, Omri Abend, Raz Alon, Tomer Asida, Amir Bergman, Roman Glozman, Michael Gokhman, Avashalom Manevich, Nir Ratner, Noam Rozen, Erez Shwartz, Mor Zusman, Yoav Shoham

    Abstract: We present Jamba, a new base large language model based on a novel hybrid Transformer-Mamba mixture-of-experts (MoE) architecture. Specifically, Jamba interleaves blocks of Transformer and Mamba layers, enjoying the benefits of both model families. MoE is added in some of these layers to increase model capacity while kee** active parameter usage manageable. This flexible architecture allows reso… ▽ More

    Submitted 3 July, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: Webpage: https://www.ai21.com/jamba

  2. arXiv:2305.12517  [pdf, other

    cs.CL cs.IR cs.LG

    Description-Based Text Similarity

    Authors: Shauli Ravfogel, Valentina Pyatkin, Amir DN Cohen, Avshalom Manevich, Yoav Goldberg

    Abstract: Identifying texts with a given semantics is central for many information seeking scenarios. Similarity search over vector embeddings appear to be central to this ability, yet the similarity reflected in current text embeddings is corpus-driven, and is inconsistent and sub-optimal for many use cases. What, then, is a good notion of similarity for effective retrieval of text? We identify the need… ▽ More

    Submitted 26 April, 2024; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: A preprint

  3. arXiv:2106.14321  [pdf, other

    cs.CL

    Draw Me a Flower: Processing and Grounding Abstraction in Natural Language

    Authors: Royi Lachmy, Valentina Pyatkin, Avshalom Manevich, Reut Tsarfaty

    Abstract: Abstraction is a core tenet of human cognition and communication. When composing natural language instructions, humans naturally evoke abstraction to convey complex procedures in an efficient and concise way. Yet, interpreting and grounding abstraction expressed in NL has not yet been systematically studied in NLP, with no accepted benchmarks specifically eliciting abstraction in NL. In this work,… ▽ More

    Submitted 30 September, 2022; v1 submitted 27 June, 2021; originally announced June 2021.

    Comments: Accepted to the TACL journal. This is a pre-MIT Press publication version