Skip to main content

Showing 1–4 of 4 results for author: Rozen, N

.
  1. arXiv:2403.19887  [pdf, other

    cs.CL cs.LG

    Jamba: A Hybrid Transformer-Mamba Language Model

    Authors: Opher Lieber, Barak Lenz, Hofit Bata, Gal Cohen, Jhonathan Osin, Itay Dalmedigos, Erez Safahi, Shaked Meirom, Yonatan Belinkov, Shai Shalev-Shwartz, Omri Abend, Raz Alon, Tomer Asida, Amir Bergman, Roman Glozman, Michael Gokhman, Avashalom Manevich, Nir Ratner, Noam Rozen, Erez Shwartz, Mor Zusman, Yoav Shoham

    Abstract: We present Jamba, a new base large language model based on a novel hybrid Transformer-Mamba mixture-of-experts (MoE) architecture. Specifically, Jamba interleaves blocks of Transformer and Mamba layers, enjoying the benefits of both model families. MoE is added in some of these layers to increase model capacity while kee** active parameter usage manageable. This flexible architecture allows reso… ▽ More

    Submitted 3 July, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: Webpage: https://www.ai21.com/jamba

  2. arXiv:2205.00445  [pdf, other

    cs.CL cs.AI

    MRKL Systems: A modular, neuro-symbolic architecture that combines large language models, external knowledge sources and discrete reasoning

    Authors: Ehud Karpas, Omri Abend, Yonatan Belinkov, Barak Lenz, Opher Lieber, Nir Ratner, Yoav Shoham, Hofit Bata, Yoav Levine, Kevin Leyton-Brown, Dor Muhlgay, Noam Rozen, Erez Schwartz, Gal Shachaf, Shai Shalev-Shwartz, Amnon Shashua, Moshe Tenenholtz

    Abstract: Huge language models (LMs) have ushered in a new era for AI, serving as a gateway to natural-language-based knowledge tasks. Although an essential element of modern AI, LMs are also inherently limited in a number of ways. We discuss these limitations and how they can be avoided by adopting a systems approach. Conceptualizing the challenge as one that involves knowledge and reasoning in addition to… ▽ More

    Submitted 1 May, 2022; originally announced May 2022.

  3. arXiv:2108.08052  [pdf, other

    stat.ML cs.AI cs.LG

    Moser Flow: Divergence-based Generative Modeling on Manifolds

    Authors: Noam Rozen, Aditya Grover, Maximilian Nickel, Yaron Lipman

    Abstract: We are interested in learning generative models for complex geometries described via manifolds, such as spheres, tori, and other implicit surfaces. Current extensions of existing (Euclidean) generative models are restricted to specific geometries and typically suffer from high computational costs. We introduce Moser Flow (MF), a new class of generative models within the family of continuous normal… ▽ More

    Submitted 2 November, 2021; v1 submitted 18 August, 2021; originally announced August 2021.

  4. arXiv:1805.09078  [pdf

    physics.class-ph cond-mat.soft physics.app-ph

    How reproducible are methods to measure the dynamic viscoelastic properties of poroelastic media?

    Authors: Paolo Bonfiglio, Francesco Pompoli, Kirill V. Horoshenkov, Mahmud Iskandar B Seth A Rahim, Luc Jaouen, Julia Rodenas, Francois-Xavier Becot, Emmanuel Gourdon, Dirk Jaeger, Volker Kursch, Maurizio Tarello, Nicolaas Bernardus Roozen, Christ Glorieux, Fabrizio Ferrian, Pierre Leroy, Francesco Briatico Vangosa, Nicolas Dauchez, Felix Foucart, Lei Lei, Kevin Carillo, Olivier Doutres, Franck Sgard, Raymond Panneton, Kevin Verdiere, Claudio Bertolini1 , et al. (8 additional authors not shown)

    Abstract: There is a considerable number of research publications on the acoustical properties of porous media with an elastic frame. A simple search through the Web of ScienceTM (last accessed 21 March 2018) suggests that there are at least 819 publications which deal with the acoustics of poroelastic media. A majority of these researches require accurate knowledge of the elastic properties over a broad fr… ▽ More

    Submitted 23 May, 2018; originally announced May 2018.

    Journal ref: Journal of Sound & Vibration, vol. 428, pp. 26-43, 2018