Skip to main content

Showing 1–5 of 5 results for author: Cohen, A D

.
  1. arXiv:2407.07080  [pdf, other

    cs.CL

    Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities

    Authors: Shaltiel Shmidman, Avi Shmidman, Amir DN Cohen, Moshe Koppel

    Abstract: Training large language models (LLMs) in low-resource languages such as Hebrew poses unique challenges. In this paper, we introduce DictaLM2.0 and DictaLM2.0-Instruct, two LLMs derived from the Mistral model, trained on a substantial corpus of approximately 200 billion tokens in both Hebrew and English. Adapting a pre-trained model to a new language involves specialized techniques that differ sign… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  2. arXiv:2310.14282  [pdf, other

    cs.CL cs.AI cs.IR

    NERetrieve: Dataset for Next Generation Named Entity Recognition and Retrieval

    Authors: Uri Katz, Matan Vetzler, Amir DN Cohen, Yoav Goldberg

    Abstract: Recognizing entities in texts is a central need in many information-seeking scenarios, and indeed, Named Entity Recognition (NER) is arguably one of the most successful examples of a widely adopted NLP task and corresponding NLP technology. Recent advances in large language models (LLMs) appear to provide effective solutions (also) for NER tasks that were traditionally handled with dedicated model… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: Findings of EMNLP 2023

  3. arXiv:2309.14568  [pdf, other

    cs.CL

    Introducing DictaLM -- A Large Generative Language Model for Modern Hebrew

    Authors: Shaltiel Shmidman, Avi Shmidman, Amir David Nissan Cohen, Moshe Koppel

    Abstract: We present DictaLM, a large-scale language model tailored for Modern Hebrew. Boasting 7B parameters, this model is predominantly trained on Hebrew-centric data. As a commitment to promoting research and development in the Hebrew language, we release both the foundation model and the instruct-tuned model under a Creative Commons license. Concurrently, we introduce DictaLM-Rab, another foundation mo… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

  4. arXiv:2305.12517  [pdf, other

    cs.CL cs.IR cs.LG

    Description-Based Text Similarity

    Authors: Shauli Ravfogel, Valentina Pyatkin, Amir DN Cohen, Avshalom Manevich, Yoav Goldberg

    Abstract: Identifying texts with a given semantics is central for many information seeking scenarios. Similarity search over vector embeddings appear to be central to this ability, yet the similarity reflected in current text embeddings is corpus-driven, and is inconsistent and sub-optimal for many use cases. What, then, is a good notion of similarity for effective retrieval of text? We identify the need… ▽ More

    Submitted 26 April, 2024; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: A preprint

  5. arXiv:2010.04829  [pdf, other

    cs.CL

    Relation Classification as Two-way Span-Prediction

    Authors: Amir DN Cohen, Shachar Rosenman, Yoav Goldberg

    Abstract: The current supervised relation classification (RC) task uses a single embedding to represent the relation between a pair of entities. We argue that a better approach is to treat the RC task as span-prediction (SP) problem, similar to Question answering (QA). We present a span-prediction based system for RC and evaluate its performance compared to the embedding based system. We demonstrate that th… ▽ More

    Submitted 17 April, 2021; v1 submitted 9 October, 2020; originally announced October 2020.