Skip to main content

Showing 1–4 of 4 results for author: Nussbaum, Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18587  [pdf, other

    cs.CV cs.AI

    Nomic Embed Vision: Expanding the Latent Space

    Authors: Zach Nussbaum, Brandon Duderstadt, Andriy Mulyar

    Abstract: This technical report describes the training of nomic-embed-vision, a highly performant, open-code, open-weights image embedding model that shares the same latent space as nomic-embed-text. Together, nomic-embed-vision and nomic-embed-text form the first unified latent space to achieve high performance across vision, language, and multimodal tasks.

    Submitted 6 June, 2024; originally announced June 2024.

  2. arXiv:2402.01613  [pdf, other

    cs.CL cs.AI

    Nomic Embed: Training a Reproducible Long Context Text Embedder

    Authors: Zach Nussbaum, John X. Morris, Brandon Duderstadt, Andriy Mulyar

    Abstract: This technical report describes the training of nomic-embed-text-v1, the first fully reproducible, open-source, open-weights, open-data, 8192 context length English text embedding model that outperforms both OpenAI Ada-002 and OpenAI text-embedding-3-small on short and long-context tasks. We release the training code and model weights under an Apache 2 license. In contrast with other open-source m… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  3. arXiv:2311.04931  [pdf, other

    cs.CL cs.AI

    GPT4All: An Ecosystem of Open Source Compressed Language Models

    Authors: Yuvanesh Anand, Zach Nussbaum, Adam Treat, Aaron Miller, Richard Guo, Ben Schmidt, GPT4All Community, Brandon Duderstadt, Andriy Mulyar

    Abstract: Large language models (LLMs) have recently achieved human-level performance on a range of professional and academic benchmarks. The accessibility of these models has lagged behind their performance. State-of-the-art LLMs require costly infrastructure; are only accessible via rate-limited, geo-locked, and censored web interfaces; and lack publicly available code and technical reports. In this paper… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: Accepted at NLP-OSS at EMNLP 2023

  4. arXiv:2107.13098  [pdf, other

    cs.CV cs.LG

    A Tale Of Two Long Tails

    Authors: Daniel D'souza, Zach Nussbaum, Chirag Agarwal, Sara Hooker

    Abstract: As machine learning models are increasingly employed to assist human decision-makers, it becomes critical to communicate the uncertainty associated with these model predictions. However, the majority of work on uncertainty has focused on traditional probabilistic or ranking approaches - where the model assigns low probabilities or scores to uncertain examples. While this captures what examples are… ▽ More

    Submitted 27 July, 2021; originally announced July 2021.

    Comments: Preliminary results accepted to Workshop on Uncertainty and Robustness in Deep Learning (UDL), ICML, 2021