Skip to main content

Showing 1–3 of 3 results for author: Carignan, D

.
  1. arXiv:2311.16452  [pdf, other

    cs.CL

    Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in Medicine

    Authors: Harsha Nori, Yin Tat Lee, Sheng Zhang, Dean Carignan, Richard Edgar, Nicolo Fusi, Nicholas King, Jonathan Larson, Yuanzhi Li, Weishung Liu, Renqian Luo, Scott Mayer McKinney, Robert Osazuwa Ness, Hoifung Poon, Tao Qin, Naoto Usuyama, Chris White, Eric Horvitz

    Abstract: Generalist foundation models such as GPT-4 have displayed surprising capabilities in a wide variety of domains and tasks. Yet, there is a prevalent assumption that they cannot match specialist capabilities of fine-tuned models. For example, most explorations to date on medical competency benchmarks have leveraged domain-specific training, as exemplified by efforts on BioGPT and Med-PaLM. We build… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 21 pages, 7 figures

    ACM Class: I.2.7

  2. arXiv:2303.13375  [pdf, other

    cs.CL cs.AI

    Capabilities of GPT-4 on Medical Challenge Problems

    Authors: Harsha Nori, Nicholas King, Scott Mayer McKinney, Dean Carignan, Eric Horvitz

    Abstract: Large language models (LLMs) have demonstrated remarkable capabilities in natural language understanding and generation across various domains, including medicine. We present a comprehensive evaluation of GPT-4, a state-of-the-art LLM, on medical competency examinations and benchmark datasets. GPT-4 is a general-purpose model that is not specialized for medical problems through training or enginee… ▽ More

    Submitted 12 April, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: 35 pages, 15 figures; added GPT-4-base model results and discussion

  3. arXiv:2110.08413  [pdf, other

    cs.CL cs.LG

    Invariant Language Modeling

    Authors: Maxime Peyrard, Sarvjeet Singh Ghotra, Martin Josifoski, Vidhan Agarwal, Barun Patra, Dean Carignan, Emre Kiciman, Robert West

    Abstract: Large pretrained language models are critical components of modern NLP pipelines. Yet, they suffer from spurious correlations, poor out-of-domain generalization, and biases. Inspired by recent progress in causal machine learning, in particular the invariant risk minimization (IRM) paradigm, we propose invariant language modeling, a framework for learning invariant representations that generalize b… ▽ More

    Submitted 14 November, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: Published at EMNLP 2022