Zero-shot and Few-shot Generation Strategies for Artificial Clinical Records

Frayling, Erlend; Lever, Jake; McDonald, Graham

Computer Science > Computation and Language

arXiv:2403.08664 (cs)

[Submitted on 13 Mar 2024 (v1), last revised 14 Mar 2024 (this version, v2)]

Title:Zero-shot and Few-shot Generation Strategies for Artificial Clinical Records

Authors:Erlend Frayling, Jake Lever, Graham McDonald

View PDF HTML (experimental)

Abstract:The challenge of accessing historical patient data for clinical research, while adhering to privacy regulations, is a significant obstacle in medical science. An innovative approach to circumvent this issue involves utilising synthetic medical records that mirror real patient data without compromising individual privacy. The creation of these synthetic datasets, particularly without using actual patient data to train Large Language Models (LLMs), presents a novel solution as gaining access to sensitive patient information to train models is also a challenge. This study assesses the capability of the Llama 2 LLM to create synthetic medical records that accurately reflect real patient information, employing zero-shot and few-shot prompting strategies for comparison against fine-tuned methodologies that do require sensitive patient data during training. We focus on generating synthetic narratives for the History of Present Illness section, utilising data from the MIMIC-IV dataset for comparison. In this work introduce a novel prompting technique that leverages a chain-of-thought approach, enhancing the model's ability to generate more accurate and contextually relevant medical narratives without prior fine-tuning. Our findings suggest that this chain-of-thought prompted approach allows the zero-shot model to achieve results on par with those of fine-tuned models, based on Rouge metrics evaluation.

Comments:	4 pages
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2403.08664 [cs.CL]
	(or arXiv:2403.08664v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2403.08664

Submission history

From: Erlend Frayling [view email]
[v1] Wed, 13 Mar 2024 16:17:09 UTC (1,264 KB)
[v2] Thu, 14 Mar 2024 15:57:59 UTC (1,264 KB)

Computer Science > Computation and Language

Title:Zero-shot and Few-shot Generation Strategies for Artificial Clinical Records

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Zero-shot and Few-shot Generation Strategies for Artificial Clinical Records

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators