Halo: Estimation and Reduction of Hallucinations in Open-Source Weak Large Language Models

Elaraby, Mohamed; Lu, Mengyin; Dunn, Jacob; Zhang, Xueying; Wang, Yu; Liu, Shizhu; Tian, **chuan; Wang, Yu**; Wang, Yuxuan

Computer Science > Computation and Language

arXiv:2308.11764 (cs)

[Submitted on 22 Aug 2023 (v1), last revised 13 Sep 2023 (this version, v4)]

Title:Halo: Estimation and Reduction of Hallucinations in Open-Source Weak Large Language Models

Authors:Mohamed Elaraby, Mengyin Lu, Jacob Dunn, Xueying Zhang, Yu Wang, Shizhu Liu, **chuan Tian, Yu** Wang, Yuxuan Wang

View PDF

Abstract:Large Language Models (LLMs) have revolutionized Natural Language Processing (NLP). Although convenient for research and practical applications, open-source LLMs with fewer parameters often suffer from severe hallucinations compared to their larger counterparts. This paper focuses on measuring and reducing hallucinations in BLOOM 7B, a representative of such weaker open-source LLMs that are publicly available for research and commercial applications. We introduce HaloCheck, a lightweight BlackBox knowledge-free framework designed to quantify the severity of hallucinations in LLMs. Additionally, we explore techniques like knowledge injection and teacher-student approaches to alleviate hallucinations in low-parameter LLMs. Our experiments effectively demonstrate the reduction of hallucinations in challenging domains for these LLMs.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2308.11764 [cs.CL]
	(or arXiv:2308.11764v4 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2308.11764

Submission history

From: Mohamed Elaraby [view email]
[v1] Tue, 22 Aug 2023 20:12:49 UTC (220 KB)
[v2] Thu, 24 Aug 2023 17:57:00 UTC (220 KB)
[v3] Thu, 7 Sep 2023 04:16:54 UTC (220 KB)
[v4] Wed, 13 Sep 2023 18:01:36 UTC (220 KB)

Computer Science > Computation and Language

Title:Halo: Estimation and Reduction of Hallucinations in Open-Source Weak Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Halo: Estimation and Reduction of Hallucinations in Open-Source Weak Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators