Editing Commonsense Knowledge in GPT

Gupta, Anshita; Mondal, Debanjan; Sheshadri, Akshay Krishna; Zhao, Wenlong; Li, Xiang Lorraine; Wiegreffe, Sarah; Tandon, Niket

Computer Science > Computation and Language

arXiv:2305.14956v1 (cs)

[Submitted on 24 May 2023 (this version), latest version 26 Oct 2023 (v3)]

Title:Editing Commonsense Knowledge in GPT

Authors:Anshita Gupta, Debanjan Mondal, Akshay Krishna Sheshadri, Wenlong Zhao, Xiang Lorraine Li, Sarah Wiegreffe, Niket Tandon

View PDF

Abstract:Memory editing methods for updating encyclopedic knowledge in transformers have received increasing attention for their efficacy, specificity, and generalization advantages. However, it remains unclear if such methods can be adapted for the more nuanced domain of commonsense knowledge. We propose $MEMIT_{CSK}$, an adaptation of MEMIT to edit commonsense mistakes in GPT-2 Large and XL. We extend editing to various token locations and employ a robust layer selection strategy. Models edited by $MEMIT_{CSK}$ outperforms the fine-tuning baselines by 10.97% and 10.73% F1 scores on subsets of PEP3k and 20Q. We further propose a novel evaluation dataset, MEMIT-CSK-PROBE, that contains unaffected neighborhood, affected neighborhood, affected paraphrase, and affected reasoning challenges. $MEMIT_{CSK}$ demonstrates favorable semantic generalization, outperforming fine-tuning baselines by 13.72% and 5.57% overall scores on MEMIT-CSK-PROBE. These results suggest a compelling future direction of incorporating context-specific user feedback concerning commonsense in GPT by direct model editing, rectifying and customizing model behaviors via human-in-the-loop systems.

Comments:	Code and data is available at this https URL
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2305.14956 [cs.CL]
	(or arXiv:2305.14956v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2305.14956

Submission history

From: Debanjan Mondal [view email]
[v1] Wed, 24 May 2023 09:50:54 UTC (4,465 KB)
[v2] Mon, 9 Oct 2023 19:00:44 UTC (5,722 KB)
[v3] Thu, 26 Oct 2023 15:38:38 UTC (2,919 KB)

Computer Science > Computation and Language

Title:Editing Commonsense Knowledge in GPT

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Editing Commonsense Knowledge in GPT

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators