Is BERT Robust to Label Noise? A Study on Learning with Noisy Labels in Text Classification

Zhu, Dawei; Hedderich, Michael A.; Zhai, Fangzhou; Adelani, David Ifeoluwa; Klakow, Dietrich

Computer Science > Computation and Language

arXiv:2204.09371 (cs)

[Submitted on 20 Apr 2022]

Title:Is BERT Robust to Label Noise? A Study on Learning with Noisy Labels in Text Classification

Authors:Dawei Zhu, Michael A. Hedderich, Fangzhou Zhai, David Ifeoluwa Adelani, Dietrich Klakow

View PDF

Abstract:Incorrect labels in training data occur when human annotators make mistakes or when the data is generated via weak or distant supervision. It has been shown that complex noise-handling techniques - by modeling, cleaning or filtering the noisy instances - are required to prevent models from fitting this label noise. However, we show in this work that, for text classification tasks with modern NLP models like BERT, over a variety of noise types, existing noisehandling methods do not always improve its performance, and may even deteriorate it, suggesting the need for further investigation. We also back our observations with a comprehensive analysis.

Comments:	Accepted at Workshop on Insights from Negative Results in NLP 2022 @ACL 2022
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2204.09371 [cs.CL]
	(or arXiv:2204.09371v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2204.09371

Submission history

From: Dawei Zhu [view email]
[v1] Wed, 20 Apr 2022 10:24:19 UTC (717 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2022-04

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Computation and Language

Title:Is BERT Robust to Label Noise? A Study on Learning with Noisy Labels in Text Classification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Is BERT Robust to Label Noise? A Study on Learning with Noisy Labels in Text Classification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators