CHBias: Bias Evaluation and Mitigation of Chinese Conversational Language Models

Zhao, Jiaxu; Fang, Meng; Shi, Zi**g; Li, Yitong; Chen, Ling; Pechenizkiy, Mykola

Computer Science > Computation and Language

arXiv:2305.11262 (cs)

[Submitted on 18 May 2023]

Title:CHBias: Bias Evaluation and Mitigation of Chinese Conversational Language Models

Authors:Jiaxu Zhao, Meng Fang, Zi**g Shi, Yitong Li, Ling Chen, Mykola Pechenizkiy

View PDF

Abstract:\textit{\textbf{\textcolor{red}{Warning}:} This paper contains content that may be offensive or upsetting.} Pretrained conversational agents have been exposed to safety issues, exhibiting a range of stereotypical human biases such as gender bias. However, there are still limited bias categories in current research, and most of them only focus on English. In this paper, we introduce a new Chinese dataset, CHBias, for bias evaluation and mitigation of Chinese conversational language models. Apart from those previous well-explored bias categories, CHBias includes under-explored bias categories, such as ageism and appearance biases, which received less attention. We evaluate two popular pretrained Chinese conversational models, CDial-GPT and EVA2.0, using CHBias. Furthermore, to mitigate different biases, we apply several debiasing methods to the Chinese pretrained models. Experimental results show that these Chinese pretrained models are potentially risky for generating texts that contain social biases, and debiasing methods using the proposed dataset can make response generation less biased while preserving the models' conversational capabilities.

Comments:	Accepted by ACL 2023
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2305.11262 [cs.CL]
	(or arXiv:2305.11262v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2305.11262

Submission history

From: Jiaxu Zhao [view email]
[v1] Thu, 18 May 2023 18:58:30 UTC (174 KB)

Computer Science > Computation and Language

Title:CHBias: Bias Evaluation and Mitigation of Chinese Conversational Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:CHBias: Bias Evaluation and Mitigation of Chinese Conversational Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators