GIEBench: Towards Holistic Evaluation of Group Identity-based Empathy for Large Language Models

Wang, Leyan; **, Yonggang; Shen, Tianhao; Zheng, Tianyu; Du, Xinrun; Zhang, Chenchen; Huang, Wenhao; Liu, Jiaheng; Wang, Shi; Zhang, Ge; Xiang, Liuyu; He, Zhaofeng

Computer Science > Artificial Intelligence

arXiv:2406.14903 (cs)

[Submitted on 21 Jun 2024 (v1), last revised 24 Jun 2024 (this version, v2)]

Title:GIEBench: Towards Holistic Evaluation of Group Identity-based Empathy for Large Language Models

Authors:Leyan Wang, Yonggang **, Tianhao Shen, Tianyu Zheng, Xinrun Du, Chenchen Zhang, Wenhao Huang, Jiaheng Liu, Shi Wang, Ge Zhang, Liuyu Xiang, Zhaofeng He

View PDF

Abstract:As large language models (LLMs) continue to develop and gain widespread application, the ability of LLMs to exhibit empathy towards diverse group identities and understand their perspectives is increasingly recognized as critical. Most existing benchmarks for empathy evaluation of LLMs focus primarily on universal human emotions, such as sadness and pain, often overlooking the context of individuals' group identities. To address this gap, we introduce GIEBench, a comprehensive benchmark that includes 11 identity dimensions, covering 97 group identities with a total of 999 single-choice questions related to specific group identities. GIEBench is designed to evaluate the empathy of LLMs when presented with specific group identities such as gender, age, occupation, and race, emphasizing their ability to respond from the standpoint of the identified group. This supports the ongoing development of empathetic LLM applications tailored to users with different identities. Our evaluation of 23 LLMs revealed that while these LLMs understand different identity standpoints, they fail to consistently exhibit equal empathy across these identities without explicit instructions to adopt those perspectives. This highlights the need for improved alignment of LLMs with diverse values to better accommodate the multifaceted nature of human identities. Our datasets are available at this https URL.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2406.14903 [cs.AI]
	(or arXiv:2406.14903v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2406.14903

Submission history

From: Leyan Wang [view email]
[v1] Fri, 21 Jun 2024 06:50:42 UTC (687 KB)
[v2] Mon, 24 Jun 2024 14:57:18 UTC (1,367 KB)

Computer Science > Artificial Intelligence

Title:GIEBench: Towards Holistic Evaluation of Group Identity-based Empathy for Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:GIEBench: Towards Holistic Evaluation of Group Identity-based Empathy for Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators