Fairness-Aware Structured Pruning in Transformers

Zayed, Abdelrahman; Mordido, Goncalo; Shabanian, Samira; Baldini, Ioana; Chandar, Sarath

Computer Science > Computation and Language

arXiv:2312.15398 (cs)

[Submitted on 24 Dec 2023]

Title:Fairness-Aware Structured Pruning in Transformers

Authors:Abdelrahman Zayed, Goncalo Mordido, Samira Shabanian, Ioana Baldini, Sarath Chandar

View PDF HTML (experimental)

Abstract:The increasing size of large language models (LLMs) has introduced challenges in their training and inference. Removing model components is perceived as a solution to tackle the large model sizes, however, existing pruning methods solely focus on performance, without considering an essential aspect for the responsible use of LLMs: model fairness. It is crucial to address the fairness of LLMs towards diverse groups, such as women, Black people, LGBTQ+, Jewish communities, among others, as they are being deployed and available to a wide audience. In this work, first, we investigate how attention heads impact fairness and performance in pre-trained transformer-based language models. We then propose a novel method to prune the attention heads that negatively impact fairness while retaining the heads critical for performance, i.e. language modeling capabilities. Our approach is practical in terms of time and resources, as it does not require fine-tuning the final pruned, and fairer, model. Our findings demonstrate a reduction in gender bias by 19%, 19.5%, 39.5%, 34.7%, 23%, and 8% for DistilGPT-2, GPT-2, GPT-Neo of two different sizes, GPT-J, and Llama 2 models, respectively, in comparison to the biased model, with only a slight decrease in performance.

Comments:	In Proceedings of AAAI 2024
Subjects:	Computation and Language (cs.CL); Computers and Society (cs.CY); Machine Learning (cs.LG)
Cite as:	arXiv:2312.15398 [cs.CL]
	(or arXiv:2312.15398v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2312.15398

Submission history

From: Abdelrahman Zayed [view email]
[v1] Sun, 24 Dec 2023 03:57:52 UTC (4,990 KB)

Computer Science > Computation and Language

Title:Fairness-Aware Structured Pruning in Transformers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Fairness-Aware Structured Pruning in Transformers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators