Fine-tuning Strategies for Domain Specific Question Answering under Low Annotation Budget Constraints

Guo, Kunpeng; Diefenbach, Dennis; Gourru, Antoine; Gravier, Christophe

doi:10.1109/ICTAI59109.2023.00032

Computer Science > Computation and Language

arXiv:2401.09168 (cs)

[Submitted on 17 Jan 2024]

Title:Fine-tuning Strategies for Domain Specific Question Answering under Low Annotation Budget Constraints

Authors:Kunpeng Guo, Dennis Diefenbach, Antoine Gourru, Christophe Gravier

View PDF HTML (experimental)

Abstract:The progress introduced by pre-trained language models and their fine-tuning has resulted in significant improvements in most downstream NLP tasks. The unsupervised training of a language model combined with further target task fine-tuning has become the standard QA fine-tuning procedure. In this work, we demonstrate that this strategy is sub-optimal for fine-tuning QA models, especially under a low QA annotation budget, which is a usual setting in practice due to the extractive QA labeling cost. We draw our conclusions by conducting an exhaustive analysis of the performance of the alternatives of the sequential fine-tuning strategy on different QA datasets. Based on the experiments performed, we observed that the best strategy to fine-tune the QA model in low-budget settings is taking a pre-trained language model (PLM) and then fine-tuning PLM with a dataset composed of the target dataset and SQuAD dataset. With zero extra annotation effort, the best strategy outperforms the standard strategy by 2.28% to 6.48%. Our experiments provide one of the first investigations on how to best fine-tune a QA system under a low budget and are therefore of the utmost practical interest to the QA practitioners.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2401.09168 [cs.CL]
	(or arXiv:2401.09168v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2401.09168
Related DOI:	https://doi.org/10.1109/ICTAI59109.2023.00032

Submission history

From: Kunpeng Guo [view email]
[v1] Wed, 17 Jan 2024 12:21:20 UTC (283 KB)

Computer Science > Computation and Language

Title:Fine-tuning Strategies for Domain Specific Question Answering under Low Annotation Budget Constraints

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Fine-tuning Strategies for Domain Specific Question Answering under Low Annotation Budget Constraints

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators