Advancing continual lifelong learning in neural information retrieval: definition, dataset, framework, and empirical evaluation

Hou, **grui; Cosma, Georgina; Finke, Axel

Computer Science > Information Retrieval

arXiv:2308.08378 (cs)

[Submitted on 16 Aug 2023 (v1), last revised 19 Jun 2024 (this version, v2)]

Title:Advancing continual lifelong learning in neural information retrieval: definition, dataset, framework, and empirical evaluation

Authors:**grui Hou, Georgina Cosma, Axel Finke

View PDF HTML (experimental)

Abstract:Continual learning refers to the capability of a machine learning model to learn and adapt to new information, without compromising its performance on previously learned tasks. Although several studies have investigated continual learning methods for information retrieval tasks, a well-defined task formulation is still lacking, and it is unclear how typical learning strategies perform in this context. To address this challenge, a systematic task formulation of continual neural information retrieval is presented, along with a multiple-topic dataset that simulates continuous information retrieval. A comprehensive continual neural information retrieval framework consisting of typical retrieval models and continual learning strategies is then proposed. Empirical evaluations illustrate that the proposed framework can successfully prevent catastrophic forgetting in neural information retrieval and enhance performance on previously learned tasks. The results indicate that embedding-based retrieval models experience a decline in their continual learning performance as the topic shift distance and dataset volume of new tasks increase. In contrast, pretraining-based models do not show any such correlation. Adopting suitable learning strategies can mitigate the effects of topic shift and data augmentation.

Comments:	Submitted to Information Sciences
Subjects:	Information Retrieval (cs.IR); Computation and Language (cs.CL)
Cite as:	arXiv:2308.08378 [cs.IR]
	(or arXiv:2308.08378v2 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2308.08378

Submission history

From: **grui Hou [view email]
[v1] Wed, 16 Aug 2023 14:01:25 UTC (511 KB)
[v2] Wed, 19 Jun 2024 21:45:30 UTC (362 KB)

Computer Science > Information Retrieval

Title:Advancing continual lifelong learning in neural information retrieval: definition, dataset, framework, and empirical evaluation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Advancing continual lifelong learning in neural information retrieval: definition, dataset, framework, and empirical evaluation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators