PIVOT: Prompting for Video Continual Learning

Villa, Andrés; Alcázar, Juan León; Alfarra, Motasem; Alhamoud, Kumail; Hurtado, Julio; Heilbron, Fabian Caba; Soto, Alvaro; Ghanem, Bernard

Computer Science > Computer Vision and Pattern Recognition

arXiv:2212.04842 (cs)

[Submitted on 9 Dec 2022 (v1), last revised 4 Apr 2023 (this version, v2)]

Title:PIVOT: Prompting for Video Continual Learning

Authors:Andrés Villa, Juan León Alcázar, Motasem Alfarra, Kumail Alhamoud, Julio Hurtado, Fabian Caba Heilbron, Alvaro Soto, Bernard Ghanem

View PDF

Abstract:Modern machine learning pipelines are limited due to data availability, storage quotas, privacy regulations, and expensive annotation processes. These constraints make it difficult or impossible to train and update large-scale models on such dynamic annotated sets. Continual learning directly approaches this problem, with the ultimate goal of devising methods where a deep neural network effectively learns relevant patterns for new (unseen) classes, without significantly altering its performance on previously learned ones. In this paper, we address the problem of continual learning for video data. We introduce PIVOT, a novel method that leverages extensive knowledge in pre-trained models from the image domain, thereby reducing the number of trainable parameters and the associated forgetting. Unlike previous methods, ours is the first approach that effectively uses prompting mechanisms for continual learning without any in-domain pre-training. Our experiments show that PIVOT improves state-of-the-art methods by a significant 27% on the 20-task ActivityNet setup.

Comments:	CVPR 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2212.04842 [cs.CV]
	(or arXiv:2212.04842v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2212.04842

Submission history

From: Andrés Villa [view email]
[v1] Fri, 9 Dec 2022 13:22:27 UTC (2,624 KB)
[v2] Tue, 4 Apr 2023 22:28:05 UTC (5,185 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:PIVOT: Prompting for Video Continual Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:PIVOT: Prompting for Video Continual Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators