Positive Unlabeled Contrastive Learning

Acharya, Anish; Sanghavi, Sujay; **g, Li; Bhushanam, Bhargav; Choudhary, Dhruv; Rabbat, Michael; Dhillon, Inderjit

Computer Science > Machine Learning

arXiv:2206.01206v1 (cs)

[Submitted on 1 Jun 2022 (this version), latest version 28 Mar 2024 (v3)]

Title:Positive Unlabeled Contrastive Learning

Authors:Anish Acharya, Sujay Sanghavi, Li **g, Bhargav Bhushanam, Dhruv Choudhary, Michael Rabbat, Inderjit Dhillon

View PDF

Abstract:Self-supervised pretraining on unlabeled data followed by supervised finetuning on labeled data is a popular paradigm for learning from limited labeled examples. In this paper, we investigate and extend this paradigm to the classical positive unlabeled (PU) setting - the weakly supervised task of learning a binary classifier only using a few labeled positive examples and a set of unlabeled samples. We propose a novel PU learning objective positive unlabeled Noise Contrastive Estimation (puNCE) that leverages the available explicit (from labeled samples) and implicit (from unlabeled samples) supervision to learn useful representations from positive unlabeled input data. The underlying idea is to assign each training sample an individual weight; labeled positives are given unit weight; unlabeled samples are duplicated, one copy is labeled positive and the other as negative with weights $\pi$ and $(1-\pi)$ where $\pi$ denotes the class prior. Extensive experiments across vision and natural language tasks reveal that puNCE consistently improves over existing unsupervised and supervised contrastive baselines under limited supervision.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2206.01206 [cs.LG]
	(or arXiv:2206.01206v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2206.01206

Submission history

From: Anish Acharya [view email]
[v1] Wed, 1 Jun 2022 20:16:32 UTC (2,299 KB)
[v2] Tue, 15 Aug 2023 11:13:59 UTC (4,314 KB)
[v3] Thu, 28 Mar 2024 23:25:14 UTC (2,299 KB)

Computer Science > Machine Learning

Title:Positive Unlabeled Contrastive Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Positive Unlabeled Contrastive Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators