Revisiting Distance Metric Learning for Few-Shot Natural Language Classification

Sosnowski, Witold; Wróblewska, Anna; Seweryn, Karolina; Gawrysiak, Piotr

Computer Science > Computation and Language

arXiv:2211.15202 (cs)

[Submitted on 28 Nov 2022]

Title:Revisiting Distance Metric Learning for Few-Shot Natural Language Classification

Authors:Witold Sosnowski, Anna Wróblewska, Karolina Seweryn, Piotr Gawrysiak

View PDF

Abstract:Distance Metric Learning (DML) has attracted much attention in image processing in recent years. This paper analyzes its impact on supervised fine-tuning language models for Natural Language Processing (NLP) classification tasks under few-shot learning settings. We investigated several DML loss functions in training RoBERTa language models on known SentEval Transfer Tasks datasets. We also analyzed the possibility of using proxy-based DML losses during model inference.
Our systematic experiments have shown that under few-shot learning settings, particularly proxy-based DML losses can positively affect the fine-tuning and inference of a supervised language model. Models tuned with a combination of CCE (categorical cross-entropy loss) and ProxyAnchor Loss have, on average, the best performance and outperform models with only CCE by about 3.27 percentage points -- up to 10.38 percentage points depending on the training dataset.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2211.15202 [cs.CL]
	(or arXiv:2211.15202v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2211.15202

Submission history

From: Karolina Seweryn [view email]
[v1] Mon, 28 Nov 2022 10:19:31 UTC (494 KB)

Computer Science > Computation and Language

Title:Revisiting Distance Metric Learning for Few-Shot Natural Language Classification

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Revisiting Distance Metric Learning for Few-Shot Natural Language Classification

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators