CAPE: Encoding Relative Positions with Continuous Augmented Positional Embeddings

Likhomanenko, Tatiana; Xu, Qiantong; Collobert, Ronan; Synnaeve, Gabriel; Rogozhnikov, Alex

Computer Science > Machine Learning

arXiv:2106.03143v1 (cs)

[Submitted on 6 Jun 2021 (this version), latest version 9 Nov 2021 (v3)]

Title:CAPE: Encoding Relative Positions with Continuous Augmented Positional Embeddings

Authors:Tatiana Likhomanenko, Qiantong Xu, Ronan Collobert, Gabriel Synnaeve, Alex Rogozhnikov

View PDF

Abstract:Without positional information, attention-based transformer neural networks are permutation-invariant. Absolute or relative positional embeddings are the most popular ways to feed transformer models positional information. Absolute positional embeddings are simple to implement, but suffer from generalization issues when evaluating on sequences of different length than those seen at training time. Relative positions are more robust to length change, but are more complex to implement and yield inferior model throughput. In this paper, we propose an augmentation-based approach (CAPE) for absolute positional embeddings, which keeps the advantages of both absolute (simplicity and speed) and relative position embeddings (better generalization). In addition, our empirical evaluation on state-of-the-art models in machine translation, image and speech recognition demonstrates that CAPE leads to better generalization performance as well as increased stability with respect to training hyper-parameters.

Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2106.03143 [cs.LG]
	(or arXiv:2106.03143v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2106.03143

Submission history

From: Tatiana Likhomanenko [view email]
[v1] Sun, 6 Jun 2021 14:54:55 UTC (1,521 KB)
[v2] Wed, 28 Jul 2021 02:42:35 UTC (14,410 KB)
[v3] Tue, 9 Nov 2021 03:03:27 UTC (14,969 KB)

Computer Science > Machine Learning

Title:CAPE: Encoding Relative Positions with Continuous Augmented Positional Embeddings

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:CAPE: Encoding Relative Positions with Continuous Augmented Positional Embeddings

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators