Word Order Matters when you Increase Masking

Lasri, Karim; Lenci, Alessandro; Poibeau, Thierry

Computer Science > Computation and Language

arXiv:2211.04427 (cs)

[Submitted on 8 Nov 2022]

Title:Word Order Matters when you Increase Masking

Authors:Karim Lasri, Alessandro Lenci, Thierry Poibeau

View PDF

Abstract:Word order, an essential property of natural languages, is injected in Transformer-based neural language models using position encoding. However, recent experiments have shown that explicit position encoding is not always useful, since some models without such feature managed to achieve state-of-the art performance on some tasks. To understand better this phenomenon, we examine the effect of removing position encodings on the pre-training objective itself (i.e., masked language modelling), to test whether models can reconstruct position information from co-occurrences alone. We do so by controlling the amount of masked tokens in the input sentence, as a proxy to affect the importance of position information for the task. We find that the necessity of position information increases with the amount of masking, and that masked language models without position encodings are not able to reconstruct this information on the task. These findings point towards a direct relationship between the amount of masking and the ability of Transformers to capture order-sensitive aspects of language using position encoding.

Comments:	Accepted at EMNLP 2022 (main conference)
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:2211.04427 [cs.CL]
	(or arXiv:2211.04427v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2211.04427

Submission history

From: Karim Lasri [view email]
[v1] Tue, 8 Nov 2022 18:14:04 UTC (6,472 KB)

Computer Science > Computation and Language

Title:Word Order Matters when you Increase Masking

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Word Order Matters when you Increase Masking

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators