Disentangling the Linguistic Competence of Privacy-Preserving BERT

Arnold, Stefan; Kemmerzell, Nils; Schreiner, Annika

Computer Science > Computation and Language

arXiv:2310.11363 (cs)

[Submitted on 17 Oct 2023]

Title:Disentangling the Linguistic Competence of Privacy-Preserving BERT

Authors:Stefan Arnold, Nils Kemmerzell, Annika Schreiner

View PDF

Abstract:Differential Privacy (DP) has been tailored to address the unique challenges of text-to-text privatization. However, text-to-text privatization is known for degrading the performance of language models when trained on perturbed text. Employing a series of interpretation techniques on the internal representations extracted from BERT trained on perturbed pre-text, we intend to disentangle at the linguistic level the distortion induced by differential privacy. Experimental results from a representational similarity analysis indicate that the overall similarity of internal representations is substantially reduced. Using probing tasks to unpack this dissimilarity, we find evidence that text-to-text privatization affects the linguistic competence across several formalisms, encoding localized properties of words while falling short at encoding the contextual relationships between spans of words.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2310.11363 [cs.CL]
	(or arXiv:2310.11363v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2310.11363

Submission history

From: Stefan Arnold [view email]
[v1] Tue, 17 Oct 2023 16:00:26 UTC (508 KB)

Computer Science > Computation and Language

Title:Disentangling the Linguistic Competence of Privacy-Preserving BERT

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Disentangling the Linguistic Competence of Privacy-Preserving BERT

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators