Avengers Ensemble! Improving Transferability of Authorship Obfuscation

Haroon, Muhammad; Zaffar, Fareed; Srinivasan, Padmini; Shafiq, Zubair

Computer Science > Machine Learning

arXiv:2109.07028 (cs)

[Submitted on 15 Sep 2021 (v1), last revised 8 Oct 2021 (this version, v2)]

Title:Avengers Ensemble! Improving Transferability of Authorship Obfuscation

Authors:Muhammad Haroon, Fareed Zaffar, Padmini Srinivasan, Zubair Shafiq

View PDF

Abstract:Stylometric approaches have been shown to be quite effective for real-world authorship attribution. To mitigate the privacy threat posed by authorship attribution, researchers have proposed automated authorship obfuscation approaches that aim to conceal the stylometric artefacts that give away the identity of an anonymous document's author. Recent work has focused on authorship obfuscation approaches that rely on black-box access to an attribution classifier to evade attribution while preserving semantics. However, to be useful under a realistic threat model, it is important that these obfuscation approaches work well even when the adversary's attribution classifier is different from the one used internally by the obfuscator. Unfortunately, existing authorship obfuscation approaches do not transfer well to unseen attribution classifiers. In this paper, we propose an ensemble-based approach for transferable authorship obfuscation. Our experiments show that if an obfuscator can evade an ensemble attribution classifier, which is based on multiple base attribution classifiers, it is more likely to transfer to different attribution classifiers. Our analysis shows that ensemble-based authorship obfuscation achieves better transferability because it combines the knowledge from each of the base attribution classifiers by essentially averaging their decision boundaries.

Comments:	Submitted to PETS 2021
Subjects:	Machine Learning (cs.LG); Cryptography and Security (cs.CR)
Cite as:	arXiv:2109.07028 [cs.LG]
	(or arXiv:2109.07028v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2109.07028

Submission history

From: Muhammad Haroon [view email]
[v1] Wed, 15 Sep 2021 00:11:40 UTC (563 KB)
[v2] Fri, 8 Oct 2021 17:04:43 UTC (563 KB)

Computer Science > Machine Learning

Title:Avengers Ensemble! Improving Transferability of Authorship Obfuscation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Avengers Ensemble! Improving Transferability of Authorship Obfuscation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators