Effective writing style imitation via combinatorial paraphrasing

Gröndahl, Tommi; Asokan, N.

Computer Science > Computation and Language

arXiv:1905.13464v1 (cs)

[Submitted on 31 May 2019 (this version), latest version 3 Jul 2020 (v3)]

Title:Effective writing style imitation via combinatorial paraphrasing

Authors:Tommi Gröndahl, N. Asokan

View PDF

Abstract:Stylometry can be used to profile authors based on their written text. Transforming text to imitate someone else's writing style while retaining meaning constitutes a defence. A variety of deep learning methods for style imitation have been proposed in recent research literature. Via empirical evaluation of three state-of-the-art models on four datasets, we illustrate that none succeed in semantic retainment, often drastically changing the original meaning or removing important parts of the text. To mitigate this problem we present ParChoice: an alternative approach based on the combinatorial application of multiple paraphrasing techniques. ParChoice first produces a large number of possible candidate paraphrases, from which it then chooses the candidate that maximizes proximity to a target corpus. Through systematic automated and manual evaluation as well as a user study, we demonstrate that ParChoice significantly outperforms prior methods in its ability to retain semantic content. Using state-of-the art deep learning author profiling tools, we additionally show that ParChoice accomplishes better imitation success than A$^4$NT, the state-of-the-art style imitation technique with the best semantic retainment.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1905.13464 [cs.CL]
	(or arXiv:1905.13464v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1905.13464

Submission history

From: Tommi Gröndahl [view email]
[v1] Fri, 31 May 2019 08:42:27 UTC (135 KB)
[v2] Tue, 16 Jun 2020 14:02:31 UTC (130 KB)
[v3] Fri, 3 Jul 2020 12:36:28 UTC (130 KB)

Computer Science > Computation and Language

Title:Effective writing style imitation via combinatorial paraphrasing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Effective writing style imitation via combinatorial paraphrasing

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators