Effective Pre-Training Objectives for Transformer-based Autoencoders

Di Liello, Luca; Gabburo, Matteo; Moschitti, Alessandro

Computer Science > Computation and Language

arXiv:2210.13536 (cs)

[Submitted on 24 Oct 2022]

Title:Effective Pre-Training Objectives for Transformer-based Autoencoders

Authors:Luca Di Liello, Matteo Gabburo, Alessandro Moschitti

View PDF

Abstract:In this paper, we study trade-offs between efficiency, cost and accuracy when pre-training Transformer encoders with different pre-training objectives. For this purpose, we analyze features of common objectives and combine them to create new effective pre-training approaches. Specifically, we designed light token generators based on a straightforward statistical approach, which can replace ELECTRA computationally heavy generators, thus highly reducing cost. Our experiments also show that (i) there are more efficient alternatives to BERT's MLM, and (ii) it is possible to efficiently pre-train Transformer-based models using lighter generators without a significant drop in performance.

Comments:	Accepted at EMNLP 2022 Findings
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2210.13536 [cs.CL]
	(or arXiv:2210.13536v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.13536

Submission history

From: Luca Di Liello [view email]
[v1] Mon, 24 Oct 2022 18:39:44 UTC (851 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2022-10

Change to browse by:

References & Citations

export BibTeX citation

Computer Science > Computation and Language

Title:Effective Pre-Training Objectives for Transformer-based Autoencoders

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Effective Pre-Training Objectives for Transformer-based Autoencoders

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators