LT@Helsinki at SemEval-2020 Task 12: Multilingual or language-specific BERT?

Pàmies, Marc; Öhman, Emily; Kajava, Kaisla; Tiedemann, Jörg

Computer Science > Computation and Language

arXiv:2008.00805 (cs)

[Submitted on 3 Aug 2020]

Title:LT@Helsinki at SemEval-2020 Task 12: Multilingual or language-specific BERT?

Authors:Marc Pàmies, Emily Öhman, Kaisla Kajava, Jörg Tiedemann

View PDF

Abstract:This paper presents the different models submitted by the LT@Helsinki team for the SemEval 2020 Shared Task 12. Our team participated in sub-tasks A and C; titled offensive language identification and offense target identification, respectively. In both cases we used the so-called Bidirectional Encoder Representation from Transformer (BERT), a model pre-trained by Google and fine-tuned by us on the OLID and SOLID datasets. The results show that offensive tweet classification is one of several language-based tasks where BERT can achieve state-of-the-art results.

Comments:	Accepted at SemEval-2020 Task 12. Identical to camera-ready version except where adjustments to fit arXiv requirements were necessary
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2008.00805 [cs.CL]
	(or arXiv:2008.00805v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2008.00805

Submission history

From: Emily Ohman [view email]
[v1] Mon, 3 Aug 2020 12:03:17 UTC (214 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-08

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jörg Tiedemann

export BibTeX citation

Computer Science > Computation and Language

Title:LT@Helsinki at SemEval-2020 Task 12: Multilingual or language-specific BERT?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:LT@Helsinki at SemEval-2020 Task 12: Multilingual or language-specific BERT?

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators