AfroXLMR-Social: Adapting Pre-trained Language Models for African Languages Social Media Text

Belay, Tadesse Destaw; Azime, Israel Abebe; Ahmad, Ibrahim Said; Abdulmumin, Idris; Ayele, Abinew Ali; Muhammad, Shamsuddeen Hassan; Yimam, Seid Muhie

Computer Science > Computation and Language

arXiv:2503.18247 (cs)

[Submitted on 24 Mar 2025]

Title:AfroXLMR-Social: Adapting Pre-trained Language Models for African Languages Social Media Text

Authors:Tadesse Destaw Belay, Israel Abebe Azime, Ibrahim Said Ahmad, Idris Abdulmumin, Abinew Ali Ayele, Shamsuddeen Hassan Muhammad, Seid Muhie Yimam

View PDF HTML (experimental)

Abstract:Pretrained Language Models (PLMs) built from various sources are the foundation of today's NLP progress. Language representations learned by such models achieve strong performance across many tasks with datasets of varying sizes drawn from various sources. We explore a thorough analysis of domain and task adaptive continual pretraining approaches for low-resource African languages and a promising result is shown for the evaluated tasks. We create AfriSocial, a corpus designed for domain adaptive finetuning that passes through quality pre-processing steps. Continual pretraining PLMs using AfriSocial as domain adaptive pretraining (DAPT) data, consistently improves performance on fine-grained emotion classification task of 16 targeted languages from 1% to 28.27% macro F1 score. Likewise, using the task adaptive pertaining (TAPT) approach, further finetuning with small unlabeled but similar task data shows promising results. For example, unlabeled sentiment data (source) for fine-grained emotion classification task (target) improves the base model results by an F1 score ranging from 0.55% to 15.11%. Combining the two methods, DAPT + TAPT, achieves also better results than base models. All the resources will be available to improve low-resource NLP tasks, generally, as well as other similar domain tasks such as hate speech and sentiment tasks.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2503.18247 [cs.CL]
	(or arXiv:2503.18247v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2503.18247

Submission history

From: Tadesse Destaw Belay [view email]
[v1] Mon, 24 Mar 2025 00:06:33 UTC (851 KB)

Computer Science > Computation and Language

Title:AfroXLMR-Social: Adapting Pre-trained Language Models for African Languages Social Media Text

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:AfroXLMR-Social: Adapting Pre-trained Language Models for African Languages Social Media Text

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators