ELECTRA and GPT-4o: Cost-Effective Partners for Sentiment Analysis

Beno, James P.

Computer Science > Computation and Language

arXiv:2501.00062 (cs)

[Submitted on 29 Dec 2024]

Title:ELECTRA and GPT-4o: Cost-Effective Partners for Sentiment Analysis

Authors:James P. Beno

View PDF HTML (experimental)

Abstract:Bidirectional transformers excel at sentiment analysis, and Large Language Models (LLM) are effective zero-shot learners. Might they perform better as a team? This paper explores collaborative approaches between ELECTRA and GPT-4o for three-way sentiment classification. We fine-tuned (FT) four models (ELECTRA Base/Large, GPT-4o/4o-mini) using a mix of reviews from Stanford Sentiment Treebank (SST) and DynaSent. We provided input from ELECTRA to GPT as: predicted label, probabilities, and retrieved examples. Sharing ELECTRA Base FT predictions with GPT-4o-mini significantly improved performance over either model alone (82.74 macro F1 vs. 79.29 ELECTRA Base FT, 79.52 GPT-4o-mini) and yielded the lowest cost/performance ratio (\$0.12/F1 point). However, when GPT models were fine-tuned, including predictions decreased performance. GPT-4o FT-M was the top performer (86.99), with GPT-4o-mini FT close behind (86.77) at much less cost (\$0.38 vs. \$1.59/F1 point). Our results show that augmenting prompts with predictions from fine-tuned encoders is an efficient way to boost performance, and a fine-tuned GPT-4o-mini is nearly as good as GPT-4o FT at 76% less cost. Both are affordable options for projects with limited resources.

Comments:	16 pages, 4 figures. Source code and data available at this https URL
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
ACM classes:	I.2.7
Cite as:	arXiv:2501.00062 [cs.CL]
	(or arXiv:2501.00062v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2501.00062

Submission history

From: James Beno [view email]
[v1] Sun, 29 Dec 2024 05:29:52 UTC (331 KB)

Computer Science > Computation and Language

Title:ELECTRA and GPT-4o: Cost-Effective Partners for Sentiment Analysis

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:ELECTRA and GPT-4o: Cost-Effective Partners for Sentiment Analysis

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators