Enhanced Short Text Modeling: Leveraging Large Language Models for Topic Refinement

Chang, Shuyu; Wang, Rui; Ren, Peng; Huang, Haiping

Computer Science > Computation and Language

arXiv:2403.17706 (cs)

[Submitted on 26 Mar 2024]

Title:Enhanced Short Text Modeling: Leveraging Large Language Models for Topic Refinement

Authors:Shuyu Chang, Rui Wang, Peng Ren, Haiping Huang

View PDF HTML (experimental)

Abstract:Crafting effective topic models for brief texts, like tweets and news headlines, is essential for capturing the swift shifts in social dynamics. Traditional topic models, however, often fall short in accurately representing the semantic intricacies of short texts due to their brevity and lack of contextual data. In our study, we harness the advanced capabilities of Large Language Models (LLMs) to introduce a novel approach termed "Topic Refinement". This approach does not directly involve itself in the initial modeling of topics but focuses on improving topics after they have been mined. By employing prompt engineering, we direct LLMs to eliminate off-topic words within a given topic, ensuring that only contextually relevant words are preserved or substituted with ones that fit better semantically. This method emulates human-like scrutiny and improvement of topics, thereby elevating the semantic quality of the topics generated by various models. Our comprehensive evaluation across three unique datasets has shown that our topic refinement approach significantly enhances the semantic coherence of topics.

Comments:	6 pages, 4 figures
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2403.17706 [cs.CL]
	(or arXiv:2403.17706v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2403.17706

Submission history

From: Shuyu Chang [view email]
[v1] Tue, 26 Mar 2024 13:50:34 UTC (1,108 KB)

Computer Science > Computation and Language

Title:Enhanced Short Text Modeling: Leveraging Large Language Models for Topic Refinement

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Enhanced Short Text Modeling: Leveraging Large Language Models for Topic Refinement

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators