ScamFerret: Detecting Scam Websites Autonomously with Large Language Models

Nakano, Hiroki; Koide, Takashi; Chiba, Daiki

Computer Science > Cryptography and Security

arXiv:2502.10110 (cs)

[Submitted on 14 Feb 2025]

Title:ScamFerret: Detecting Scam Websites Autonomously with Large Language Models

Authors:Hiroki Nakano, Takashi Koide, Daiki Chiba

View PDF HTML (experimental)

Abstract:With the rise of sophisticated scam websites that exploit human psychological vulnerabilities, distinguishing between legitimate and scam websites has become increasingly challenging. This paper presents ScamFerret, an innovative agent system employing a large language model (LLM) to autonomously collect and analyze data from a given URL to determine whether it is a scam. Unlike traditional machine learning models that require large datasets and feature engineering, ScamFerret leverages LLMs' natural language understanding to accurately identify scam websites of various types and languages without requiring additional training or fine-tuning. Our evaluation demonstrated that ScamFerret achieves 0.972 accuracy in classifying four scam types in English and 0.993 accuracy in classifying online shopping websites across three different languages, particularly when using GPT-4. Furthermore, we confirmed that ScamFerret collects and analyzes external information such as web content, DNS records, and user reviews as necessary, providing a basis for identifying scam websites from multiple perspectives. These results suggest that LLMs have significant potential in enhancing cybersecurity measures against sophisticated scam websites.

Comments:	Accepted for publication at DIMVA 2025
Subjects:	Cryptography and Security (cs.CR)
Cite as:	arXiv:2502.10110 [cs.CR]
	(or arXiv:2502.10110v1 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2502.10110

Submission history

From: Hiroki Nakano [view email]
[v1] Fri, 14 Feb 2025 12:16:38 UTC (225 KB)

Computer Science > Cryptography and Security

Title:ScamFerret: Detecting Scam Websites Autonomously with Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:ScamFerret: Detecting Scam Websites Autonomously with Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators