Can Zero-Shot Commercial APIs Deliver Regulatory-Grade Clinical Text DeIdentification?

Kocaman, Veysel; Santas, Muhammed; Gul, Yigit; Butgul, Mehmet; Talby, David

Computer Science > Computation and Language

arXiv:2503.20794 (cs)

[Submitted on 21 Mar 2025 (v1), last revised 31 Mar 2025 (this version, v2)]

Title:Can Zero-Shot Commercial APIs Deliver Regulatory-Grade Clinical Text DeIdentification?

Authors:Veysel Kocaman, Muhammed Santas, Yigit Gul, Mehmet Butgul, David Talby

View PDF HTML (experimental)

Abstract:We evaluate the performance of four leading solutions for de-identification of unstructured medical text - Azure Health Data Services, AWS Comprehend Medical, OpenAI GPT-4o, and John Snow Labs - on a ground truth dataset of 48 clinical documents annotated by medical experts. The analysis, conducted at both entity-level and token-level, suggests that John Snow Labs' Medical Language Models solution achieves the highest accuracy, with a 96% F1-score in protected health information (PHI) detection, outperforming Azure (91%), AWS (83%), and GPT-4o (79%). John Snow Labs is not only the only solution which achieves regulatory-grade accuracy (surpassing that of human experts) but is also the most cost-effective solution: It is over 80% cheaper compared to Azure and GPT-4o, and is the only solution not priced by token. Its fixed-cost local deployment model avoids the escalating per-request fees of cloud-based services, making it a scalable and economical choice.

Comments:	14 pages, accepted at Text2Story Workshop at ECIR 2025
Subjects:	Computation and Language (cs.CL); Cryptography and Security (cs.CR); Information Retrieval (cs.IR); Machine Learning (cs.LG)
ACM classes:	H.3; F.2.2; I.2.7
Cite as:	arXiv:2503.20794 [cs.CL]
	(or arXiv:2503.20794v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2503.20794

Submission history

From: Veysel Kocaman Vk [view email]
[v1] Fri, 21 Mar 2025 10:05:04 UTC (2,776 KB)
[v2] Mon, 31 Mar 2025 19:44:35 UTC (2,776 KB)

Computer Science > Computation and Language

Title:Can Zero-Shot Commercial APIs Deliver Regulatory-Grade Clinical Text DeIdentification?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Can Zero-Shot Commercial APIs Deliver Regulatory-Grade Clinical Text DeIdentification?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators