Can Zero-Shot Commercial APIs Deliver Regulatory-Grade Clinical Text DeIdentification?

Kocaman, Veysel; Santas, Muhammed; Gul, Yigit; Butgul, Mehmet; Talby, David

Computer Science > Computation and Language

arXiv:2503.20794 (cs)

[Submitted on 21 Mar 2025]

Title:Can Zero-Shot Commercial APIs Deliver Regulatory-Grade Clinical Text DeIdentification?

Authors:Veysel Kocaman, Muhammed Santas, Yigit Gul, Mehmet Butgul, David Talby

View PDF HTML (experimental)

Abstract:We systematically assess the performance of three leading API-based de-identification systems - Azure Health Data Services, AWS Comprehend Medical, and OpenAI GPT-4o - against our de-identification systems on a ground truth dataset of 48 clinical documents annotated by medical experts. Our analysis, conducted at both entity-level and token-level, demonstrates that our solution, Healthcare NLP, achieves the highest accuracy, with a 96% F1-score in protected health information (PHI) detection, significantly outperforming Azure (91%), AWS (83%), and GPT-4o (79%). Beyond accuracy, Healthcare NLP is also the most cost-effective solution, reducing processing costs by over 80% compared to Azure and GPT-4o. Its fixed-cost local deployment model avoids the escalating per-request fees of cloud-based services, making it a scalable and economical choice. Our results underscore a critical limitation: zero-shot commercial APIs fail to meet the accuracy, adaptability, and cost-efficiency required for regulatory-grade clinical de-identification. Healthcare NLP's superior performance, customization capabilities, and economic advantages position it as the more viable solution for healthcare organizations seeking compliance and scalability in clinical NLP workflows.

Comments:	14 pages, accepted at Text2Story Workshop at ECIR 2025
Subjects:	Computation and Language (cs.CL); Cryptography and Security (cs.CR); Information Retrieval (cs.IR); Machine Learning (cs.LG)
ACM classes:	H.3, F.2.2, I.2.7
Cite as:	arXiv:2503.20794 [cs.CL]
	(or arXiv:2503.20794v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2503.20794

Submission history

From: Veysel Kocaman Vk [view email]
[v1] Fri, 21 Mar 2025 10:05:04 UTC (2,776 KB)

Computer Science > Computation and Language

Title:Can Zero-Shot Commercial APIs Deliver Regulatory-Grade Clinical Text DeIdentification?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Can Zero-Shot Commercial APIs Deliver Regulatory-Grade Clinical Text DeIdentification?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators