Beyond Negation Detection: Comprehensive Assertion Detection Models for Clinical NLP

Kocaman, Veysel; Gul, Yigit; Kaya, M. Aytug; Haq, Hasham Ul; Butgul, Mehmet; Celik, Cabir; Talby, David

Computer Science > Computation and Language

arXiv:2503.17425 (cs)

[Submitted on 21 Mar 2025]

Title:Beyond Negation Detection: Comprehensive Assertion Detection Models for Clinical NLP

Authors:Veysel Kocaman, Yigit Gul, M. Aytug Kaya, Hasham Ul Haq, Mehmet Butgul, Cabir Celik, David Talby

View PDF HTML (experimental)

Abstract:Assertion status detection is a critical yet often overlooked component of clinical NLP, essential for accurately attributing extracted medical facts. Past studies have narrowly focused on negation detection, leading to underperforming commercial solutions such as AWS Medical Comprehend, Azure AI Text Analytics, and GPT-4o due to their limited domain adaptation. To address this gap, we developed state-of-the-art assertion detection models, including fine-tuned LLMs, transformer-based classifiers, few-shot classifiers, and deep learning (DL) approaches. We evaluated these models against cloud-based commercial API solutions, the legacy rule-based NegEx approach, and GPT-4o. Our fine-tuned LLM achieves the highest overall accuracy (0.962), outperforming GPT-4o (0.901) and commercial APIs by a notable margin, particularly excelling in Present (+4.2%), Absent (+8.4%), and Hypothetical (+23.4%) assertions. Our DL-based models surpass commercial solutions in Conditional (+5.3%) and Associated-with-Someone-Else (+10.1%) categories, while the few-shot classifier offers a lightweight yet highly competitive alternative (0.929), making it ideal for resource-constrained environments. Integrated within Spark NLP, our models consistently outperform black-box commercial solutions while enabling scalable inference and seamless integration with medical NER, Relation Extraction, and Terminology Resolution. These results reinforce the importance of domain-adapted, transparent, and customizable clinical NLP solutions over general-purpose LLMs and proprietary APIs.

Comments:	accepted at Text2Story Workshop at ECIR 2025
Subjects:	Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
MSC classes:	H.3
ACM classes:	H.3
Cite as:	arXiv:2503.17425 [cs.CL]
	(or arXiv:2503.17425v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2503.17425

Submission history

From: Veysel Kocaman Vk [view email]
[v1] Fri, 21 Mar 2025 10:18:47 UTC (919 KB)

Computer Science > Computation and Language

Title:Beyond Negation Detection: Comprehensive Assertion Detection Models for Clinical NLP

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Beyond Negation Detection: Comprehensive Assertion Detection Models for Clinical NLP

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators