Large Language Models are Zero-Shot Clinical Information Extractors

Agrawal, Monica; Hegselmann, Stefan; Lang, Hunter; Kim, Yoon; Sontag, David

Computer Science > Computation and Language

arXiv:2205.12689v1 (cs)

[Submitted on 25 May 2022 (this version), latest version 30 Nov 2022 (v2)]

Title:Large Language Models are Zero-Shot Clinical Information Extractors

Authors:Monica Agrawal, Stefan Hegselmann, Hunter Lang, Yoon Kim, David Sontag

View PDF

Abstract:We show that large language models, such as GPT-3, perform well at zero-shot information extraction from clinical text despite not being trained specifically for the clinical domain. We present several examples showing how to use these models as tools for the diverse tasks of (i) concept disambiguation, (ii) evidence extraction, (iii) coreference resolution, and (iv) concept extraction, all on clinical text. The key to good performance is the use of simple task-specific programs that map from the language model outputs to the label space of the task. We refer to these programs as resolvers, a generalization of the verbalizer, which defines a mapping between output tokens and a discrete label space. We show in our examples that good resolvers share common components (e.g., "safety checks" that ensure the language model outputs faithfully match the input data), and that the common patterns across tasks make resolvers lightweight and easy to create. To better evaluate these systems, we also introduce two new datasets for benchmarking zero-shot clinical information extraction based on manual relabeling of the CASI dataset (Moon et al., 2014) with labels for new tasks. On the clinical extraction tasks we studied, the GPT-3 + resolver systems significantly outperform existing zero- and few-shot baselines.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2205.12689 [cs.CL]
	(or arXiv:2205.12689v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2205.12689

Submission history

From: Hunter Lang [view email]
[v1] Wed, 25 May 2022 11:49:58 UTC (444 KB)
[v2] Wed, 30 Nov 2022 18:43:44 UTC (532 KB)

Computer Science > Computation and Language

Title:Large Language Models are Zero-Shot Clinical Information Extractors

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Large Language Models are Zero-Shot Clinical Information Extractors

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators