AutoElicit: Using Large Language Models for Expert Prior Elicitation in Predictive Modelling

Capstick, Alexander; Krishnan, Rahul G.; Barnaghi, Payam

Computer Science > Machine Learning

arXiv:2411.17284 (cs)

[Submitted on 26 Nov 2024 (v1), last revised 31 Jan 2025 (this version, v4)]

Title:AutoElicit: Using Large Language Models for Expert Prior Elicitation in Predictive Modelling

Authors:Alexander Capstick, Rahul G. Krishnan, Payam Barnaghi

View PDF HTML (experimental)

Abstract:Large language models (LLMs) acquire a breadth of information across various domains. However, their computational complexity, cost, and lack of transparency often hinder their direct application for predictive tasks where privacy and interpretability are paramount. In fields such as healthcare, biology, and finance, specialised and interpretable linear models still hold considerable value. In such domains, labelled data may be scarce or expensive to obtain. Well-specified prior distributions over model parameters can reduce the sample complexity of learning through Bayesian inference; however, eliciting expert priors can be time-consuming. We therefore introduce AutoElicit to extract knowledge from LLMs and construct priors for predictive models. We show these priors are informative and can be refined using natural language. We perform a careful study contrasting AutoElicit with in-context learning and demonstrate how to perform model selection between the two methods. We find that AutoElicit yields priors that can substantially reduce error over uninformative priors, using fewer labels, and consistently outperform in-context learning. We show that AutoElicit saves over 6 months of labelling effort when building a new predictive model for urinary tract infections from sensor recordings of people living with dementia.

Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL); Machine Learning (stat.ML)
Cite as:	arXiv:2411.17284 [cs.LG]
	(or arXiv:2411.17284v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2411.17284

Submission history

From: Alexander Capstick [view email]
[v1] Tue, 26 Nov 2024 10:13:39 UTC (3,069 KB)
[v2] Tue, 10 Dec 2024 11:36:48 UTC (3,069 KB)
[v3] Wed, 18 Dec 2024 17:51:52 UTC (3,179 KB)
[v4] Fri, 31 Jan 2025 15:04:34 UTC (4,825 KB)

Computer Science > Machine Learning

Title:AutoElicit: Using Large Language Models for Expert Prior Elicitation in Predictive Modelling

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:AutoElicit: Using Large Language Models for Expert Prior Elicitation in Predictive Modelling

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators