Reducing Aleatoric and Epistemic Uncertainty through Multi-modal Data Acquisition

Hoarau, Arthur; Quost, Benjamin; Destercke, Sébastien; Waegeman, Willem

Computer Science > Machine Learning

arXiv:2501.18268 (cs)

[Submitted on 30 Jan 2025]

Title:Reducing Aleatoric and Epistemic Uncertainty through Multi-modal Data Acquisition

Authors:Arthur Hoarau, Benjamin Quost, Sébastien Destercke, Willem Waegeman

View PDF HTML (experimental)

Abstract:To generate accurate and reliable predictions, modern AI systems need to combine data from multiple modalities, such as text, images, audio, spreadsheets, and time series. Multi-modal data introduces new opportunities and challenges for disentangling uncertainty: it is commonly assumed in the machine learning community that epistemic uncertainty can be reduced by collecting more data, while aleatoric uncertainty is irreducible. However, this assumption is challenged in modern AI systems when information is obtained from different modalities. This paper introduces an innovative data acquisition framework where uncertainty disentanglement leads to actionable decisions, allowing sampling in two directions: sample size and data modality. The main hypothesis is that aleatoric uncertainty decreases as the number of modalities increases, while epistemic uncertainty decreases by collecting more observations. We provide proof-of-concept implementations on two multi-modal datasets to showcase our data acquisition framework, which combines ideas from active learning, active feature acquisition and uncertainty quantification.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2501.18268 [cs.LG]
	(or arXiv:2501.18268v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2501.18268

Submission history

From: Arthur Hoarau [view email]
[v1] Thu, 30 Jan 2025 11:05:59 UTC (2,175 KB)

Computer Science > Machine Learning

Title:Reducing Aleatoric and Epistemic Uncertainty through Multi-modal Data Acquisition

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Reducing Aleatoric and Epistemic Uncertainty through Multi-modal Data Acquisition

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators