A Quantitative Perspective on Values of Domain Knowledge for Machine Learning

Yang, Jianyi; Ren, Shaolei

Computer Science > Machine Learning

arXiv:2011.08450 (cs)

[Submitted on 17 Nov 2020 (v1), last revised 9 Feb 2021 (this version, v2)]

Title:A Quantitative Perspective on Values of Domain Knowledge for Machine Learning

Authors:Jianyi Yang, Shaolei Ren

View PDF

Abstract:With the exploding popularity of machine learning, domain knowledge in various forms has been playing a crucial role in improving the learning performance, especially when training data is limited. Nonetheless, there is little understanding of to what extent domain knowledge can affect a machine learning task from a quantitative perspective. To increase the transparency and rigorously explain the role of domain knowledge in machine learning, we study the problem of quantifying the values of domain knowledge in terms of its contribution to the learning performance in the context of informed machine learning. We propose a quantification method based on Shapley value that fairly attributes the overall learning performance improvement to different domain knowledge. We also present Monte-Carlo sampling to approximate the fair value of domain knowledge with a polynomial time complexity. We run experiments of injecting symbolic domain knowledge into semi-supervised learning tasks on both MNIST and CIFAR10 datasets, providing quantitative values of different symbolic knowledge and rigorously explaining how it affects the machine learning performance in terms of test accuracy.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2011.08450 [cs.LG]
	(or arXiv:2011.08450v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2011.08450

Submission history

From: Jianyi Yang [view email]
[v1] Tue, 17 Nov 2020 06:12:23 UTC (194 KB)
[v2] Tue, 9 Feb 2021 09:14:56 UTC (190 KB)

Computer Science > Machine Learning

Title:A Quantitative Perspective on Values of Domain Knowledge for Machine Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Quantitative Perspective on Values of Domain Knowledge for Machine Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators