MonoNet: Towards Interpretable Models by Learning Monotonic Features

Nguyen, An-phi; Martínez, María Rodríguez

Computer Science > Machine Learning

arXiv:1909.13611 (cs)

[Submitted on 30 Sep 2019]

Title:MonoNet: Towards Interpretable Models by Learning Monotonic Features

Authors:An-phi Nguyen, María Rodríguez Martínez

View PDF

Abstract:Being able to interpret, or explain, the predictions made by a machine learning model is of fundamental importance. This is especially true when there is interest in deploying data-driven models to make high-stakes decisions, e.g. in healthcare. While recent years have seen an increasing interest in interpretable machine learning research, this field is currently lacking an agreed-upon definition of interpretability, and some researchers have called for a more active conversation towards a rigorous approach to interpretability. Joining this conversation, we claim in this paper that the difficulty of interpreting a complex model stems from the existing interactions among features. We argue that by enforcing monotonicity between features and outputs, we are able to reason about the effect of a single feature on an output independently from other features, and consequently better understand the model. We show how to structurally introduce this constraint in deep learning models by adding new simple layers. We validate our model on benchmark datasets, and compare our results with previously proposed interpretable models.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1909.13611 [cs.LG]
	(or arXiv:1909.13611v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1909.13611

Submission history

From: An-Phi Nguyen [view email]
[v1] Mon, 30 Sep 2019 12:02:16 UTC (185 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-09

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

An-phi Nguyen
María Rodríguez Martínez

export BibTeX citation

Computer Science > Machine Learning

Title:MonoNet: Towards Interpretable Models by Learning Monotonic Features

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:MonoNet: Towards Interpretable Models by Learning Monotonic Features

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators