Interpreting RNN behaviour via excitable network attractors

Ceni, Andrea; Ashwin, Peter; Livi, Lorenzo

Computer Science > Machine Learning

arXiv:1807.10478v1 (cs)

[Submitted on 27 Jul 2018 (this version), latest version 10 Mar 2019 (v6)]

Title:Interpreting RNN behaviour via excitable network attractors

Authors:Andrea Ceni, Peter Ashwin, Lorenzo Livi

View PDF

Abstract:Machine learning has become a basic tool in scientific research and for the development of technologies with significant impact on society. In fact, such methods allow to discover regularities in data and make predictions without explicit knowledge of the rules governing the system under analysis. However, a price must be paid for exploiting such a modeling flexibility: machine learning methods are usually black-box, meaning that it is difficult to fully understand what the machine is doing and how. This poses constraints on the applicability of such methods, neglecting the possibility to gather novel scientific insights from experimental data. Our research aims to open the black-box of recurrent neural networks, an important family of neural networks suitable to process sequential data. Here, we propose a novel methodology that allows to provide a mechanistic interpretation of their behaviour when used to solve computational tasks. The methodology is based on mathematical constructs called excitable network attractors, which are models represented as networks in phase space composed by stable attractors and excitable connections between them. As the behaviour of recurrent neural networks depends on training and inputs driving the autonomous system, we introduce an algorithm to extract network attractors directly from a trajectory generated by the neural network while solving tasks. Simulations conducted on a controlled benchmark highlight the relevance of the proposed methodology for interpreting the behaviour of recurrent neural networks on tasks that involve learning a finite number of stable states.

Subjects:	Machine Learning (cs.LG); Dynamical Systems (math.DS); Machine Learning (stat.ML)
Cite as:	arXiv:1807.10478 [cs.LG]
	(or arXiv:1807.10478v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1807.10478

Submission history

From: Lorenzo Livi [view email]
[v1] Fri, 27 Jul 2018 08:02:45 UTC (6,302 KB)
[v2] Tue, 31 Jul 2018 10:02:41 UTC (6,302 KB)
[v3] Sat, 10 Nov 2018 16:24:50 UTC (5,833 KB)
[v4] Thu, 29 Nov 2018 11:38:47 UTC (5,798 KB)
[v5] Tue, 19 Feb 2019 16:21:30 UTC (5,909 KB)
[v6] Sun, 10 Mar 2019 09:35:27 UTC (5,820 KB)

Computer Science > Machine Learning

Title:Interpreting RNN behaviour via excitable network attractors

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Interpreting RNN behaviour via excitable network attractors

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators