Methodological Issues in Building, Training, and Testing Artificial Neural Networks

Ozesmi, Stacy L.; Ozesmi, Uygar; Tan, Can Ozan

doi:10.1016/j.ecolmodel.2005.11.012

Quantitative Biology > Populations and Evolution

arXiv:q-bio/0510017 (q-bio)

[Submitted on 7 Oct 2005]

Title:Methodological Issues in Building, Training, and Testing Artificial Neural Networks

Authors:Stacy L. Ozesmi, Uygar Ozesmi, Can Ozan Tan

View PDF

Abstract: We review the use of artificial neural networks, particularly the feedforward multilayer perceptron with back-propagation for training (MLP), in ecological modelling. Overtraining on data or giving vague references to how it was avoided is the major problem. Various methods can be used to determine when to stop training in artificial neural networks: 1) early stopping based on cross-validation, 2) stopping after a analyst defined error is reached or after the error levels off, 3) use of a test data set. We do not recommend the third method as the test data set is then not independent of model development. Many studies used the testing data to optimize the model and training. Although this method may give the best model for that set of data it does not give generalizability or improve understanding of the study system. The importance of an independent data set cannot be overemphasized as we found dramatic differences in model accuracy assessed with prediction accuracy on the training data set, as estimated with bootstrapping, and from use of an independent data set. The comparison of the artificial neural network with a general linear model (GLM) as a standard procedure is recommended because a GLM may perform as well or better than the MLP. MLP models should not be treated as black box models but instead techniques such as sensitivity analyses, input variable relevances, neural interpretation diagrams, randomization tests, and partial derivatives should be used to make the model more transparent, and further our ecological understanding which is an important goal of the modelling process. Based on our experience we discuss how to build a MLP model and how to optimize the parameters and architecture.

Comments:	22 pages, 2 figures. Presented in ISEI3 (2002). Ecological Modelling in press
Subjects:	Populations and Evolution (q-bio.PE); Quantitative Methods (q-bio.QM)
Cite as:	arXiv:q-bio/0510017 [q-bio.PE]
	(or arXiv:q-bio/0510017v1 [q-bio.PE] for this version)
	https://doi.org/10.48550/arXiv.q-bio/0510017
Journal reference:	Ecological Modelling, 195:83-93. 2006
Related DOI:	https://doi.org/10.1016/j.ecolmodel.2005.11.012

Submission history

From: Can Ozan Tan Mr. [view email]
[v1] Fri, 7 Oct 2005 19:17:27 UTC (47 KB)

Quantitative Biology > Populations and Evolution

Title:Methodological Issues in Building, Training, and Testing Artificial Neural Networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Quantitative Biology > Populations and Evolution

Title:Methodological Issues in Building, Training, and Testing Artificial Neural Networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators