Learning Convolutional Neural Networks using Hybrid Orthogonal Projection and Estimation

Pan, Hengyue; Jiang, Hui

Computer Science > Neural and Evolutionary Computing

arXiv:1606.05929 (cs)

[Submitted on 20 Jun 2016 (v1), last revised 11 Sep 2016 (this version, v4)]

Title:Learning Convolutional Neural Networks using Hybrid Orthogonal Projection and Estimation

Authors:Hengyue Pan, Hui Jiang

View PDF

Abstract:Convolutional neural networks (CNNs) have yielded the excellent performance in a variety of computer vision tasks, where CNNs typically adopt a similar structure consisting of convolution layers, pooling layers and fully connected layers. In this paper, we propose to apply a novel method, namely Hybrid Orthogonal Projection and Estimation (HOPE), to CNNs in order to introduce orthogonality into the CNN structure. The HOPE model can be viewed as a hybrid model to combine feature extraction using orthogonal linear projection with mixture models. It is an effective model to extract useful information from the original high-dimension feature vectors and meanwhile filter out irrelevant noises. In this work, we present three different ways to apply the HOPE models to CNNs, i.e., {\em HOPE-Input}, {\em single-HOPE-Block} and {\em multi-HOPE-Blocks}. For {\em HOPE-Input} CNNs, a HOPE layer is directly used right after the input to de-correlate high-dimension input feature vectors. Alternatively, in {\em single-HOPE-Block} and {\em multi-HOPE-Blocks} CNNs, we consider to use HOPE layers to replace one or more blocks in the CNNs, where one block may include several convolutional layers and one pooling layer. The experimental results on both Cifar-10 and Cifar-100 data sets have shown that the orthogonal constraints imposed by the HOPE layers can significantly improve the performance of CNNs in these image classification tasks (we have achieved one of the best performance when image augmentation has not been applied, and top 5 performance with image augmentation).

Comments:	7 Pages, 5 figures, submitted to AAAI 2017
Subjects:	Neural and Evolutionary Computing (cs.NE); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1606.05929 [cs.NE]
	(or arXiv:1606.05929v4 [cs.NE] for this version)
	https://doi.org/10.48550/arXiv.1606.05929

Submission history

From: Hengyue Pan [view email]
[v1] Mon, 20 Jun 2016 00:19:43 UTC (131 KB)
[v2] Wed, 29 Jun 2016 20:42:04 UTC (131 KB)
[v3] Thu, 8 Sep 2016 08:46:52 UTC (502 KB)
[v4] Sun, 11 Sep 2016 03:25:25 UTC (134 KB)

Computer Science > Neural and Evolutionary Computing

Title:Learning Convolutional Neural Networks using Hybrid Orthogonal Projection and Estimation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Neural and Evolutionary Computing

Title:Learning Convolutional Neural Networks using Hybrid Orthogonal Projection and Estimation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators