PSConv: Squeezing Feature Pyramid into One Compact Poly-Scale Convolutional Layer

Li, Duo; Yao, Anbang; Chen, Qifeng

Computer Science > Computer Vision and Pattern Recognition

arXiv:2007.06191 (cs)

[Submitted on 13 Jul 2020]

Title:PSConv: Squeezing Feature Pyramid into One Compact Poly-Scale Convolutional Layer

Authors:Duo Li, Anbang Yao, Qifeng Chen

View PDF

Abstract:Despite their strong modeling capacities, Convolutional Neural Networks (CNNs) are often scale-sensitive. For enhancing the robustness of CNNs to scale variance, multi-scale feature fusion from different layers or filters attracts great attention among existing solutions, while the more granular kernel space is overlooked. We bridge this regret by exploiting multi-scale features in a finer granularity. The proposed convolution operation, named Poly-Scale Convolution (PSConv), mixes up a spectrum of dilation rates and tactfully allocate them in the individual convolutional kernels of each filter regarding a single convolutional layer. Specifically, dilation rates vary cyclically along the axes of input and output channels of the filters, aggregating features over a wide range of scales in a neat style. PSConv could be a drop-in replacement of the vanilla convolution in many prevailing CNN backbones, allowing better representation learning without introducing additional parameters and computational complexities. Comprehensive experiments on the ImageNet and MS COCO benchmarks validate the superior performance of PSConv. Code and models are available at this https URL.

Comments:	Accepted by ECCV 2020. Code and models are available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2007.06191 [cs.CV]
	(or arXiv:2007.06191v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2007.06191

Submission history

From: Duo Li [view email]
[v1] Mon, 13 Jul 2020 05:14:11 UTC (4,000 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:PSConv: Squeezing Feature Pyramid into One Compact Poly-Scale Convolutional Layer

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:PSConv: Squeezing Feature Pyramid into One Compact Poly-Scale Convolutional Layer

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators