Accurate Text Localization in Natural Image with Cascaded Convolutional Text Network

He, Tong; Huang, Weilin; Qiao, Yu; Yao, Jian

Computer Science > Computer Vision and Pattern Recognition

arXiv:1603.09423 (cs)

[Submitted on 31 Mar 2016]

Title:Accurate Text Localization in Natural Image with Cascaded Convolutional Text Network

Authors:Tong He, Weilin Huang, Yu Qiao, Jian Yao

View PDF

Abstract:We introduce a new top-down pipeline for scene text detection. We propose a novel Cascaded Convolutional Text Network (CCTN) that joints two customized convolutional networks for coarse-to-fine text localization. The CCTN fast detects text regions roughly from a low-resolution image, and then accurately localizes text lines from each enlarged region. We cast previous character based detection into direct text region estimation, avoiding multiple bottom- up post-processing steps. It exhibits surprising robustness and discriminative power by considering whole text region as detection object which provides strong semantic information. We customize convolutional network by develop- ing rectangle convolutions and multiple in-network fusions. This enables it to handle multi-shape and multi-scale text efficiently. Furthermore, the CCTN is computationally efficient by sharing convolutional computations, and high-level property allows it to be invariant to various languages and multiple orientations. It achieves 0.84 and 0.86 F-measures on the ICDAR 2011 and ICDAR 2013, delivering substantial improvements over state-of-the-art results [23, 1].

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1603.09423 [cs.CV]
	(or arXiv:1603.09423v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1603.09423

Submission history

From: Weilin Huang [view email]
[v1] Thu, 31 Mar 2016 00:16:31 UTC (11,017 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2016-03

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Tong He
Weilin Huang
Yu Qiao
Jian Yao

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Accurate Text Localization in Natural Image with Cascaded Convolutional Text Network

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Accurate Text Localization in Natural Image with Cascaded Convolutional Text Network

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators