ReADS: A Rectified Attentional Double Supervised Network for Scene Text Recognition

Song, Qi; Jiang, Qianyi; Li, Nan; Zhang, Rui; Wei, Xiaolin

Computer Science > Computer Vision and Pattern Recognition

arXiv:2004.02070 (cs)

[Submitted on 5 Apr 2020 (v1), last revised 7 Apr 2020 (this version, v2)]

Title:ReADS: A Rectified Attentional Double Supervised Network for Scene Text Recognition

Authors:Qi Song, Qianyi Jiang, Nan Li, Rui Zhang, Xiaolin Wei

View PDF

Abstract:In recent years, scene text recognition is always regarded as a sequence-to-sequence problem. Connectionist Temporal Classification (CTC) and Attentional sequence recognition (Attn) are two very prevailing approaches to tackle this problem while they may fail in some scenarios respectively. CTC concentrates more on every individual character but is weak in text semantic dependency modeling. Attn based methods have better context semantic modeling ability while tends to overfit on limited training data. In this paper, we elaborately design a Rectified Attentional Double Supervised Network (ReADS) for general scene text recognition. To overcome the weakness of CTC and Attn, both of them are applied in our method but with different modules in two supervised branches which can make a complementary to each other. Moreover, effective spatial and channel attention mechanisms are introduced to eliminate background noise and extract valid foreground information. Finally, a simple rectified network is implemented to rectify irregular text. The ReADS can be trained end-to-end and only word-level annotations are required. Extensive experiments on various benchmarks verify the effectiveness of ReADS which achieves state-of-the-art performance.

Comments:	8 pages, 3 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2004.02070 [cs.CV]
	(or arXiv:2004.02070v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2004.02070

Submission history

From: Qi Song [view email]
[v1] Sun, 5 Apr 2020 02:05:35 UTC (1,397 KB)
[v2] Tue, 7 Apr 2020 01:44:17 UTC (1,445 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:ReADS: A Rectified Attentional Double Supervised Network for Scene Text Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:ReADS: A Rectified Attentional Double Supervised Network for Scene Text Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators