Bidirectional Scene Text Recognition with a Single Decoder

Bleeker, Maurits; de Rijke, Maarten

Computer Science > Computer Vision and Pattern Recognition

arXiv:1912.03656 (cs)

[Submitted on 8 Dec 2019 (v1), last revised 2 Mar 2020 (this version, v2)]

Title:Bidirectional Scene Text Recognition with a Single Decoder

Authors:Maurits Bleeker, Maarten de Rijke

View PDF

Abstract:Scene Text Recognition (STR) is the problem of recognizing the correct word or character sequence in a cropped word image. To obtain more robust output sequences, the notion of bidirectional STR has been introduced. So far, bidirectional STRs have been implemented by using two separate decoders; one for left-to-right decoding and one for right-to-left. Having two separate decoders for almost the same task with the same output space is undesirable from a computational and optimization point of view. We introduce the bidirectional Scene Text Transformer (Bi-STET), a novel bidirectional STR method with a single decoder for bidirectional text decoding. With its single decoder, Bi-STET outperforms methods that apply bidirectional decoding by using two separate decoders while also being more efficient than those methods, Furthermore, we achieve or beat state-of-the-art (SOTA) methods on all STR benchmarks with Bi-STET. Finally, we provide analyses and insights into the performance of Bi-STET.

Comments:	8 pages. In 24th European Conference on Artificial Intelligence
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:1912.03656 [cs.CV]
	(or arXiv:1912.03656v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1912.03656

Submission history

From: Maurits Bleeker [view email]
[v1] Sun, 8 Dec 2019 11:20:35 UTC (3,628 KB)
[v2] Mon, 2 Mar 2020 14:44:34 UTC (1,854 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Bidirectional Scene Text Recognition with a Single Decoder

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Bidirectional Scene Text Recognition with a Single Decoder

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators