On-Device Text Image Super Resolution

Jain, Dhruval; Prabhu, Arun D; Ramena, Gopi; Goyal, Manoj; Mohanty, Debi Prasanna; Moharana, Sukumar; Purre, Naresh

doi:10.1109/ICPR48806.2021.9412222

Computer Science > Computer Vision and Pattern Recognition

arXiv:2011.10251 (cs)

[Submitted on 20 Nov 2020]

Title:On-Device Text Image Super Resolution

Authors:Dhruval Jain, Arun D Prabhu, Gopi Ramena, Manoj Goyal, Debi Prasanna Mohanty, Sukumar Moharana, Naresh Purre

View PDF

Abstract:Recent research on super-resolution (SR) has witnessed major developments with the advancements of deep convolutional neural networks. There is a need for information extraction from scenic text images or even document images on device, most of which are low-resolution (LR) images. Therefore, SR becomes an essential pre-processing step as Bicubic Upsampling, which is conventionally present in smartphones, performs poorly on LR images. To give the user more control over his privacy, and to reduce the carbon footprint by reducing the overhead of cloud computing and hours of GPU usage, executing SR models on the edge is a necessity in the recent times. There are various challenges in running and optimizing a model on resource-constrained platforms like smartphones. In this paper, we present a novel deep neural network that reconstructs sharper character edges and thus boosts OCR confidence. The proposed architecture not only achieves significant improvement in PSNR over bicubic upsampling on various benchmark datasets but also runs with an average inference time of 11.7 ms per image. We have outperformed state-of-the-art on the Text330 dataset. We also achieve an OCR accuracy of 75.89% on the ICDAR 2015 TextSR dataset, where ground truth has an accuracy of 78.10%.

Comments:	Accepted to the International Conference on Pattern Recognition(ICPR), 2020
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2011.10251 [cs.CV]
	(or arXiv:2011.10251v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2011.10251
Related DOI:	https://doi.org/10.1109/ICPR48806.2021.9412222

Submission history

From: Dhruval Jain [view email]
[v1] Fri, 20 Nov 2020 07:49:48 UTC (3,137 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:On-Device Text Image Super Resolution

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:On-Device Text Image Super Resolution

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators