DDI-100: Dataset for Text Detection and Recognition

Zharikov, Ilia; Nikitin, Filipp; Vasiliev, Ilia; Dokholyan, Vladimir

doi:10.1145/3440084.3441192

Computer Science > Computer Vision and Pattern Recognition

arXiv:1912.11658 (cs)

[Submitted on 25 Dec 2019]

Title:DDI-100: Dataset for Text Detection and Recognition

Authors:Ilia Zharikov, Filipp Nikitin, Ilia Vasiliev, Vladimir Dokholyan (Moscow Institute of Physics and Technology)

View PDF

Abstract:Nowadays document analysis and recognition remain challenging tasks. However, only a few datasets designed for text detection (TD) and optical character recognition (OCR) problems exist. In this paper we present Distorted Document Images dataset (DDI-100) and demonstrate its usefulness in a wide range of document analysis problems. DDI-100 dataset is a synthetic dataset based on 7000 real unique document pages and consists of more than 100000 augmented images. Ground truth comprises text and stamp masks, text and characters bounding boxes with relevant annotations. Validation of DDI-100 dataset was conducted using several TD and OCR models that show high-quality performance on real data.

Comments:	Accepted by CCVPR 2019. Dataset is available here: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1912.11658 [cs.CV]
	(or arXiv:1912.11658v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1912.11658
Related DOI:	https://doi.org/10.1145/3440084.3441192

Submission history

From: Ilya Vasilev [view email]
[v1] Wed, 25 Dec 2019 12:47:35 UTC (565 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2019-12

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:DDI-100: Dataset for Text Detection and Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:DDI-100: Dataset for Text Detection and Recognition

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators