Predicting the Original Appearance of Damaged Historical Documents

Yang, Zhenhua; Peng, Dezhi; Shi, Yongxin; Zhang, Yuyi; Liu, Chongyu; Jin, Lianwen

Computer Science > Computer Vision and Pattern Recognition

arXiv:2412.11634 (cs)

[Submitted on 16 Dec 2024]

Title:Predicting the Original Appearance of Damaged Historical Documents

Authors:Zhenhua Yang, Dezhi Peng, Yongxin Shi, Yuyi Zhang, Chongyu Liu, Lianwen Jin

View PDF HTML (experimental)

Abstract:Historical documents encompass a wealth of cultural treasures but suffer from severe damages including character missing, paper damage, and ink erosion over time. However, existing document processing methods primarily focus on binarization, enhancement, etc., neglecting the repair of these damages. To this end, we present a new task, termed Historical Document Repair (HDR), which aims to predict the original appearance of damaged historical documents. To fill the gap in this field, we propose a large-scale dataset HDR28K and a diffusion-based network DiffHDR for historical document repair. Specifically, HDR28K contains 28,552 damaged-repaired image pairs with character-level annotations and multi-style degradations. Moreover, DiffHDR augments the vanilla diffusion framework with semantic and spatial information and a meticulously designed character perceptual loss for contextual and visual coherence. Experimental results demonstrate that the proposed DiffHDR trained using HDR28K significantly surpasses existing approaches and exhibits remarkable performance in handling real damaged documents. Notably, DiffHDR can also be extended to document editing and text block generation, showcasing its high flexibility and generalization capacity. We believe this study could pioneer a new direction of document processing and contribute to the inheritance of invaluable cultures and civilizations. The dataset and code is available at this https URL.

Comments:	Accepted to AAAI 2025; Github Page: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2412.11634 [cs.CV]
	(or arXiv:2412.11634v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2412.11634
Journal reference:	39th AAAI Conference on Artificial Intelligence (AAAI-25), Philadelphia, Pennsylvania, USA, 2025

Submission history

From: Zhenhua Yang [view email]
[v1] Mon, 16 Dec 2024 10:25:03 UTC (19,885 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Predicting the Original Appearance of Damaged Historical Documents

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Predicting the Original Appearance of Damaged Historical Documents

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators