How to Choose Pretrained Handwriting Recognition Models for Single Writer Fine-Tuning

Pippi, Vittorio; Cascianelli, Silvia; Kermorvant, Christopher; Cucchiara, Rita

Computer Science > Computer Vision and Pattern Recognition

arXiv:2305.02593 (cs)

[Submitted on 4 May 2023]

Title:How to Choose Pretrained Handwriting Recognition Models for Single Writer Fine-Tuning

Authors:Vittorio Pippi, Silvia Cascianelli, Christopher Kermorvant, Rita Cucchiara

View PDF

Abstract:Recent advancements in Deep Learning-based Handwritten Text Recognition (HTR) have led to models with remarkable performance on both modern and historical manuscripts in large benchmark datasets. Nonetheless, those models struggle to obtain the same performance when applied to manuscripts with peculiar characteristics, such as language, paper support, ink, and author handwriting. This issue is very relevant for valuable but small collections of documents preserved in historical archives, for which obtaining sufficient annotated training data is costly or, in some cases, unfeasible. To overcome this challenge, a possible solution is to pretrain HTR models on large datasets and then fine-tune them on small single-author collections. In this paper, we take into account large, real benchmark datasets and synthetic ones obtained with a styled Handwritten Text Generation model. Through extensive experimental analysis, also considering the amount of fine-tuning lines, we give a quantitative indication of the most relevant characteristics of such data for obtaining an HTR model able to effectively transcribe manuscripts in small collections with as little as five real fine-tuning lines.

Comments:	Accepted at ICDAR2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Digital Libraries (cs.DL)
Cite as:	arXiv:2305.02593 [cs.CV]
	(or arXiv:2305.02593v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2305.02593

Submission history

From: Silvia Cascianelli PhD [view email]
[v1] Thu, 4 May 2023 07:00:28 UTC (3,450 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:How to Choose Pretrained Handwriting Recognition Models for Single Writer Fine-Tuning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:How to Choose Pretrained Handwriting Recognition Models for Single Writer Fine-Tuning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators