Character Queries: A Transformer-based Approach to On-Line Handwritten Character Segmentation

Jungo, Michael; Wolf, Beat; Maksai, Andrii; Musat, Claudiu; Fischer, Andreas

doi:10.1007/978-3-031-41676-7_6

Computer Science > Computer Vision and Pattern Recognition

arXiv:2309.03072 (cs)

[Submitted on 6 Sep 2023]

Title:Character Queries: A Transformer-based Approach to On-Line Handwritten Character Segmentation

Authors:Michael Jungo, Beat Wolf, Andrii Maksai, Claudiu Musat, Andreas Fischer

View PDF

Abstract:On-line handwritten character segmentation is often associated with handwriting recognition and even though recognition models include mechanisms to locate relevant positions during the recognition process, it is typically insufficient to produce a precise segmentation. Decoupling the segmentation from the recognition unlocks the potential to further utilize the result of the recognition. We specifically focus on the scenario where the transcription is known beforehand, in which case the character segmentation becomes an assignment problem between sampling points of the stylus trajectory and characters in the text. Inspired by the $k$-means clustering algorithm, we view it from the perspective of cluster assignment and present a Transformer-based architecture where each cluster is formed based on a learned character query in the Transformer decoder block. In order to assess the quality of our approach, we create character segmentation ground truths for two popular on-line handwriting datasets, IAM-OnDB and HANDS-VNOnDB, and evaluate multiple methods on them, demonstrating that our approach achieves the overall best results.

Comments:	ICDAR 2023 Best Student Paper Award. Code available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2309.03072 [cs.CV]
	(or arXiv:2309.03072v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2309.03072
Journal reference:	International Conference on Document Analysis and Recognition - ICDAR 2023, pp. 98-114. Cham: Springer Nature Switzerland
Related DOI:	https://doi.org/10.1007/978-3-031-41676-7_6

Submission history

From: Michael Jungo [view email]
[v1] Wed, 6 Sep 2023 15:19:04 UTC (114 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Character Queries: A Transformer-based Approach to On-Line Handwritten Character Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Character Queries: A Transformer-based Approach to On-Line Handwritten Character Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators