SFDLA: Source-Free Document Layout Analysis

Tewes, Sebastian; Chen, Yufan; Moured, Omar; Zhang, Jiaming; Stiefelhagen, Rainer

Computer Science > Computer Vision and Pattern Recognition

arXiv:2503.18742 (cs)

[Submitted on 24 Mar 2025]

Title:SFDLA: Source-Free Document Layout Analysis

Authors:Sebastian Tewes, Yufan Chen, Omar Moured, Jiaming Zhang, Rainer Stiefelhagen

View PDF HTML (experimental)

Abstract:Document Layout Analysis (DLA) is a fundamental task in document understanding. However, existing DLA and adaptation methods often require access to large-scale source data and target labels. This requirements severely limiting their real-world applicability, particularly in privacy-sensitive and resource-constrained domains, such as financial statements, medical records, and proprietary business documents. According to our observation, directly transferring source-domain fine-tuned models on target domains often results in a significant performance drop (Avg. -32.64%). In this work, we introduce Source-Free Document Layout Analysis (SFDLA), aiming for adapting a pre-trained source DLA models to an unlabeled target domain, without access to any source data. To address this challenge, we establish the first SFDLA benchmark, covering three major DLA datasets for geometric- and content-aware adaptation. Furthermore, we propose Document Layout Analysis Adapter (DLAdapter), a novel framework that is designed to improve source-free adaptation across document domains. Our method achieves a +4.21% improvement over the source-only baseline and a +2.26% gain over existing source-free methods from PubLayNet to DocLayNet. We believe this work will inspire the DLA community to further investigate source-free document understanding. To support future research of the community, the benchmark, models, and code will be publicly available at this https URL.

Comments:	The benchmark, models, and code will be publicly available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2503.18742 [cs.CV]
	(or arXiv:2503.18742v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2503.18742

Submission history

From: Jiaming Zhang [view email]
[v1] Mon, 24 Mar 2025 14:50:28 UTC (12,343 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:SFDLA: Source-Free Document Layout Analysis

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:SFDLA: Source-Free Document Layout Analysis

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators