LF Tracy: A Unified Single-Pipeline Approach for Salient Object Detection in Light Field Cameras

Teng, Fei; Zhang, Jiaming; Liu, Jiawei; Peng, Kunyu; Cheng, Xina; Li, Zhiyong; Yang, Kailun

Computer Science > Computer Vision and Pattern Recognition

arXiv:2401.16712 (cs)

[Submitted on 30 Jan 2024 (v1), last revised 26 Aug 2024 (this version, v2)]

Title:LF Tracy: A Unified Single-Pipeline Approach for Salient Object Detection in Light Field Cameras

Authors:Fei Teng, Jiaming Zhang, Jiawei Liu, Kunyu Peng, Xina Cheng, Zhiyong Li, Kailun Yang

View PDF HTML (experimental)

Abstract:Leveraging rich information is crucial for dense prediction tasks. Light field (LF) cameras are instrumental in this regard, as they allow data to be sampled from various perspectives. This capability provides valuable spatial, depth, and angular information, enhancing scene-parsing tasks. However, we have identified two overlooked issues for the LF salient object detection (SOD) task. (1): Previous approaches predominantly employ a customized two-stream design to discover the spatial and depth features within light field images. The network struggles to learn the implicit angular information between different images due to a lack of intra-network data connectivity. (2): Little research has been directed towards the data augmentation strategy for LF SOD. Research on inter-network data connectivity is scant. In this study, we propose an efficient paradigm (LF Tracy) to address those issues. This comprises a single-pipeline encoder paired with a highly efficient information aggregation (IA) module (around 8M parameters) to establish an intra-network connection. Then, a simple yet effective data augmentation strategy called MixLD is designed to bridge the inter-network connections. Owing to this innovative paradigm, our model surpasses the existing state-of-the-art method through extensive experiments. Especially, LF Tracy demonstrates a 23% improvement over previous results on the latest large-scale PKU dataset. The source code is publicly available at: this https URL.

Comments:	Accepted to ICPR 2024. The source code is publicly available at: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Robotics (cs.RO); Image and Video Processing (eess.IV)
Cite as:	arXiv:2401.16712 [cs.CV]
	(or arXiv:2401.16712v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2401.16712

Submission history

From: Kailun Yang [view email]
[v1] Tue, 30 Jan 2024 03:17:02 UTC (1,721 KB)
[v2] Mon, 26 Aug 2024 12:52:25 UTC (2,400 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:LF Tracy: A Unified Single-Pipeline Approach for Salient Object Detection in Light Field Cameras

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:LF Tracy: A Unified Single-Pipeline Approach for Salient Object Detection in Light Field Cameras

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators