Editable-DeepSC: Reliable Cross-Modal Semantic Communications for Facial Editing

Chen, Bin; Yu, Wenbo; Zhang, Qinshan; Xia, Shu-Tao

Abstract:Real-time computer vision (CV) plays a crucial role in various real-world applications, whose performance is highly dependent on communication networks. Nonetheless, the data-oriented characteristics of conventional communications often do not align with the special needs of real-time CV tasks. To alleviate this issue, the recently emerged semantic communications only transmit task-related semantic information and exhibit a promising landscape to address this problem. However, the communication challenges associated with Semantic Facial Editing, one of the most important real-time CV applications on social media, still remain largely unexplored. In this paper, we fill this gap by proposing Editable-DeepSC, a novel cross-modal semantic communication approach for facial editing. Firstly, we theoretically discuss different transmission schemes that separately handle communications and editings, and emphasize the necessity of Joint Editing-Channel Coding (JECC) via iterative attributes matching, which integrates editings into the communication chain to preserve more semantic mutual information. To compactly represent the high-dimensional data, we leverage inversion methods via pre-trained StyleGAN priors for semantic coding. To tackle the dynamic channel noise conditions, we propose SNR-aware channel coding via model fine-tuning. Extensive experiments indicate that Editable-DeepSC can achieve superior editings while significantly saving the transmission bandwidth, even under high-resolution and out-of-distribution (OOD) settings.

Subjects:	Information Theory (cs.IT); Computer Vision and Pattern Recognition (cs.CV); Networking and Internet Architecture (cs.NI)
Cite as:	arXiv:2411.15702 [cs.IT]
	(or arXiv:2411.15702v1 [cs.IT] for this version)
	https://doi.org/10.48550/arXiv.2411.15702

Computer Science > Information Theory

Title:Editable-DeepSC: Reliable Cross-Modal Semantic Communications for Facial Editing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators