MANSY: Generalizing Neural Adaptive Immersive Video Streaming With Ensemble and Representation Learning

Wu, Duo; Wu, Panlong; Zhang, Miao; Wang, Fangxin

doi:10.1109/TMC.2024.3487175

Computer Science > Networking and Internet Architecture

arXiv:2311.06812 (cs)

[Submitted on 12 Nov 2023 (v1), last revised 25 Oct 2024 (this version, v2)]

Title:MANSY: Generalizing Neural Adaptive Immersive Video Streaming With Ensemble and Representation Learning

Authors:Duo Wu, Panlong Wu, Miao Zhang, Fangxin Wang

View PDF HTML (experimental)

Abstract:The popularity of immersive videos has prompted extensive research into neural adaptive tile-based streaming to optimize video transmission over networks with limited bandwidth. However, the diversity of users' viewing patterns and Quality of Experience (QoE) preferences has not been fully addressed yet by existing neural adaptive approaches for viewport prediction and bitrate selection. Their performance can significantly deteriorate when users' actual viewing patterns and QoE preferences differ considerably from those observed during the training phase, resulting in poor generalization. In this paper, we propose MANSY, a novel streaming system that embraces user diversity to improve generalization. Specifically, to accommodate users' diverse viewing patterns, we design a Transformer-based viewport prediction model with an efficient multi-viewport trajectory input output architecture based on implicit ensemble learning. Besides, we for the first time combine the advanced representation learning and deep reinforcement learning to train the bitrate selection model to maximize diverse QoE objectives, enabling the model to generalize across users with diverse preferences. Extensive experiments demonstrate that MANSY outperforms state-of-the-art approaches in viewport prediction accuracy and QoE improvement on both trained and unseen viewing patterns and QoE preferences, achieving better generalization.

Comments:	This article has been accepted for publication in IEEE Transactions on Mobile Computing. Citation information: DOI this https URL
Subjects:	Networking and Internet Architecture (cs.NI)
Cite as:	arXiv:2311.06812 [cs.NI]
	(or arXiv:2311.06812v2 [cs.NI] for this version)
	https://doi.org/10.48550/arXiv.2311.06812
Related DOI:	https://doi.org/10.1109/TMC.2024.3487175

Submission history

From: Duo Wu [view email]
[v1] Sun, 12 Nov 2023 11:20:25 UTC (16,956 KB)
[v2] Fri, 25 Oct 2024 05:52:09 UTC (9,639 KB)

Computer Science > Networking and Internet Architecture

Title:MANSY: Generalizing Neural Adaptive Immersive Video Streaming With Ensemble and Representation Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Networking and Internet Architecture

Title:MANSY: Generalizing Neural Adaptive Immersive Video Streaming With Ensemble and Representation Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators