Cellular UAV-to-Device Communications: Trajectory Design and Mode Selection by Multi-agent Deep Reinforcement Learning

Wu, F.; Zhang, H.; Wu, J.; Song, L.

Electrical Engineering and Systems Science > Signal Processing

arXiv:2002.08040 (eess)

[Submitted on 19 Feb 2020 (v1), last revised 12 Mar 2020 (this version, v2)]

Title:Cellular UAV-to-Device Communications: Trajectory Design and Mode Selection by Multi-agent Deep Reinforcement Learning

Authors:F. Wu, H. Zhang, J. Wu, L. Song

View PDF

Abstract:In the current unmanned aircraft systems (UASs) for sensing services, unmanned aerial vehicles (UAVs) transmit their sensory data to terrestrial mobile devices over the unlicensed spectrum. However, the interference from surrounding terminals is uncontrollable due to the opportunistic channel access. In this paper, we consider a cellular Internet of UAVs to guarantee the Quality-of-Service (QoS), where the sensory data can be transmitted to the mobile devices either by UAV-to-Device (U2D) communications over cellular networks, or directly through the base station (BS). Since UAVs' sensing and transmission may influence their trajectories, we study the trajectory design problem for UAVs in consideration of their sensing and transmission. This is a Markov decision problem (MDP) with a large state-action space, and thus, we utilize multi-agent deep reinforcement learning (DRL) to approximate the state-action space, and then propose a multi-UAV trajectory design algorithm to solve this problem. Simulation results show that our proposed algorithm can achieve a higher total utility than policy gradient algorithm and single-agent algorithm.

Comments:	33 pages, 12 figures
Subjects:	Signal Processing (eess.SP); Networking and Internet Architecture (cs.NI)
Cite as:	arXiv:2002.08040 [eess.SP]
	(or arXiv:2002.08040v2 [eess.SP] for this version)
	https://doi.org/10.48550/arXiv.2002.08040

Submission history

From: Fanyi Wu [view email]
[v1] Wed, 19 Feb 2020 07:56:06 UTC (2,409 KB)
[v2] Thu, 12 Mar 2020 14:25:06 UTC (2,409 KB)

Electrical Engineering and Systems Science > Signal Processing

Title:Cellular UAV-to-Device Communications: Trajectory Design and Mode Selection by Multi-agent Deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Signal Processing

Title:Cellular UAV-to-Device Communications: Trajectory Design and Mode Selection by Multi-agent Deep Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators