Onboard Deep Deterministic Policy Gradients for Online Flight Resource Allocation of UAVs
Ref: CISTER-TR-200602 Publication Date: 1, Sep, 2020
Onboard Deep Deterministic Policy Gradients for Online Flight Resource Allocation of UAVsRef: CISTER-TR-200602 Publication Date: 1, Sep, 2020
In Unmanned Aerial Vehicle (UAV) enabled data collection, scheduling data transmissions of the ground nodes while controlling flight of the UAV, e.g., heading and velocity, is critical to reduce the data packet loss resulting from buffer overflows and channel fading. In this letter, a new online flight resource allocation scheme based on deep deterministic policy gradients (DDPG-FRAS) is studied to jointly optimize the flight control of the UAV and data collection scheduling along the trajectory in real time, thereby asymptotically minimizing the packet loss of the ground sensor networks. Numerical results confirm that the proposed DDPG-FRAS can gradually converge, while enlarging the buffer size can reduce the packet loss by 47.9%.
Published in IEEE Networking Letters, IEEE, Volume 2, Issue 3, pp 106-110.