UAV-assisted maritime data collection via optical communications using deep reinforcement learning

S Ma, M Li, T Deng, J Wang, R Ruby… - 2023 9th International …, 2023 - ieeexplore.ieee.org
S Ma, M Li, T Deng, J Wang, R Ruby, H Luo
2023 9th International Conference on Big Data Computing and …, 2023ieeexplore.ieee.org
Ocean data collection via unmanned aerial vehicles (UAVs) has received widespread
attention due to its flexibility and low cost. To further improve the efficiency of large maritime
data collection between the UAVs and the ocean surface buoys, optical wireless
transmission is considered as a promising technique because of its low latency and high
bandwidth. However, the waves and other disturbances in the complex oceanic environment
result in drift and instability of the buoy, which deteriorate and even interrupt the line-of-sight …
Ocean data collection via unmanned aerial vehicles (UAVs) has received widespread attention due to its flexibility and low cost. To further improve the efficiency of large maritime data collection between the UAVs and the ocean surface buoys, optical wireless transmission is considered as a promising technique because of its low latency and high bandwidth. However, the waves and other disturbances in the complex oceanic environment result in drift and instability of the buoy, which deteriorate and even interrupt the line-of-sight (LOS) optical transmission. To tackle these challenges, in this paper, we propose a reliable data collection scheme based on a deep reinforcement learning (DRL) approach. We first model the optical channel and calculate the maximum transmission range that satisfies a pre-defined bit error rate (BER) to ensure the quality of service (QoS). Then, we formulate the data collection procedure as a Markov decision process (MDP) aiming at maximizing the received signal intensity and minimizing the energy consumption, in which the system uncertainty caused by the oceanic environmental disturbances is taken into account. Finally, we propose a beam pointing adjustment algorithm based on deep deterministic policy gradient (DDPG) approach to alleviate the performance deterioration and maintain a stable LOS communication. Through extensive simulations, the results demonstrate that the proposed scheme is effective and achieves reliable data collection via the optical links.
ieeexplore.ieee.org
Showing the best result for this search. See all results