Panoutsakopoulos, ChristosYuksek, BurakInalhan, GokhanTsourdos, Antonios2022-01-262022-01-262021-12-29Panoutsakopoulos C, Yuksek B, Inalhan G & Tsourdos A (2021) Towards safe deep reinforcement learning for autonomous airborne collision avoidance systems. In: AIAA SciTech 2022 Forum, 3-7 January 2022, San Diego, CA, USA and Virtual Eventhttps://doi.org/10.2514/6.2022-2102.vidhttps://dspace.lib.cranfield.ac.uk/handle/1826/17504In this paper we consider the application of Safe Deep Reinforcement Learning in the context of a trustworthy autonomous Airborne Collision Avoidance System. A simple 2D airspace model is defined, in which a hypothetical air vehicle attempts to fly to a given waypoint while autonomously avoiding Near Mid-Air collisions (NMACs) with non-cooperative traffic. We use Proximal Policy Optimisation for our learning agent, and we propose a reward engineering approach based on a combination of sparse terminal rewards at natural termination points and dense step rewards providing the agent with continuous feedback on its actions, based on relative geometry and motion attributes of its trajectory with respect to the traffic and the target waypoint. The performance of our trained agent is evaluated through Monte-Carlo simulations, and it is demonstrated that it achieves to master the collision avoidance task with respect to safety for a reasonable trade-off in mission performance.enAttribution-NonCommercial 4.0 Internationalhttp://creativecommons.org/licenses/by-nc/4.0/Towards safe deep reinforcement learning for autonomous airborne collision avoidance systemsConference paper