Reinforcement learning for UAV path planning under complicated constraints with GNSS quality awareness

dc.contributor.authorAlyammahi, Abdulla
dc.contributor.authorXu, Zhengjia
dc.contributor.authorPetrunin, Ivan
dc.contributor.authorPeng, Bo
dc.contributor.authorGrech, Raphael
dc.date.accessioned2025-07-21T15:37:09Z
dc.date.available2025-07-21T15:37:09Z
dc.date.freetoread2025-07-21
dc.date.issued2025-06-25
dc.date.pubOnline2025-06-25
dc.description.abstractRequirements for Unmanned Aerial Vehicle (UAV) applications in low-altitude operations are escalating, which demands resilient Position, Navigation and Timing (PNT) solutions incorporating global navigation satellite system (GNSS) services. However, UAVs often operate in stringent environments with degraded GNSS performance. Practical challenges often arise from dense, dynamic, complex, and uncertain obstacles. When flying in complex environments, it is important to consider signal degradation caused by reflections (multipath) and obscuration (Non-Line of Sight (NLOS)), which can lead to positioning errors that must be minimized to ensure mission reliability. Recent works integrate GNSS reliability maps derived from pseudorange error estimations into path planning to reduce loss-of-GNSS risks with PNT degradations. To accommodate multiple constraint conditions attempting to improve flight resilience against GNSS-degraded environments, this paper proposes a reinforcement learning (RL) approach to feature GNSS signal quality awareness during path planning. The non-linear relations between GNSS signal quality in the form of dilution of precision (DoP), geographic locations, and the policy of searching sub-minima points are learned by the clipped Proximal Policy Optimization (PPO) method. Other constraints considered include static obstacle occurrence, altitude boundary, forbidden flying regions, and operational volumes. The reward and punishment functions and the training method are designed to maximize the success criteria of approaching destinations. The proposed RL approach is demonstrated using a real 3D map of Indianapolis, USA, in the Godot engine, incorporating forecasted DoP data generated by a Geospatial Augmentation system named GNSS Foresight from Spirent. Results indicate a 36% enhancement in mission success rates when GNSS performance is included in the path planning training. Additionally, the varying tensor size, representing the UAV’s DoP perception range, exhibits a positive proportion relation to a higher mission rate, despite an increment in computational complexity.
dc.description.conferencenameEuropean Navigation Conference 2024
dc.description.journalNameEngineering Proceedings
dc.identifier.citationAlyammahi A, Xu Z, Petrunin I, et al., (2025) Reinforcement learning for UAV path planning under complicated constraints with GNSS quality awareness. Engineering Proceedings, Volume 88, Issue 1, June 2025, Article number 66. European Navigation Conference 2024. 22-24 May 2024, Noordwijk, The Netherlandsen_UK
dc.identifier.elementsID674044
dc.identifier.issueNo1
dc.identifier.paperNo66
dc.identifier.urihttps://doi.org/10.3390/engproc2025088066
dc.identifier.urihttps://dspace.lib.cranfield.ac.uk/handle/1826/24201
dc.identifier.volumeNo88
dc.language.isoen
dc.publisherMDPIen_UK
dc.publisher.urihttps://www.mdpi.com/2673-4591/88/1/66
dc.rightsAttribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subject4605 Data Management and Data Scienceen_UK
dc.subject46 Information and Computing Sciencesen_UK
dc.subject4602 Artificial Intelligenceen_UK
dc.subjectBehavioral and Social Scienceen_UK
dc.subjectBasic Behavioral and Social Scienceen_UK
dc.subjectpath planningen_UK
dc.subjectGNSS quality awarenessen_UK
dc.subjectdilution of precisionen_UK
dc.subjectreinforcement learningen_UK
dc.subjectclipped proximal policy optimizationen_UK
dc.titleReinforcement learning for UAV path planning under complicated constraints with GNSS quality awarenessen_UK
dc.typeConference paper
dcterms.coverageNoordwijk, The Netherlands
dcterms.temporal.endDate24-May-2024
dcterms.temporal.startDate22-May-2024

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Reinforcement_learning_for_UAV-2025.pdf
Size:
6.87 MB
Format:
Adobe Portable Document Format
Description:
Published version

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.63 KB
Format:
Plain Text
Description: