Generalising rescue operations in disaster scenarios using drones: a lifelong reinforcement learning approach

dc.contributor.authorXu, Jiangshan
dc.contributor.authorPanagopoulos, Dimitris
dc.contributor.authorPerrusquía, Adolfo
dc.contributor.authorGuo, Weisi
dc.contributor.authorTsourdos, Antonios
dc.date.accessioned2025-06-26T11:50:29Z
dc.date.available2025-06-26T11:50:29Z
dc.date.freetoread2025-06-26
dc.date.issued2025-06-03
dc.date.pubOnline2025-06-03
dc.description.abstractSearch and rescue (SAR) operations in post-earthquake environments are hindered by unseen environment conditions and uncertain victim locations. While reinforcement learning (RL) has been used to enhance unmanned aerial vehicle (UAV) navigation in such scenarios, its limited generalisation to novel environments, such as post-disaster environments, remains a challenge. To deal with this issue, this paper proposes an RL-based framework that combines the principles of lifelong learning and eligibility traces. Here, the approach uses a shaping reward heuristic based on pre-training experiences obtained from similar environments to improve generalisation, and simultaneously, eligibility traces are used to accelerate convergence of the overall approach. The combined contributions allows the RL algorithm to adapt to new environments, whilst ensuring fast convergence, critical for rescue missions. Extensive simulation studies show that the proposed framework can improve the average reward return by 46% compared to baseline RL algorithms. Ablation studies are also conducted, which demonstrate a 23% improvement in the overall reward score in environments with different complexities and a 56% improvement in scenarios with varying numbers of trapped individuals.
dc.description.journalNameDrones
dc.identifier.citationXu J, Panagopoulos D, Perrusquía A, et al., (2025) Generalising rescue operations in disaster scenarios using drones: a lifelong reinforcement learning approach. Drones, Volume 9, Issue 6, June 2025, Article number 409en_UK
dc.identifier.eissn2504-446X
dc.identifier.elementsID673592
dc.identifier.issn2504-446X
dc.identifier.issueNo6
dc.identifier.paperNo409
dc.identifier.urihttps://doi.org/10.3390/drones9060409
dc.identifier.urihttps://dspace.lib.cranfield.ac.uk/handle/1826/24039
dc.identifier.volumeNo9
dc.languageEnglish
dc.language.isoen
dc.publisherMDPIen_UK
dc.publisher.urihttps://www.mdpi.com/2504-446X/9/6/409
dc.rightsAttribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subject4602 Artificial Intelligenceen_UK
dc.subject4 Quality Educationen_UK
dc.subject40 Engineeringen_UK
dc.subject46 Information and computing sciencesen_UK
dc.subjectlifelong reinforcement learningen_UK
dc.subjectshaping reward heuristicen_UK
dc.subjecteligibility tracesen_UK
dc.subjectState-Action-Reward-State-Action (Sarsa)en_UK
dc.subjectrescue environmentsen_UK
dc.subjectdronesen_UK
dc.titleGeneralising rescue operations in disaster scenarios using drones: a lifelong reinforcement learning approachen_UK
dc.typeArticle
dcterms.dateAccepted2025-06-01

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Generalising_Rescue_Operations-2025.pdf
Size:
1.31 MB
Format:
Adobe Portable Document Format
Description:
Published version

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.63 KB
Format:
Plain Text
Description: