Generalising rescue operations in disaster scenarios using drones: a lifelong reinforcement learning approach

Loading...
Thumbnail Image

Date published

Free to read from

2025-06-26

Supervisor/s

Journal Title

Journal ISSN

Volume Title

Publisher

Department

Course name

ISSN

2504-446X

Format

Citation

Xu J, Panagopoulos D, Perrusquía A, et al., (2025) Generalising rescue operations in disaster scenarios using drones: a lifelong reinforcement learning approach. Drones, Volume 9, Issue 6, June 2025, Article number 409

Abstract

Search and rescue (SAR) operations in post-earthquake environments are hindered by unseen environment conditions and uncertain victim locations. While reinforcement learning (RL) has been used to enhance unmanned aerial vehicle (UAV) navigation in such scenarios, its limited generalisation to novel environments, such as post-disaster environments, remains a challenge. To deal with this issue, this paper proposes an RL-based framework that combines the principles of lifelong learning and eligibility traces. Here, the approach uses a shaping reward heuristic based on pre-training experiences obtained from similar environments to improve generalisation, and simultaneously, eligibility traces are used to accelerate convergence of the overall approach. The combined contributions allows the RL algorithm to adapt to new environments, whilst ensuring fast convergence, critical for rescue missions. Extensive simulation studies show that the proposed framework can improve the average reward return by 46% compared to baseline RL algorithms. Ablation studies are also conducted, which demonstrate a 23% improvement in the overall reward score in environments with different complexities and a 56% improvement in scenarios with varying numbers of trapped individuals.

Description

Software Description

Software Language

Github

Keywords

4602 Artificial Intelligence, 4 Quality Education, 40 Engineering, 46 Information and computing sciences, lifelong reinforcement learning, shaping reward heuristic, eligibility traces, State-Action-Reward-State-Action (Sarsa), rescue environments, drones

DOI

Rights

Attribution 4.0 International

Funder/s

Relationships

Relationships

Resources