A COLREGs compliance reinforcement learning approach for USV manoeuvring in track-following and collision avoidance problems
dc.contributor.author | Sonntag, Valentin | |
dc.contributor.author | Perrusquía, Adolfo | |
dc.contributor.author | Tsourdos, Antonios | |
dc.contributor.author | Guo, Weisi | |
dc.date.accessioned | 2025-01-09T14:15:58Z | |
dc.date.available | 2025-01-09T14:15:58Z | |
dc.date.freetoread | 2025-01-09 | |
dc.date.issued | 2025-01-15 | |
dc.date.pubOnline | 2024-12-02 | |
dc.description.abstract | The development of new technologies for autonomous platforms has allowed their integration into sea mine countermeasures. This has allowed to remove the personnel from the potential danger by having the mine search task performed by an unmanned surface vessel (USV). Traditional intelligent systems are built by agglomerating hand-coded behaviours that determine how a good manoeuvre looks like. This induces cognitive bias into the pre-defined behaviours that can violate safety and regulatory rules imposed by the COLREGs. To alleviate this issue, this paper proposes a COLREGs compliant reinforcement learning (RL) approach that gives a solution for the autonomous navigation of USVs. A custom simulation environment is developed. The RL agents are trained to deal with path-following problem with obstacle avoidance capabilities. A custom reward function is defined to consider the turning disks for the agent's decision process. A smoothing decision feature is used to smooth the transitions between consecutive actions. The results demonstrate good convergence and high performance under different scenarios. The collision avoidance with COLREGs compliances shows the effectiveness of the proposed approach under several scenarios with static and moving obstacles. | |
dc.description.journalName | Ocean Engineering | |
dc.identifier.citation | Sonntag V, Perrusquía A, Tsourdos A, Guo W. (2025) A COLREGs compliance reinforcement learning approach for USV manoeuvring in track-following and collision avoidance problems. Ocean Engineering, Volume 316, January 2025, Article number 119907 | |
dc.identifier.elementsID | 560119 | |
dc.identifier.issn | 0029-8018 | |
dc.identifier.paperNo | 119907 | |
dc.identifier.uri | https://doi.org/10.1016/j.oceaneng.2024.119907 | |
dc.identifier.uri | https://dspace.lib.cranfield.ac.uk/handle/1826/23344 | |
dc.identifier.volumeNo | 316 | |
dc.language | English | |
dc.language.iso | en | |
dc.publisher | Elsevier | |
dc.publisher.uri | https://www.sciencedirect.com/science/article/pii/S0029801824032451?via%3Dihub | |
dc.rights | Attribution 4.0 International | en |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
dc.subject | 4012 Fluid Mechanics and Thermal Engineering | |
dc.subject | 4005 Civil Engineering | |
dc.subject | 4015 Maritime Engineering | |
dc.subject | 40 Engineering | |
dc.subject | 14 Life Below Water | |
dc.subject | Civil Engineering | |
dc.subject | 4005 Civil engineering | |
dc.subject | 4012 Fluid mechanics and thermal engineering | |
dc.subject | 4015 Maritime engineering | |
dc.subject | USV | |
dc.subject | COLREGs | |
dc.subject | Reinforcement learning | |
dc.subject | Reward design | |
dc.subject | Smoothing decision feature | |
dc.title | A COLREGs compliance reinforcement learning approach for USV manoeuvring in track-following and collision avoidance problems | |
dc.type | Article | |
dc.type.subtype | Journal Article | |
dcterms.dateAccepted | 2024-11-22 |