Swarm intelligence in cooperative environments: introducing the N-step dynamic tree search algorithm
| dc.contributor.author | Espinós Longa, Marc | |
| dc.contributor.author | Inalhan, Gokhan | |
| dc.contributor.author | Tsourdos, Antonios | |
| dc.date.accessioned | 2022-01-24T09:50:02Z | |
| dc.date.available | 2022-01-24T09:50:02Z | |
| dc.date.issued | 2021-12-29 | |
| dc.description.abstract | Uncertainty and partial or unknown information about environment dynamics have led reward-based methods to play a key role in the Single-Agent and Multi-Agent Learning problem. Tree-based planning approaches such as Monte Carlo Tree Search algorithm have been a striking success in single-agent domains where a perfect simulator model is available, e.g., Go and chess strategic board games. This paper presents a decentralized tree-based planning scheme, that combines forward planning with direct reinforcement learning temporal-difference updates applied to the multi-agent setting. Forward planning requires an engine model which is learned from experience and represented via function approximation. Evaluation and validation are carried out in the Hunter-Prey Pursuit cooperative environment and performance is compared with state-of-the-art RL techniques. N-Step Dynamic Tree Search (NSDTS) pretends to adapt the most successful single-agent learning methods to the multi-agent boundaries in a decentralized system structure, dealing with scalability issues and exponential growth of computational resources suffered by centralized systems. NSDTS demonstrates to be a remarkable advance compared to the conventional Q-Learning temporal-difference method. | en_UK |
| dc.identifier.citation | Espinós Longa M, Inalhan G, Tsourdos A. (2021) Swarm intelligence in cooperative environments: introducing the N-step dynamic tree search algorithm. In: AIAA SciTech 2022 Forum, 3-7 January 2022, San Diego, CA, USA and Virtual Event, Paper number AIAA 2022-1839 | en_UK |
| dc.identifier.uri | https://doi.org/10.2514/6.2022-1839 | |
| dc.identifier.uri | https://dspace.lib.cranfield.ac.uk/handle/1826/17481 | |
| dc.language.iso | en | en_UK |
| dc.publisher | AIAA | en_UK |
| dc.rights | Attribution-NonCommercial 4.0 International | * |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc/4.0/ | * |
| dc.title | Swarm intelligence in cooperative environments: introducing the N-step dynamic tree search algorithm | en_UK |
| dc.type | Conference paper | en_UK |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Swarm_intelligence_in_cooperative_environments-2021.pdf
- Size:
- 693.06 KB
- Format:
- Adobe Portable Document Format
- Description:
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 1.63 KB
- Format:
- Item-specific license agreed upon to submission
- Description: