Multiscale human activity recognition and anticipation network

dc.contributor.authorXing, Yang
dc.contributor.authorGolodetz, Stuart
dc.contributor.authorEveritt, Aluna
dc.contributor.authorMarkham, Andrew
dc.contributor.authorTrigoni, Niki
dc.date.accessioned2022-06-13T15:30:39Z
dc.date.available2022-06-13T15:30:39Z
dc.date.issued2022-05-27
dc.description.abstractDeep convolutional neural networks have been leveraged to achieve huge improvements in video understanding and human activity recognition performance in the past decade. However, most existing methods focus on activities that have similar time scales, leaving the task of action recognition on multiscale human behaviors less explored. In this study, a two-stream multiscale human activity recognition and anticipation (MS-HARA) network is proposed, which is jointly optimized using a multitask learning method. The MS-HARA network fuses the two streams of the network using an efficient temporal-channel attention (TCA)-based fusion approach to improve the model's representational ability for both temporal and spatial features. We investigate the multiscale human activities from two basic categories, namely, midterm activities and long-term activities. The network is designed to function as part of a real-time processing framework to support interaction and mutual understanding between humans and intelligent machines. It achieves state-of-the-art results on several datasets for different tasks and different application domains. The midterm and long-term action recognition and anticipation performance, as well as the network fusion, are extensively tested to show the efficiency of the proposed network. The results show that the MS-HARA network can easily be extended to different application domains.en_UK
dc.identifier.citationXing Y, Golodetz S, Everitt A, et al., (2024) Multiscale human activity recognition and anticipation network. IEEE Transactions on Neural Networks and Learning Systems, Volume 35, Issue 1, January 2024, pp. 451-465en_UK
dc.identifier.issn2162-237X
dc.identifier.urihttps://doi.org.10.1109/TNNLS.2022.3175480
dc.identifier.urihttps://dspace.lib.cranfield.ac.uk/handle/1826/18016
dc.language.isoenen_UK
dc.publisherIEEEen_UK
dc.rightsAttribution-NonCommercial 4.0 International*
dc.rights.urihttp://creativecommons.org/licenses/by-nc/4.0/*
dc.subjectActivity recognition and anticipationen_UK
dc.subjectmultiscale behavior modelingen_UK
dc.subjectmultitask learningen_UK
dc.subjecttwo-stream network fusionen_UK
dc.titleMultiscale human activity recognition and anticipation networken_UK
dc.typeArticleen_UK

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Multi-scale_human_activity_recognition-2022.pdf
Size:
1.36 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.63 KB
Format:
Item-specific license agreed upon to submission
Description: