Showing 1–1 of 1 results for author: Duhme, M

Search v0.5.6 released 2020-02-24

arXiv:2109.12946 [pdf, other]

cs.CV

Fusion-GCN: Multimodal Action Recognition using Graph Convolutional Networks

Authors: Michael Duhme, Raphael Memmesheimer, Dietrich Paulus

Abstract: In this paper, we present Fusion-GCN, an approach for multimodal action recognition using Graph Convolutional Networks (GCNs). Action recognition methods based around GCNs recently yielded state-of-the-art performance for skeleton-based action recognition. With Fusion-GCN, we propose to integrate various sensor data modalities into a graph that is trained using a GCN model for multi-modal action r… ▽ More In this paper, we present Fusion-GCN, an approach for multimodal action recognition using Graph Convolutional Networks (GCNs). Action recognition methods based around GCNs recently yielded state-of-the-art performance for skeleton-based action recognition. With Fusion-GCN, we propose to integrate various sensor data modalities into a graph that is trained using a GCN model for multi-modal action recognition. Additional sensor measurements are incorporated into the graph representation, either on a channel dimension (introducing additional node attributes) or spatial dimension (introducing new nodes). Fusion-GCN was evaluated on two public available datasets, the UTD-MHAD- and MMACT datasets, and demonstrates flexible fusion of RGB sequences, inertial measurements and skeleton sequences. Our approach gets comparable results on the UTD-MHAD dataset and improves the baseline on the large-scale MMACT dataset by a significant margin of up to 12.37% (F1-Measure) with the fusion of skeleton estimates and accelerometer measurements. △ Less

Submitted 27 September, 2021; originally announced September 2021.

Comments: 18 pages, 6 figures, 3 tables, GCPR 2021

Search v0.5.6 released 2020-02-24