[1]DAI Jinli,CAO Jiangtao,JI Xiaofei.Two-person interaction recognition based on the interactive relationship hypergraph convolution network model[J].CAAI Transactions on Intelligent Systems,2024,19(2):316-324.[doi:10.11992/tis.202208001]
Copy
CAAI Transactions on Intelligent Systems[ISSN 1673-4785/CN 23-1538/TP] Volume:
19
Number of periods:
2024 2
Page number:
316-324
Column:
学术论文—机器感知与模式识别
Public date:
2024-03-05
- Title:
-
Two-person interaction recognition based on the interactive relationship hypergraph convolution network model
- Author(s):
-
DAI Jinli1; CAO Jiangtao1; JI Xiaofei2
-
1. School of Information and Control Engineering, Liaoning Petrochemical University, Fushun 113001, China;
2. School of Automation, Shenyang Aerospace University, Shenyang 110136, China
-
- Keywords:
-
two-person interaction; behavior recognition; skeleton node data; deep learning; ST-GCN; hypergraph; graph structure; neural networks
- CLC:
-
TP183
- DOI:
-
10.11992/tis.202208001
- Abstract:
-
To enhance the security of schools, shopping malls, and other public places, it is important to achieve automatic identification of abnormal two-person interaction behaviors, such as stealing, robbing, fighting, and assaulting, in surveillance videos. However, the current behavior recognition method based on joint data in graph creation neglects the two-person interaction information as well as the interaction relationship between the single unnatural connection joints. To address this issue, a two-person interaction behavior recognition model based on the interactive relationship hypergraph convolution network is proposed to model and identify human interactions. First, the corresponding single hypergraph and two-person interaction graph are created for the joint-point data of each frame, where the hypergraph makes the information of multiple unnaturally connected nodes interchangeable at the same time, and the interaction graph emphasizes the interaction strength between nodes. The above-constructed graph models are fed into the spatiotemporal graph convolution to model the spatial and temporal information separately, and lastly, the recognition results are acquired by the SoftMax classifier. The benefits of the proposed algorithm framework are that the interactive relationship between two persons, the structural relationship between unnatural connections, and the flexible motion characteristics of limbs are regarded in the graph construction process. Tests on the NTU data set demonstrate that the algorithm attains a correct recognition rate of 97.36%. The findings indicate that the network model enhances the ability to represent the characteristics of two-person interaction and has better recognition performance than the current models.