Action recognition using Kinematics Posture Feature on 3D skeleton joint locations

Article

Ahad, M. A. R., Ahmed, M., Antar, A. D., Makihara, Y. and Yagi. Y. 2021. Action recognition using Kinematics Posture Feature on 3D skeleton joint locations. Pattern Recognition Letters. 145, pp. 216-224. https://doi.org/10.1016/j.patrec.2021.02.013

Publication dates
Authors	Ahad, M. A. R., Ahmed, M., Antar, A. D., Makihara, Y. and Yagi. Y.
Abstract	Action recognition is a very widely explored research area in computer vision and related fields. We propose Kinematics Posture Feature (KPF) extraction from 3D joint positions based on skeleton data for improving the performance of action recognition. In this approach, we consider the skeleton 3D joints as kinematics sensors. We propose Linear Joint Position Feature (LJPF) and Angular Joint Position Feature (AJPF) based on 3D linear joint positions and angles between bone segments. We then combine these two kinematics features for each video frame for each action to create the KPF feature sets. These feature sets encode the variation of motion in the temporal domain as if each body joint represents kinematics position and orientation sensors. In the next stage, we process the extracted KPF feature descriptor by using a low pass filter, and segment them by using sliding windows with optimized length. This concept resembles the approach of processing kinematics sensor data. From the segmented windows, we compute the Position-based Statistical Feature (PSF). These features consist of temporal domain statistical features (e.g., mean, standard deviation, variance, etc.). These statistical features encode the variation of postures (i.e., joint positions and angles) across the video frames. For performing classification, we explore Support Vector Machine (Linear), RNN, CNNRNN, and ConvRNN model. The proposed PSF feature sets demonstrate prominent performance in both statistical machine learning- and deep learning-based models. For evaluation, we explore five benchmark datasets namely UTKinect-Action3D, Kinect Activity Recognition Dataset (KARD), MSR 3D Action Pairs, Florence 3D, and Office Activity Dataset (OAD). To prevent overfitting, we consider the leave-one-subject-out framework as the experimental setup and perform 10-fold cross-validation. Our approach outperforms several existing methods in these benchmark datasets and achieves very promising classification performance.
Keywords	AI; Activity recognition; Skeleton; Vision; Deep learning
Journal	Pattern Recognition Letters
Journal citation	145, pp. 216-224
ISSN	0167-8655
Year	2021
Publisher	Elsevier
Publisher's version	prl aa.pdf License CC BY 4.0 File Access Level Anyone
Digital Object Identifier (DOI)	https://doi.org/10.1016/j.patrec.2021.02.013
Online	03 Mar 2021
Print	May 2021
Publication process dates
Accepted	26 Feb 2021
Deposited	04 Dec 2023
Copyright holder	© 2021, The Authors

Permalink -

https://repository.uel.ac.uk/item/8wz27

Download files

Publisher's version

	prl aa.pdf
License: CC BY 4.0
File access level: Anyone

168
total views
54
total downloads
5
views this month
2
downloads this month

Export as

Related outputs

MMF-Gait: A Multi-Model Fusion-Enhanced Gait Recognition Framework Integrating Convolutional and Attention Networks

Hasan, K., Tuhin, K. A., Bapary, M. R. I., Doula, M. S. U., Alam, M. A., Ahad, M. and Uddin, M. Z. 2025. MMF-Gait: A Multi-Model Fusion-Enhanced Gait Recognition Framework Integrating Convolutional and Attention Networks. Symmetry. 17 (7), p. Art. 1155. https://doi.org/10.3390/sym17071155

View-embedding GCN for skeleton-based cross-view gait recognition

Uddin, M. Z., Ray, A., Das, B. and Ahad, M. A. R. 2025. View-embedding GCN for skeleton-based cross-view gait recognition. IEEE Transactions on Human-Machine Systems. p. In press.

Automated Autism Assessment With Multimodal Data and Ensemble Learning: A Scalable and Consistent Robot-Enhanced Therapy Framework

Hassan, I., Nahid, N., Islam, M., Hossain, S., Schuller, B. and Ahad, M. 2025. Automated Autism Assessment With Multimodal Data and Ensemble Learning: A Scalable and Consistent Robot-Enhanced Therapy Framework. IEEE Transactions on Neural Systems and Rehabilitation Engineering. 33, pp. 1191-1201. https://doi.org/10.1109/TNSRE.2025.3546519

Predictive Modeling for Heatstroke Risk Forecasting Integrating Physiological Features Using Ensemble Classifier

Sheikh, M. M., Hossain, S. and Ahad, M. A. R. 2025. Predictive Modeling for Heatstroke Risk Forecasting Integrating Physiological Features Using Ensemble Classifier. in: Inoue, S., Lopez, G., Hossain, T. and Ahad, M. A. R. (ed.) Activity, Behavior, and Healthcare Computing CRC Press.

A stacked CNN and random forest ensemble architecture for complex nursing activity recognition and nurse identification

Rahman, A., Nahid, N., Schuller, S. and Ahad, M. 2024. A stacked CNN and random forest ensemble architecture for complex nursing activity recognition and nurse identification. Scientific Reports. 14 (Art. 31667). https://doi.org/10.1038/s41598-024-81228-x

Horizontal and Vertical Part-Wise Feature Extraction for Cross-View Gait Recognition

Uddin, M. Z., Hasan, K., Ahad, M. and Alnajjar, F. 2024. Horizontal and Vertical Part-Wise Feature Extraction for Cross-View Gait Recognition. IEEE Access. 12, pp. 185511-185527. https://doi.org/10.1109/ACCESS.2024.3513541

Multi-Biometric Feature Extraction from Multiple Pose Estimation Algorithms for Cross-View Gait Recognition

Ray, A., Uddin, Z., Hasan, K., Melody, Z. R., Sarker, P. K. and Ahad, M. 2024. Multi-Biometric Feature Extraction from Multiple Pose Estimation Algorithms for Cross-View Gait Recognition. Sensors. 24 (23), p. Art. 7669. https://doi.org/10.3390/s24237669

Improving Gait Recognition Through Occlusion Detection and Silhouette Sequence Reconstruction

Hasan, K., Uddin, Z., Ray, A., Hasan, M., Alnajjar, F. and Ahad, M. 2024. Improving Gait Recognition Through Occlusion Detection and Silhouette Sequence Reconstruction. IEEE Access. 12, pp. 158597-158610. https://doi.org/10.1109/ACCESS.2024.3482430

Human Identification at a Distance: Challenges, Methods and Results on the Competition HID 2024

Yu, S., Wu, W., Hu, J., Wang, Z., Wang, J., Zhang, M., Wang, R., Ni, Y., Huang, Y., Wang, L. and Ahad, M. A. R. 2024. Human Identification at a Distance: Challenges, Methods and Results on the Competition HID 2024. 2024 IEEE International Joint Conference on Biometrics (IJCB 2024). Buffalo, USA 15 - 18 Sep 2024 IEEE. https://doi.org/10.1109/IJCB62174.2024.10744507

Optimizing Endotracheal Suctioning Classification: Leveraging Prompt Engineering in Machine Learning for Feature Selection

Islam, M. R., Ferodous, A. M., Hossain, S., Alnajjar, F. and Ahad, M. 2024. Optimizing Endotracheal Suctioning Classification: Leveraging Prompt Engineering in Machine Learning for Feature Selection. ABC 2024: 6th International Conference on Activity and Behavior Computing. Kyushu, Japan 28 - 31 May 2024 IEEE. https://doi.org/10.1109/ABC61795.2024.10652117

Bacterial Behaviour Analysis Through Image Segmentation Using Deep Learning Approaches

Rahman, A., Rahman, M. and Ahad, M. 2024. Bacterial Behaviour Analysis Through Image Segmentation Using Deep Learning Approaches. AIiH 2024: 1st International Conference on Artificial Intelligence in Healthcare. Swansea, UK 04 - 06 Sep 2024 Springer. https://doi.org/10.1007/978-3-031-67285-9_13

Nurse Activity Recognition based on Temporal Frequency Features

Rahman, M. S., Rahman, H. R., Zarif, A., Pritom, Y. A. and Ahad, M. A. R. 2024. Nurse Activity Recognition based on Temporal Frequency Features. in: Ahad, M. A. R., Inoue, S., Lopez, G. and Hossain, T. (ed.) Human Activity and Behavior Analysis: Advances in Computer Vision and Sensors, Vol. 1 CRC Press: Taylor & Francis Group. pp. 311-322

A Sequential-based Analytical Approach for Nurse Care Activity Forecasting

Sheikh, M. M., Hossain, S. and Ahad, M. A. R. 2024. A Sequential-based Analytical Approach for Nurse Care Activity Forecasting. in: Ahad, M. A. R., Inoue, S., Lopez, G. and Hossain, T. (ed.) Human Activity and Behavior Analysis Advances in Computer Vision and Sensors: Volume 1 CRC Press: Taylor & Francis Group. pp. 349-368

Psychological Analysis in Human-Robot Collaboration from Workplace Stress Factors: A Review

Nahid, N., Xinyi, M., Inoue, S. and Ahad, M. A. R. 2024. Psychological Analysis in Human-Robot Collaboration from Workplace Stress Factors: A Review. in: Ahad, M. A. R., Inoue, S., Lopez, G. and Hossain, T. (ed.) Human Activity and Behavior Analysis: Advances in Computer Vision and Sensors: Volume 2 Boca Raton, Florida CRC Press: Taylor & Francis Group. pp. 165-197

Static Sign Language Recognition Using Segmented Images and HOG on Cluttered Backgrounds

Sadeghzadeh, A., Islam, B. and Ahad, M. A. R. 2024. Static Sign Language Recognition Using Segmented Images and HOG on Cluttered Backgrounds. in: Ahad, M. A. R., Inoue, S., Lopez, G. and Hossain, T. (ed.) Human Activity and Behavior Analysis: Advances in Computer Vision and Sensors: Volume 2 Boca Raton, Florida CRC Press: Taylor & Francis Group. pp. 23-45

E2ETCA: End-to-end training of CNN and attention ensembles for rice disease diagnosis

Uddin, M. Z., Mahamood, M. N., Ray, A., Pramanik, M. I., Alnajjar, F. and Ahad, M. A. R. 2024. E2ETCA: End-to-end training of CNN and attention ensembles for rice disease diagnosis. Journal of Integrative Agriculture. In Press. https://doi.org/10.1016/j.jia.2024.03.075

Elderly Motion Analysis to Estimate Emotion: A Systematic Review

Hassan, I., Nahid, N., Ahad, M. and Inoue, S. 2024. Elderly Motion Analysis to Estimate Emotion: A Systematic Review. International Journal of Activity and Behavior Computing. (2), pp. 1-23. https://doi.org/10.60401/ijabc.23

Integrating Human Behavioral Model for Intimate-distance Human Robot Collaboration

Nahid, N., Hassan, I., Min, X., Ryoke, N., Ahad, M. and Inoue, S. 2024. Integrating Human Behavioral Model for Intimate-distance Human Robot Collaboration. International Journal of Activity and Behavior Computing. (2), pp. 1-26. https://doi.org/10.60401/ijabc.27

Generative AI for Recognizing Nurse Training Activities in Skeleton-Based Video Data

Mamun, M., Hossain, S., Islam, M. B. and Ahad, M. A. R. 2024. Generative AI for Recognizing Nurse Training Activities in Skeleton-Based Video Data. International Journal of Activity and Behavior Computing. 2024 (3), pp. 1-20. https://doi.org/10.60401/ijabc.34

Enhancing Nursing Activity Recognition During Endotracheal Suctioning Through Video-based Pose Estimation and Machine Learning

Islam, S., Hossain, S. M. H., Uddin, M. Z., Hossain, S. and Ahad, M. 2024. Enhancing Nursing Activity Recognition During Endotracheal Suctioning Through Video-based Pose Estimation and Machine Learning. International Journal of Activity and Behavior Computing. 2024 (3), pp. 1-15. https://doi.org/10.60401/ijabc.36

Stereoscopic Video Deblurring Transformer

Imani, H., Islam, M. B., Junayed, M, S. and Ahad, M. A R. 2024. Stereoscopic Video Deblurring Transformer. Scientific Reports. 14 (Art. 14342). https://doi.org/10.1038/s41598-024-63860-9

Learn Programming with C: An Easy Step-by-Step Self-Practice Book for Learning C

Imran, S. M. S. and Ahad, M. A. R. 2024. Learn Programming with C: An Easy Step-by-Step Self-Practice Book for Learning C. CRC Press: Taylor & Francis Group.

Deep learning with image-based autism spectrum disorder analysis: A systematic review

Uddin, M. Z., Shahriar, M. A., Mahamood, M. N., Alnajjar, F., Pramanik, M. I. and Ahad, M. A. R. 2024. Deep learning with image-based autism spectrum disorder analysis: A systematic review. Engineering Applications of Artificial Intelligence. 127 (Art. 107185). https://doi.org/10.1016/j.engappai.2023.107185

ASD-EVNet: An Ensemble Vision Network based on Facial Expression for Autism Spectrum Disorder Recognition

Jaby, A., Islam, M. B. and Ahad, M. A. R. 2023. ASD-EVNet: An Ensemble Vision Network based on Facial Expression for Autism Spectrum Disorder Recognition. 18th International Conference on Machine Vision and Applications (MVA). Hamamatsu, Japan 23 - 25 Jul 2023 IEEE. https://doi.org/10.23919/MVA57639.2023.10215688

Unsupervised Stereoscopic Video Style Transfer

Imani, H., Islam, M. B. and Ahad, M. A. R. 2023. Unsupervised Stereoscopic Video Style Transfer. ASYU 2023: Innovations in Intelligent Systems and Applications Conference. Sivas, Türkiye 11 - 13 Oct 2023 IEEE. https://doi.org/10.1109/ASYU58738.2023.10296716

Human Identification at a Distance: Challenges, Methods and Results on HID 2023

Yu, S., Weng, C., Zhao, Y., Wang, L., Wang, M., Li, Q., Li, W., Wang, R., Huang, Y., Wang, L., Makihara, Y. and Ahad, M. A. R. 2023. Human Identification at a Distance: Challenges, Methods and Results on HID 2023. IJCB 2023: IEEE International Joint Conference on Biometrics. Ljubljana, Slovenia 25 - 28 Sep 2023 IEEE. https://doi.org/10.1109/IJCB57857.2023.10448952

Autism Spectrum Disorder Classification via Local and Global Feature Representation of Facial Image

Mahamood, M. N., Uddin, M. Z., Shahriar, M. A., Alnajjar, F. and Ahad, M. A. R. 2023. Autism Spectrum Disorder Classification via Local and Global Feature Representation of Facial Image. SMC 2023: IEEE International Conference on Systems, Man, and Cybernetics. Hawaii, USA 01 - 04 Oct 2023 IEEE. https://doi.org/10.1109/SMC53992.2023.10394092

Status of deep learning for EEG-based brain–computer interface applications

Hossain, K. M., Islam, M. A., Hossain, S., Nijholt, A. and Ahad, M. A. R. 2023. Status of deep learning for EEG-based brain–computer interface applications. Frontiers in Computational Neuroscience. 16 (Art. 1006763). https://doi.org/10.3389/fncom.2022.1006763

Annotator-dependent uncertainty-aware estimation of gait relative attributes

Shehata, A., Makihara, Y., Muramatsu, D., Ahad, M. and Yasushi, Y. 2023. Annotator-dependent uncertainty-aware estimation of gait relative attributes. Pattern Recognition. 136 (Art. 109197). https://doi.org/10.1016/j.patcog.2022.109197

Signal Processing and Computer Vision

Ahad, M. A. R. and Ahmed, M. U. 2022. Signal Processing and Computer Vision. in: Electrical and Electronic Engineering: Prospects and Challenges Dhaka University Press. pp. 159-202

HID 2022: The 3rd International Competition on Human Identification at a Distance

Yu, S., Huang, Y., Wang, L., Makihara, Y., Wang, S., Ahad, M. and Nixon, M. 2022. HID 2022: The 3rd International Competition on Human Identification at a Distance. IJCB 2022: IEEE International Joint Conference on Biometrics. Abu Dhabi, UAE 10 - 13 Dec 2023 IEEE. https://doi.org/10.1109/IJCB54206.2022.10007993

Advances in Human Action, Activity and Gesture Recognition

Mahbub, U. and Ahad, M. 2022. Advances in Human Action, Activity and Gesture Recognition. Pattern Recognition Letters. 155, pp. 186-190. https://doi.org/10.1016/j.patrec.2021.11.003

Automated detection approaches to autism spectrum disorder based on human activity analysis: A review

Rahman, S., Ahmed, S. F., Shahid, O., Arrafi, M. A. and Ahad, M. A. R. 2022. Automated detection approaches to autism spectrum disorder based on human activity analysis: A review. Cognitive Computation. 14, pp. 1773-1800. https://doi.org/10.1007/s12559-021-09895-w

Exploiting domain transformation and deep learning for hand gesture recognition using a low-cost dataglove

Faisal, M. A. A., Abir, F. F., Ahmed, M. U. and Ahad, M. A. R. 2022. Exploiting domain transformation and deep learning for hand gesture recognition using a low-cost dataglove. Scientific Reports. 12 (Art. 21446). https://doi.org/10.1038/s41598-022-25108-2

A Sleep Monitoring System Using Ultrasonic Sensors

Shammi, U. A. and Ahad, M. 2022. A Sleep Monitoring System Using Ultrasonic Sensors. International Journal of Biomedical Soft Computing and Human Sciences. 27 (1), pp. 13-20. https://doi.org/10.24466/ijbschs.27.1_13

Can Ensemble of Classifiers Provide Better Recognition Results in Packaging Activity?

Nazmus Sakib, A. H. M., Basak, P., Doha Uddin, S., Mustavi Tasin, S. and Ahad, M. 2022. Can Ensemble of Classifiers Provide Better Recognition Results in Packaging Activity? ABC 2021: 3rd International Conference on Activity and Behavior Computing. Online 22 - 23 Oct 2021 Springer Singapore. https://doi.org/10.1007/978-981-19-0361-8_10

Identification of Food Packaging Activity Using MoCap Sensor Data

Anwar, A., Islam Tapotee, M., Saha, P. and Ahad, M. 2022. Identification of Food Packaging Activity Using MoCap Sensor Data. ABC 2021: 3rd International Conference on Activity and Behavior Computing. Online 22 - 23 Oct 2021 Springer Singapore. https://doi.org/10.1007/978-981-19-0361-8_11

Lunch-Box Preparation Activity Understanding from Motion Capture Data Using Handcrafted Features

Pritom, Y. A., Rahman, M. S., Rahman, H. R., Kowshik, M. A. and Ahad, M. 2022. Lunch-Box Preparation Activity Understanding from Motion Capture Data Using Handcrafted Features. ABC 2021: 3rd International Conference on Activity and Behavior Computing. Online 22 - 23 Oct 2021 Springer Singapore. https://doi.org/10.1007/978-981-19-0361-8_12

Bento Packaging Activity Recognition Based on Statistical Features

Rakib Sayem, F., Sheikh, M. M. and Ahad, M. 2022. Bento Packaging Activity Recognition Based on Statistical Features. ABC 2021: 3rd International Conference on Activity and Behavior Computing. Online 22 - 23 Oct 2021 Springer Singapore. https://doi.org/10.1007/978-981-19-0361-8_13

MUMAP: Modified Ultralightweight Mutual Authentication protocol for RFID enabled IoT networks

Raju, M. H., Ahmed, M. U. and Ahad, M. A. R. 2021. MUMAP: Modified Ultralightweight Mutual Authentication protocol for RFID enabled IoT networks. Journal of the Institute of Industrial Applications Engineers. 9 (2), pp. 33-39. https://doi.org/10.12792/JIIAE.9.33

Emotion Recognition from EEG Signal Focusing on Deep Learning and Shallow Learning Techniques

Islam, M. R., Moni, M. A., Islam, M. M., Rashed-Al-Mahfuz, M., Islam, M. S., Hasan, M. K., Hossain, M. S., Ahmad, M., Uddin, S., Azad, A., Alyami, S. A., Ahad, M. A. R. and Lió, P. 2021. Emotion Recognition from EEG Signal Focusing on Deep Learning and Shallow Learning Techniques. IEEE Access. 9, pp. 94601-94624. https://doi.org/10.1109/ACCESS.2021.3091487

Static Postural Transition-based Technique and Efficient Feature Extraction for Sensor-based Activity Recognition

Ahmed, M., Das Antar, A. and Ahad, M. 2021. Static Postural Transition-based Technique and Efficient Feature Extraction for Sensor-based Activity Recognition. Pattern Recognition Letters. 147, pp. 25-33. https://doi.org/10.1016/j.patrec.2021.04.001

Recognition of human locomotion on various transportations fusing smartphone sensors

Das Antar, A., Ahmed, M. and Ahad, M. 2021. Recognition of human locomotion on various transportations fusing smartphone sensors. Pattern Recognition Letters. 148, pp. 146-153. https://doi.org/10.1016/j.patrec.2021.04.015

Activity Recognition from Accelerometer Data Based on Supervised Learning for Wireless Sensor Network

Israt, F. A., Hossain, T., Inoue, S. and Ahad, M. A. R. 2021. Activity Recognition from Accelerometer Data Based on Supervised Learning for Wireless Sensor Network. International Journal of Biomedical Soft Computing and Human Sciences. 26 (2), pp. 73-86. https://doi.org/10.24466/ijbschs.26.2_73

Exploring Human Activities Using eSense Earable Device

Islam, M. S., Hossain, T., Ahad, M. and Inoue, S. 2021. Exploring Human Activities Using eSense Earable Device. in: Ahad, M., Inoue, S., Roggen, D. and Fujinami, K. (ed.) Activity and Behavior Computing Springer Singapore. pp. 169–185

Contactless Human Monitoring: Challenges and Future Direction

Mahbub, U., Rahman, T. and Ahad, M. 2021. Contactless Human Monitoring: Challenges and Future Direction. in: Ahad, M., Mahbub, U. and Ahad, M. (ed.) Contactless Human Activity Analysis Springer, Cham. pp. 335-364

Contactless Human Emotion Analysis Across Different Modalities

Nahid, N., Rahman, A. and Ahad, M. 2021. Contactless Human Emotion Analysis Across Different Modalities. in: Ahad, M., Mahbub, U. and Rahman, T. (ed.) Contactless Human Activity Analysis Springer, Cham. pp. 237-269

Contactless Fall Detection for the Elderly

Nahian, M. J. A., Raju, M. H., Tasnim, Z., Mahmud, M., Ahad, M. and Kaiser, M. S. 2021. Contactless Fall Detection for the Elderly. in: Ahad, M., Mahbub, U. and Rahman, T. (ed.) Contactless Human Activity Analysis Springer, Cham. pp. 203-235

Signal Processing for Contactless Monitoring

Billah, M. S., Ahad, M. and Mahbub, U. 2021. Signal Processing for Contactless Monitoring. in: Ahad, M., Mahbub, U. and Rahman, T. (ed.) Contactless Human Activity Analysis Springer, Cham. pp. 113-144

Skeleton-Based Activity Recognition: Preprocessing and Approaches

Sarker, S., Rahman, S., Hossain, T., Faiza Ahmed, S., Jamal, L. and Ahad, M. 2021. Skeleton-Based Activity Recognition: Preprocessing and Approaches. in: Ahad, M., Mahbub, U. and Rahman, T. (ed.) Contactless Human Activity Analysis Springer, Cham. pp. 48-81

IoT Sensor-Based Activity Recognition: Human Activity Recognition

Ahad, M., Antar, A. D. and Ahmed, M. 2021. IoT Sensor-Based Activity Recognition: Human Activity Recognition. Springer, Cham.

A Method for Sensor-Based Activity Recognition in Missing Data Scenario

Hossain, T., Ahad, M. A. R. and Inoue, S. 2020. A Method for Sensor-Based Activity Recognition in Missing Data Scenario. Sensors. 20 (14), pp. 1-23. https://doi.org/10.3390/s20143811

An AI-based Visual Aid with Integrated Reading Assistant for the Completely Blind

Khan, M. A., Paul, P., Rashid, M., Hossain, M. and Ahad, M. 2020. An AI-based Visual Aid with Integrated Reading Assistant for the Completely Blind. IEEE Transactions on Human-Machine Systems. 50 (6), pp. 507-517. https://doi.org/10.1109/THMS.2020.3027534

Action recognition using Kinematics Posture Feature on 3D skeleton joint locations

Download files

Publisher's version

168

54

5

2

Export as

Related outputs