Multi-Grained Temporal Clip Transformer for Skeleton-Based Human Activity Recognition

Peiwang Zhu; Chengwu Liang; Yalong Liu; Songqi Jiang

Multi-Grained Temporal Clip Transformer for Skeleton-Based Human Activity Recognition

2025

Peiwang Zhu | Chengwu Liang | Yalong Liu | Songqi Jiang

Skeleton-based human activity recognition is a key research topic in the fields of deep learning and computer vision. However, existing approaches are less effective at capturing short-term sub-action information at different granularity levels and long-term motion correlations, which affect recognition accuracy. To overcome these challenges, an innovative multi-grained temporal clip transformer (MTC-Former) is proposed. Firstly, based on the transformer backbone, a multi-grained temporal clip attention (MTCA) module with multi-branch architecture is proposed to capture the characteristics of short-term sub-action features. Secondly, an innovative multi-scale spatial–temporal feature interaction module is proposed to jointly learn sub-action dependencies and facilitate skeletal motion interactions, where long-range motion patterns are embedded to enhance correlation modeling. Experiments were conducted on three datasets, including NTU RGB+D, NTU RGB+D 120, and InHARD, and achieved state-of-the-art Top-1 recognition accuracy, demonstrating the superiority of the proposed MTC-Former.

اظهر المزيد [+]

الكلمات المفتاحية الخاصة بالمكنز الزراعي (أجروفوك)

skeleton

المعلومات البيبليوغرافية

نُشرت في

Applied Sciences

المجلد 15 الإصدار 9 ترقيم الصفحات 4768 الرقم التسلسلي المعياري الدولي (ردمد) 2076-3417

الناشر

MDPI AG

مواضيع أخرى

Self-attention; Temporal clip; Deep learning; Human activity recognition

اللغة

إنجليزي

في أجريس منذ: 2025-05-22

نوع الملف: DOAJ

مزود البيانات

تم تزويد هذا السجل من قبل Directory of Open Access Journals

اكتشف مجموعة مزود البيانات هذا في أجريس

الروابط

DOI https://www.mdpi.com/2076-3417/15/9/4768

تصفح الباحث العلمي من جوجل

إذا لاحظت أي معلومات غير صحيحة تتعلق بهذا السجل ، يرجى الاتصال بنا [email protected]

أجريس - النظام الدولي للعلوم الزراعية والتكنولوجيا

Share

Multi-Grained Temporal Clip Transformer for Skeleton-Based Human Activity Recognition

2025

الكلمات المفتاحية الخاصة بالمكنز الزراعي (أجروفوك)

المعلومات البيبليوغرافية