Efficient Sparse Representation based Action Recognition in video
Ushapreethi P1, Lakshmi Priya G G2
1Ushapreethi P, School of Information Technology and Engineering, Vellore Institute of Technology, Vellore.
2Lakshmi Priya G G, School of Information Technology and Engineering, Vellore Institute of Technology, Vellore.
Manuscript received on November 26, 2019. | Revised Manuscript received on December 15, 2019. | Manuscript published on December 30, 2019. | PP: 728-732 | Volume-9 Issue-2, December, 2019. | Retrieval Number: B2950129219/2020©BEIESP | DOI: 10.35940/ijeat.B2950.129219
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)
Abstract: Human Action Recognition (HAR) is an interesting and helpful topic in various real-life applications such as surveillance based security system, computer vision and robotics. The selected features and feature representation methods, classification algorithms decides the accuracy of the HAR systems. A new feature called, Skeletonized STIP (Spatio Temporal Interest Points) is identified and used in this work. The skeletonization on the action video’s foreground frames are performed and the new feature is generated as STIP values of the skeleton frame sequence. Then the feature set is used for initial dictionary construction in sparse coding. The data for action recognition is huge, since the feature set is represented using the sparse representation. To refine the sparse representation the max pooling method is used and the action recognition is performed using SVM classifier. The proposed approach outperforms on the benchmark datasets.
Keywords: Skeletonization Sparse representation, Action recognition, Sparse coding, Sparse dictionaries, SVM classifier.