Please use this identifier to cite or link to this item: http://hdl.handle.net/2080/3483
Title: A Three Stream Deep Network on Extracted Projected Planes for Human Action Recognition
Authors: Sahoo, Suraj Prakash
Ari, Samit
Keywords: Convolutional neural network
Projected planes, score fusion
Transfer learning
Issue Date: Jan-2020
Citation: International Conference on Computer, Electrical & Communication Engineering (ICCECE-2020), 17-18 January 2020, Kolkata, West Bengal, India
Abstract: Human actions are challenging to recognize as it varies its shape from different angle of perception. To tackle this challenge, a multi view camera set up can be arranged, however, it is not cost effective. To handle this issue, a multi stream deep learning network is proposed in this work which is trained on different 3D projected planes. The extracted projected planes which represents different angle of perception, are used as an alternative to multi view action recognition. The projected planes are such that they represents top, side and front view for the action videos. The projected planes are then fed to a three stream deep convolutional neural network. The network uses transfer learning technique to avoid training from scratch. Finally, the scores from three streams are fused to provide the final score to recognize the query video. To evaluate the proposed work, the challenging KTH dataset is used which is widely used and publicly available. The results show that the proposed work performs better compared to state-of-the-art techniques.
Description: Copyright belongs to proceeding publisher
URI: http://hdl.handle.net/2080/3483
Appears in Collections:Conference Papers

Files in This Item:
File Description SizeFormat 
2020_ICCECE_SPSahoo_Three.pdf1.64 MBAdobe PDFView/Open    Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.