References
- I. Lillo et al. "Sparse composition of body poses and atomic actions for human activity recognition in RGB-D videos," Image and Vision Computing, vol. 59, pp. 63-75, 2017. https://doi.org/10.1016/j.imavis.2016.11.004
- H.S. Min et al. "Sparse representation-based human action recognition using an action region-aware dictionary," EEEISM, 2013.
- I. Laptev et al. "On Space-Time Interest Points," Int. J. of Computer Vision, vol 64, pp.107-123, 2005. https://doi.org/10.1007/s11263-005-1838-7
- P. Dollar, et al. "Behavior recognition via sparse spatio-temporal features," VS-PETS, 2005.
- G. Willems, et al. "An efficient dense and scale invariant spatio-temporal interest point detector," ECCV, 2008.
- I. Laptev et al. "Learning realistic human actions from movies," CVPR, 2008.
- A. Klaeseret al. "A spatio-temporal descriptor based on 3D-gradients," BMVC 2008.
- H. Wang et al. "Evaluation of local spatio-temporal features for action recognition," BMVC, 2009.
- V. Delaitre et al. "Recognizing human action in still images: a study of bag-of-features and partial-based representations," BMVC, 2010.
- 홍준혁 외 "가중치 기반 Bag-of-Feature와 앙상블 결정트리를 이용한 정지 영상에서의 인간행동 인식," 한국통신학회논문지, 2013.
- L. Breiman "Random forests," Machine Learning, vol. 45, pp. 5-32, 2001. https://doi.org/10.1023/A:1010933404324
- K. Simonyan et al. "Two-stream convolutional networks for action recognition in videos," NIPS, 2014.
- S. Ji et al. "3d convolutional neural networks for human action recognition," IEEE Transaction PAMI, vol. 35, pp.221-231, 2013. https://doi.org/10.1109/TPAMI.2012.59
- D. Tran et al. "Learning spatiotemporal features with 3D convolutional networks," ICCV, 2015.
- A. Alahi, "Social LSTM: Human Trajectory Prediction in Crowded Spaces," CVPR, 2016.
- S. Hochreiter and J. Schmidhuber, "Long Short-Term Memory," Neural Computation, vol.9, pp. 1735-1780, 1997. https://doi.org/10.1162/neco.1997.9.8.1735
- J. Donahue et al, "Long-term Recurrent Convolutional Networks for Visual Recognition and Description," Berkeley Tech. Report, 2014.
- X. Wang et al. "Beyond Frame-level CNN: Saliency-aware 3D CNN with LSTM for Video Action Recognition," IEEE Sig. Processing Letters, 2016.
- M. S. Ibrahim et al. "A hierarchical deep temporal model for group activity recognition," CVPR, 2016.