DOI QR코드

DOI QR Code

Adaptive Bayesian Object Tracking with Histograms of Dense Local Image Descriptors

  • Kim, Minyoung (Department of Electronics & IT Media Engineering, Seoul National University of Science & Technology)
  • 투고 : 2016.05.31
  • 심사 : 2016.06.21
  • 발행 : 2016.06.30

초록

Dense local image descriptors like SIFT are fruitful for capturing salient information about image, shown to be successful in various image-related tasks when formed in bag-of-words representation (i.e., histograms). In this paper we consider to utilize these dense local descriptors in the object tracking problem. A notable aspect of our tracker is that instead of adopting a point estimate for the target model, we account for uncertainty in data noise and model incompleteness by maintaining a distribution over plausible candidate models within the Bayesian framework. The target model is also updated adaptively by the principled Bayesian posterior inference, which admits a closed form within our Dirichlet prior modeling. With empirical evaluations on some video datasets, the proposed method is shown to yield more accurate tracking than baseline histogram-based trackers with the same types of features, often being superior to the appearance-based (visual) trackers.

키워드

참고문헌

  1. K. Zhang, Q. Liu, Y. Wu, and M. H. Yang, "Robust visual tracking via convolutional networks without training," IEEE Transactions on Image Processing, vol. 25, no. 4, pp. 1779-1792, 2016. http://dx.doi.org/10.1109/TIP.2016.2531283
  2. J. H. Yoon, M. H. Yang, and K. J. Yoon, "Interacting multiview tracker," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 38, no. 5, pp. 903-917, 2016. http://dx.doi.org/10.1109/TPAMI.2015.2473862
  3. M. J. Black and A. D. Jepson, "EigenTracking: Robust matching and tracking of articulated objects using a viewbased representation," International Journal of Computer Vision, vol. 26, no. 1, pp. 63-84, 1998. http://dx.doi.org/10.1023/A:1007939232436
  4. D. A. Ross, J. Lim, R. S. Lin, and M. H. Yang, "Incremental learning for robust visual tracking," International Journal of Computer Vision, vol. 77, no. 1, pp. 125-141, 2008. http://dx.doi.org/10.1007/s11263-007-0075-7
  5. M. Kim, "Correlation-based incremental visual tracking," Pattern Recognition, vol. 45, no. 3, pp. 1050-1060, 2012. http://dx.doi.org/10.1016/j.patcog.2011.08.026
  6. L. Sevilla-Lara and E. Learned-Miller, "Distribution fields for tracking," in Proceedings of 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, 2012, pp. 1910-1917. http://dx.doi.org/10.1109/CVPR.2012.6247891
  7. D. Comaniciu, V. Ramesh, and P. Meer, "Kernel-based object tracking," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 25, no. 5, pp. 564-577, 2003. http://dx.doi.org/10.1109/TPAMI.2003.1195991
  8. A. Adam, E. Rivlin, and I. Shimshoni, "Robust fragmentsbased tracking using the integral histogram," in Proceedings of 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, New York, NY, 2006, pp. 798-805. http://dx.doi.org/10.1109/CVPR.2006.256
  9. S. He, Q. Yang, R. W. H. Lau, J. Wang, and M. H. Yang, "Visual tracking via locality sensitive histograms," in Proceedings of 2013 IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, 2013, pp. 2427-2434. http://dx.doi.org/10.1109/CVPR.2013.314
  10. A. Bolovinou, I. Pratikakis, and S. Perantonis, "Bag of spatio-visual words for context inference in scene classification," Pattern Recognition, vol. 46, no. 3, pp. 1039-1053, 2013. http://dx.doi.org/10.1016/j.patcog.2012.07.024
  11. D. G. Lowe, "Distinctive image features from scaleinvariant keypoints," International Journal of Computer Vision, vol. 60, no. 2, pp. 91-110, 2004. http://dx.doi.org/10.1023/B:VISI.0000029664.99615.94
  12. L. Fei-Fei and P. Perona, "A Bayesian hierarchical model for learning natural scene categories," in Proceedings of 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Diego, CA, 2005, pp. 524-531. http://dx.doi.org/10.1109/CVPR.2005.16
  13. W. Chong, D. Blei, and F. F. Li, "Simultaneous image classification and annotation," in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, 2009, pp. 1903-1910. http://dx.doi.org/10.1109/CVPR.2009.5206800
  14. A. Levey and M. Lindenbaum, "Sequential Karhunen-Loeve basis extraction and its application to images," IEEE Transactions on Image Processing, vol. 9, no. 8, pp. 1371-1374, 2000. http://dx.doi.org/10.1109/83.855432
  15. J. Deng, W. Dong, R. Socher, L. J. Li, K. Li, and L. Fei-Fei, "ImageNet: a large-scale hierarchical image database," in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, 2009, pp. 248-255. http://dx.doi.org/10.1109/CVPR.2009.5206848
  16. O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, et al, "ImageNet large scale visual recognition challenge," International Journal of Computer Vision, vol. 115, no. 3, pp. 211-252, 2015. http://dx.doi.org/10.1007/s11263-015-0816-y
  17. A. Barla, F. Odone, and A. Verri, "Histogram intersection kernel for image classification," in Proceedings of International Conference on Image Processing, Barcelona, Spain, 2003, pp. 513-516. http://dx.doi.org/10.1109/ICIP.2003.1247294
  18. O. Pele and M. Werman, "The quadratic-chi histogram distance family," in Proceedings of 11th European Conference on Computer Vision, Crete, Greece, 2010, pp. 749-762. http://dx.doi.org/10.1007/978-3-642-15552-9_54