DOI QR코드

DOI QR Code

A two-stage cascaded foreground seeds generation for parametric min-cuts

  • Li, Shao-Mei (National Digital Switching System Engineering and Technological Research and Development Center) ;
  • Zhu, Jun-Guang (National Digital Switching System Engineering and Technological Research and Development Center) ;
  • Gao, Chao (National Digital Switching System Engineering and Technological Research and Development Center) ;
  • Li, Chun-Wei (National Digital Switching System Engineering and Technological Research and Development Center)
  • Received : 2016.03.07
  • Accepted : 2016.10.24
  • Published : 2016.11.30

Abstract

Parametric min-cuts is an object proposal algorithm, which can be used for accurate image segmentation. In parametric min-cuts, foreground seeds generation plays an important role since the number and quality of foreground seeds have great effect on its efficiency and accuracy. To improve the performance of parametric min-cuts, this paper proposes a new framework for foreground seeds generation. First, to increase the odds of finding objects, saliency detection at multiple scales is used to generate a large set of diverse candidate seeds. Second, to further select good-quality seeds, a two-stage cascaded ranking classifier is used to filter and rank the candidates based on their appearance features. Experimental results show that parametric min-cuts using our seeding strategy can obtain a relative small pool of proposals with high accuracy.

Keywords

References

  1. J. Hosang, R. Benenson, P. Dollar and B. Schiele, "What makes for effective detection proposals?" IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.38, no.4, pp. 6644-6665, August, 2016.
  2. B. Alexe, T. Deselaers, V. Ferrari, "What is an object?" in Proc. of 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 73-80, June 13-18, 2010.
  3. KEAVD Sande, JRR Uijlings, T. Gevers and AWM Smeulders, "Segmentation as selective search for object recognition," in Proc. of the 2011 IEEE Conference on Computer Vision (ICCV), pp. 1879-1886, January 12-16, 2011.
  4. J. Carreira, C. Sminchisescu, " Constrained parametric min-cuts for automatic object segmentation," in Proc. of the 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3241-3248, June 13-18, 2010.
  5. R. Girshick, J. Donahue, T. Darrell and J. Malik, " Rich feature hierarchies for accurate object detection and semantic segmentation," in Proc. of the 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 580-587, June 23-28,2014.
  6. L. Gomez and D. Karatzas, "Object Proposals for Text Extraction in the Wild," in Proc. of 2015 International Conference on Document Analysis and Recognition, pp.1786-1812, August 23-26,2015.
  7. A. Milan, L. Leal-Taixe, K. Schindler, et al, "Joint tracking and segmentation of multiple targets," in Proc. of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 5397-5406, June 7-12,2015.
  8. J. Carreira, R. Caseiro, J. Batista, et al, "Semantic segmentation with second-order pooling," in Proc. of 2012 European Conference on Computer Vision (ECCV), pp. 430-443, October 7-13, 2012.
  9. A. Borji, D. N Sihite, L. Itti, "Salient object detection: A Benchmark,"IEEE Transactions on Image Processing, vol. 24, no. 12, pp. 5706-5722, October, 2015. https://doi.org/10.1109/TIP.2015.2487833
  10. B. Alexe, T. Deselaers, V. Ferrari, "Measuring the objectness of image windows," PAMI, vol. 34, no.11, pp. 2189-2202, November, 2012. https://doi.org/10.1109/TPAMI.2012.28
  11. E. Rahtu , J. Kannala, M. Blaschko, "Learning a category independent object detection cascade," in Proc. of the 2011 International Conference on Computer Vision (ICCV) , pp. 1052-1059, November 6-13, 2011.
  12. M.M. Cheng, Z. Zhang, W.Y. Lin and P. Torr, "BING: Binarized normed gradients for objectness estimation at 300fps," in Proc. of the 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3286-3293, June 23-28,2014.
  13. S. Manen, M. Guillaumin, L.V. Gool, "Prime object proposals with randomized prims algorithm," in Proc. of the 2013 International Conference on Computer Vision (ICCV), pp. 2536-2543, December 1-8,2013.
  14. X. Wang, M. Yang, S. Zhu and Y. Lin, "Regionlets for generic object detection," IEEE Transactions on Pattern Analysis & Machine Intelligence, vol.37, no.10, pp.17-24, January, 2015.
  15. P.F. Felzenszwalb, D.P. Huttenlocher, "Efficient graph-based image segmentation," IJCV, vol.59, no.2, pp.167-181, September, 2004. https://doi.org/10.1023/B:VISI.0000022288.19776.77
  16. J. Carreira, C. Sminchisescu, "Cpmc: Automatic object segmentation using constrained parametric min-cuts," PAMI, vol.34, no.7, pp. 1312-1328, December, 2011.
  17. I. Endres, D. Hoiem, "Category-independent object proposals with diverse ranking," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.36, no.2, pp. 222-234, June, 2014. https://doi.org/10.1109/TPAMI.2013.122
  18. P. Rantalankila, J. Kannala, E. Rahtu, "Generating object segmentation proposals using global and local search," in Proc. of the 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2417-2424, June 23-28, 2014.
  19. C. L. Zitnick, P. Dollar, "Edge boxes: Locating object proposals from edges," in Proc. of 2014 European Conference on Computer Vision (ECCV), pp. 391-405, September 6-12, 2014.
  20. M. Everingham, S. M. A. Eslami, L. Van Gool, et al, "The pascal visual object classes challenge: A retrospective,"IJCV, vol.111, no.1, pp. 98-136, January, 2015. https://doi.org/10.1007/s11263-014-0733-5
  21. J. Kim, K. Grauman, "Shape sharing for object segmentation," in Proc. of 2012 European Conference on Computer Vision (ECCV), pp. 444-458, October 7-13,2012.
  22. T. Lee, S. Fidler, S. Dickinson, "Multi-cue mid-level grouping," in Proc. of 2014 Asian Conference on Computer Vision (ACCV), pp. 376-390, April 16-20, 2014.
  23. T. Lee, S. Fidler, S. Dickinson, "Learning to Combine Mid-level Cues for Object Proposal Generation," in Proc. of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1680-168, February 18, 2015.
  24. B. Singh, X. Han, Z. Wu, et al, "PSPGC: Part-Based Seeds for Parametric Graph-Cuts," in Proc. of 2014 Asican Conference on Computer Vision (ACCV), pp. 360-375, April 16, 2014.
  25. P. F. Felzenszwalb, R. B. Girshick, D. McAllester, et al, "Object detection with discriminatively trained part-based models," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.32, no.9, pp. 1627-1645, September, 2009. https://doi.org/10.1109/TPAMI.2009.167
  26. P. Krahenbuhl, V. Koltun, "Geodesic object proposals," in Proc. of 2014 European Conference on Computer Vision (ECCV), pp. 725-739, September 5-8, 2014.
  27. X. Hou, L. Zhang, "Saliency detection: A spectral residual approach," in Proc. of the 2007 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1-8, June 17-22, 2007.
  28. T. Y. Liu, Learning to rank for information retrieval, 2nd Edition, Springer Berlin Heidelberg, 2011.
  29. T. Joachims, "Optimizing search engines using clickthrough data," in Proc. of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, pp. 133-142, August 12-15,2002.
  30. C. Gu, J. J. Lim, P. Arbelaez, et al, "Recognition using regions," in Proc. of the 2009 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1030-1037, June 20-25, 2009.
  31. N. Dalal, B. Triggs, "Histograms of oriented gradients for human detection," in Proc. of the 2005 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 886-893, June 20-25, 2005.
  32. I. Tsochantaridis, T. Joachims, T. Hofmann, et al, "Large margin methods for structured and interdependent output variables," Journal of Machine Learning Research. vol.6, no.2, pp. 1453-1484, January, 2005.
  33. T. Joachims, T. Finley, C. N. J. Yu, "Cutting-plane training of structural SVMs," Machine Learning, vol.77, no.1, pp. 27-59, October, 2009. https://doi.org/10.1007/s10994-009-5108-8
  34. A. Humayun, F. Li, J. M. Rehg, "RIGOR: Reusing inference in graph cuts for generating object regions," in Proc. of the 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 336-343, June 23-28, 2014.
  35. V. Kolmogorov, Y. Boykov, C. Rother, "Applications of parametric maxflow in computer vision," in Proc. of the 2007 International Conference on Computer Vision (ICCV) , pp. 1-8, Octorber 14-21, 2007.
  36. M. Maire, P. Arbelaez, C. Fowlkes, et al, "Using contours to detect and localize junctions in natural images," in Proc. of the 2008 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1-8, June 23-28,2008.
  37. P. Dollar, C. L. Zitnick, "Fast edge detection using structured forests," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.37, no.8, pp.1558-1570, December, 2015. https://doi.org/10.1109/TPAMI.2014.2377715
  38. J. Pont-Tuset, P. Arbelaez, J. T. Barron, et al, "Multiscale Combinatorial Grouping for Image Segmentation and Object Proposal Generation," IEEE Transactions on Pattern Analysis & Machine Intelligence, pp. 1-1, March, 2016.