DOI QR코드

DOI QR Code

Three-Dimensional Shape Recognition and Classification Using Local Features of Model Views and Sparse Representation of Shape Descriptors

  • Received : 2017.05.17
  • Accepted : 2018.05.31
  • Published : 2020.04.30

Abstract

In this paper, a new algorithm is proposed for three-dimensional (3D) shape recognition using local features of model views and its sparse representation. The algorithm starts with the normalization of 3D models and the extraction of 2D views from uniformly distributed viewpoints. Consequently, the 2D views are stacked over each other to from view cubes. The algorithm employs the descriptors of 3D local features in the view cubes after applying Gabor filters in various directions as the initial features for 3D shape recognition. In the training stage, we store some 3D local features to build the prototype dictionary of local features. To extract an intermediate feature vector, we measure the similarity between the local descriptors of a shape model and the local features of the prototype dictionary. We represent the intermediate feature vectors of 3D models in the sparse domain to obtain the final descriptors of the models. Finally, support vector machine classifiers are used to recognize the 3D models. Experimental results using the Princeton Shape Benchmark database showed the average recognition rate of 89.7% using 20 views. We compared the proposed approach with state-of-the-art approaches and the results showed the effectiveness of the proposed algorithm.

Keywords

References

  1. J. S. Nam, H. Gao, M. K. Kang, K. T. Kim, S. C. Son, C. U. Pom, and K. Heo, "Scenario-based 3D Objects Synthesizing System Design," Journal of Information Processing Systems, vol. 2, no. 1, pp. 18-22, 2006. https://doi.org/10.3745/JIPS.2006.2.1.018
  2. I. Atmosukarto, "3D shape analysis for quantification, classification, and retrieval," Ph.D. dissertation, University of Washington, Seattle, WA, 2010.
  3. J. W. Tangelder and R. C. Veltkamp, "A survey of content based 3D shape retrieval methods," Multimedia Tools and Applications, vol. 39, article no. 441, 2008.
  4. M. A. Savelonas, I. Pratikakis, and K. Sfikas, "An overview of partial 3D object retrieval methodologies," Multimedia Tools and Applications, vol. 74, pp. 11783-11808, 2015. https://doi.org/10.1007/s11042-014-2267-9
  5. C. Li and A. B. Hamza, "Spatially aggregating spectral descriptors for nonrigid 3D shape retrieval: a comparative survey," Multimedia Systems, vol. 20, no. 3, pp. 253-281, 2014. https://doi.org/10.1007/s00530-013-0318-0
  6. G. L. Lopez, A. P. P. Negron, A. D. A. Jimenez, J. R. Rodriguez, and R. I. Paredes, "Comparative analysis of shape descriptors for 3D objects," Multimedia Tools and Applications, vol. 76, pp. 6993-7040, 2017. https://doi.org/10.1007/s11042-016-3330-5
  7. M. Elad, A. Tal, and S. Ar, "Content based retrieval of VRML objects: an iterative and interactive approach," in Multimedia 2001. Vienna: Springer, 2002, pp. 107-118.
  8. R. Osada, T. Funkhouser, B. Chazelle, and D. Dobkin, "Shape distributions," ACM Transactions on Graphics, vol. 21, no. 4, pp. 807-832, 2002. https://doi.org/10.1145/571647.571648
  9. R. Ohbuchi, T. Minamitani, and T. Takei, "Shape-similarity search of 3D models by using enhanced shape functions," International Journal of Computer Applications in Technology, vol. 23, no. 2-4, pp. 70-85, 2005. https://doi.org/10.1504/IJCAT.2005.006466
  10. M. Mahmoudi and G. Sapiro, "Three-dimensional point cloud recognition via distributions of geometric distances," Graphical Models, vol. 71, no. 1, pp. 22-31, 2009. https://doi.org/10.1016/j.gmod.2008.10.002
  11. D. Saupe and D. V. Vranic, "3D model retrieval with spherical harmonics and moments," in Pattern Recognition. Heidelberg: Springer, 2001, pp. 392-397.
  12. M. Kazhdan, T. Funkhouser, and S. Rusinkiewicz, "Rotation invariant spherical harmonic representation of 3D shape descriptors," in Proceedings of Eurographics Symposium on Geometry Processing, Archen, Germany, 2003, pp. 156-164.
  13. H. Laga, H. Takahashi, an M. Nakajima, "Spherical wavelet descriptors for content-based 3D model retrieval," in Proceedings of IEEE International Conference on Shape Modeling and Applications, Matsushima, Japan, 2006.
  14. P. Shilane and T. Funkhouser, "Distinctive regions of 3D surfaces," ACM Transactions on Graphics, vol. 26, no. 2, article no. 7-es, 2007.
  15. Y. Zhao, Y. Liu, Y. Wang, B. Wei, J. Yang, Y. Zhao, and Y. Wang, "Region-based saliency estimation for 3D shape analysis and understanding," Neurocomputing, vol. 197, pp. 1-13, 2016. https://doi.org/10.1016/j.neucom.2016.01.012
  16. I. Atmosukarto, K. Wilamowska, C. Heike, and L. G. Shapiro, "3D object classification using salient point patterns with application to craniofacial research," Pattern Recognition, vol. 43, no. 4, pp. 1502-1517, 2010. https://doi.org/10.1016/j.patcog.2009.11.004
  17. I. Sipiran and B. Bustos, "Harris 3D: a robust extension of the Harris operator for interest point detection on 3D meshes," The Visual Computer, vol. 27, no. 11, article no. 963, 2011.
  18. S. Bu, Z. Liu, J. Han, J. Wu, and R. Ji, "Learning high-level feature by deep belief networks for 3-D model retrieval and recognition," IEEE Transactions on Multimedia, vol. 16, no. 8, pp. 2154-2167, 2014. https://doi.org/10.1109/TMM.2014.2351788
  19. A. Flint, A. Dick, and A. Van den Hengel, "Local 3D structure recognition in range images," IET Computer Vision, vol. 2, no. 4, pp. 208-217, 2008. https://doi.org/10.1049/iet-cvi:20080037
  20. H. Sundar, D. Silver, N. Gagvani, and S. Dickinson, "Skeleton based shape matching and retrieval," in Proceedings of 2003 Shape Modeling International, Seoul, South Korea, 2003, pp. 130-139.
  21. M. Hilaga, Y. Shinagawa, T. Kohmura, and T. L. Kunii, "Topology matching for fully automatic similarity estimation of 3D shapes," in Proceedings of the 28th Annual Conference on Computer Graphics and Interactive Techniques, Los Angeles, CA, 2001, pp. 203-212.
  22. B. Leng, C. Du, S. Guo, X. Zhang, and Z. Xiong, "A powerful 3D model classification mechanism based on fusing multi-graph," Neurocomputing, vol. 168, pp. 761-769, 2015. https://doi.org/10.1016/j.neucom.2015.05.048
  23. A. Kacem, W. Mohamed, and A. B. Hamza, "Spectral geometric descriptor for deformable 3D shape matching and retrieval," in Image Analysis and Recognition. Heidelberg: Springer, 2013, pp. 181-188.
  24. W. Mohamed and A. B. Hamza, "Deformable 3D shape retrieval using a spectral geometric descriptor," Applied Intelligence, vol. 45, no. 2, pp. 213-229, 2016. https://doi.org/10.1007/s10489-015-0746-y
  25. P. Daras and A. Axenopoulos, "A compact multi-view descriptor for 3D object retrieval," in Proceedings of 2009 Seventh International Workshop on Content-Based Multimedia Indexing, Chania, Greece, 2009, pp. 115-119.
  26. H. Su, S. Maji, E. Kalogerakis, and E. Learned-Miller, "Multi-view convolutional neural networks for 3D shape recognition," in Proceedings of the IEEE International Conference on Computer Vision, Santiago, Chile, 2015, pp. 945-953.
  27. K. Ding and Y. Liu, "A probabilistic 3D model retrieval system using sphere image," in Computer Vision - ACCV 2012. Heidelberg: Springer, 2012, pp. 536-547.
  28. S. Zhao, H. Yao, Y. Zhang, Y. Wang, and S. Liu, "View-based 3D object retrieval via multi-modal graph learning," Signal Processing, vol. 112, pp. 110-118, 2015. https://doi.org/10.1016/j.sigpro.2014.09.038
  29. D. V. Vranic and D. Saupe, "3D model retrieval," in Proceedings of the Spring Conference on Computer Graphics and its Applications (SCCG2000), Budmerice, Slovakia, 2000, pp. 89-93.
  30. D. V. Vranic, D. Saupe, and J. Richter, "Tools for 3D-object retrieval: Karhunen-Loeve transform and spherical harmonics," in Proceedings of 2001 IEEE 4th Workshop on Multimedia Signal Processing (Cat. No. 01TH8564), Cannes, France, 2001, pp. 293-298.
  31. P. Papadakis, I. Pratikakis, S. Perantonis, and T. Theoharis, "Efficient 3D shape matching and retrieval using a concrete radialized spherical projection representation," Pattern Recognition, vol. 40, no. 9, pp. 2437-2452, 2007. https://doi.org/10.1016/j.patcog.2006.12.026
  32. J. Mutch and D. G. Lowe, "Object class recognition and localization using sparse features with limited receptive fields," International Journal of Computer Vision, vol. 80, no. 1, pp. 45-57, 2008. https://doi.org/10.1007/s11263-007-0118-0
  33. M. Aharon, M. Elad, and A. Bruckstein, "K-SVD: an algorithm for designing overcomplete dictionaries for sparse representation," IEEE Transactions on Signal Processing, vol. 54, no. 11, pp. 4311-4322, 2006. https://doi.org/10.1109/TSP.2006.881199
  34. H. Lee, A. Battle, R. Raina, and A. Y. Ng, "Efficient sparse coding algorithms," Advances in Neural Information Processing Systems, vol. 20, pp. 801-808, 2007.
  35. P. H. Chen, C. J. Lin, and B. Scholkopf, "A tutorial on ${\nu}$-support vector machines," Applied Stochastic Models in Business and Industry, vol. 21, no. 2, pp. 111-136, 2005. https://doi.org/10.1002/asmb.537
  36. K. Siddiqi, J. Zhang, D. Macrini, A. Shokoufandeh, S. Bouix, and S. Dickinson, "Retrieving articulated 3-D models using medial surfaces," Machine Vision and Applications, vol, 19, no. 4, pp. 261-275, 2008. https://doi.org/10.1007/s00138-007-0097-8
  37. P. Shilane, P. Min, M. Kazhdan, and T, Funkhouser, "The princeton shape benchmark," in Proceedings Shape Modeling Applications, Genova, Italy, 2004, pp. 167-178.
  38. L. Hamid and M. Nakajima, "Supervised learning of salient 2D views of 3D models," The Journal of the Society for Art and Science, vol. 7, no. 4, pp. 124-131, 2008. https://doi.org/10.3756/artsci.7.124