DOI QR코드

DOI QR Code

Comic Image Normalization using the gradient Radon Transform based on OpenCL implementation

OpenCL 기반의 그래디언트 라돈변환을 이용한 만화영상의 정규화

  • Received : 2011.02.21
  • Accepted : 2011.05.25
  • Published : 2011.08.31

Abstract

Digital comic images are one of popular contents on the Internet. Usually, they are scanned from comic books by digital scanners. Without post-processing, they may have different sizes, skews and margins other than contents at the boundary. To normalize the size of their contents without the skews and margins is an important step in comic image analysis and application such as content-based comic image retrieval system. In this paper, we propose a method to detect a box frame in comic images by extracting of line segments using the gradient Radon transform. The box frame in comic images is the maximum rectangle which consists of contents without margins. We use the detected box frame to normalize the size of comic images and to make them no skew. In addition, the proposed method is implemented by OpenCL to speed up the detection of the line segments. Experimental results show that our proposed method effectively detects the box frame in comic images.

디지털 만화영상은 인터넷에서 매우 인기 있는 컨텐츠이다. 일반적으로 디지털 만화영상은 디지털 스캐너에 의해 스캔되며, 후처리를 하지 않으면 서로 다른 크기와 기울어짐을 가질 수 있으며, 경계부분에 내용이외의 여백이 있을 수 있다. 기울어짐과 여백이 없이 영상의 내용의 크기를 정규화하는 것은 내용기반 만화영상 검색과 같은 응용에서 매우 중요한 단계이다. 본 논문에서는 그래디언트 라돈변환을 사용하여 검출한 선분을 이용하여 만화영상의 박스 프레임을 검출하는 방법을 제안한다. 만화영상에서 박스프레임은 여백이 없는 만화영상 내용으로 이루어진 최대 사각영역이다. 만화영상의 크기를 정규화하고, 기울어짐을 없애기 위하여 박스 프레임을 사용하고, 선분 검출 속도를 높이기 위하여 OpenCL로 구현하였다. 제안 방법이 만화영상에서 효과적으로 박스 프레임을 검출함을 실험으로 보였다.

Keywords

References

  1. Yamada et al., "Mamoru Endoo, and Shinya Miyazaki. Comic image decomposition for reading comics on cellular phones," IEICE transactions on information and systems, E87-D(6), pp.1370-1376, 2004.
  2. Kenji et al., Juichi Miyamichi, "Layout Analysis of Tree-Structured Scene Frames in Comic Images," International Joint Conference on Artificial Intelligence, pp.2885-2890, 2007.
  3. 한은정, 장창혁, 정기철, " 다층 신경망을 이용한 모바일 자동변환 시스템," 멀티미디어 학회 논문지, 제 12권 2호, pp.272-280, 2008.
  4. Yasuto Ishitani, "Document layout analysis by interaction between data-driven processing and concept-driven processing", Journal of Information Processing Society of Japan, Vol.42, No.11, pp.2711-2723, 2001.
  5. 김동근, 양승범, 황치정,"그래디언트 라돈변환을 이용한 만화영상의 외곽 경계사각형 검출," 멀티미디어 학회 논문지, 제 14권 4호, pp.538-545, 2011.
  6. Duda, R. O. and P. E. Hart, "Use of the Hough Transformation to detect Lines and Curves in Pictures," Communications of the ACM, Vol.15, pp.11-15, 1972. https://doi.org/10.1145/361237.361242
  7. D. Ballard and C. Brown Computer Vision, Prentice-Hall, 1982.
  8. William H. Press, "Discrete Radon transform has an exact, fast inverse and generalizes to operations other than sums along lines," Proceedings of the National Academy of Sciences(PNAS), Vol.103, No.51, pp.19249-19254, 2006. https://doi.org/10.1073/pnas.0609228103
  9. Peyrin.F, Goutte.R, "Image invariant via the Radon Transform" Image Processing and its Applications, pp.458-461, 1992.
  10. T. S. Durrani and D. Bisset, "The Radon transform and its properties", GEOPHYSICS, Vol.49, No.8, pp.1180-1187, 1984. https://doi.org/10.1190/1.1441747
  11. A.S.M Shihavuddin et. al, "Road Boundary Detection by a Remote Vehicle Using Radon Transform for Path Map Generation of an Unknown Area", International Journal of Computer Science and Network Security(IJCSNS), Vol.8 No.8, pp.64-69, 2008.
  12. Si-Yu Guo et al, "Probabilistic Hough transform for line detection utilizing surround suppression," International Conference on Machine Learning and Cybernetics, pp.2993-2998, 2008.
  13. N. Kiryati, Y. Eldar, A. M. Bruckstein, "A probabilistic Hough transform", Pattern Recognition, Vol.24, No.4, pp.303-316, 1991. https://doi.org/10.1016/0031-3203(91)90073-E
  14. L. Xu, E. Oja, and P. Kultanan, "A new curve detection method: Randomized Hough transform (RHT)", Pattern Recognition Letter, pp.331-338, 1990.
  15. E. Montseny et al, "Edge Orientation-based Fuzzy Hough Transform (EOFHT)," European Society for Fuzzy Logic and Technology(EUSFLAT) Conference Proceedings, 2003.
  16. 김동근, OpenCV programming, 가메출판사, 2010.
  17. OpenCV Reference Manual, http://sourceforge.net/projects/opencvlibrary/
  18. Intel OpenCL SDK, http://software.intel.com/en-us/articles/intel-opencl-sdk/
  19. http://www.khronos.org/opencl/