The Binarization of Text Regions in Natural Scene Images, based on Stroke Width Estimation

자연 영상에서 획 너비 추정 기반 텍스트 영역 이진화

  • ;
  • 김정환 (전남대학교 전자컴퓨터공학부) ;
  • 이귀상 (전남대학교 전자컴퓨터공학부)
  • Received : 2012.08.07
  • Accepted : 2012.12.13
  • Published : 2012.12.31

Abstract

In this paper, a novel text binarization is presented that can deal with some complex conditions, such as shadows, non-uniform illumination due to highlight or object projection, and messy backgrounds. To locate the target text region, a focus line is assumed to pass through a text region. Next, connected component analysis and stroke width estimation based on location information of the focus line is used to locate the bounding box of the text region, and each box of connected components. A series of classifications are applied to identify whether each CC(Connected component) is text or non-text. Also, a modified K-means clustering method based on an HCL color space is applied to reduce the color dimension. A text binarization procedure based on location of text component and seed color pixel is then used to generate the final result.

Keywords