• Title/Summary/Keyword: image binarization

Search Result 283, Processing Time 0.022 seconds

An Adaptive Binarization of Camera Document Image by Image Quality Estimation (화질 분석을 통한 카메라 문서 영상의 적응적 이진화)

  • Kim, In-Jung
    • Journal of KIISE:Software and Applications
    • /
    • v.34 no.9
    • /
    • pp.797-803
    • /
    • 2007
  • Adaptive binarization is very important for the camera-based document recognition. This paper proposes a binarization method which can effectively adapt to the variation of image Qualify. Firstly, it analyzes the effect of binarization parameters to the result and proposes a method to measure the image quality. Then, it statistically analyzes the relationship between the image quality and the binarization parameter. Finally, it proposes a binarization method that automatically adapts to the quality of the input image, using the analysis result. The experiment results show that there is a meaningful relationship between the image quality and the binarization parameter, and therefore, the proposed method can effectively adapt to the variation of image quality.

Document Image Binarization Technique using MSER (MSER을 이용한 문서 이미지 이진화 기법)

  • Yu, Young-Jung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.18 no.8
    • /
    • pp.1941-1947
    • /
    • 2014
  • Document image binarization is largely used as previous stage of document recognition. And the result of document recognition is much affected from the result of document image binarization. There were many studies to binarize document images. The results of previous studies for document image binarization is varied according to the state of document images. In this paper, we propose a technique for document image binarization using MSER that is applied to extract objects from an image. At first, raw MSER objects are extracted from a document image. Because the raw MSER objects cannot be used for document image binarization, the extracted raw MSER objects are modified. Then the final MSER objects are used for document image binarization with the contrast image that is extracted from the document image. Experimental results show that the proposed technique is useful for document image binarization.

Document Image Binarization by GAN with Unpaired Data Training

  • Dang, Quang-Vinh;Lee, Guee-Sang
    • International Journal of Contents
    • /
    • v.16 no.2
    • /
    • pp.8-18
    • /
    • 2020
  • Data is critical in deep learning but the scarcity of data often occurs in research, especially in the preparation of the paired training data. In this paper, document image binarization with unpaired data is studied by introducing adversarial learning, excluding the need for supervised or labeled datasets. However, the simple extension of the previous unpaired training to binarization inevitably leads to poor performance compared to paired data training. Thus, a new deep learning approach is proposed by introducing a multi-diversity of higher quality generated images. In this paper, a two-stage model is proposed that comprises the generative adversarial network (GAN) followed by the U-net network. In the first stage, the GAN uses the unpaired image data to create paired image data. With the second stage, the generated paired image data are passed through the U-net network for binarization. Thus, the trained U-net becomes the binarization model during the testing. The proposed model has been evaluated over the publicly available DIBCO dataset and it outperforms other techniques on unpaired training data. The paper shows the potential of using unpaired data for binarization, for the first time in the literature, which can be further improved to replace paired data training for binarization in the future.

Dynamic Adaptive Binarization Method Using Fuzzy Trapezoidal Type and Image Stepwise Segmentation (퍼지의 사다리꼴 타입과 영상 단계적 분할을 이용한 동적 적응적 이진화 방법)

  • Lee, Ho Chang
    • Journal of Korea Multimedia Society
    • /
    • v.25 no.5
    • /
    • pp.670-675
    • /
    • 2022
  • This study proposes an improved binarization method to improve image recognition rate. The research goal is to minimize the information loss that occurs during the binarization process, and to transform the object of the original image that cannot be determined through the transformation process into an image that can be judged. The proposed method uses a stepwise segmentation method of an image and divides blocks using prime numbers. Also, within one block, a trapezoidal type of fuzzy is applied. The fuzzy trapezoid is binarized by dividing the brightness histogram area into three parts according to the degree of membership. As a result of the experiment, information loss was minimized in general images. In addition, it was found that the converted binarized image expressed the object better than the original image in the special image in which the brightness region was tilted to one side.

Best Combination of Binarization Methods for License Plate Character Segmentation

  • Yoon, Youngwoo;Ban, Kyu-Dae;Yoon, Hosub;Lee, Jaeyeon;Kim, Jaehong
    • ETRI Journal
    • /
    • v.35 no.3
    • /
    • pp.491-500
    • /
    • 2013
  • A connected component analysis from a binary image is a popular character segmentation method but occasionally fails to segment the characters owing to image noise and uneven illumination. A multimethod binarization scheme that incorporates two or more binary images is a novel solution, but selection of binarization methods has never been analyzed before. This paper reveals the best combination of binarization methods and parameters and presents an in-depth analysis of the multimethod binarization scheme for better character segmentation. We carry out an extensive quantitative evaluation, which shows a significant improvement over conventional single-method binarization methods. Experiment results of six binarization methods and their combinations with different test images are presented.

Adaptive Application of Modified Niblack Algorithm for Letter Image Binarization (우편 영상 이진화를 위한 수정된 Niblack 알고리듬의 적응적 적용)

  • 이재용;오현화;김두식;진성일
    • Proceedings of the IEEK Conference
    • /
    • 2003.07e
    • /
    • pp.2076-2079
    • /
    • 2003
  • This paper describes an efficient thresholding method for the binarization of a grey-level letter image. This method determines the adaptive threshold for letter image binarization by introducing the readjusting parameter, based on the global variance of the input image. Experimental results show that the proposed binarization method outperforms on the various letter images with a texture or noise when compared to the other methods.

  • PDF

DP-LinkNet: A convolutional network for historical document image binarization

  • Xiong, Wei;Jia, Xiuhong;Yang, Dichun;Ai, Meihui;Li, Lirong;Wang, Song
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.5
    • /
    • pp.1778-1797
    • /
    • 2021
  • Document image binarization is an important pre-processing step in document analysis and archiving. The state-of-the-art models for document image binarization are variants of encoder-decoder architectures, such as FCN (fully convolutional network) and U-Net. Despite their success, they still suffer from three limitations: (1) reduced feature map resolution due to consecutive strided pooling or convolutions, (2) multiple scales of target objects, and (3) reduced localization accuracy due to the built-in invariance of deep convolutional neural networks (DCNNs). To overcome these three challenges, we propose an improved semantic segmentation model, referred to as DP-LinkNet, which adopts the D-LinkNet architecture as its backbone, with the proposed hybrid dilated convolution (HDC) and spatial pyramid pooling (SPP) modules between the encoder and the decoder. Extensive experiments are conducted on recent document image binarization competition (DIBCO) and handwritten document image binarization competition (H-DIBCO) benchmark datasets. Results show that our proposed DP-LinkNet outperforms other state-of-the-art techniques by a large margin. Our implementation and the pre-trained models are available at https://github.com/beargolden/DP-LinkNet.

Vehicle License Plate Recognition System Using Image Binarization and Template Matching (영상 이진화와 템플릿 매칭을 이용한 자동차 번호판 인식 시스템)

  • Oh, Soojin;Park, Chun-Su
    • Journal of the Semiconductor & Display Technology
    • /
    • v.13 no.2
    • /
    • pp.7-12
    • /
    • 2014
  • A vehicle license plate includes the most important information for recognition and classification of the vehicle. In this paper, we propose a vehicle license plate recognition system using image binarization and template matching. In the proposed system, an image of the vehicle license plate is converted into a gray scale image and the gray image undergoes the binarization process. Finally, the numbers on the plate are extracted from the binary image using the template matching algorithm.

An α-cut Automatic Set based on Fuzzy Binarization Using Fuzzy Logic (퍼지논리를 이용한 α-cut 자동 설정 기반 퍼지 이진화)

  • Lee, Ho Chang;Kim, Kwang Baek;Park, Hyun Jun;Cha, Eui-Young
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.19 no.12
    • /
    • pp.2924-2932
    • /
    • 2015
  • Image binarization is a process to divide the image into objects and backgrounds, widely applied to the fields of image analysis and its recognition. In the existing method of binarization, there is some uncertainty when there is insufficient brightness gap between objects and backgrounds in setting threshold. The method of fuzzy binarization has improved the features of objects efficiently. However, since this method sets ${\alpha}$-cut value statically, there remain some problems that important features of objects can be lost during binarization. Therefore, in this paper, we propose a binarization method which does not set ${\alpha}$-cut value statically. The proposed method uses fuzzy membership functions calculated by thresholds of mean, iterative, and Otsu binarization. Experiment results show the proposed method binaries various images with less loss than the existing methods.

Enhanced Fuzzy Binarization Method for Car License Plate Binarization (자동차번호판 이진화를 위한 개선된 퍼지 이진화 방법)

  • Cho, Jae-Hyun
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.6 no.2
    • /
    • pp.231-236
    • /
    • 2011
  • The binarization algorithm frequently applies to one part of the preprocessing phase for a variety of image processing techniques such as image recognition and image analysis, etc. So it is important that binarization algorithm is determined by the selection of threshold value for binarization in image processing. The previous algorithms could get the proper threshold value in the case that shows all the difference of brightness between background and object, but if not, they could not get the proper threshold value. In this paper, we propose the efficient fuzzy binarization method which first, segments the brightness range of gray_scale images to 2 intervals to perform car license plate binarization and applies fuzzy member function to each intervals. The experiment for performance evaluation of the proposed binarization algorithm showed that the proposed algorithm generates the more effective threshold value than the previous algorithms in car license plate.