• Title/Summary/Keyword: 문서영상

Search Result 381, Processing Time 0.033 seconds

Automatic Title Detection by Spatial Feature and Projection Profile for Document Images (공간 정보와 투영 프로파일을 이용한 문서 영상에서의 타이틀 영역 추출)

  • Park, Hyo-Jin;Kim, Bo-Ram;Kim, Wook-Hyun
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.11 no.3
    • /
    • pp.209-214
    • /
    • 2010
  • This paper proposes an algorithm of segmentation and title detection for document image. The automated title detection method that we have developed is composed of two phases, segmentation and title area detection. In the first phase, we extract and segment the document image. To perform this operation, the binary map is segmented by combination of morphological operation and CCA(connected component algorithm). The first phase provides segmented regions that would be detected as title area for the second stage. Candidate title areas are detected using geometric information, then we can extract the title region that is performed by removing non-title regions. After classification step that removes non-text regions, projection is performed to detect a title region. From the fact that usually the largest font is used for the title in the document, horizontal projection is performed within text areas. In this paper, we proposed a method of segmentation and title detection for various forms of document images using geometric features and projection profile analysis. The proposed system is expected to have various applications, such as document title recognition, multimedia data searching, real-time image processing and so on.

Speed-up of Document Image Binarization Method Based on Water Flow Model (Water flow model에 기반한 문서영상 이진화 방법의 속도 개선)

  • 오현화;김도훈;이재용;김두식;임길택;진성일
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.41 no.4
    • /
    • pp.75-86
    • /
    • 2004
  • This paper proposes a method to speed up the document image binarization using a water flow model. The proposed method extracts the region of interest (ROI) around characters from a document image and restricts pouring water onto a 3-dimensional terrain surface of an image only within the ROI. The amount of water to be filed into a local valley is determined automatically depending on its depth and slope. The proposed method accumulates weighted water not only on the locally lowest position but also on its neighbors. Therefore, a valley is filed enough with only one try of pouring water onto the terrain surface of the ROI. Finally, the depth of each pond is adaptively thresholded for robust character segmentation, because the depth of a pond formed at a valley varies widely according to the gray-level difference between characters and backgrounds. In our experiments on real document images, the Proposed method has attained good binarization performance as well as remarkably reduced processing time compared with that of the existing method based on a water flow model.

Text extraction from camera based document image (카메라 기반 문서영상에서의 문자 추출)

  • 박희주;김진호
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.8 no.2
    • /
    • pp.14-20
    • /
    • 2003
  • This paper presents a text extraction method of camera based document image. It is more difficult to recognize camera based document image in comparison with scanner based image because of segmentation problem due to variable lighting condition and versatile fonts. Both document binarization and character extraction are important processes to recognize camera based document image. After converting color image into grey level image, gray level normalization is used to extract character region independent of lighting condition and background image. Local adaptive binarization method is then used to extract character from the background after the removal of noise. In this character extraction step, the information of the horizontal and vertical projection and the connected components is used to extract character line, word region and character region. To evaluate the proposed method, we have experimented with documents mixed Hangul, English, symbols and digits of the ETRI database. An encouraging binarization and character extraction results have been obtained.

  • PDF

Study on Measuring Geometrical Modification of Document Image in Scanning Process (스캐닝 과정에서 발생하는 전자문서의 기하학적 변형감지에 관한 연구)

  • Oh, Dong-Yeol;Oh, Hae-Seok;Rhew, Sung-Yul
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.10 no.8
    • /
    • pp.1869-1876
    • /
    • 2009
  • Scanner which is a kind of optical devices is used to convert paper documents into document image files. The assessment of scanned document image is performed to check if there are any modification on document image files in scanning process. In assessment of scanned documents, user checks the degree of skew, noise, folded state and etc This paper proposed to how to measure geometrical modifications of document image in scanning process. In this study, we check the degree of modification in document image file by image processing and we compare the evaluation value which means the degree of modification in each items with OCR success ratio in a document image file. To analyse the correlation between OCR success ratio and the evaluation value which means the degree of modification in each items, we apply Pearson Correlation Coefficient and calculate weight value for each items to score total evaluation value of image modification degrees on a image file. The document image which has high rating score by proposed method also has high OCR success ratio.

Segmentation and Contents Classification of Document Images Using Local Entropy and Texture-based PCA Algorithm (지역적 엔트로피와 텍스처의 주성분 분석을 이용한 문서영상의 분할 및 구성요소 분류)

  • Kim, Bo-Ram;Oh, Jun-Taek;Kim, Wook-Hyun
    • The KIPS Transactions:PartB
    • /
    • v.16B no.5
    • /
    • pp.377-384
    • /
    • 2009
  • A new algorithm in order to classify various contents in the image documents, such as text, figure, graph, table, etc. is proposed in this paper by classifying contents using texture-based PCA, and by segmenting document images using local entropy-based histogram. Local entropy and histogram made the binarization of image document not only robust to various transformation and noise, but also easy and less time-consuming. And texture-based PCA algorithm for each segmented region was taken notice of each content in the image documents having different texture information. Through this, it was not necessary to establish any pre-defined structural information, and advantages were found from the fact of fast and efficient classification. The result demonstrated that the proposed method had shown better performances of segmentation and classification for various images, and is also found superior to previous methods by its efficiency.

Adaptive Binarization for Camera-based Document Recognition (카메라 기반 문서 인식을 위한 적응적 이진화)

  • Kim, In-Jung
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.12 no.3
    • /
    • pp.132-140
    • /
    • 2007
  • The quality of the camera image is worse than that of the scanner image because of lighting variation and inaccurate focus. This paper proposes a binarization method for camera-based document recognition, which is tolerant to low-quality camera images. Based on an existing method reported to be effective in previous evaluations, we enhanced the adaptability to the image with a low contrast due to low intensity and inaccurate focus. Furthermore, applying an additional small-size window in the binarization process, it is effective to extract the fine detail of character structure, which is often degraded by conventional methods. In experiments, we applied the proposed method as well as other methods to a document recognizer and compared the performance for many cm images. The result showed the proposed method is effective for recognition of document images captured by the camera.

  • PDF

A Method for Thresholding and Correction of Skew in Camera Document Images (카메라 문서 영상의 이진화 및 기울어짐 보정 방법)

  • Jang Dae-Geun;Chun Byung-Tae
    • Journal of the Korea Society of Computer and Information
    • /
    • v.10 no.3 s.35
    • /
    • pp.143-150
    • /
    • 2005
  • Camera image is very sensitive to illumination that result in difficulties for recognizing character. Also Camera captured document images have not only skew but also vignetting effect and geometric distortion. Vignetting effect make it difficult to separate characters from the document images. Geometric distortion, occurred by the mismatch of angle and center position between the document image and the camera, make the shape of characters to be distorted, so that the character recognition is more difficult than the case of using scanner. In this paper, we propose a method that can increase the performance of character recognition by correcting the geometric distortion of document images using a linear approximation which changes the quadrilateral region to the rectangle one. The proposed method also determine the quadrilateral transform region automatically, using the alignment of character lines and the skewed angles of characters located in the edges of each character line. Proposed method, therefore, can correct the geometric distortion without getting positional information from camera.

  • PDF

Word Spotting Algorithms Using SIFT in Document Images (SIFT를 이용한 문서 영상에서의 단어 검색 알고리즘)

  • Lee, Duk-Ryong;Jeon, Hyo-Jong;Oh, Il-Seok
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2011.06a
    • /
    • pp.488-490
    • /
    • 2011
  • 본 논문에서는 문서 영상에서 글자 분할 및 인식이 필요 없는 단어 검색 알고리즘을 제안한다. 글자 분할을 하지 않고 검색하기 위해 영상 검색에 사용되는 SIFT특징을 이용하였다. 제안하는 알고리즘은 사용자가 입력한 질의어를 질의 영상으로 변환하고, 질의 영상에서 SIFT특징을 추출한다. 추출된 특징은 문서영상에서 추출한 특징과 매칭을 통해 매칭점 쌍을 생성한다. 생성된 매칭점 쌍들을 군집화 조건에 따라 군집화 한다. 군집화는 질의 영상과 지리적 분포가 유사하게 군집화 되도록 설계되었다. 생성된 군집은 군집에 포함된 특징점의 개수가 많을수록 질의 영상과 유사하다. 따라서 N개 이상의 원소를 가지는 군집을 결과로 출력한다. 실험한 결과 제안하는 알고리즘의 가능성을 확인할 수 있었다.

Understanding Documents With Chemical Structures Using Image Segmentation (영상 분할을 활용한 화학 구조 문서 이해)

  • Yang, Haeyoon;Cho, Nam Ik
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2022.06a
    • /
    • pp.1297-1300
    • /
    • 2022
  • Document layout analysis는 문서 이미지의 구조와 구성요소를 파악하는 기술이다. 기존 딥러닝을 사용한 학습 기반 방법에는 각 구성 요소를 검출하는 detection 기반 방식이 많으나 이는 다양한 형식의 문서 이미지에 확장될 수 있는 가능성이 낮다는 한계가 존재한다. 특히, 다양한 모양과 크기의 화학 구조를 포함하는 화학 문서 이미지에 적용하기 어렵다. 본 논문에서는 영상분할을 활용하여 화학 구조 문서를 이해하는 연구를 진행하였다. 기존의 블록 단위로 레이블링된 벤치마크와 다르게 객체 단위로 레이블링한 학습 데이터를 가지고 DeepLabv3 구조의 네트워크를 학습하여 화학 문서 이미지를 효과적으로 분할하였다. 객체 단위 레이블링과 영상 분할을 사용한 방식이 문서 이해 및 화학 구조 검출에 준수한 성능을 보이는 것을 확인하였고 이 방식이 다양한 형식의 문서 이미지에 확장될 수 있음을 보였다.

  • PDF

A Design of Image Information Retrieval System based on XML Database (XML 데이터베이스 기반의 영상정보 검색시스템 설계)

  • Kwak Kil-Sin;Joo Kyung-Soo
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2005.11b
    • /
    • pp.139-141
    • /
    • 2005
  • 최근 인터넷의 발달에 따라 XML 문서의 사용과 각종 영상정보의 양이 크게 증가되었다. 이에 따라 XML 문서를 관리하기 위한 XML 데이터베이스의 필요성과 메타데이터 표준화에 대한 중요성이 증가되고 있다. XML 데이터베이스는 XML 문서의 특성을 고려하여 그 특성을 효율적으로 지원할 수 있다. 또한 국내에서는 교육정보분야 메타데이터 표준인 KEM 2.0이 제정 되었고 국외에서는 멀티미디어 데이터에 대한 표준으로 MPEG-7이 제정이 되었다. 이에 따라 본 논문에서는 MPEG-7을 기반으로 KEM 2.0을 이용한 영상정보 XML 스키마를 생성하고 이를 이용한 영상정보 검색시스템을 XML 데이터베이스 기반으로 설계하고자 한다. 본 논문에서 설계하는 XML 데이터베이스 기반의 영상정보 검색시스템은 XML 문서에 대한 빠른 저장과 검색이 가능할 것이다. 또한 검색 기능에 있어서는 키워드 기반의 의미기반 검색과 유사 이미지를 통한 내용기반 검색, 그리고 이를 내용기반과 의미기반을 통합한 검색 기능을 제공할 것이며 XML 문서에 대한 강력한 질의 수단인 XQuery 질의를 포함하게 될 것이다.

  • PDF