• Title/Summary/Keyword: document image processing

Search Result 105, Processing Time 0.027 seconds

TheReviser : A Gesture-based Editing System on a Digital Desk (TheReviser : 가상 데스크 상의 제스처 기반 문서 교정 시스템)

  • Jung, Ki-Chul;Kang, Hyun
    • The KIPS Transactions:PartB
    • /
    • v.11B no.4
    • /
    • pp.527-536
    • /
    • 2004
  • TheReviser is a digital document revision application on a projection display, which allows us to interact a digital document with the same gestures used for paper documents revision. To enable these interactions, TheReviser should detect foreground objects such as hands or pens on a projection display, and should spot and recognize gesture commands from continuous movements of a user. To detect foreground objects from a complex background in various lighting conditions, we perform geometry and color calibration between a captured image and a frame buffer image. TheReviser uses an HMM-based gesture recognition method Experimental results show that the proposed application recognizes user's gestures on average 93.22% in test gesture sequences.

Feature based Text Watermarking for Binary Document Image (이진 문서 영상을 위한 특징 기반 텍스트 워터마킹)

  • Choo Hyon-Gon;Kim Whoi-yul
    • The KIPS Transactions:PartB
    • /
    • v.12B no.2 s.98
    • /
    • pp.151-156
    • /
    • 2005
  • In this paper, we propose feature based character watermarking methods based on geometical features specific to characters of text in document image. The proposed methods can satisfy both data capacity and robustness simultaneously while none of the conventional methods can. According to the characteristics of characters, watermark can be embed or detected through changes of connectivity of the characters, differences of characteristics of edge pixels or changes of area of holes. Experimental results show that our identification techniques are very robust to distortion and have high data capacity.

Adaptive Binarization using Integral Image (적분영상을 이용한 적응적 이진화)

  • Lee, Yeon-Kyung;Yoo, Hoon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2012.10a
    • /
    • pp.109-110
    • /
    • 2012
  • In this paper, we propose an adaptive thresholding method to binarize two-dimensional barcode images. Adaptive thresholding methods are applied to document image binarization. Thus, they inappropriate to use in recognition of two-dimensional barcode images. To overcome the problem, we propose a new adaptive threshold method using the integral image. To show the effectiveness of our method, we compared our method with the well-known existing method in terms of visual quality and processing time. The experimental result indicates that the proposed method is superior to the existing method.

  • PDF

XML Document Editing System for Structural Processing of the Digital Document to Including Mathematical Formula (수식을 포함한 전자문헌의 구조적 처리를 위한 XML 문서편집시스템)

  • 윤화묵;유범종;김창수;정회경
    • Journal of the Korean Society for information Management
    • /
    • v.19 no.4
    • /
    • pp.96-111
    • /
    • 2002
  • A lot of accumulated data of many quantity exist within a institution or an organization, but most data is remained in form of standardization as each institution or organization. There are difficulty in exchange and share of information. New concept of knowledge information resource management to overcome this disadvantage was introduced, and the digitization of knowledge information resources to share and manage accumulated data is been doing. Specially, in science technic or education scholarship it, the tendency that importing XML to process necessary data to exchange and share of knowledge information resources structurally, and limitation of back for search and indexing or reusability is happened according as expression of great many mathematics used inside electron document of these sphere is processed to nonstructural data of image or text and so on. There is interest converged in processing of mathematics that use MathML to overcome this, and we require the solution to be able to process MathML easily and efficiently on structural document. In this paper, designed and implemented of XML document editing system which easy structural process of electronic document for knowledge information resources, and create and express MathML easily on structural document without expert knowledge about MathML.

Keyword Spotting on Hangul Document Images Using Character Feature Models (문자 별 특징 모델을 이용한 한글 문서 영상에서 키워드 검색)

  • Park, Sang-Cheol;Kim, Soo-Hyung;Choi, Deok-Jai
    • The KIPS Transactions:PartB
    • /
    • v.12B no.5 s.101
    • /
    • pp.521-526
    • /
    • 2005
  • In this Paper, we propose a keyword spotting system as an alternative to searching system for poor quality Korean document images and compare the Proposed system with an OCR-based document retrieval system. The system is composed of character segmentation, feature extraction for the query keyword, and word-to-word matching. In the character segmentation step, we propose an effective method to remove the connectivity between adjacent characters and a character segmentation method by making the variance of character widths minimum. In the query creation step, feature vector for the query is constructed by a combination of a character model by typeface. In the matching step, word-to-word matching is applied base on a character-to-character matching. We demonstrated that the proposed keyword spotting system is more efficient than the OCR-based one to search a keyword on the Korean document images, especially when the quality of documents is quite poor and point size is small.

Character Segmentation on Printed Korean Document Images Using a Simplification of Projection Profiles (투영 프로파일의 간략화 방법을 이용한 인쇄체 한글 문서 영상에서의 문자 분할)

  • Park Sang-Cheol;Kim Soo-Hyung
    • The KIPS Transactions:PartB
    • /
    • v.13B no.2 s.105
    • /
    • pp.89-96
    • /
    • 2006
  • In this paper, we propose two approaches for the character segmentation on Korean document images. One is an improved version of a projection profile-based algorithm. It involves estimating the number of characters, obtaining the split points and then searching for each character's boundary, and selecting the best segmentation result. The other is developed for low quality document images where adjacent characters are connected. In this case, parts of the projection profile are cut to resolve the connection between the characters. This is called ${\alpha}$-cut. Afterwards, the revised former segmentation procedure is conducted. The two approaches have been tested with 43,572 low-quality Korean word images punted in various font styles. The segmentation accuracies of the former and the latter are 91.81% and 99.57%, respectively. This result shows that the proposed algorithm using a ${\alpha}$-cut is effective for low-quality Korean document images.

A Framework for Digitalizing Handwritten Document using Digital Pen and Handwriting Recognition Technology (디지털펜과 필기체인식 기술을 이용한 수기문서 전자화 프레임워크)

  • Son, Bong-Ki;Kim, Hak-Joon
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.12 no.3
    • /
    • pp.1417-1426
    • /
    • 2011
  • Business still relies heavily on pen and paper for legal reasons or convenience. The handwritten document is to be converted into digitalized document for IT system to manage and process in real time. Because the previous document digitalization systems convert the handwritten documents into digitalized documents by scanning and post-processing the documents, it is difficult to seamlessly proceed the work process. This paper proposes the LiveForm, a framework for digitalizing handwritten document using digital pen and handwriting recognition technology. To prove the applicability of the proposed LiveForm, we also implement a LiveForm based service in industrial gas distribution process and analyze effects of the system. The LiveForm generates the same digital image as the handwritten document by writing up the paper with absolute coordinates by digital pen and converts the handwriting data to digital text to insert the information into back-end system. The LiveForm based system eliminates scanning for document digitalization and data input with keyboard into back-end system in paper-based information gathering. Therefore, it is possible for the LiveForm to improve work process in various business areas.

Document Image Segmentation by the Statistical Distribution Analysis of Wavelet Coefficients (웨이블릿 계수의 통계적 이산 분석을 이용한 문서 영상 분할)

  • Lee, In-Sue;Kim, Min-Soo;Kim, Woo-Sung;Hahn, Kwang-Rok
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2000.10b
    • /
    • pp.927-930
    • /
    • 2000
  • 본 논문은 문서 영상에 대해 투영을 사용하여 영역을 나누었고 각 영역에 대해 고주파 밴드의 웨이블렛 계수의 통계적 분산과 히스토그램을 기반으로 한 두 가지 특징을 사용하여 문자와 그림으로 분류하였다. 투영으로 나누어진 영역들에 대해 일정 크기의 블록으로 나누고 두 가지 특징에 따라 문자와 그림으로 분류하였다. 따라서 투영에 의해 나뉜 영역 중 문자와 그림이 혼합되어 의미가 모호한 영역에 대해 잘못 분류되는 가능성을 줄일 수 있었다.

  • PDF

A Study of Development on technique for document & image unite (웹 기반 영상과 문서의 자동결합기법에 관한 연구)

  • Na, Jong-Won
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2006.11a
    • /
    • pp.187-190
    • /
    • 2006
  • 텍스트위주의 자료와 영상회의 자료를 결합하여 통합하고 이를 웹을 통해 서비스 하도록 구현하였다. 멀티미디어 회의정보 등록시 사용자는 DAM실행 파일을 자신의 PC에서 실행하고 프로그램에 내장된 윈도우 탐색기를 통해 등록하고자 하는 회의파일을 선택하면 XML 자동 피싱모듈이 실행되면서 자료 저장을 위한 핵심정보와 검색을 위한 단어색인 및 인덱스를 수행하여 오류시에는 에러메세지를 보여주고 성공시에는 분석된 핵심정보를 제공해준다. 서버로 전송된 멀티미디어 회의자료는 종류/시간/등으로 구분되어 윈도우 탐색기 형태의 트리구조와 사용자가 직접 단어를 입력하여 원하는 회의 정보를 검색하는 2가지 형태의 검색서비스를 제공하게 된다. 사용자는 본인의 서비스 환경에 따라 영상+문서, 문서 등을 자유롭게 선택하여 볼 수 있다. DAM의 경제적인 효과에 대하여 정확한 통계 자료가 국내에 아직 발표되지 않아 유사분야인 WCM시장 분석을 근거로 예상하였으며 시스템이 다양하고 복잡해질수록 투자대비 경제적인 효과가 높아 동기화 기술분야 시장이 확대될 것으로 기대된다.

  • PDF

R&D of Intelligent Document Recognition Library for utilizing image data (이미지데이터 활용을 위한 지능형 인식 라이브러리 연구 개발)

  • Kwag, Hee Kue;Kim, Sung Hun;Lee, Jung Woo;Yoo, Ji Hun;Lee, Hyun Joo
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2009.11a
    • /
    • pp.329-330
    • /
    • 2009
  • 본 연구는 공공기관이 소장한 이미지데이터 활용성을 높이기 위한 전문검색서비스 구현 시 필수적인 문서인식시스템의 고도화에 있으며, 주요한 연구방향은 공공기관이 소장하고 있는 데이터의 분석을 통해 이미지분석 기술 및 라이브러리를 개발하고 특화된 지식베이스를 구성하는 것이다. 또한, 향후 확장성을 고려하여 지식베이스를 지속적으로 관리할 수 있는 툴을 개발하는 것이다. 본 연구는 현재 지능형 인식 라이브러리를 결합한 프로토타입(prototype) 시스템 개발이 완료된 바, 방대한 국가기록원내 소장자료를 대상으로 다양한 성능평가를 위한 테스트베드 구축이 진행되고 있다.