Search | Korea Science

The Extraction of Table Lines and Data in Document Image (문서영상에서 표 구성 직선과 데이터 추출)

Jang, Dae-Geun;Kim, Eui-Jeong
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.10 no.3
- /
- pp.556-563
- /
- 2006
We should extract lines and data which consist of the table in order to classify the table region and analyze its structure in document image. But it is difficult to extract lines and data exactly because the lines are cut and their lengths are changed, or characters or noises are merged to the table lines. These problems result from the error of image input device or image reduction. In this paper, we propose the better method of extracting lines and data for table region classification and structure analysis than the previous ones including commercial softwares. The prposed method extracts horizontal and vertical lines which consist of the table by the use of one dimensional median filter. This filter not only eliminates the noises which attach to the line and the lines which are orthogonal to the filtering direction, but also connects the cut line of which the gap is shorter than the length of the filter tap in the process of extracting lines to the filtering direction. Furthermore, texts attached to the line are separated in the process of extracting vertical lines. This is an example of ABSTRACT format.
PDF KSCI

Forgery Detection Mechanism with Abnormal Structure Analysis on Office Open XML based MS-Word File

Lee, HanSeong;Lee, Hyung-Woo
- International journal of advanced smart convergence
- /
- v.8 no.4
- /
- pp.47-57
- /
- 2019
We examine the weaknesses of the existing OOXML-based MS-Word file structure, and analyze how data concealment and forgery are performed in MS-Word digital documents. In case of forgery by including hidden information in MS-Word digital document, there is no difference in opening the file with the MS-Word Processor. However, the computer system may be malfunctioned by malware or shell code hidden in the digital document. If a malicious image file or ZIP file is hidden in the document by using the structural vulnerability of the MS-Word document, it may be infected by ransomware that encrypts the entire file on the disk even if the MS-Word file is normally executed. Therefore, it is necessary to analyze forgery and alteration of digital document through internal structure analysis of MS-Word file. In this paper, we designed and implemented a mechanism to detect this efficiently and automatic detection software, and presented a method to proactively respond to attacks such as ransomware exploiting MS-Word security vulnerabilities.
https://doi.org/10.7236/IJASC.2019.8.4.47 인용 PDF KSCI

Mongolian Traditional Stamp Recognition using Scalable kNN

Gantuya., P;Mungunshagai., B;Suvdaa., B
- International journal of advanced smart convergence
- /
- v.4 no.2
- /
- pp.170-176
- /
- 2015
The stamp is one of the crucial information of traditional historical and cultural for nations. In this paper, we purpose to detect official stamps from scanned document and recognize the Mongolian traditional, historical stamps. Therefore we performed following steps: first, we detect official stamps from scanned document based on red-color segmentation and document standard. Then we collected 234 traditional stamp images with 6 classes and 100 official stamp images from scanned document images. Also we implemented the processing algorithms for noise removing, resize and reshape etc. Finally, we proposed a new scale invariant classification algorithm based on KNN (k-nearest neighbor). In the experimental result, our proposed a method had shown proper recognition rate.
https://doi.org/10.7236/IJASC.2015.4.2.170 인용 PDF KSCI

Development of Intelligent OCR Technology to Utilize Document Image Data (문서 이미지 데이터 활용을 위한 지능형 OCR 기술 개발)

Kim, Sangjun;Yu, Donghui;Hwang, Soyoung;Kim, Minho
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2022.05a
- /
- pp.212-215
- /
- 2022
In the era of so-called digital transformation today, the need for the construction and utilization of big data in various fields has increased. Today, a lot of data is produced and stored in a digital device and media-friendly manner, but the production and storage of data for a long time in the past has been dominated by print books. Therefore, the need for Optical Character Recognition (OCR) technology to utilize the vast amount of print books accumulated for a long time as big data was also required in line with the need for big data. In this study, a system for digitizing the structure and content of a document object inside a scanned book image is proposed. The proposal system largely consists of the following three steps. 1) Recognition of area information by document objects (table, equation, picture, text body) in scanned book image. 2) OCR processing for each area of the text body-table-formula module according to recognized document object areas. 3) The processed document informations gather up and returned to the JSON format. The model proposed in this study uses an open-source project that additional learning and improvement. Intelligent OCR proposed as a system in this study showed commercial OCR software-level performance in processing four types of document objects(table, equation, image, text body).
PDF

Baseline Searching Method for Document Skew Detection (문서 영상의 기울기 검출을 위한 기준선 탐색 기법)

Shin, Myoung-Jin;Kim, Do-Hyeon;Cha, Eui-Young
- Journal of Korea Multimedia Society
- /
- v.10 no.2
- /
- pp.218-225
- /
- 2007
This paper presents a technique to detect a document skew that often occurs during document scanning. To correct a skewed document is essential for automatic processing system including character segmentation, character recognition and so on. The proposed algorithm can detect a skew angle exactly by searching characters baselines that have slant information of the document within a candidated area. To reduce processing time, we resized the image small and then established a ROI (region of interest) by morphology operations and connected components analysis. We compared our method with the existing method based on morphology operations and proved correctness and efficiency of the proposed algorithm through experiments and analysis with various kind of document images.
PDF

Simple Image Stenography Technology for Large Scale Text (대용량 텍스트를 위한 손실 없는 영상 은닉기술)

Rhee, Keun-Moo
- Proceedings of the Korea Information Processing Society Conference
- /
- 2008.05a
- /
- pp.1104-1107
- /
- 2008
These people where generally the image or the document nik technique silver document image, against the digital data of audio back all type the research is advanced being used with objective and the use which are various, is a d. Needs a low-end leveling instrument security text from the research which it sees and with substitution quantity the silver nik being simple it will be able to deliver the technique which is simple it embodied. It combined the text image first and the nose which is in the collar image of 24 bit depth which will reach ting it did and it rehabilitatedded and a higher officer technique and the result it used that the loss ratio of the text image to analyze is slight it was ascertained.
https://doi.org/10.3745/PKIPS.y2008m05a.1104 인용 PDF

A Image Alignment Algorithm for an OCR System and its Hardware Implementation (OCR 시스템을 위한 화상 정렬 알고리즘과 고속 하드웨어 구현)

최완수;최진호;정윤구;김수원
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.30B no.8
- /
- pp.33-40
- /
- 1993
This paper presents a hardware for image alignment based on proposed new algorithm which can align a small misaligned document image simply by one transformation with a parallel shifting of pixels. This hardware is simulated with VHDL and estimated to be about 65 ms to align an image made up of 380 by 480 pixels. Also, we will demonstrate the effectiveness of the proposed image alignment algorithm in OCR system by comparing its characteristics with those of the existing image rotation algorithms.
PDF

A Study on the Improvement of Retrieval Efficiency Based on the CRFMD (공통기술표현포맷에 기반한 다매체자료의 검색효율 향상에 관한 연구)

Park, Il-Jong;Jeong, Ki-Tai
- Journal of the Korean Society for information Management
- /
- v.23 no.3 s.61
- /
- pp.5-21
- /
- 2006
In recent years, theories of image and sound analysis have been proposed to work with text retrieval systems and have progressed quickly with the rapid progress in data processing speeds. This study proposes a common representation format for multimedia documents (CRFMD) composed of both images and text to form a single data structure. It also shows that image classification of a given test set is dramatically improved when text features are encoded together with image features. CRFMD might be applicable to other areas of multimedia document retrieval and processing, such as medical image retrieval, World Wide Web searching, and museum collection retrieval.
https://doi.org/10.3743/KOSIM.2006.23.3.005 인용 PDF

Correction of Specular Region on Document Images (문서 영상의 전반사 영역 보정 기법)

Simon, Christian;Williem;Park, In Kyu
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2013.11a
- /
- pp.239-240
- /
- 2013
The quality of document images captured by digital camera might be degraded because of non-uniform illumination condition. The high illumination (glare distortion) affects on the contrast condition of the document images. This condition leads to the poor contrast condition of the text in document image. So, optical character recognition (OCR) system might hardly recognize text in the high illuminated area. The method to increase the contrast condition between text (foreground) and background in high illuminated area is proposed in this paper.
PDF

Line Edge-Based Type-Specific Corner Points Extraction for the Analysis of Table Form Document Structure (표 서식 문서의 구조 분석을 위한 선분 에지 기반의 유형별 꼭짓점 검출)

Jung, Jae-young
- Journal of Digital Contents Society
- /
- v.15 no.2
- /
- pp.209-217
- /
- 2014
It is very important to classify a lot of table-form documents into the same type of classes or to extract information filled in the template automatically. For these, it is necessary to accurately analyze table-form structure. This paper proposes an algorithm to extract corner points based on line edge segments and to classify the type of junction from table-form images. The algorithm preprocesses image through binarization, skew correction, deletion of isolated small area of black color because that they are probably generated by noises.. And then, it processes detections of edge block, line edges from a edge block, corner points. The extracted corner points are classified as 9 types of junction based on the combination of horizontal/vertical line edge segments in a block. The proposed method is applied to the several unconstraint document images such as tax form, transaction receipt, ordinary document containing tables, etc. The experimental results show that the performance of point detection is over 99%. Considering that almost corner points make a correspondence pair in the table, the information of type of corner and width of line may be useful to analyse the structure of table-form document.
https://doi.org/10.9728/dcs.2014.15.2.209 인용 PDF KSCI

Search Result 301, Processing Time 0.032 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)