• Title/Summary/Keyword: document layout analysis

Search Result 20, Processing Time 0.021 seconds

Analysis of Furniture Planning and Layout Type in Subject Specialization of University Library (대학도서관 주제자료실의 가구계획 및 배치유형 분석)

  • Chang, Ari;Hwang, Yeon-Sook
    • Korean Institute of Interior Design Journal
    • /
    • v.24 no.2
    • /
    • pp.180-188
    • /
    • 2015
  • University libraries aim to improve not only educational effects but also the general quality of colleges. A primary way of pursuing this goal is through providing professors and students with sufficient amounts of available references and materials that can be used for academic purposes. However, even though university libraries are intended to be used by college students majoring in different fields, they tend to provide mostly books. This limited offering of resources means that they are not distinguishing themselves from regular libraries. The purpose of this study is to present basic data for the spatial design of a subject specialization room in a college library. Included in the design are recommendations for the type and placement of the furniture in the room. The summary of results for this study and the conclusions are as follows: The layout of data space and reading space in a subject specialization room can be categorized into both document-oriented (document centralized and document categorized) and reading-oriented (reading centralized, all, and group types). The public reading seats and private reading seats in a subject specialization room, according to their ratio, can be divided into private reading, public reading, and distributed reading sections. The ratio of open-spaced tables is higher for groups of four or more people, but users often sit separately from others in order to ensure privacy. Unfortunately, this practice results in seating gaps that do not make efficient use of space. The result is that the public reading seats are less efficient than the private reading seats in terms of space. Therefore, it is necessary to increase the number of cubicles.

Table Detection from Document Image using Vertical Arrangement of Text Blocks

  • Tran, Dieu Ni;Tran, Tuan Anh;Oh, Aran;Kim, Soo Hyung;Na, In Seop
    • International Journal of Contents
    • /
    • v.11 no.4
    • /
    • pp.77-85
    • /
    • 2015
  • Table detection is a challenging problem and plays an important role in document layout analysis. In this paper, we propose an effective method to identify the table region from document images. First, the regions of interest (ROIs) are recognized as the table candidates. In each ROI, we locate text components and extract text blocks. After that, we check all text blocks to determine if they are arranged horizontally or vertically and compare the height of each text block with the average height. If the text blocks satisfy a series of rules, the ROI is regarded as a table. Experiments on the ICDAR 2013 dataset show that the results obtained are very encouraging. This proves the effectiveness and superiority of our proposed method.

Word Extraction from Table Regions in Document Images (문서 영상 내 테이블 영역에서의 단어 추출)

  • Jeong, Chang-Bu;Kim, Soo-Hyung
    • The KIPS Transactions:PartB
    • /
    • v.12B no.4 s.100
    • /
    • pp.369-378
    • /
    • 2005
  • Document image is segmented and classified into text, picture, or table by a document layout analysis, and the words in table regions are significant for keyword spotting because they are more meaningful than the words in other regions. This paper proposes a method to extract words from table regions in document images. As word extraction from table regions is practically regarded extracting words from cell regions composing the table, it is necessary to extract the cell correctly. In the cell extraction module, table frame is extracted first by analyzing connected components, and then the intersection points are extracted from the table frame. We modify the false intersections using the correlation between the neighboring intersections, and extract the cells using the information of intersections. Text regions in the individual cells are located by using the connected components information that was obtained during the cell extraction module, and they are segmented into text lines by using projection profiles. Finally we divide the segmented lines into words using gap clustering and special symbol detection. The experiment performed on In table images that are extracted from Korean documents, and shows $99.16\%$ accuracy of word extraction.

Web Document Transcoding Technique for Small Display Devices (소형 화면 단말기를 위한 웹 문서 변환 기법)

  • Shin, Hee-Sook;Mah, Pyeong-Soo;Cho, Soo-Sun;Lee, Dong-Woo
    • The KIPS Transactions:PartD
    • /
    • v.9D no.6
    • /
    • pp.1145-1156
    • /
    • 2002
  • We propose a web document transcoding technique that translates existing web pages designed for desktop computers into an appropriate form for hand-held devices connected to the wireless internet. By defining a content block based on a visual separation and using it as a minimum unit for analyzing and converting processes, we can get web pages converted more exactly. We also apply the reallocation of the content block and the generation of new index in order to provide convenient interface without left-right scrolling in small screen devices. These methods, compared with existing ways such as text level summary or partial extraction method, can provide efficient navigation and a full recognition of web documents. To gain those transcoding benefits, we propose the Layout-Forming Tag Analysis Algorithm that analyzes structural tags, which motivate visual separation and the Component Grouping Algorithm that extracts the content block. We also classify and rearrange the content block and generate the new index to produce an appropriate form of web pages for small display devices. We have designed and implemented our transcoding system in a proxy server and evaluated the methods and the algorithms through an analysis of transcoded results. Our transcoding system showed a good result on most of popular web pages that have complicated structures.

Development of an Automated ESG Document Review System using Ensemble-Based OCR and RAG Technologies

  • Eun-Sil Choi
    • Journal of the Korea Society of Computer and Information
    • /
    • v.29 no.9
    • /
    • pp.25-37
    • /
    • 2024
  • This study proposes a novel automation system that integrates Optical Character Recognition (OCR) and Retrieval-Augmented Generation (RAG) technologies to enhance the efficiency of the ESG (Environmental, Social, and Governance) document review process. The proposed system improves text recognition accuracy by applying an ensemble model-based image preprocessing algorithm and hybrid information extraction models in the OCR process. Additionally, the RAG pipeline optimizes information retrieval and answer generation reliability through the implementation of layout analysis algorithms, re-ranking algorithms, and ensemble retrievers. The system's performance was evaluated using certificate images from online portals and corporate internal regulations obtained from various sources, such as the company's websites. The results demonstrated an accuracy of 93.8% for certification reviews and 92.2% for company regulations reviews, indicating that the proposed system effectively supports human evaluators in the ESG assessment process.

Implementation of a Journal's Table of Contents Separation System based on Contents Analysis (내용분석을 통한 논문지의 목차분류 시스템의 구현)

  • Kwon, Young-Bin
    • The KIPS Transactions:PartB
    • /
    • v.14B no.7
    • /
    • pp.481-492
    • /
    • 2007
  • In this paper, a method for automatic indexing of contents to reduce effort for inputting paper information and constructing index is considered. Existing document analysis methods can't analyse various table of contents of journal paper formats efficiently because they have many exceptions. In this paper, various contents formats for journals, which have different features from those for general documents, are analysed and described. The principal elements that we want to represent are titles, authors, and pages for each papers. Thus, the three principal elements are modeled according to the order of their arrangement, and their features are extracted. And a table of content recognition system of journal is implemented, based on the proposed modeling method. The accuracy of exact extraction ratio of 91.5% on title, author, and page type on 660 published papers of various journals is obtained.

A New Method for Nonparametric Document Layout Analysis (매개변수에 무관한 새로운 문서 구조 분석 방법)

  • 류대석;강선미;이성환
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 1999.10b
    • /
    • pp.482-484
    • /
    • 1999
  • 본 논문에서는 매개변수 없이 입력 문서 영상을 최대 동질 영역들로 분할한 다음, 각 동질 영역을 텍스트, 그림, 표 그리고 선으로 자동 분류하는 새로운 방법을 제안한다. 다단계 분석과 하향식 접근 방법을 사용하기 위하여 문서 영상을 피라미드 구조로 계층화하였으며, 어떤 영역을 분할할 지의 여부를 결정하기 위하여 그 영역의 주기성을 이용하여 판단하였다. 이러한 주기성 정보를 이용함으로써, 어떠한 매개변수 없이도 활자체 크기와 행간에 무관하게 텍스트 영역을 정확히 분석할 수 있었으며, 피라미드 구조를 만드는데 걸리는 시간이 질감 분석 접근방법보다 빠른 방법으로 설계되었다. Washington 대학의 문서 영상 데이터베이스를 이용한 실험 결과, 제안된 방법이 기존의 방법들보다 더 정확하게 문서 영상을 분할 및 분류할 수 있음을 확인할 수 있었다.

  • PDF

Page Layout Analysis and Text Segmentation in Document Image (문서영상의 레이아웃 분석과 문자 분할)

  • Choi, Jae-Hyung;Cho, Nam-Ik
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2012.07a
    • /
    • pp.71-74
    • /
    • 2012
  • 본 논문에서는 새로운 문자 분할 알고리즘을 제안한다. 고전적인 문자 분할 알고리즘은 학술적인 문서영상과 같이 단순한 구조를 가진 문서영상을 대상으로 하여 좋은 성능을 보였지만 다양한 문자 크기와 색상, 그림, 복잡한 배경 등으로 구성된 문서영상에서는 좋지 못한 성능을 보인다. 최근에 제안고 있는 방법들은 복잡한 문서영상에서도 좋은 성능을 보이도록 다양한 기법들을 적용하여 우수한 성능을 보이고 있지만, 대부분의 방법들이 영상을 일정한 크기의 블록으로 나누어 문자분할을 하기 때문에 세밀한 부분에서는 성능이 어느 정도 한계를 보인다. 따라서 본 논문에서는 블록의 크기에 제한을 갖지 않는 새로운 방법으로서, watershed 알고리즘을 이용한 문자분할 방법을 제시한다. 구체적으로, watershed 알고리즘을 이용하여 문서영상의 구조(docstrum)를 파악하고 이를 기반으로 문자를 분할한다. 제안하는 방법은 크게 엣지 검출, distance transform, watershed 알고리즘을 이용한 docstrum 분석, 문자 분할의 네 단계를 거친다. 실험 결과 블록에 기반한 기존의 방법들이 놓치는 세밀한 부분에서도 제안된 알고리즘은 올바른 분할결과를 얻을 수 있음을 확인하였다.

  • PDF

A HTML Document Transcoding Technique for Mobile Devices (이동 단말을 위한 HTML 문서의 변환 기법)

  • Shin, Hee-Sook;Cho, Soo-Sun;Lee, Dong-Woo;Mah, Pyeong-Soo
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2002.11c
    • /
    • pp.2347-2350
    • /
    • 2002
  • 본 논문에서는 일반 데스크탑 PC의 디스플레이 성능에 적합하도록 작성된 유선의 웹 문서를 무선 인터넷 환경의 핸드헬드 계열 소형 단말에서도 효율적으로 표현하기 위한 변환 기법을 제시한다. 이는 기존의 단순한 텍스트 위주의 추출 및 형식의 변환과는 달리, 분석 및 변환을 위한 최소 내용 단위를 설정하고, 이들의 재배치를 통하여 원본 웹 문서의 정보를 보다 정확히 반영한다. 또한 새로운 인덱스 형식으로의 재표현을 통하여 기존의 페이지 조각과 계층적 구조의 인덱스 링크를 이용한 인터페이스보다 편리한 검색 및 페이지 이동을 제공한다. 이 기법은 보다 많은 정보를 복잡한 구조로 표현하는 현재의 웹 문서 특징을 반영하고, 이동 단말들의 고성능화 추세와 함께 화려한 무선 인터넷을 요구하는 사용자들을 고려한 것이다. 전체 변환 과정은 Layout-Forming Tag Analysis Algorithm, Component Grouping Algorithm Component Block Classification Method, Index Generation Method로 구성된다. 변환 시스템의 구성 모듈별 설계와 프로토타입의 구현을 통하여 제안하는 알고리즘 및 변환 방법을 평가하였고, 실제 웹 문서에 대한 실험 결과에서 단말의 소형 화면에 적합하게 변환된 모습을 확인하였다.

  • PDF

Prototype Structure of integrated Document Forms for Construction PMIS based on Analysis (건설 PMIS 현황분석에 기반한 통합양식체계 프로토타입)

  • Kim, Myeong-Jin;Jung, Tae-Hwan;Noh, Gyu-Tae;Koo, Kyo-Jin
    • Korean Journal of Construction Engineering and Management
    • /
    • v.12 no.5
    • /
    • pp.3-11
    • /
    • 2011
  • The Project Management Information System (PMIS), the core of construction information, performs the function of handling large volumes of data and information in the actual layout and construction, and supports the manager in making a prompt decision. The purpose of this study is to develop a construction PMIS integrated form system prototype for process and quality control forms, submitted between project groups, as the method of data sharing and communication among an owner, supervisor, and site manager. This study of the integrated form classification system was carried out with limited case studies and interviews of professionals due to lack of data from the previous thesis and related research material From the results of this study, an Excel-based integrated form system was established for a complete site.