• Title/Summary/Keyword: 문서배열

Search Result 20, Processing Time 0.029 seconds

A Study on Records Filing Systems (문서기록물의 파일링시스템에 관한 연구)

  • Yoo, Jae-Ok
    • Journal of the Korean BIBLIA Society for library and Information Science
    • /
    • v.16 no.2
    • /
    • pp.5-24
    • /
    • 2005
  • This study reviews various kinds of records filing systems, which function as a basic fundamental to effective records management. The purposes, methods and characteristics of Alphabetic, geographic, numeric, subject, and combined filing systems are examined. The alphabetic filing method uses letters of the alphabet to determine the order of names of people and companies. In subject filing the subjects are filed in alphabetic order. In numeric filing, numbers representing names or subjects are used. When records are requested by place or location rather than by individual or business name, geographic filing is advantageous.

  • PDF

An Automatic Text Classification Model using Association Rules (데이타마이닝 기법을 이용한 문서 자동 분류 모델)

  • 김영인;이진용;문현정;우용태
    • Proceedings of the Korea Database Society Conference
    • /
    • 2000.11a
    • /
    • pp.101-108
    • /
    • 2000
  • 기업에서 보유한 전문 지식 정보가 급속도로 증가함에 따라 대량의 문서에 저장된 지식 정보를 효과적으로 탐색하여 기업 경영에 활용하기 위한 지식경영시스템 도입이 확산되고 있다. 이러한 지식경영시스템에서 핵심적인 구성 요소는 전문 분야의 지식 정보를 체계적으로 분류하고 효율적으로 검색하기 위한 지식 탐사 기법이다. 본 논문에서는 데이타마이닝 기법을 이용하여 문서를 자동적으로 분류하기 위한 새로운 모델을 제안하였다. 연관 규칙 탐사 알고리즘을 이용하여 학습용 문서 집합으로부터 세부 분야를 대표하는 색인어 집합을 구성하였다. 세부 분야별 색인어 집합에 대하여 전체 문서에 대한 비중에 따라 가중치 배열을 구성하여 문서를 자동으로 분류하기 위한 기준으로 삼았다. 임의의 문서를 자동적으로 분류하는 실험을 통하여 제안된 방법의 효율성을 검정하였다.

  • PDF

The Classification arranged from Protectorate period to the early Japanese Colonial rule period : for Official Documents during the period from Kabo Reform to The Great Han Empire - Focusing on Classification Stamp and Warehouse Number Stamp - (통감부~일제 초기 갑오개혁과 대한제국기 공문서의 분류 - 분류도장·창고번호도장을 중심으로 -)

  • Park, Sung-Joon
    • The Korean Journal of Archival Studies
    • /
    • no.22
    • /
    • pp.115-155
    • /
    • 2009
  • As Korea was merged into Japan, the official documents during Kabo Reform and The Great Han Empire time were handed over to the Government-General of Chosun and reclassified from section based to ministry based. However they had been reclassified before many times. The footprints of reclassification can be found in the classification stamps and warehouse number stamps which remained on the cover of official documents from Kabo Reform to The Great Han Empire. They classified the documents by Section in the classification system of Ministry-Department-Section, stamped and numbered them. It is consistent with the official document classification system in The Great Han Empire, which shows the section based classification was maintained. Although they stamped by Section and numbered the documents, there were differences in sub classification system by Section. In the documents of Land Tax Section, many institutions can be found. The documents of the same year can be found in different group and documents of similar characteristics are classified in the same group. Customs Section and Other Tax Section seemed to number their documents according to the year of documents. However the year and the order of 'i-ro-ha(イロハ) song' does not match. From Kabo Reform to The Great Han Empire the documents were grouped by Section. However they did not have classification rules for the sub units of Section. Therefore, it is not clear if the document grouping of classification stamps can be understood as the original order of official document classification system of The Great Han Empire. However, given the grouping method reflects the document classification system, the sub section classification system of the Great Han Empire can be inferred through the grouping method. In this inference, it is understood that the classification system was divided into two such as 'Section - Counterpart Institution' and 'Section - Document Issuance Year'. The Government-General of Chosun took over the official documents of The Great Han Empire, stored them in the warehouse and marked them with Warehouse Number Stamps. Warehouse Number Stamp contained the Institution that grouped those documents and the documents were stored by warehouse. Although most of the documents on the shelves in each warehouse were arranged by classification stamp number, some of them were mixed and the order of shelves and that of documents did not match. Although they arranged the documents on the shelves and gave the symbols in the order of 'i-ro-ha(イロハ) song', these symbols were not given by the order of number. During the storage of the documents by the Government-General of Chosun, the classification system according to the classification stamps was affected. One characteristic that can be found in warehouse number stamps is that the preservation period on each document group lost the meaning. The preservation period id decided according to the historical and administrative value. However, the warehouse number stamps did not distinguish the documents according to the preservation period and put the documents with different preservation period on one shelf. As Japan merged Korea, The Great Han Empire did not consider the official documents of the Great Han Empire as administrative documents that should be disposed some time later. It considered them as materials to review the old which is necessary for the colonial governance. As the meaning of the documents has been changed from general administrative documents to the materials that they would need to govern the colony, they dealt with all the official documents of The Great Han Empire as the same object regardless of preservation period. The Government-General of Chosun destroyed the classification system of the Great Han Empire which was based on Section and the functions in the Section by reclassifying them according to Ministry when they reclassified the official documents during Kobo Reform and the Great Han Empire in order to utilize them to govern the colony.

The Study on Cutting Characteristic according to a Shape, Size and Array of Cutter for Paper Shredder (문서세단기의 커터날 형상, 크기, 배열과 절단특성에 관한 연구)

  • Lee, Wi-Ro;Lee, Dong-Gyu;Kim, Min-Ho
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.23 no.1 s.178
    • /
    • pp.56-63
    • /
    • 2006
  • The aim of this study is to find the best cutting conditions as analyzing cutting process of paper shredder and shape of cutter. The test has been done variation of torque and cutting velocity according to load. When shape of cutter and distance between cutter and shaft are changed, The variation of cutting force according to cutting angle and load is geometrically analyzed. The result of geometrical analysis is presented that the radius and array of cutter is the method to improve torque of paper shredder. In this paper it is presented as basic method of design to improve cutting performance of paper shredder.

The Recognition of Vowels and Consonants in a Handwritten Hangul Text with Attributed Grammars (속성문법을 이용한 필기체 한글 문서 내의 자모인식)

  • Lyu, Sung-Pil;Kim, Tae-Kyun
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.26 no.3
    • /
    • pp.85-94
    • /
    • 1989
  • This paper proposes a method to recognize vowels and consonants in a handwritten Hangul text, in which the sizes of chracters and the spaces between characters are not uniform. In this method, all characters in the thinned image of a handwritten Hangul text are transformed into strokes, and the attributes which represent the relations between strokes are extracted from these strokes, and the attributes which represent the relations between strokes are extracted from these strokes. The vowels and consonants are recognized by applying attributed grammars to the strokes and attributes.

  • PDF

A Method for Thresholding and Correction of Skew in Camera Document Images (카메라 문서 영상의 이진화 및 기울어짐 보정 방법)

  • Jang Dae-Geun;Chun Byung-Tae
    • Journal of the Korea Society of Computer and Information
    • /
    • v.10 no.3 s.35
    • /
    • pp.143-150
    • /
    • 2005
  • Camera image is very sensitive to illumination that result in difficulties for recognizing character. Also Camera captured document images have not only skew but also vignetting effect and geometric distortion. Vignetting effect make it difficult to separate characters from the document images. Geometric distortion, occurred by the mismatch of angle and center position between the document image and the camera, make the shape of characters to be distorted, so that the character recognition is more difficult than the case of using scanner. In this paper, we propose a method that can increase the performance of character recognition by correcting the geometric distortion of document images using a linear approximation which changes the quadrilateral region to the rectangle one. The proposed method also determine the quadrilateral transform region automatically, using the alignment of character lines and the skewed angles of characters located in the edges of each character line. Proposed method, therefore, can correct the geometric distortion without getting positional information from camera.

  • PDF

The Study for the Recognition System of Finger Languages (자화 인식 시스템에 관한 연구)

  • 강민지;최은숙;손영선
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2003.09b
    • /
    • pp.151-154
    • /
    • 2003
  • 본 논문에서는 흑백 CCD 카메라를 이용하여 청각 장애인의 의사전달 수단인 지화 동작을 동영상으로 입력받아 인식하여, 편집 가능한 텍스트 문서로 변환하는 시스템을 구현하였다. 일련의 입력 영상들 중에서 흐린 영상과 선명한 영상의 구분은 영상의 잔상을 이용하였고, 촬영된 연속 영상들의 배열로부터 문자 자소를 구하고, 오토마타를 적용하여 완성된 문자를 문서 편집기에 출력시켰다 획득된 선명한 영상 데이터 중 변화가 심한 손목 부분을 제거한 후, 최대 원형 이동법을 이용하여 손의 무게 중심점을 구하고, 원형 패턴 벡터 알고리즘을 적용하여 지화 해석에 필요한 손을 인식하였다. 손 중심으로부터 거리 스펙트럼을 이용하여 지화 인식에 사용되는 손 모양의 특징 벡터를 추출하고, 퍼지추론을 적용하여 표준 패턴과 입력 패턴의 특징벡터를 비교, 지화 동작을 인식하였다.

  • PDF

Detecting and Interpreting Terms: Focusing Korean Medical Terms (전문용어 탐지와 해석 모델: 한국어 의학용어 중심으로 )

  • Haram-Yeom;Jae-Hoon Kim
    • Annual Conference on Human and Language Technology
    • /
    • 2022.10a
    • /
    • pp.407-411
    • /
    • 2022
  • 최근 COVID-19로 인해 대중의 의학 분야 관심이 증가하고 있다. 대부분의 의학문서는 전문용어인 의학용어로 구성되어 있어 대중이 이를 보고 이해하기에 어려움이 있다. 의학용어를 쉬운 뜻으로 풀이하는 모델을 이용한다면 대중이 의학 문서를 쉽게 이해할 수 있을 것이다. 이런 문제를 완화하기 위해서 본 논문에서는 Transformer 기반 번역 모델을 이용한 의학용어 탐지 및 해석 모델을 제안한다. 번역 모델에 적용하기 위해 병렬말뭉치가 필요하다. 본 논문에서는 다음과 같은 방법으로 병렬말뭉치를 구축한다: 1) 의학용어 사전을 구축한다. 2) 의학 드라마의 자막으로부터 의학용어를 찾아서 그 뜻풀이로 대체한다. 3) 원자막과 뜻풀이가 포함된 자막을 나란히 배열한다. 구축된 병렬말뭉치를 이용해서 Transformer 번역모델에 적용하여 전문용어를 찾아서 해석하는 모델을 구축한다. 각 문장은 음절 단위로 나뉘어 사전학습 된 KoCharELECTRA를 이용해서 임베딩한다. 제안된 모델은 약 69.3%의 어절단위 BLEU 점수를 보였다. 제안된 의학용어 해석기를 통해 대중이 의학문서를 좀 더 쉽게 접근할 수 있을 것이다.

  • PDF

Speaker classification and prediction with language model (언어모델을 활용한 문서 내 발화자 예측 분류 모델)

  • Kim, Gyeongmin;Han, Seunggyu;Seo, Jaehyung;Lee, Chanhee;Lim, Heuiseok
    • Annual Conference on Human and Language Technology
    • /
    • 2020.10a
    • /
    • pp.317-320
    • /
    • 2020
  • 연설문은 구어체와 문어체 두 가지 특성을 모두 갖고 있는 복합적인 데이터 형태이다. 발화자의 문장 표현, 배열, 그리고 결합에 따라 그 구조가 다르기 때문에, 화자 별 갖는 문체적 특성 또한 모두 다르다. 국정을 다루는 정치인들의 연설문은 국정 현황을 포함한 다양한 주요 문제점을 다룬다. 그러면 발화자의 문서 내 문체적 특성을 고려할 경우, 해당 문서가 어느 정치인의 연설문인지 파악 할 수 있는가? 본 연구에서는 대한민국 정책 브리핑 사이트로부터 한국어 기반 사전 학습된 언어 모델을 활용하여 연설문에 대한 미세조정을 진행함으로써 발화자 예측 분류 모델을 생성하고, 그 가능성을 입증하고자 한다. 본 연구는 5-cross validation으로 모델 성능을 평가하였고 KoBERT, KoGPT2 모델에서 각각 90.22%, 84.41% 정확도를 보였다.

  • PDF

Character Shape Distortion Correction of Camera Acquired Document Images (카메라 획득 문서영상에서의 글자모양 왜곡보정)

  • Jang Dae-Geun;Kim Eui-Jeong
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.10 no.4
    • /
    • pp.680-686
    • /
    • 2006
  • Document images captured by scanners have only skewing distortion. But camera captured document images have not only skew but also vignetting effect and geometric distortion. Vignetting effect, which makes the border areas to be darker than the center of the image, make it difficult to separate characters from the document images. But this effect has being decreased, as the lens manufacturing skill is developed. Geometric distortion, occurred by the mismatch of angle and center position between the document image and the camera, make the shape of characters to be distorted, so that the character recognition is more difficult than the case of using scanner. In this paper, we propose a method that can increase the performance of character recognition by correcting the geometric distortion of document images using a linear approximation which changes the quadrilateral region to the rectangle one. The proposed method also determine the quadrilateral transform region automatically, using the alignment of character lines and the skewed angles of characters located in the edges of each character line. Proposed method, therefore, can correct the geometric distortion without getting positional information from camera.