• Title/Summary/Keyword: document image processing

Search Result 105, Processing Time 0.022 seconds

A Hierarchical Index Technique for Moving Image Retrieval System based on MPEG-7 (MPEG-7에 기반한 동영상 검색 시스템을 위한 계층형 인덱스 기법)

  • Kim Tack gon;Kim Woo saeng
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.29 no.10C
    • /
    • pp.1444-1450
    • /
    • 2004
  • MPEG-7 based on XML represents various information of multimedia data's contents. and it support search and browsing by user's wants. But, MPEG-7 standard don't support retrieval method and Many XML Indexing is not compatible to retrieval MPEG-7 documents. So Much research activity and interest has emerged recently in retrieval MPEG-7 documents. In our paper, we suppose a hierarchical index based on MPEG-7 document's structural information, and review how to query processing based on high level feature description.

The Construction and Common Use of Old Document DB in the Foreign Countries (해외 소장 고문헌의 DB구축과 공동활용 방안)

  • Kang, Soon-Ae
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.42 no.3
    • /
    • pp.61-79
    • /
    • 2008
  • The purpose of this paper is to investigate the three aspects of the construction and common use of old document DB in the foreign countries: i) the processing of old documents, ii) the problem and improvement of DB systems of old documents. and iii) the common use of old document DB. Results from this research are as follows: The National Library of Korea(NLK) copied old documents in the foreign countries from 1982 to 2006 and published the brief catalog. The Reogang Publishing company issued four volumes catalogs of old document in Japan. The National Research Institute of Cultural Heritage(NRICH) investigated old books and published some catalogs of several organizations in Japan. America. France. and all. The National Institute of Korean History(NIKH) investigated old archives and published some catalogs of several organizations in Japan. The characteristics of the Korean Old and Rare Collection Information System(KORCIS) of the NLK, the Old Books Cultural Heritage in Overseas System of the NRICH. and the Korea History DB System and MF Catalog/ Image System of NIKH were described in the DB systems of old documents, the problems of DB systems were checked over and some alternatives were suggested. In the common use of old document DB, KORMARC format and description rules(draft) for archives should be revised to adopt a new standard such as KS editions. and all the institutes involved should thoroughly follow the standards. when creating bibliographic records and digitizing texts. It is necessary to educate and train the specialists of old documents. A government organization should be established to supervise all the procedures of developing technology for sharing digitized resources. using contents. and cooperating with the related internationl organizations and institutes.

Optical Music Score Recognition System for Smart Mobile Devices

  • Han, SeJin;Lee, GueeSang
    • International Journal of Contents
    • /
    • v.10 no.4
    • /
    • pp.63-68
    • /
    • 2014
  • In this paper, we propose a smart system that can optically recognize a music score within a document and can play the music after recognition. Many historic handwritten documents have now been digitalized. Converting images of a music score within documents into digital files is particularly difficult and requires considerable resources because a music score consists of a 2D structure with both staff lines and symbols. The proposed system takes an input image using a mobile device equipped with a camera module, and the image is optimized via preprocessing. Binarization, music sheet correction, staff line recognition, vertical line detection, note recognition, and symbol recognition processing are then applied, and a music file is generated in an XML format. The Music XML file is recorded as digital information, and based on that file, we can modify the result, logically correct errors, and finally generate a MIDI file. Our system reduces misrecognition, and a wider range of music score can be recognized because we have implemented distortion correction and vertical line detection. We show that the proposed method is practical, and that is has potential for wide application through an experiment with a variety of music scores.

A study on development of simulation model of Underwater Acoustic Imaging (UAI) system with the inclusion of underwater propagation medium and stepped frequency beam-steering acoustic array

  • L.S. Praveen;Govind R. Kadambi;S. Malathi;Preetham Shankpal
    • Ocean Systems Engineering
    • /
    • v.13 no.2
    • /
    • pp.195-224
    • /
    • 2023
  • This paper proposes a method for the acoustic imaging wherein the traditional requirement of the relative movement between the transmitter and target is overcome. This is facilitated through the beamforming acoustic array in the transmitter, in which the target is illuminated by the array at various azimuth and elevation angles without the physical movement of the acoustic array. The concept of beam steering of the acoustic array facilitates the formation of the beam at desired angular positions of azimuth and elevation angles. This paper substantiates that the combination of illumination of the target from different azimuth and elevation angles with respect to the transmitter (through the beam steering of beam forming acoustic array) and the beam steering at multiple frequencies (through SF) results in enhanced reconstruction of images of the target in the underwater scenario. This paper also demonstrates the possibility of reconstruction of the image of a target in underwater without invoking the traditional algorithms of Digital Image Processing (DIP). This paper comprehensively and succinctly presents all the empirical formulae required for modelling the acoustic medium and the target to facilitate the reader with a comprehensive summary document incorporating the various parameters of multi-disciplinary nature.

Image Based Text Matching Using Local Crowdedness and Hausdorff Distance (지역 밀집도 및 Hausdorff 거리를 이용한 영상기반 텍스트 매칭)

  • Son, Hwa-Jeong;Kim, Ji-Soo;Park, Mi-Seon;Yoo, Jae-Myeong;Kim, Soo-Hyung
    • The Journal of the Korea Contents Association
    • /
    • v.6 no.10
    • /
    • pp.134-142
    • /
    • 2006
  • In this paper, we investigate a Hausdorff distance, which is used for the measurement of image similarity, to see whether it is also effective for document retrieval. The proposed method uses a local crowdedness and a Hausdorff distance to locate text images by determining whether a pair of images scanned at different time comes from the same text or not. To reduce the processing time, which is one of the disadvantages of a Hausdorff distance algorithm, we adopt a local crowdedness for feature point extraction. We apply the proposed method to 190 pairs of the same class and 190 pairs of the different class collected from postal envelop images. The results show that the modified Hausdorff distance proposed in this paper performed well in locating the tort region and calculating the degree of similarity between two images. An improvement of accuracy by 2.7% and 9.0% has been obtained, compared to a binary correlation method and the original Hausdorff distance method, respectively.

  • PDF

A Study on Natural Language Document and Query Processor for Information Retrieval in Digital Library (디지털 도서관 환경에서의 정보 검색을 위한 자연어 문서 및 질의 처리기에 관한 연구)

  • 윤성희
    • Journal of the Korea Computer Industry Society
    • /
    • v.2 no.12
    • /
    • pp.1601-1608
    • /
    • 2001
  • Digital library is the most important database system that needs information retrieval engine for natural language documents and multimedia data. This paper describes the experimental results of information retrieval engine and browser based on natural language processing. It includes lexical analysis, syntax processing, stemming, and keyword indexing for the natural language text. With the experimental database ‘Earth and Space Science’ that has lots of images and titles and their descriptive text in natural language, text-based search engine was tested. Combined with content-based image search engine, it is expected to be a multimedia information retrieval system in digital library

  • PDF

Improved Edge Detection Algorithm Using Ant Colony System (개미 군락 시스템을 이용한 개선된 에지 검색 알고리즘)

  • Kim In-Kyeom;Yun Min-Young
    • The KIPS Transactions:PartB
    • /
    • v.13B no.3 s.106
    • /
    • pp.315-322
    • /
    • 2006
  • Ant Colony System(ACS) is easily applicable to the traveling salesman problem(TSP) and it has demonstrated good performance on TSP. Recently, ACS has been emerged as the useful tool for the pattern recognition, feature extraction, and edge detection. The edge detection is wifely utilized in the area of document analysis, character recognition, and face recognition. However, the conventional operator-based edge detection approaches require additional postprocessing steps for the application. In the present study, in order to overcome this shortcoming, we have proposed the new ACS-based edge detection algorithm. The experimental results indicate that this proposed algorithm has the excellent performance in terms of robustness and flexibility.

Convolutional Neural Network-based Malware Classification Method utilizing Local Feature-based Global Image (로컬 특징 기반 글로벌 이미지를 사용한 CNN 기반의 악성코드 분류 방법)

  • Jang, Sejun;Sung, Yunsick
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2020.05a
    • /
    • pp.222-223
    • /
    • 2020
  • 최근 악성코드로 인한 피해가 증가하고 있다. 악성코드는 악성코드가 속한 종류에 따라서 대응하는 방법도 다르기 때문에 악성코드를 종류별로 분류하는 연구도 중요하다. 기존에는 악성코드 시각화 과정을 통해서 생성된 악성코드의 글로벌 이미지를 사용해 악성코드를 각 종류별로 분류한다. 글로벌 이미지를 악성코드로부터 추출한 바이너리 정보를 사용해서 생성한다. 하지만, 글로벌 이미지만을 사용해서 악성코드를 각 종류별로 분류하는 경우 악성코드의 종류별로 중요한 특징을 고려하기 않기 때문에 분류 정확도가 떨어진다. 본 논문에서는 악성코드의 글로벌 이미지에 악성코드의 종류별 특징을 나타내기 위한 로컬 특징 기반 글로벌 이미지를 사용한 악성코드 분류 방법을 제안한다. 첫 번째, 악성 코드로부터 바이너리를 추출하고 추출된 바이너리를 사용해서 글로벌 이미지를 생성한다. 두 번째, 악성 코드로부터 로컬 특징을 추출하고 악성코드의 종류별 핵심 로컬 특징을 단어-역문서 빈도(Term Frequency Inverse Document Frequency, TFIDF) 알고리즘을 사용해 선택한다. 세 번째, 생성된 글로벌 이미지에 악성코드의 패밀리별 핵심 특징을 픽셀화해서 적용한다. 네 번째, 생성된 로컬 특징 기반 글로벌 이미지를 사용해서 컨볼루션 모델을 학습하고, 학습된 컨볼루션 모델을 사용해서 악성코드를 각 종류별로 분류한다.

A Study on Data Management Systems for Spatial Assessments of Road Visibilities at Night (야간도로 시인성에 대한 공간적 평가를 위한 자료관리체계 연구)

  • Woo, Hee Sook;Kwon, Kwang Seok;Kim, Byung Guk;Yoon, Chun Joo;Kim, Young Rok
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.22 no.4
    • /
    • pp.107-115
    • /
    • 2014
  • Visibility of the road influence the safe driving because it recognizes the obstacle on the road. In this paper, we propose a mobile data acquisition and processing system for evaluating road visibility at night. And it was converted efficiently with mobile images and archived for spatial analysis of road-visibilities at night. This was applied to the following techniques to the system. Low-power computing units, open an image processing library, GPU-based acceleration techniques and document database techniques, etc. And converting the RGB image to the YUV color system, which was integrated the brightness component and the spatial information. High performance Android devices were used to collect brightness data on roads and it was confirmed whether this prototype was to determine the spatial distribution of such acquisition and management systems for spatial-assessments of road visibility at night.

Considerations for Applying Korean Natural Language Processing Technology in Records Management (기록관리 분야에서 한국어 자연어 처리 기술을 적용하기 위한 고려사항)

  • Haklae, Kim
    • Journal of Korean Society of Archives and Records Management
    • /
    • v.22 no.4
    • /
    • pp.129-149
    • /
    • 2022
  • Records have temporal characteristics, including the past and present; linguistic characteristics not limited to a specific language; and various types categorized in a complex way. Processing records such as text, video, and audio in the life cycle of records' creation, preservation, and utilization entails exhaustive effort and cost. Primary natural language processing (NLP) technologies, such as machine translation, document summarization, named-entity recognition, and image recognition, can be widely applied to electronic records and analog digitization. In particular, Korean deep learning-based NLP technologies effectively recognize various record types and generate record management metadata. This paper provides an overview of Korean NLP technologies and discusses considerations for applying NLP technology in records management. The process of using NLP technologies, such as machine translation and optical character recognition for digital conversion of records, is introduced as an example implemented in the Python environment. In contrast, a plan to improve environmental factors and record digitization guidelines for applying NLP technology in the records management field is proposed for utilizing NLP technology.