• Title/Summary/Keyword: 자동정보 추출

Search Result 1,996, Processing Time 0.029 seconds

Fontface Recognition Using the Font Density Function (폰트 밀도함수를 애용한 폰트 타입의 인식)

  • 진성아;주문원
    • Proceedings of the Korea Multimedia Society Conference
    • /
    • 2001.06a
    • /
    • pp.189-191
    • /
    • 2001
  • 폰트는 텍스트 정보를 기술하는 기본 요소로서 다양한 타입에 따른 독특한 감성정보를 내재하고 있다. 본 연구는 문서에 나타나 있는 영문폰트의 분포에 따른 감성정보 자동추출 시스템의 전처리 단계로서 문서상에서 특정의 폰트를 인식하는 모듈을 소개하고자 한다. 폰트 디자이너에 생성된 대부분의 폰트는 glyph data 라고 하는 2D boundary 좌표값에 의해 그 모양(Shape)이 결정된다. 이 데이터로부터 정의된 폰트밀도함수와 각 문자가 등장하는 보편적 확률 값의 linear combination으로부터 각 폰트를 식별할 수 있다.

  • PDF

RFID 정책 추진 방향

  • 조규조
    • The Proceeding of the Korean Institute of Electromagnetic Engineering and Science
    • /
    • v.15 no.2
    • /
    • pp.5-11
    • /
    • 2004
  • RFID 기술은 물품 등 관리할 사물에 아주 작은 전자태그를 부착하고 전파를 이용하여 사물의 정보 (Identification) 및 주변 환경정보를 자동으로 추출하여 관리하는 것으로 향후 IT 시장을 선도할 유망기술이다. 정보통신부는 RFID 기술을 기반으로 하는 정보화를 u-센서 네트워크(USN: Ubiquitous-Sensor Network)라는 개념으로 정립하였으며 기술개발 및 시범사업 등을 통하여 RFID 서비스를 활성화하고 u-센서 네트워크를 구축하여 국민소득 2만 달러 달성을 위한 IT 산업육성 정책을 적극적으로 추진할 계획이다.

A DVS System based on Process Monitoring Technique (프로세스 모니터링 기법에 기반한 DVS 시스템)

  • 이준희;차호정
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2004.04a
    • /
    • pp.103-105
    • /
    • 2004
  • 본 논문에서는 프로세스 모니터링 기법에 기반한 DVS 시스템을 제안한다. 이상적인 DVS 시스템은 응용프로그램의 수정 없이 자동으로 수행되어야 하며 프로세스의 QoS를 고려해야 한다. 본 논문은 이를 위해 본 연구의 이전 논문에서 제시한 Kernel Control Path를 모니터링하여 주기적 프로세스의 QoS관련 정보를 추출할 수 있는 기법을 기반으로 DVS 시스템을 제안한다. 제안한 DVS 시스템은 리눅스 운영체제상에서 실제 구현하였으며 관련 연구와의 비교를 위해 관련연구도 구현하여 실험하였다. 이를 통해 제안한 DVS 시스템이 주기적 프로세스의 QoS를 보장하면서 전력소비를 최소화할 수 있음을 밝힌다.

  • PDF

An Automatically Extracting Formal Information from Unstructured Security Intelligence Report (비정형 Security Intelligence Report의 정형 정보 자동 추출)

  • Hur, Yuna;Lee, Chanhee;Kim, Gyeongmin;Jo, Jaechoon;Lim, Heuiseok
    • Journal of Digital Convergence
    • /
    • v.17 no.11
    • /
    • pp.233-240
    • /
    • 2019
  • In order to predict and respond to cyber attacks, a number of security companies quickly identify the methods, types and characteristics of attack techniques and are publishing Security Intelligence Reports(SIRs) on them. However, the SIRs distributed by each company are huge and unstructured. In this paper, we propose a framework that uses five analytic techniques to formulate a report and extract key information in order to reduce the time required to extract information on large unstructured SIRs efficiently. Since the SIRs data do not have the correct answer label, we propose four analysis techniques, Keyword Extraction, Topic Modeling, Summarization, and Document Similarity, through Unsupervised Learning. Finally, has built the data to extract threat information from SIRs, analysis applies to the Named Entity Recognition (NER) technology to recognize the words belonging to the IP, Domain/URL, Hash, Malware and determine if the word belongs to which type We propose a framework that applies a total of five analysis techniques, including technology.

A Keyphrase Extraction Model for Each Conference or Journal (학술대회 및 저널별 기술 핵심구 추출 모델)

  • Jeong, Hyun Ji;Jang, Gwangseon;Kim, Tae Hyun;Sin, Donggu
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.10a
    • /
    • pp.81-83
    • /
    • 2022
  • Understanding research trends is necessary to select research topics and explore related works. Most researchers search representative keywords of interesting domains or technologies to understand research trends. However some conferences in artificial intelligence or data mining fields recently publish hundreds to thousands of papers for each year. It makes difficult for researchers to understand research trend of interesting domains. In our paper, we propose an automatic technology keyphrase extraction method to support researcher to understand research trend for each conference or journal. Keyphrase extraction that extracts important terms or phrases from a text, is a fundamental technology for a natural language processing such as summarization or searching, etc. Previous keyphrase extraction technologies based on pretrained language model extract keyphrases from long texts so performances are degraded in short texts like titles of papers. In this paper, we propose a techonolgy keyphrase extraction model that is robust in short text and considers the importance of the word.

  • PDF

Automatic Extraction of Opinion Words from Korean Product Reviews Using the k-Structure (k-Structure를 이용한 한국어 상품평 단어 자동 추출 방법)

  • Kang, Han-Hoon;Yoo, Seong-Joon;Han, Dong-Il
    • Journal of KIISE:Software and Applications
    • /
    • v.37 no.6
    • /
    • pp.470-479
    • /
    • 2010
  • In relation to the extraction of opinion words, it may be difficult to directly apply most of the methods suggested in existing English studies to the Korean language. Additionally, the manual method suggested by studies in Korea poses a problem with the extraction of opinion words in that it takes a long time. In addition, English thesaurus-based extraction of Korean opinion words leaves a challenge to reconsider the deterioration of precision attributed to the one to one mismatching between Korean and English words. Studies based on Korean phrase analyzers may potentially fail due to the fact that they select opinion words with a low level of frequency. Therefore, this study will suggest the k-Structure (k=5 or 8) method, which may possibly improve the precision while mutually complementing existing studies in Korea, in automatically extracting opinion words from a simple sentence in a given Korean product review. A simple sentence is defined to be composed of at least 3 words, i.e., a sentence including an opinion word in ${\pm}2$ distance from the attribute name (e.g., the 'battery' of a camera) of a evaluated product (e.g., a 'camera'). In the performance experiment, the precision of those opinion words for 8 previously given attribute names were automatically extracted and estimated for 1,868 product reviews collected from major domestic shopping malls, by using k-Structure. The results showed that k=5 led to a recall of 79.0% and a precision of 87.0%; while k=8 led to a recall of 92.35% and a precision of 89.3%. Also, a test was conducted using PMI-IR (Pointwise Mutual Information - Information Retrieval) out of those methods suggested in English studies, which resulted in a recall of 55% and a precision of 57%.

Automatic Extraction of Buildings using Aerial Photo and Airborne LIDAR Data (항공사진과 항공레이저 데이터를 이용한 건물 자동추출)

  • 조우석;이영진;좌윤석
    • Korean Journal of Remote Sensing
    • /
    • v.19 no.4
    • /
    • pp.307-317
    • /
    • 2003
  • This paper presents an algorithm that automatically extracts buildings among many different features on the earth surface by fusing LIDAR data with panchromatic aerial images. The proposed algorithm consists of three stages such as point level process, polygon level process, parameter space level process. At the first stage, we eliminate gross errors and apply a local maxima filter to detect building candidate points from the raw laser scanning data. After then, a grouping procedure is performed for segmenting raw LIDAR data and the segmented LIDAR data is polygonized by the encasing polygon algorithm developed in the research. At the second stage, we eliminate non-building polygons using several constraints such as area and circularity. At the last stage, all the polygons generated at the second stage are projected onto the aerial stereo images through collinearity condition equations. Finally, we fuse the projected encasing polygons with edges detected by image processing for refining the building segments. The experimental results showed that the RMSEs of building corners in X, Y and Z were 8.1cm, 24.7cm, 35.9cm, respectively.

Texture Analysis Algorithm and its Application to Leather Automatic Classification Inspection System (텍스처 분석 알고리즘과 피혁 자동 선별 시스템에의 응용)

  • 김명재;이명수;권장우;김광섭;길경석
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2001.10a
    • /
    • pp.363-366
    • /
    • 2001
  • The present process of grading leather quality by the rare eyes is not reliable. Because inconsistency of grading due to eyes strain for long time can cause incorrect result of grading. Therefore it is necessary to automate the process of grading quality of leather based on objective standard for it. In this paper, leather automatic classification system consists of the process obtaining the information of leather and the process grading the quality of leather from the information. Leather is graded by its information such as texture density, types and distribution of defects. This paper proposes the algorithm which sorts out leather information like texture density and defects from the gray-level images obtained by digital camera. The density information is sorted out by the distribution value of Fourier spectrum which comes out after original image is converted to the image in frequency domain. And the defect information is obtained by the statistics of pixels which is relevant to Window using searching Window after sort out boundary lines from preprocessed images. The information for entire leather is used as standard of grading leather quality, and the proposed algorithm is practically applied to machine vision system.

  • PDF

Automatic Extraction of Fractures and Their Characteristics in Rock Masses by LIDAR System and the Split-FX Software (LIDAR와 Split-FX 소프트웨어를 이용한 암반 절리면의 자동추출과 절리의 특성 분석)

  • Kim, Chee-Hwan;Kemeny, John
    • Tunnel and Underground Space
    • /
    • v.19 no.1
    • /
    • pp.1-10
    • /
    • 2009
  • Site characterization for structural stability in rock masses mainly involves the collection of joint property data, and in the current practice, much of this data is collected by hand directly at exposed slopes and outcrops. There are many issues with the collection of this data in the field, including issues of safety, slope access, field time, lack of data quantity, reusability of data and human bias. It is shown that information on joint orientation, spacing and roughness in rock masses, can be automatically extracted from LIDAR (light detection and ranging) point floods using the currently available Split-FX point cloud processing software, thereby reducing processing time, safety and human bias issues.

The Facial Area Extraction Using Multi-Channel Skin Color Model and The Facial Recognition Using Efficient Feature Vectors (Multi-Channel 피부색 모델을 이용한 얼굴영역추출과 효율적인 특징벡터를 이용한 얼굴 인식)

  • Choi Gwang-Mi;Kim Hyeong-Gyun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.9 no.7
    • /
    • pp.1513-1517
    • /
    • 2005
  • In this paper, I make use of a Multi-Channel skin color model with Hue, Cb, Cg using Red, Blue, Green channel altogether which remove bight component as being consider the characteristics of skin color to do modeling more effective to a facial skin color for extracting a facial area. 1 used efficient HOLA(Higher order local autocorrelation function) using 26 feature vectors to obtain both feature vectors of a facial area and the edge image extraction using Harr wavelet in image which split a facial area. Calculated feature vectors are used of date for the facial recognition through learning of neural network It demonstrate improvement in both the recognition rate and speed by proposed algorithm through simulation.