• Title/Summary/Keyword: 자동정보 추출

Search Result 1,995, Processing Time 0.032 seconds

Boolean Query Formulation From Korean Natural Language Queries using Syntactic Analysis (구문분석에 기반한 한글 자연어 질의로부터의 불리언 질의 생성)

  • Park, Mi-Hwa;Won, Hyeong-Seok;Lee, Geun-Bae
    • Journal of KIISE:Software and Applications
    • /
    • v.26 no.10
    • /
    • pp.1219-1229
    • /
    • 1999
  • 일반적으로 AND, OR, NOT과 같은 연산자를 사용하는 불리언 질의는 사용자의 검색의도를 정확하게 표현할 수 있기 때문에 검색 전문가들은 불리언 질의를 사용하여 높은 검색성능을 얻는다고 알려져 있지만, 일반 사용자는 자신이 원하는 정보를 불리언 형태로 표현하는데 익숙하지 않다. 본 논문에서는 검색성능의 향상과 사용자 편의성을 동시에 만족하기 위하여 사용자의 자연어 질의를 확장 불리언 질의로 자동 변환하는 방법론을 제안한다. 먼저 자연어 질의를 범주문법에 기반한 구문분석을 수행하여 구문트리를 생성하고 연산자 및 키워드 정보를 추출하여 구문트리를 간략화한다. 다음으로 간략화된 구문트리로부터 명사구를 합성하고 키워드들에 대한 가중치를 부여한 후 불리언 질의를 생성하여 검색을 수행한다. 또한 구문분석의 오류로 인한 검색성능 저하를 최소화하기 위하여 상위 N개 구문트리에 대해 각각 불리언 질의를 생성하여 검색하는 N-BEST average 방법을 제안하였다. 정보검색 실험용 데이타 모음인 KTSET2.0으로 실험한 결과 제안된 방법은 수동으로 추출한 불리언 질의보다 8% 더 우수한 성능을 보였고, 기존의 벡터공간 모델에 기반한 자연어질의 시스템에 비해 23% 성능향상을 보였다. Abstract There have been a considerable evidence that trained users can achieve a good search effectiveness through a boolean query because a structural boolean query containing operators such as AND, OR, and NOT can make a more accurate representation of user's information need. However, it is not easy for ordinary users to construct a boolean query using appropriate boolean operators. In this paper, we propose a boolean query formulation method that automatically transforms a user's natural language query into a extended boolean query for both effectiveness and user convenience. First, a user's natural language query is syntactically analyzed using KCCG(Korean Combinatory Categorial Grammar) parser and resulting syntactic trees are structurally simplified using a tree-simplifying mechanism in order to catch the logical relationships between keywords. Next, in a simplified tree, plausible noun phrases are identified and added into the same tree as new additional keywords. Finally, a simplified syntactic tree is automatically converted into a boolean query using some mapping rules and linguistic heuristics. We also propose an N-BEST average method that uses top N syntactic trees to compensate for bad effects of single incorrect top syntactic tree. In experiments using KTSET2.0, we showed that a proposed method outperformed a traditional vector space model by 23%, and surprisingly manually constructed boolean queries by 8%.

A Study on Optical Condition and preprocessing for Input Image Improvement of Dented and Raised Characters of Rubber Tires (고무타이어 문자열 입력영상 개선을 위한 전처리와 광학조건에 관한 연구)

  • 류한성;최중경;권정혁;구본민;박무열
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.6 no.1
    • /
    • pp.124-132
    • /
    • 2002
  • In this paper, we present a vision algorithm and method for input image improvement and preprocessing of dented and raised characters on the sidewall of tires. we define optical condition between reflect coefficient and reflectance by the physical vector calculate. On the contrary this work will recognize the engraved characters using the computer vision technique. Tire input images have all most same grey levels between the characters and backgrounds. The reflectance is little from a tire surface. therefore, it's very difficult segment the characters from the background. Moreover, one side of the character string is raised and the other is dented. So, the captured images are varied with the angle of camera and illumination. For optimum Input images, the angle between camera and illumination was found out to be with in 90$^{\circ}$. In addition, We used complex filtering with low-pass and high-pass band filters to improve input images, for clear input images. Finally we define equation reflect coefficient and reflectance. By doing this, we obtained good images of tires for pattern recognition.

Escape Route Prediction and Tracking System using Artificial Intelligence (인공지능을 활용한 도주경로 예측 및 추적 시스템)

  • Yang, Bum-suk;Park, Dea-woo
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.05a
    • /
    • pp.225-227
    • /
    • 2022
  • Now In Seoul, about 75,000 CCTVs are installed in 25 district offices. Each ward office in Seoul has built a control center for CCTV control and is building information such as people, vehicle types, license plate recognition and color classification into big data through 24-hour artificial intelligence intelligent image analysis. Seoul Metropolitan Government has signed MOUs with the Ministry of Land, Infrastructure and Transport, the National Police Agency, the Fire Service, the Ministry of Justice, and the military base to enable rapid response to emergency/emergency situations. In other words, we are building a smart city that is safe and can prevent disasters by providing CCTV images of each ward office. In this paper, the CCTV image is designed to extract the characteristics of the vehicle and personnel when an incident occurs through artificial intelligence, and based on this, predict the escape route and enable continuous tracking. It is designed so that the AI automatically selects and displays the CCTV image of the route. It is designed to expand the smart city integration platform by providing image information and extracted information to the adjacent ward office when the escape route of a person or vehicle related to an incident is expected to an area other than the relevant jurisdiction. This paper will contribute as basic data to the development of smart city integrated platform research.

  • PDF

Service Identification Method for Encrypted Traffic Based on SSL/TLS (SSL/TLS 기반 암호화 트래픽의 서비스 식별 방법)

  • Kim, Sung-Min;Park, Jun-Sang;Yoon, Sung-Ho;Kim, Jong-Hyun;Choi, Sun-Oh;Kim, Myung-Sup
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.40 no.11
    • /
    • pp.2160-2168
    • /
    • 2015
  • The SSL/TLS, one of the most popular encryption protocol, was developed as a solution of various network security problem while the network traffic has become complex and diverse. But the SSL/TLS traffic has been identified as its protocol name, not its used services, which is required for the effective network traffic management. This paper proposes a new method to generate service signatures automatically from SSL/TLS payload data and to classify network traffic in accordance with their application services. We utilize the certificate publication information field in the certificate exchanging record of SSL/TLS traffic for the service signatures, which occurs when SSL/TLS performs Handshaking before encrypt transmission. We proved the performance and feasibility of the proposed method by experimental result that classify about 95% SSL/TLS traffic with 95% accuracy for every SSL/TLS services.

Hierarchical Neural Network for Real-time Medicine-bottle Classification (실시간 약통 분류를 위한 계층적 신경회로망)

  • Kim, Jung-Joon;Kim, Tae-Hun;Ryu, Gang-Soo;Lee, Dae-Sik;Lee, Jong-Hak;Park, Kil-Houm
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.23 no.3
    • /
    • pp.226-231
    • /
    • 2013
  • In The matching algorithm for automatic packaging of drugs is essential to determine whether the canister can exactly refill the suitable medicine. In this paper, we propose a hierarchical neural network with the upper and lower layers which can perform real-time processing and classification of many types of medicine bottles to prevent accidental medicine disaster. A few number of low-dimensional feature vector are extracted from the label images presenting medicine-bottle information. By using the extracted feature vectors, the lower layer of MLP(Multi-layer Perceptron) neural networks is learned. Then, the output of the learned middle layer of the MLP is used as the input to the upper layer of the MLP learning. The proposed hierarchical neural network shows good classification performance and real- time operation in the test of up to 30 degrees rotated to the left and right images of 100 different medicine bottles.

A Retrieval System of Environment Education Contents using Method of Automatic Annotation and Histogram (자동 주석 및 히스토그램 기법을 이용한 환경 교육 컨텐츠 검색 시스템)

  • Lee, Keun-Wang;Kim, Jin-Hyung
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.9 no.1
    • /
    • pp.114-121
    • /
    • 2008
  • In order to process video data effectively, it is required that the content information of video data is loaded in database and semantic- based retrieval method can be available for various query of users. In this paper, we propose semantic-based video retrieval system for Environment Education Contents which support semantic retrieval of various users by feature-based retrieval and annotation-based retrieval of massive video data. By user's fundamental query and selection of image for key frame that extracted form query, the agent gives the detail shape for annotation of extracted key frame. Also, key frame selected by user become query image and searches the most similar key frame through feature based retrieval method that propose. From experiment, the designed and implemented system showed high precision ratio in performance assessment more than 90 percents.

GIS Application Model for Temporal and Spatial Simulation of Surface Runoff from a small watershed (소유역 지표유출의 시간적 . 공간적 재현을 위한 GIS응용모형)

  • 정하우;김성준;최진용;김대식
    • Spatial Information Research
    • /
    • v.3 no.2
    • /
    • pp.135-146
    • /
    • 1995
  • The purpose of this study is to develop a GIS application and interface model (GISCELWAB) for the temporal and spatial simulation of surface runoff from a small watershed. The model was constituted by three sub - models : The input data extraction model (GISINDATA) which prepares cell-based input data automatically for a given watershed, the cell water balance model(CELWAB) which calculates the water balance for a cell and simulates surface runoff of watershed simultaneously by the interaction of cells, and the output data management model(GISOUTDISP) which visualize the results of temporal and spatial variation of surface runoff. The input data extraction model was developed to solve the time-consuming problems for the input-data preparation of distributed hydrologic model. The input data for CELWAB can be obtained by extracting ASCII data from a vector map. The output data management model was developed to convert the storage depth and discharge of cell into grid map. This model ean-bles to visualize the temporal and spatial formulation process of watershed storage depth and surface runoff wholly with time increment.

  • PDF

Image Analysis for Discrimination of Neoplastic Cellis in Spatial Frequency Domain (종양세포식별을 위한 공간주파수영역에서의 화상해석)

  • 나철훈;김창원;김현재
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.18 no.3
    • /
    • pp.385-396
    • /
    • 1993
  • In this paper, a improved method of digital image analysis required in basic medical science for diagnosis of cells was proposed. The object image was the thyroid gland cell image, and the purpose was automatic discrimination of three classes cells(normal cell, follicular neoplastic cells, and papillary neoplastic cells) by difference of chromatin patterns. To segment the cell nucleus from background, the region segmentation algorithm by edge tracing was proposed. And feature parameter was obtained from discrete Fourier transformation of image. After construct a feature sample group of each cells, experiment of discrimination was executed with any verification cells. As a consequency of using features proposed in this paper, get a better recognition rate(70-90%) than previously reported papers, and this method give shape to get objectivity and fixed quantity in diagnosis of cells, The methods described in this paper be used immediately for discrimination of neoplastic cells.

  • PDF

A Recognition Algorithm for Handwritten Logic Circuit Diagrams Using Neural Network (신경회로망을 이용한 손으로 작성된 논리회로 도면 인식 알고리듬)

  • Kim, Dug-Ryung;Park, Sung-Han
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.27 no.10
    • /
    • pp.68-77
    • /
    • 1990
  • In this paper, a neural patten recognition method for the automatic circuit diagram reading system is proposed. The proposed procedure to recognize a deformed logic symbols is composed of three stages: feature detection, log mapping, and pattern classification. In the feature detection stage, a modified competitive learning algorithm where each pattern has the inhibition weight as well as the activation weight is developed. The global information of hand-written logic symbols is obtained by the feature detection neural network having both the inhibition and activation weights. The obtained global data is then transformed into a log space by the conformal mapping where according to the Schwartz's theory about the human visual signal process-ing, the degree of rotation and the scale change are mapped into the translation change. Logic symbols are finally classified by a three layer perceptron trained by the error back propagation algorithm. The computer simulation demonstrates that the proposed multistage neural network system can recognize well the deformed patterns of hand-written logic circuit diagrams.

  • PDF

A Proposal of a Shape Matching and Geo-referencing method for Building Features in Construction CAD Data to Digital Map using a Vertex Attributed String Matching algorithm (VASM 알고리즘을 이용한 건축물 CAD 자료의 수치지도 건물 객체와의 형상 정합 및 지도좌표 부여 방법의 제안)

  • Huh, Yong;Yu, Ki-Yun;Kim, Hyung-Tae
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.26 no.4
    • /
    • pp.387-396
    • /
    • 2008
  • An integration between construction CAD data and GIS data needs geo-referencing processes of construction CAD data whose coordinate systems are their own native or even unknown. Generally, these processes are based on manually detected conjugate-vertices. In this study, we proposed an semi-automated conjugate -vertices detection method for building features between construction CAD data and a digital map using a vertex attributed string matching algorithm. A geo-referencing function for construction CAD data based on the similarity transform could be derived with those conjugate-vertices. Using our proposed method, we overlaid geo-referenced CAD data to a digital map of the College of Engineering, Seoul National University and evaluated our method.