• Title/Summary/Keyword: Automatic Information Extraction

Search Result 592, Processing Time 0.026 seconds

SEMI-AUTOMATIC 3D BUILDING EXTRACTION FROM HIGH RESOLUTION SATELLITE IMAGES

  • Javzandulam, Tsend-Ayush;Rhee, Soo-Ahm;Kim, Tae-Jung;Kim, Kyung-Ok
    • Proceedings of the KSRS Conference
    • /
    • v.2
    • /
    • pp.606-609
    • /
    • 2006
  • Extraction of building is one of essential issues for the 3D city models generation. In recent years, high-resolution satellite imagery has become widely available, and this shows an opportunity for the urban mapping. In this paper, we have developed a semi-automatic algorithm to extract 3D buildings in urban settlements areas from high-spatial resolution panchromatic imagery. The proposed algorithm determines building height interactively by projecting shadow regions for a given building height onto image space and by adjusting the building height until the shadow region and actual shadow in the image match. Proposed algorithm is tested with IKONOS images over Deajeon city and the algorithm showed promising results.┌阀؀䭏佈䉌ᔀ鳪떭臬隑駭验耀

  • PDF

Dynamic Expansion of Semantic Dictionary for Topic Extraction in Automatic Summarization (자동요약의 주제어 추출을 위한 의미사전의 동적 확장)

  • Choo, Kyo-Nam;Woo, Yo-Seob
    • Journal of IKEEE
    • /
    • v.13 no.2
    • /
    • pp.241-247
    • /
    • 2009
  • This paper suggests the expansion methods of semantic dictionary, taking Korean semantic features account. These methods will be used to extract a practical topic word in the automatic summarization. The first is the method which is constructed the synonym dictionary for improving the performance of semantic-marker analysis. The second is the method which is extracted the probabilistic information from the subcategorization dictionary for resolving the syntactic and semantic ambiguity. The third is the method which is predicted the subcategorization patterns of the unregistered predicate, for the resolution of an affix-derived predicate.

  • PDF

Automatic Payload Signature Update System for the Classification of Dynamically Changing Internet Applications

  • Shim, Kyu-Seok;Goo, Young-Hoon;Lee, Dongcheul;Kim, Myung-Sup
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.3
    • /
    • pp.1284-1297
    • /
    • 2019
  • The network environment is presently becoming very increased. Accordingly, the study of traffic classification for network management is becoming difficult. Automatic signature extraction system is a hot topic in the field of traffic classification research. However, existing automatic payload signature generation systems suffer problems such as semi-automatic system, generating of disposable signatures, generating of false-positive signatures and signatures are not kept up to date. Therefore, we provide a fully automatic signature update system that automatically performs all the processes, such as traffic collection, signature generation, signature management and signature verification. The step of traffic collection automatically collects ground-truth traffic through the traffic measurement agent (TMA) and traffic management server (TMS). The step of signature management removes unnecessary signatures. The step of signature generation generates new signatures. Finally, the step of signature verification removes the false-positive signatures. The proposed system can solve the problems of existing systems. The result of this system to a campus network showed that, in the case of four applications, high recall values and low false-positive rates can be maintained.

Design of Arrhythmia Automatic Diagnostic System Using Decision Table (판정테이블을 이용한 부정맥 자동진단 시스템 설계에 관한 연구)

  • 정기삼;이재준
    • Journal of Biomedical Engineering Research
    • /
    • v.12 no.1
    • /
    • pp.63-70
    • /
    • 1991
  • Design of Arrhythmia Automatic Diagnostic System Using Decision Table We have developed an arrhythmia automatic diagnostic system using decision table which is based on the criteria of Minnesota code. This system is divided into two Parts. One is wave detection algorithm using significant point extraction method, the other is arrhythmia diag- nostic algorthm. The proposed system allows physicians to diagnose more accurately by pro- viding the objective information about a lot of computer -processed ECG data.

  • PDF

A Distance Approach for Open Information Extraction Based on Word Vector

  • Liu, Peiqian;Wang, Xiaojie
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.12 no.6
    • /
    • pp.2470-2491
    • /
    • 2018
  • Web-scale open information extraction (Open IE) plays an important role in NLP tasks like acquiring common-sense knowledge, learning selectional preferences and automatic text understanding. A large number of Open IE approaches have been proposed in the last decade, and the majority of these approaches are based on supervised learning or dependency parsing. In this paper, we present a novel method for web scale open information extraction, which employs cosine distance based on Google word vector as the confidence score of the extraction. The proposed method is a purely unsupervised learning algorithm without requiring any hand-labeled training data or dependency parse features. We also present the mathematically rigorous proof for the new method with Bayes Inference and Artificial Neural Network theory. It turns out that the proposed algorithm is equivalent to Maximum Likelihood Estimation of the joint probability distribution over the elements of the candidate extraction. The proof itself also theoretically suggests a typical usage of word vector for other NLP tasks. Experiments show that the distance-based method leads to further improvements over the newly presented Open IE systems on three benchmark datasets, in terms of effectiveness and efficiency.

Feature Extraction of Non-proliferative Diabetic Retinopathy Using Faster R-CNN and Automatic Severity Classification System Using Random Forest Method

  • Jung, Younghoon;Kim, Daewon
    • Journal of Information Processing Systems
    • /
    • v.18 no.5
    • /
    • pp.599-613
    • /
    • 2022
  • Non-proliferative diabetic retinopathy is a representative complication of diabetic patients and is known to be a major cause of impaired vision and blindness. There has been ongoing research on automatic detection of diabetic retinopathy, however, there is also a growing need for research on an automatic severity classification system. This study proposes an automatic detection system for pathological symptoms of diabetic retinopathy such as microaneurysms, retinal hemorrhage, and hard exudate by applying the Faster R-CNN technique. An automatic severity classification system was devised by training and testing a Random Forest classifier based on the data obtained through preprocessing of detected features. An experiment of classifying 228 test fundus images with the proposed classification system showed 97.8% accuracy.

A Protein-Protein Interaction Extraction Approach Based on Large Pre-trained Language Model and Adversarial Training

  • Tang, Zhan;Guo, Xuchao;Bai, Zhao;Diao, Lei;Lu, Shuhan;Li, Lin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.3
    • /
    • pp.771-791
    • /
    • 2022
  • Protein-protein interaction (PPI) extraction from original text is important for revealing the molecular mechanism of biological processes. With the rapid growth of biomedical literature, manually extracting PPI has become more time-consuming and laborious. Therefore, the automatic PPI extraction from the raw literature through natural language processing technology has attracted the attention of the majority of researchers. We propose a PPI extraction model based on the large pre-trained language model and adversarial training. It enhances the learning of semantic and syntactic features using BioBERT pre-trained weights, which are built on large-scale domain corpora, and adversarial perturbations are applied to the embedding layer to improve the robustness of the model. Experimental results showed that the proposed model achieved the highest F1 scores (83.93% and 90.31%) on two corpora with large sample sizes, namely, AIMed and BioInfer, respectively, compared with the previous method. It also achieved comparable performance on three corpora with small sample sizes, namely, HPRD50, IEPA, and LLL.

An Automatic Extraction of Blood Flow Contour from Cardiac MRI (심장 MRI 영상에서 혈류 윤곽선의 자동 추출)

  • Lee, Hyeong-Jik;Jo, Sang-Hyeon;Choe, Heung-Mun
    • Journal of the Institute of Electronics Engineers of Korea SC
    • /
    • v.37 no.5
    • /
    • pp.56-62
    • /
    • 2000
  • In this paper, an automatic extraction of the blood flow contour from cardiac MRI is proposed. By using the GVF snake which has wider capture range than the conventional snake, and by automatically generating the initial points along the outside of the contour of the zero GVF field in the edge image of the cardiac MRI, the blood flow contour can be automatically extracted, even when the contours have boundary concavities due to the papillary muscles, without any manual initialization of the experts. Experiments are conducted on the various real cardiac MRIs including noise and papillary muscles, and the proposed method is proved to be efficient in automatic extraction of the blood contours even if they have the boundary concavities.

  • PDF

Measurement Criteria for Ontology Extraction Tools (온톨로지 자동추출도구의 기능적 성능 평가를 위한 평가지표의 개발 및 적용)

  • Park, Jin-Soo;Cho, Won-Chin;Rho, Sang-Kyu
    • Journal of Intelligence and Information Systems
    • /
    • v.14 no.4
    • /
    • pp.69-87
    • /
    • 2008
  • The Web is evolving toward the Semantic Web. Ontologies are considered as a crucial component of the Semantic Web since it is the backbone of knowledge representation for this Web. However, most of these ontologies are still built manually. Manual building of an ontology is time-consuming activity which requires many resources. Consequently, the need for automatic ontology extraction tools has been increased for the last decade, and many tools have been developed for this purpose. Yet, there is no comprehensive framework for evaluating such tools. In this paper, we proposed a set of criteria for evaluating ontology extraction tools and carried out an experiment on four popular ontology extraction tools (i.e., OntoLT, Text-To-Onto, TERMINAE, and OntoBuilder) using our proposed evaluation framework. The proposed framework can be applied as a useful benchmark when developers want to develop ontology extraction tools.

  • PDF

Theory and Practice of Automatic Indexing (자동색인의 이론과 실제)

    • Journal of Korean Library and Information Science Society
    • /
    • v.30 no.3
    • /
    • pp.27-51
    • /
    • 1999
  • This paper deals with the methods as well as the problems associated with automatic extraction indexing and assignment indexing, expert systems for indexing, and major approaches currently used to index the Internet resources. It also briefly reviews basic methods for establishing hypertext/hypermedia links automatically. The methods used in much of text processing today are not particularly new. Most of the them were used, perhaps in a more rudimentary form, 30 or more years ago by Luhn and many other investigators. Better results can be achieved today because much greater bodies of electronic text are now avaliable and the power of present-day computers allows the processing of such text with reasonable efficiency.

  • PDF