• Title/Summary/Keyword: Automatic Data Extraction

Search Result 310, Processing Time 0.029 seconds

An Algorithm for Translation from RDB Schema Model to XML Schema Model Considering Implicit Referential Integrity (묵시적 참조 무결성을 고려한 관계형 스키마 모델의 XML 스키마 모델 변환 알고리즘)

  • Kim, Jin-Hyung;Jeong, Dong-Won;Baik, Doo-Kwon
    • Journal of KIISE:Databases
    • /
    • v.33 no.5
    • /
    • pp.526-537
    • /
    • 2006
  • The most representative approach for efficient storing of XML data is to store XML data in relational databases. The merit of this approach is that it can easily accept the realistic status that most data are still stored in relational databases. This approach needs to convert XML data into relational data or relational data into XML data. The most important issue in the translation is to reflect structural and semantic relations of RDB to XML schema model exactly. Many studies have been done to resolve the issue, but those methods have several problems: Not cover structural semantics or just support explicit referential integrity relations. In this paper, we propose an algorithm for extracting implicit referential integrities automatically. We also design and implement the suggested algorithm, and execute comparative evaluations using translated XML documents. The proposed algorithm provides several good points such as improving semantic information extraction and conversion, securing sufficient referential integrity of the target databases, and so on. By using the suggested algorithm, we can guarantee not only explicit referential integrities but also implicit referential integrities of the initial relational schema model completely. That is, we can create more exact XML schema model through the suggested algorithm.

Automation of Information Extraction from IFC-BIM for Indoor Air Quality Certification (IFC-BIM을 활용한 실내공기질 인증 요구정보 생성 자동화)

  • Hong, Simheee;Yeo, Changjae;Yu, Jungho
    • Korean Journal of Construction Engineering and Management
    • /
    • v.18 no.3
    • /
    • pp.63-73
    • /
    • 2017
  • In contemporary society, it is increasingly common to spend more time indoors. As such, there is a continually growing desire to build comfortable and safe indoor environments. Along with this trend, however, there are some serious indoor-environment challenges, such as the quality of indoor air and Sick House Syndrome. To address these concerns the government implements various systems to supervise and manage indoor environments. For example, green building certification is now compulsory for public buildings. There are three categories of green building certification related to indoor air in Korea: Health-Friendly Housing Construction Standards, Green Standard for Energy & Environmental Design(G-SEED), and Indoor Air Certification. The first two types of certification, Health-Friendly Housing Construction Standards and G-SEED, evaluate data in a drawing plan. In comparison, the Indoor Air Certification evaluates measured data. The certification using data from a drawing requires a considerable amount of time compared to other work. A 2D tool needs to be employed to measure the area manually. Thus, this study proposes an automatic assessment process using a Building Information Modeling(BIM) model based on 3D data. This process, using open source Industry Foundation Classes(IFC), exports data for the certification system, and extracts the data to create an Excel sheet for the certification. This is expected to improve the work process and reduce the workload associated with evaluating indoor air conditions.

Automatic Extraction of Individual Tree Height in Mountainous Forest Using Airborne Lidar Data (항공 Lidar 데이터를 이용한 산림지역의 개체목 자동 인식 및 수고 추출)

  • Woo, Choong-Shik;Yoon, Jong-Suk;Shin, Jung-Il;Lee, Kyu-Sung
    • Journal of Korean Society of Forest Science
    • /
    • v.96 no.3
    • /
    • pp.251-258
    • /
    • 2007
  • Airborne Lidar (light detection and ranging) can be an effective alternative in forest inventory to overcome the limitations of conventional field survey and aerial photo interpretation. In this study, we attempt to develop methodologies to identify individual trees and to estimate tree height from airborne Lidar data. Initially, digital elevation model (DEM) data representing the exact ground surface were generated by removing non-ground returns from the multiple-return laser point clouds, obtained over the coniferous forest site of rugged terrain. Based on the canopy height model (CHM) data representing non-ground layer, individual tree heights are extracted through pseudo-grid method and moving window filtering algorithm. Comparing with field survey data and aerial photo interpretation on sample plots, the number of trees extracted from Lidar data show over 90% accuracy and tree heights were underestimated within 1.1m in average at two plantation stands of pine (Pinus koraiensis) and larch (Larix leptolepis).

Temporal Classification Method for Forecasting Power Load Patterns From AMR Data

  • Lee, Heon-Gyu;Shin, Jin-Ho;Park, Hong-Kyu;Kim, Young-Il;Lee, Bong-Jae;Ryu, Keun-Ho
    • Korean Journal of Remote Sensing
    • /
    • v.23 no.5
    • /
    • pp.393-400
    • /
    • 2007
  • We present in this paper a novel power load prediction method using temporal pattern mining from AMR(Automatic Meter Reading) data. Since the power load patterns have time-varying characteristic and very different patterns according to the hour, time, day and week and so on, it gives rise to the uninformative results if only traditional data mining is used. Also, research on data mining for analyzing electric load patterns focused on cluster analysis and classification methods. However despite the usefulness of rules that include temporal dimension and the fact that the AMR data has temporal attribute, the above methods were limited in static pattern extraction and did not consider temporal attributes. Therefore, we propose a new classification method for predicting power load patterns. The main tasks include clustering method and temporal classification method. Cluster analysis is used to create load pattern classes and the representative load profiles for each class. Next, the classification method uses representative load profiles to build a classifier able to assign different load patterns to the existing classes. The proposed classification method is the Calendar-based temporal mining and it discovers electric load patterns in multiple time granularities. Lastly, we show that the proposed method used AMR data and discovered more interest patterns.

A Study on BIM Implementation Process Model through Importing Vertex Coordinate Data for Customized Curtain Wall Panel - Focusing on importing Vertex Coordinate data to Revit from Rhino - (맞춤형 커튼월 패널의 꼭짓점 좌표데이터 전이를 통한 BIM 형태 구축 프로세스 모델 연구 - 라이노에서 레빗으로의 좌표데이터 전이를 중심으로 -)

  • Ko, Sung Hak
    • Journal of the Architectural Institute of Korea Planning & Design
    • /
    • v.35 no.11
    • /
    • pp.69-78
    • /
    • 2019
  • The purpose of this study is to propose a modeling methodology through the exchange of coordinate data of a three-dimensional custom curtain wall panel between Rhino and Revit, and to examine the validity of the model implemented in the drawing. Although the modeling means and method are different, a fundamental principle is that all three-dimensional modeling begins by defining the position of the points, the most primitive element of geometry, in the XYZ coordinate space. For the BIM modeling methodology proposal based on this geometry basic concept, the functions and characteristics associated with the points of Rhino and Revit programs are identified, and then BIM implementation process model is organized and systemized through the setting of the interoperability process algorithm. The BIM implementation process model proposed in this study is (1) Modeling and panelizing surface into individual panels using Rhino and Grasshopper; (2) Extraction of vertex coordinate data from individual panels and create CSV file; (3) Curtain wall modeling through Adaptive Component Family in Revit and (4) Automatic creation of Revit curtain wall panels through API. The proposed process model is expected to help reduce design errors and improve component and construction quality by automatically converting general elements into architectural meaningful information, automating a set of processes that build them into BIM data, and enabling consistent and integrated design management.

Automatic Extraction of Buildings using Aerial Photo and Airborne LIDAR Data (항공사진과 항공레이저 데이터를 이용한 건물 자동추출)

  • 조우석;이영진;좌윤석
    • Korean Journal of Remote Sensing
    • /
    • v.19 no.4
    • /
    • pp.307-317
    • /
    • 2003
  • This paper presents an algorithm that automatically extracts buildings among many different features on the earth surface by fusing LIDAR data with panchromatic aerial images. The proposed algorithm consists of three stages such as point level process, polygon level process, parameter space level process. At the first stage, we eliminate gross errors and apply a local maxima filter to detect building candidate points from the raw laser scanning data. After then, a grouping procedure is performed for segmenting raw LIDAR data and the segmented LIDAR data is polygonized by the encasing polygon algorithm developed in the research. At the second stage, we eliminate non-building polygons using several constraints such as area and circularity. At the last stage, all the polygons generated at the second stage are projected onto the aerial stereo images through collinearity condition equations. Finally, we fuse the projected encasing polygons with edges detected by image processing for refining the building segments. The experimental results showed that the RMSEs of building corners in X, Y and Z were 8.1cm, 24.7cm, 35.9cm, respectively.

The Design and Implementation of OWL Ontology Construction System through Information Extraction of Unstructured Documents (비정형 문서의 정보추출을 통한 OWL 온톨로지 구축 시스템의 설계 및 구현)

  • Jo, Dae Woong;Choi, Ji Woong;Kim, Myung Ho
    • Journal of the Korea Society of Computer and Information
    • /
    • v.19 no.10
    • /
    • pp.23-33
    • /
    • 2014
  • The development of the information retrieval field is evolving to the research field searching accurately for the information from thing finding rapidly a large amount of information. Personalization and the semantic web technology is a key technology. The automatic indexing technology about the web document and throughput go beyond the research stage and show up as the practical service. However, there is a lack of research on the document information retrieval field about the attached document type of except the web document. In this paper, we illustrate about the method in which it analyzed the text content of the unstructured documents prepared in the text, word, hwp form and it how to construction OWL ontology. To build TBox of the document ontology and the resources which can be obtained from the document is selected, and we implement with the system in order to utilize as the instant of the constructed document ontology. It is effectually usable in the information retrieval and document management system using the semantic technology of the correspondence document as the ontology automatic construction of this kind of the unstructured documents.

Automatic Leather Quality Inspection and Grading System by Leather Texture Analysis (텍스쳐 분석에 의한 피혁 등급 판정 및 자동 선별시스템에의 응용)

  • 권장우;김명재;길경석
    • Journal of Korea Multimedia Society
    • /
    • v.7 no.4
    • /
    • pp.451-458
    • /
    • 2004
  • A leather quality inspection by naked eyes has known as unreliable because of its biological characteristics like accumulated fatigue caused from an optical illusion and biological phenomenon. Therefore it is necessary to automate the leather quality inspection by computer vision technique. In this paper, we present automatic leather qua1ity classification system get information from leather surface. Leather is usually graded by its information such as texture density, types and distribution of defects. The presented algorithm explain how we analyze leather information like texture density and defects from the gray-level images obtained by digital camera. The density data is computed by its ratio of distribution area, width, and height of Fourier spectrum magnitude. And the defect information of leather surface can be obtained by histogram distribution of pixels which is Windowed from preprocessed images. The information for entire leather could be a standard for grading leather quality. The proposed leather inspection system using machine vision can also be applied to another field to substitute human eye inspection.

  • PDF

Topic Automatic Extraction Model based on Unstructured Security Intelligence Report (비정형 보안 인텔리전스 보고서 기반 토픽 자동 추출 모델)

  • Hur, YunA;Lee, Chanhee;Kim, Gyeongmin;Lim, HeuiSeok
    • Journal of the Korea Convergence Society
    • /
    • v.10 no.6
    • /
    • pp.33-39
    • /
    • 2019
  • As cyber attack methods are becoming more intelligent, incidents such as security breaches and international crimes are increasing. In order to predict and respond to these cyber attacks, the characteristics, methods, and types of attack techniques should be identified. To this end, many security companies are publishing security intelligence reports to quickly identify various attack patterns and prevent further damage. However, the reports that each company distributes are not structured, yet, the number of published intelligence reports are ever-increasing. In this paper, we propose a method to extract structured data from unstructured security intelligence reports. We also propose an automatic intelligence report analysis system that divides a large volume of reports into sub-groups based on their topics, making the report analysis process more effective and efficient.

A Study on Implementation of 4D and 5D Support Algorithm Using BIM Attribute Information - Focused on Process Simulation and Quantity Calculation - (BIM 속성정보를 활용한 4D, 5D 설계 지원 알고리즘 구현 및 검증에 관한 연구 - 공정시뮬레이션과 물량산출을 중심으로 -)

  • Jeong, Jae-Won;Seo, Ji-Hyo;Park, Hye-Jin;Choo, Seung-Yeon
    • Journal of the Regional Association of Architectural Institute of Korea
    • /
    • v.21 no.4
    • /
    • pp.15-26
    • /
    • 2019
  • In recent years, researchers are increasingly trying to use BIM-based 3D models for BIM nD design such as 4D (3D + Time) and 5D (4D + Cost). However, there are still many problems in efficiently using process management based on the BIM information created at each design stage. Therefore, this study proposes a method to automate 4D and 5D design support in each design stage by using BIM-based Dynamo algorithm. To do this, I implemented an algorithm that can automatically input the process information needed for 4D and 5D by using Revit's Add-in program, Dynamo. In order to support the 4D design, the algorithm was created to enable automatic process simulation by synchronizing process simulation information (Excel file) through the Navisworks program, BIM software. The algorithm was created to automatically enable process simulation. And to support the 5D design, the algorithm was developed to enable automatic extraction of the information needed for mass production from the BIM model by utilizing the dynamo algorithm. Therefore, in order to verify the 4D and 5D design support algorithms, we verified the applicability through consultation with related workers and experts. As a result, it has been demonstrated that it is possible to manage information about process information and to quickly extract information from design and design changes. In addition, BIM data can be used to manage and input the necessary process information in 4D and 5D, which is advantageous for shortening construction time and cost. This study will make it easy to improve design quality and manage design information, and will be the foundation for future building automation research.