• Title/Summary/Keyword: Data-based model

Search Result 20,759, Processing Time 0.044 seconds

A Novel Data Prediction Model using Data Weights and Neural Network based on R for Meaning Analysis between Data (데이터간 의미 분석을 위한 R기반의 데이터 가중치 및 신경망기반의 데이터 예측 모형에 관한 연구)

  • Jung, Se Hoon;Kim, Jong Chan;Sim, Chun Bo
    • Journal of Korea Multimedia Society
    • /
    • v.18 no.4
    • /
    • pp.524-532
    • /
    • 2015
  • All data created in BigData times is included potentially meaning and correlation in data. A variety of data during a day in all society sectors has become created and stored. Research areas in analysis and grasp meaning between data is proceeding briskly. Especially, accuracy of meaning prediction and data imbalance problem between data for analysis is part in course of something important in data analysis field. In this paper, we proposed data prediction model based on data weights and neural network using R for meaning analysis between data. Proposed data prediction model is composed of classification model and analysis model. Classification model is working as weights application of normal distribution and optimum independent variable selection of multiple regression analysis. Analysis model role is increased prediction accuracy of output variable through neural network. Performance evaluation result, we were confirmed superiority of prediction model so that performance of result prediction through primitive data was measured 87.475% by proposed data prediction model.

Designing and Implementing XML DBMS based on Generic Data Model (Generic Data Model 기반의 XML DBMS 설계 및 구현)

  • 임종선;주경수
    • The Journal of Society for e-Business Studies
    • /
    • v.8 no.1
    • /
    • pp.103-111
    • /
    • 2003
  • Nowadays XML is used for exchanging information in e-Commerce, especially B2B. Necessity of XML DBMS has being increased to efficiently process XML data. So a lots of database products for supporting XML are rapidly appeared in the market. In this paper, we made an XML DBMS system based on Generic Data Model. First we developed XML Adaptor based on Generic Data Model and added it on relational DBMS for developing XML DBMS. XML Adaptor is composed of Query Convertor and XML Repository System. The Query Convertor parse commands that are for XML data manipulation and then call the relevant component of XML Repository System for relational database operation. The XML Repository System handles relational database operations such as create, delete, store, and etc. In this way we can use a relational DBMS for manipulation XML data. Therefor we can build more economically XML DBMS.

  • PDF

N-point modified exponential model for household projections in Korea using multi-point register-based census data

  • Saebom Jeon;Tae Yeon Kwon
    • Communications for Statistical Applications and Methods
    • /
    • v.31 no.4
    • /
    • pp.377-391
    • /
    • 2024
  • Accurate household projections are essential for sectors such as housing supply and tax policy planning, given the rapid social changes like declining birthrates, an aging population, and a rise in single-person households that impact household size and type. Korea introduced its first register-based census in 2015, transitioning from five-year general survey-based approach to an annual administrative data-based census. This change in census allows for more frequent and effective capturing the rapid demographic shifts and trends. However, this change in census has caused challenges in future projection by the existing household projection model due to the rapid dynamics. This paper proposes a new household projection method, the N-point Modified Exponential Model (MEM), that accurately reflects register-based census data and mitigates the impact of rapid demographic changes, in three types: the Weighted N-point MEM, the Regression-based N-point MEM, and the Rolling Weighted N+point MEM. Using register-based census data from 2016 to 2020 to forecast household headship rates by age, household size, and household type to 2051, the N-point modified exponential model outperformed the existing model in both long- and short-term forecast accuracy, suggesting its suitability as a future household projection model for Korea.

Relationships Between the Characteristics of the Business Data Set and Forecasting Accuracy of Prediction models (시계열 데이터의 성격과 예측 모델의 예측력에 관한 연구)

  • 이원하;최종욱
    • Journal of Intelligence and Information Systems
    • /
    • v.4 no.1
    • /
    • pp.133-147
    • /
    • 1998
  • Recently, many researchers have been involved in finding deterministic equations which can accurately predict future event, based on chaotic theory, or fractal theory. The theory says that some events which seem very random but internally deterministic can be accurately predicted by fractal equations. In contrast to the conventional methods, such as AR model, MA, model, or ARIMA model, the fractal equation attempts to discover a deterministic order inherent in time series data set. In discovering deterministic order, researchers have found that neural networks are much more effective than the conventional statistical models. Even though prediction accuracy of the network can be different depending on the topological structure and modification of the algorithms, many researchers asserted that the neural network systems outperforms other systems, because of non-linear behaviour of the network models, mechanisms of massive parallel processing, generalization capability based on adaptive learning. However, recent survey shows that prediction accuracy of the forecasting models can be determined by the model structure and data structures. In the experiments based on actual economic data sets, it was found that the prediction accuracy of the neural network model is similar to the performance level of the conventional forecasting model. Especially, for the data set which is deterministically chaotic, the AR model, a conventional statistical model, was not significantly different from the MLP model, a neural network model. This result shows that the forecasting model. This result shows that the forecasting model a, pp.opriate to a prediction task should be selected based on characteristics of the time series data set. Analysis of the characteristics of the data set was performed by fractal analysis, measurement of Hurst index, and measurement of Lyapunov exponents. As a conclusion, a significant difference was not found in forecasting future events for the time series data which is deterministically chaotic, between a conventional forecasting model and a typical neural network model.

  • PDF

Basic Study on Safety Accident Prediction Model Using Random Forest in Construction Field (랜덤 포레스트 기법을 이용한 건설현장 안전재해 예측 모형 기초 연구)

  • Kang, Kyung-Su;Ryu, Han-Guk
    • Proceedings of the Korean Institute of Building Construction Conference
    • /
    • 2018.11a
    • /
    • pp.59-60
    • /
    • 2018
  • The purpose of this study is to predict and classify the accident types based on the KOSHA (Korea Occupational Safety & Health Agency) and weather data. We also have an effort to suggest an important management method according to accident types by deriving feature importance. We designed two models based on accident data and weather data (model(a)) and only weather data (model(b)). As a result of random forest method, the model(b) showed a lack of accuracy in prediction. However, the model(a) presented more accurate prediction results than the model(b). Thus we presented safety management plan based on the results. In the future, this study will continue to carry out real time prediction to occurrence types to prevent safety accidents by supplementing the real time accident data and weather data.

  • PDF

Design of Low Complexity Human Anxiety Classification Model based on Machine Learning (기계학습 기반 저 복잡도 긴장 상태 분류 모델)

  • Hong, Eunjae;Park, Hyunggon
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.66 no.9
    • /
    • pp.1402-1408
    • /
    • 2017
  • Recently, services for personal biometric data analysis based on real-time monitoring systems has been increasing and many of them have focused on recognition of emotions. In this paper, we propose a classification model to classify anxiety emotion using biometric data actually collected from people. We propose to deploy the support vector machine to build a classification model. In order to improve the classification accuracy, we propose two data pre-processing procedures, which are normalization and data deletion. The proposed algorithms are actually implemented based on Real-time Traffic Flow Measurement structure, which consists of data collection module, data preprocessing module, and creating classification model module. Our experiment results show that the proposed classification model can infers anxiety emotions of people with the accuracy of 65.18%. Moreover, the proposed model with the proposed pre-processing techniques shows the improved accuracy, which is 78.77%. Therefore, we can conclude that the proposed classification model based on the pre-processing process can improve the classification accuracy with lower computation complexity.

A Study of an AI-Based Content Source Data Generation Model using Folk Paintings and Genre Paintings (민화와 풍속화를 이용한 AI 기반의 콘텐츠 원천 데이터 생성 모델의 연구)

  • Yang, Seokhwan;Lee, Young-Suk
    • Journal of Korea Multimedia Society
    • /
    • v.24 no.5
    • /
    • pp.736-743
    • /
    • 2021
  • Due to COVID-19, the non-face-to-face content market is growing rapidly. However, most of the non-face-to-face content such as webtoons and web novels are produced based on the traditional culture of other countries, not Korean traditional culture. The biggest cause of this situation is the lack of reference materials for creating based on Korean traditional culture. Therefore, the need for materials on traditional Korean culture that can be used for content creation is emerging. In this paper, we propose a generation model of source data based on traditional folk paintings through the fusion of traditional Korean folk paintings and AI technology. The proposed model secures basic data based on folk tales, analyzes the style and characteristics of folk tales, and converts historical backgrounds and various stories related to folk tales into data. In addition, using the built data, various new stories are created based on AI technology. The proposed model is highly utilized in that it provides a foundation for new creation based on Korean traditional folk painting and AI technology.

Statistical analysis of KNHANES data with measurement error models

  • Hwang, Jinseub
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.3
    • /
    • pp.773-779
    • /
    • 2015
  • We study a statistical analysis about the fifth wave data of the Korea National Health and Nutrition Examination Survey based on linear regression models with measurement errors. The data is obtained from a national population-based complex survey. To demonstrate the availability of measurement error models, two results between the general linear regression model and measurement error model are compared based on the model selection criteria which are Akaike information criterion and Bayesian information criterion. For our study, we use the simulation extrapolation algorithm for measurement error model and the jackknife method for the estimation of standard errors.

Generative Model of Acceleration Data for Deep Learning-based Damage Detection for Bridges Using Generative Adversarial Network (딥러닝 기반 교량 손상추정을 위한 Generative Adversarial Network를 이용한 가속도 데이터 생성 모델)

  • Lee, Kanghyeok;Shin, Do Hyoung
    • Journal of KIBIM
    • /
    • v.9 no.1
    • /
    • pp.42-51
    • /
    • 2019
  • Maintenance of aging structures has attracted societal attention. Maintenance of the aging structure can be efficiently performed with a digital twin. In order to maintain the structure based on the digital twin, it is required to accurately detect the damage of the structure. Meanwhile, deep learning-based damage detection approaches have shown good performance for detecting damage of structures. However, in order to develop such deep learning-based damage detection approaches, it is necessary to use a large number of data before and after damage, but there is a problem that the amount of data before and after the damage is unbalanced in reality. In order to solve this problem, this study proposed a method based on Generative adversarial network, one of Generative Model, for generating acceleration data usually used for damage detection approaches. As results, it is confirmed that the acceleration data generated by the GAN has a very similar pattern to the acceleration generated by the simulation with structural analysis software. These results show that not only the pattern of the macroscopic data but also the frequency domain of the acceleration data can be reproduced. Therefore, these findings show that the GAN model can analyze complex acceleration data on its own, and it is thought that this data can help training of the deep learning-based damage detection approaches.

Proposal of Process Model for Research Data Quality Management (연구데이터 품질관리를 위한 프로세스 모델 제안)

  • Na-eun Han
    • Journal of the Korean Society for information Management
    • /
    • v.40 no.1
    • /
    • pp.51-71
    • /
    • 2023
  • This study analyzed the government data quality management model, big data quality management model, and data lifecycle model for research data management, and analyzed the components common to each data quality management model. Those data quality management models are designed and proposed according to the lifecycle or based on the PDCA model according to the characteristics of target data, which is the object that performs quality management. And commonly, the components of planning, collection and construction, operation and utilization, and preservation and disposal are included. Based on this, the study proposed a process model for research data quality management, in particular, the research data quality management to be performed in a series of processes from collecting to servicing on a research data platform that provides services using research data as target data was discussed in the stages of planning, construction and operation, and utilization. This study has significance in providing knowledge based for research data quality management implementation methods.