• Title/Summary/Keyword: 데이터 분석론

Search Result 1,379, Processing Time 0.029 seconds

Stability Analysis of Landslides using a Probabilistic Analysis Method in the Boeun Area (확률론적 해석기법을 이용한 보은지역의 사면재해 안정성분석)

  • Jeong, Nam-Soo;You, Kwang-ho;Park, Hyuck-Jin
    • The Journal of Engineering Geology
    • /
    • v.21 no.3
    • /
    • pp.247-257
    • /
    • 2011
  • In this study the infinite slope model, one of the physical landslide models has been suggested to evaluate the susceptibility of the landslide. However, applying the infinite slope model in regional study area can be difficult or impossible because of the difficulties in obtaining and processing of large spatial data sets. With limited site investigation data, uncertainties were inevitably involved with. Therefore, the probabilistic analysis method such as Monte Carlo simulation and the GIS based infinite slope stability model have been used to evaluate the probability of failure. The proposed approach has been applied to practical example. The study area in Boeun area been selected since the area has been experienced tremendous amount of landslide occurrence. The geometric characteristics of the slope and the mechanical properties of soils like to friction angle and cohesion were obtained. In addition, coefficient of variation (COV) values in the uncertain parameters were varied from 10% to 30% in order to evaluate the effect of the uncertainty. The analysis results showed that the probabilistic analysis method can reduce the effect of uncertainty involved in input parameters.

Considering Customer Buying Sequences to Enhance the Quality of Collaborative Filtering (구매순서를 고려한 개선된 협업필터링 방법론)

  • Cho, Yeong-Bin;Cho, Yoon-Ho
    • Journal of Intelligence and Information Systems
    • /
    • v.13 no.2
    • /
    • pp.69-80
    • /
    • 2007
  • The preferences of customers change over time. However, existing collaborative filtering (CF) systems are static, since they only incorporate information regarding whether a customer buys a product during a certain period and do not make use of the purchase sequences of customers. Therefore, the quality of the recommendations of the typical CF could be improved through the use of information on such sequences. In this study, we propose a new methodology for enhancing the quality of CF recommendation that uses customer purchase sequences. The proposed methodology is applied to a large department store in Korea and compared to existing CF techniques. Various experiments using real-world data demonstrate that the proposed methodology provides higher quality recommendations than do typical CF techniques with better performance.

  • PDF

A Study of improving reliability on prediction model by analyzing method Big data (빅데이터 분석방법을 이용한 예측모형의 신뢰도 향상에 관한 연구)

  • Song, Min-Gu;Kim, Sun-Bae
    • Journal of Digital Convergence
    • /
    • v.11 no.6
    • /
    • pp.103-112
    • /
    • 2013
  • Traditional method of establishing prediction model is usually using formal data stored in Data Base. However, nowadays advent of "smart" era brought by ground-breaking development of communication system makes informal data to dominate overall data, such 80% in total. Therefore, conventional method using formal data as establishing predicting model would be untrustworthy means in present. In other words, it is indispensible to make prediction model credible including informal data(SNS, image, video) and semi-formal data(log data). In this study, we increase credibility of predicting model adapting Bigdata method and comparing reliability of conventional measurement to real-data.

A Study on Travel Pattern Analysis and Political Application using Transportation Card Data: In Gyeonggi-Do Case (교통카드자료를 이용한 통행패턴분석과 정책활용방안 연구 -경기도를 중심으로-)

  • Bin, Miyoung;Moon, Juback;Joh, Chang-Hyeon
    • Journal of the Economic Geographical Society of Korea
    • /
    • v.15 no.4
    • /
    • pp.615-627
    • /
    • 2012
  • This study analyzed the travel pattern with respect to use of public transportation by using transportation card data and presented the measures that can be used in a traffic policy. Transportation card data targeted Gyeonggi-Do area and as a utilization plan, a scenario that when a traffic policy decision maker improves bus stop facilities, the person selects a target site by using several variables that can be obtained from transportation card data was set and analyzed. The analysis result showed that K means cluster analysis which is decision making methodology and CHAID(Chi-squared automatic interaction detection) were used and it can be used usefully in policies in significance level of p <0.01. Also, based on these results, this study presented policy implications to be improved to actually use transportation card data in policies.

  • PDF

Design of Optimized Radial Basis Function Neural Networks Classifier with the Aid of Principal Component Analysis and Linear Discriminant Analysis (주성분 분석법과 선형판별 분석법을 이용한 최적화된 방사형 기저 함수 신경회로망 분류기의 설계)

  • Kim, Wook-Dong;Oh, Sung-Kwun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.22 no.6
    • /
    • pp.735-740
    • /
    • 2012
  • In this paper, we introduce design methodologies of polynomial radial basis function neural network classifier with the aid of Principal Component Analysis(PCA) and Linear Discriminant Analysis(LDA). By minimizing the information loss of given data, Feature data is obtained through preprocessing of PCA and LDA and then this data is used as input data of RBFNNs. The hidden layer of RBFNNs is built up by Fuzzy C-Mean(FCM) clustering algorithm instead of receptive fields and linear polynomial function is used as connection weights between hidden and output layer. In order to design optimized classifier, the structural and parametric values such as the number of eigenvectors of PCA and LDA, and fuzzification coefficient of FCM algorithm are optimized by Artificial Bee Colony(ABC) optimization algorithm. The proposed classifier is applied to some machine learning datasets and its result is compared with some other classifiers.

The Development of a Human Reliability Analysis System for Safety Assessment of a Nuclear Power Plants (원자력 발전소 안전성 평가를 위한 인간 신뢰도 분석 방법론 개발 및 지원 시스템 구축)

  • Kim, Seung-Hwn;Jung, Won-Dea
    • Journal of the Korea Society of Computer and Information
    • /
    • v.11 no.6 s.44
    • /
    • pp.261-267
    • /
    • 2006
  • In order to perform a probabilistic safety assessment (PSA), it requires a large number of data for various fields. And the quality of a PSA results have become more important thing of the risk assessment. As part of enhancing the PSA qualify, Korea Atomic Energy Research Institute is developing a full power Human Reliability Analysis (HRA) calculator to manage human failure events (HFEs) and to calculate the diagnosis human error probabilities and execution human error probabilities. This paper introduces the development process and an overview of a standard HRA method for nuclear power plants. The study was carried out in three stages; 1) development of the procedures and rules for a standard HRA method. 2) design of a system structure, 3) development of the HRA calculator.

  • PDF

Estimating System for Responsible Days of Schedule Delay for Construction Projects through Time Impact Analysis (건설공사 공기지연 영향분석을 통한 책임일수 산정체계 구축방안)

  • Kang, Leen-Seok;Kwon, Jung-Hee
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.28 no.5D
    • /
    • pp.685-694
    • /
    • 2008
  • Recently, construction projects are being gradually changed to large projects with complicated work processes. As a result, claims are increasing as well. Of these, the claims by delayed construction duration are given much weight in the general construction claims. To solve this problem, researches to estimate damages due to delay are carried out in research institutes by analyzing the causes of schedule delay. A system to easily estimate responsible days by analyzing the complicated processes needs for project manager. This study suggests a methodology based on the time impact analysis method, which can utilize the consequential analytical method with the consideration of reduced construction period. The suggested methodology is verified by comparing with actual data of case study by the computerized system for estimating the responsible days.

Analyzing Spatial Correlation between Location-Based Social Media Data and Real Estates Price Index through Rasterization (격자기반 분석을 통한 위치기반 소셜 미디어 데이터와 부동산 가격지수 간의 공간적 상관성 분석 연구)

  • Park, Woo Jin;Eo, Seung Won;Yu, Ki Yun
    • Journal of Korean Society for Geospatial Information Science
    • /
    • v.23 no.1
    • /
    • pp.23-29
    • /
    • 2015
  • In this study, the spatial relevance between the regional housing price data and the spatial distribution of the location-based social media data is explored. The spatial analysis with rasterization was applied to this study, because the both data have a different form to analyze. The geo-tagged Twitter data had been collected for a month and the regional housing price index about sales and lease were used. The spatial range of both data includes Seoul and the some parts of the metropolitan area. 2,000m grid was constructed to consider the different spatial measure between two data, and they were combined into the constructed grids. The Hotspot Analysis was operated using the combined dataset to see the comparison of spatial distribution, and the bivariate spatial correlation coefficients between two data were measured for the quantitative analysis. The result of this study shows that Seocho-gu area is detected as a common hotspot of tweet and housing sales price index data. though the spatial relevance is not detected between tweet and housing lease price index data.

A Study on Elemental Technology Identification of Sound Data for Audio Forensics (오디오 포렌식을 위한 소리 데이터의 요소 기술 식별 연구)

  • Hyejin Ryu;Ah-hyun Park;Sungkyun Jung;Doowon Jeong
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.34 no.1
    • /
    • pp.115-127
    • /
    • 2024
  • The recent increase in digital audio media has greatly expanded the size and diversity of sound data, which has increased the importance of sound data analysis in the digital forensics process. However, the lack of standardized procedures and guidelines for sound data analysis has caused problems with the consistency and reliability of analysis results. The digital environment includes a wide variety of audio formats and recording conditions, but current audio forensic methodologies do not adequately reflect this diversity. Therefore, this study identifies Life-Cycle-based sound data elemental technologies and provides overall guidelines for sound data analysis so that effective analysis can be performed in all situations. Furthermore, the identified elemental technologies were analyzed for use in the development of digital forensic techniques for sound data. To demonstrate the effectiveness of the life-cycle-based sound data elemental technology identification system presented in this study, a case study on the process of developing an emergency retrieval technology based on sound data is presented. Through this case study, we confirmed that the elemental technologies identified based on the Life-Cycle in the process of developing digital forensic technology for sound data ensure the quality and consistency of data analysis and enable efficient sound data analysis.

A Study of Safety Accident Prediction Model (Focusing on Military Traffic Accident Cases) (안전사고 예측모형 개발 방안에 관한 연구(군 교통사고 사례를 중심으로))

  • Ki, Jae-Sug;Hong, Myeong-Gi
    • Journal of the Society of Disaster Information
    • /
    • v.17 no.3
    • /
    • pp.427-441
    • /
    • 2021
  • Purpose: This study proposes a method for developing a model that predicts the probability of traffic accidents in advance to prevent the most frequent traffic accidents in the military. Method: For this purpose, CRISP-DM (Cross Industry Standard Process for Data Mining) was applied in this study. The CRISP-DM process consists of 6 stages, and each stage is not unidirectional like the Waterfall Model, but improves the level of completeness through feedback between stages. Results: As a result of modeling the same data set as the previously constructed accident investigation data for the entire group, when the classification criterion was 0.5, Significant results were derived from the accuracy, specificity, sensitivity, and AUC of the model for predicting traffic accidents. Conclusion: In the process of designing the prediction model, it was confirmed that it was difficult to obtain a meaningful prediction value due to the lack of data. The methodology for designing a predictive model using the data set was proposed by reorganizing and expanding a data set capable of rational inference to solve the data shortage.