• Title/Summary/Keyword: Data Analyze

Search Result 19,032, Processing Time 0.048 seconds

Analysis of Traffic Accident using Association Rule Model

  • Ihm, Sun-Young;Park, Young-Ho
    • Journal of Multimedia Information System
    • /
    • v.5 no.2
    • /
    • pp.111-114
    • /
    • 2018
  • Traffic accident analysis is important to reduce the occurrence of the accidents. In this paper, we analyze the traffic accident with Apriori algorithm to find out an association rule of traffic accident in Korea. We first design the traffic accident analysis model, and then collect the traffic accidents data. We preprocessed the collected data and derived some new variables and attributes for analyzing. Next, we analyze based on statistical method and Apriori algorithm. The result shows that many large-scale accident has occurred by vans in daytime. Medium-scale accident has occurred more in day than nighttime, and by cars more than vans. Small-scale accident has occurred more in night time than day time, however, the numbers were similar. Also, car-human accident is more occurred than car-car accident in small-scale accident.

The Management of Medical Information Quality Utilizing Big Data (빅 데이터를 활용한 의료정보 질 관리)

  • Cho, Young-bok;Woo, Sung-Hee;Lee, Sang-Ho
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2014.05a
    • /
    • pp.728-731
    • /
    • 2014
  • Today, the quality of medical service has become a major concern because that sustainable development of IT technology and extending people's life expectancy. This paper, it is used as a tool for the medical information quality management that analyze tweets big data form generated by individual's daily. The result of the analyze big data offers improvement medical information based evidence based medicine. Also it has been possible for a trace observation of chronic disease and can reduce additional other complications of patients. Therefore, effective treatment of disease and prevention is possible.

  • PDF

Dose Sol Raises Consumer Prices? (음력설이 소비자물가에 영향을 미치는가?)

  • Lee, Geung Hui
    • The Korean Journal of Applied Statistics
    • /
    • v.12 no.2
    • /
    • pp.387-387
    • /
    • 1999
  • The traditional holiday, Sol which is based on a lunar calendar, falls in January orFebruary and makes it difficult to analyze time series data accurately. To analyze whetherSol raises consumer prices or not, RegARIMA models and paired t tests are used. It isfound that Sol raises consumer prices of food products significantly, but So1's effects onconsumer prices of all items are not significant.

Big Data Activation Plan for Digital Transformation of Agriculture and Rural (농업·농촌 디지털 전환을 위한 빅데이터 활성화 방안 연구)

  • Lee, Won Suk;Son, Kyungja;Jun, Daeho;Shin, Yongtae
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.9 no.8
    • /
    • pp.235-242
    • /
    • 2020
  • In order to promote digital transformation of our agricultural and rural communities in the wake of the fourth industrial revolution and prepare for the upcoming artificial intelligence era, it is necessary to establish a system and system that can collect, analyze and utilize necessary quality data. To this end, we will investigate and analyze problems and issues felt by various stakeholders such as farmers and agricultural officials, and present strategic measures to revitalize big data, which must be decided in order to promote digital transformation of our agricultural and rural communities, such as expanding big data platforms for joint utilization, establishing sustainable big data governance, and revitalizing the foundation for big data utilization based on demand.

Hurdle Model for Longitudinal Zero-Inflated Count Data Analysis (영과잉 경시적 가산자료 분석을 위한 허들모형)

  • Jin, Iktae;Lee, Keunbaik
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.6
    • /
    • pp.923-932
    • /
    • 2014
  • The Hurdle model can to analyze zero-inflated count data. This model is a mixed model of the logit model for a binary component and a truncated Poisson model of a truncated count component. We propose a new hurdle model with a general heterogeneous random effects covariance matrix to analyze longitudinal zero-inflated count data using modified Cholesky decomposition. This decomposition factors the random effects covariance matrix into generalized autoregressive parameters and innovation variance. The parameters are modeled using (generalized) linear models and estimated with a Bayesian method. We use these methods to carefully analyze a real dataset.

An Analysis of Key Elements for FinTech Companies Based on Text Mining: From the User's Review (텍스트 마이닝 기반의 자산관리 핀테크 기업 핵심 요소 분석: 사용자 리뷰를 바탕으로)

  • Son, Aelin;Shin, Wangsoo;Lee, Zoonky
    • The Journal of Information Systems
    • /
    • v.29 no.4
    • /
    • pp.137-151
    • /
    • 2020
  • Purpose Domestic asset management fintech companies are expected to grow by leaps and bounds along with the implementation of the "Data bills." Contrary to the market fever, however, academic research is insufficient. Therefore, we want to analyze user reviews of asset management fintech companies that are expected to grow significantly in the future to derive strengths and complementary points of services that have been provided, and analyze key elements of asset management fintech companies. Design/methodology/approach To analyze large amounts of review text data, this study applied text mining techniques. Bank Salad and Toss, domestic asset management application services, were selected for the study. To get the data, app reviews were crawled in the online app store and preprocessed using natural language processing techniques. Topic Modeling and Aspect-Sentiment Analysis were used as analysis methods. Findings According to the analysis results, this study was able to derive the elements that asset management fintech companies should have. As a result of Topic Modeling, 7 topics were derived from Bank Salad and Toss respectively. As a result, topics related to function and usage and topics on stability and marketing were extracted. Sentiment Analysis showed that users responded positively to function-related topics, but negatively to usage-related topics and stability topics. Through this, we were able to extract the key elements needed for asset management fintech companies.

Development of data analysis and experiment evaluation supporting system(DAEXESS) (실험데이타 분석 및 평가지원시스템(DAEXESS) 개발)

  • 이현철;오인석;심봉식
    • Journal of the Ergonomics Society of Korea
    • /
    • v.16 no.1
    • /
    • pp.119-126
    • /
    • 1997
  • Most of human factors experiments in nuclear industry domain produe lots of experimental data, thus much time is reauired to analyze the data. DAEXESS was developed to reduce resource demands necessary for the analysis work through systematic data analysis requirements and automated data processing based on computer technology. Physilolgical data, human behavior recording data, system log data and verbal protocl can be collected, synthesized and easily analyzed with with respect to time domain in DAEXESS so that analyser is able to look into inte- grated information on operating context. DAEXESS assists analyser to carry out qualitative and quantitative data analysis easily.

  • PDF

Generic Multidimensional Model of Complex Data: Design and Implementation

  • Khrouf, Kais;Turki, Hela
    • International Journal of Computer Science & Network Security
    • /
    • v.21 no.12spc
    • /
    • pp.643-647
    • /
    • 2021
  • The use of data analysis on large volumes of data constitutes a challenge for deducting knowledge and new information. Data can be heterogeneous and complex: Semi-structured data (Example: XML), Data from social networks (Example: Tweets) and Factual data (Example: Spreading of Covid-19). In this paper, we propose a generic multidimensional model in order to analyze complex data, according to several dimensions.

Finding associations between genes by time-series microarray sequential patterns analysis

  • Nam, Ho-Jung;Lee, Do-Heon
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2005.09a
    • /
    • pp.161-164
    • /
    • 2005
  • Data mining techniques can be applied to identify patterns of interest in the gene expression data. One goal in mining gene expression data is to determine how the expression of any particular gene might affect the expression of other genes. To find relationships between different genes, association rules have been applied to gene expression data set [1]. A notable limitation of association rule mining method is that only the association in a single profile experiment can be detected. It cannot be used to find rules across different condition profiles or different time point profile experiments. However, with the appearance of time-series microarray data, it became possible to analyze the temporal relationship between genes. In this paper, we analyze the time-series microarray gene expression data to extract the sequential patterns which are similar to the association rules between genes among different time points in the yeast cell cycle. The sequential patterns found in our work can catch the associations between different genes which express or repress at diverse time points. We have applied sequential pattern mining method to time-series microarray gene expression data and discovered a number of sequential patterns from two groups of genes (test, control) and more sequential patterns have been discovered from test group (same CO term group) than from the control group (different GO term group). This result can be a support for the potential of sequential patterns which is capable of catching the biologically meaningful association between genes.

  • PDF

Analysis of Elderly Traffic Accidents Using Public Data (공공데이터를 활용한 노인교통사고 발생유형 분석연구)

  • Lee, Jeongwon;Lee, Choong Ho
    • Journal of Industrial Convergence
    • /
    • v.17 no.4
    • /
    • pp.53-58
    • /
    • 2019
  • It is important to collect and analyze the data from the traffic accident analysis system and the National Statistical Office to reduce the traffic accident rate of the elderly, who are the weakest. In particular, it is more important to analyze the data in areas where the elderly population is large and where accidents occur frequently. This paper visualizes and analyzes the data of elderly traffic accidents that occurred in recent 5 years in the area where many elderly people live in Buyeo-gun. The elderly traffic accident type, accident area, and location data of the elderly can be useful for the improvement measures and related decision making to reduce the elderly traffic accidents.