• Title/Summary/Keyword: analysis data

Search Result 85,083, Processing Time 0.086 seconds

Tendency and Network Analysis of Diet Using Big Data (빅데이터를 활용한 다이어트 현황 및 네트워크 분석)

  • Jung, Eun-Jin;Chang, Un-Jae
    • Journal of the Korean Dietetic Association
    • /
    • v.22 no.4
    • /
    • pp.310-319
    • /
    • 2016
  • Limitation of a questionnaire survey which is widely used is time and money, limited numbers of participants, biased confidence interval and unreliable results. To overcome these, we performed tendency and network analysis of diet using big Data in Koreans. The keyword on diet were collected from the portal site Naver from January 1, 2015 until December 31, 2015 and collected data were analyzed by simple frequency analysis, N-gram analysis, keyword network analysis and seasonality analysis. The results showed that diet menu appeared most frequently by N-gram analysis, even though exercise had the highest frequency by simple frequency analysis. In addition, keyword network analysis were categorized into four groups: diet group, exercise group, commercial diet program company group and commercial diet food group. The analysis of seasonality showed that subjects' interests in diet had increased steadily since February, 2015, although subjects were most interested indiet in July, these results suggest that the best strategies for weight loss are based on diet menu and starting diet before July. As people are especially sensitive to diet trends, researches are needed about annual analysis of big data.

Information Visualization for the Manufacturing Process Optimization Based on Design of Experiment and Data Analysis (실험계획법과 데이터 분석 기반의 제조공정 최적화를 위한 정보 시각화)

  • Kim, Jae Chun;Jin, Seon A;Park, Young Hee;Noh, Seong Yeo;Lee, Hyun Dong
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.4 no.9
    • /
    • pp.393-402
    • /
    • 2015
  • Data visualization technology helps people easily understand various data and its analysis result, so usefulness of it is expected in the real industrial manufacturing sites. The large amount of data which is occurred at the manufacturing sites is able to fulfill very important roll to improve the manufacturing process. In this paper, we propose an information visualization for the manufacturing process optimization based on design of experimental and data analysis. The manufacturing process may be improved and be reduced cause of faulty by providing the easy-process analysis to understand the operation site through the information visualization of data analysis result.

Analysis of Business Performance of Local SMEs Based on Various Alternative Information and Corporate SCORE Index

  • HWANG, Sun Hee;KIM, Hee Jae;KWAK, Dong Chul
    • The Journal of Economics, Marketing and Management
    • /
    • v.10 no.3
    • /
    • pp.21-36
    • /
    • 2022
  • Purpose: The purpose of this study is to compare and analyze the enterprise's score index calculated from atypical data and corrected data. Research design, data, and methodology: In this study, news articles which are non-financial information but qualitative data were collected from 2,432 SMEs that has been extracted "square proportional stratification" out of 18,910 enterprises with fixed data and compared/analyzed each enterprise's score index through text mining analysis methodology. Result: The analysis showed that qualitative data can be quantitatively evaluated by region, industry and period by collecting news from SMEs, and that there are concerns that it could be an element of alternative credit evaluation. Conclusion: News data cannot be collected even if one of the small businesses is self-employed or small businesses has little or no news coverage. Data normalization or standardization should be considered to overcome the difference in scores due to the amount of reference. Furthermore, since keyword sentiment analysis may have different results depending on the researcher's point of view, it is also necessary to consider deep learning sentiment analysis, which is conducted by sentence.

A Profile Analysis about Thermal Life Data of Electrical insulating materials at Accelerated Life Test

  • Bark, Shim-Kyu
    • Journal of Korea Multimedia Society
    • /
    • v.13 no.12
    • /
    • pp.1814-1819
    • /
    • 2010
  • Since 1987, when statistical analyzing guide for thermal life test of Accelerated Life Test(ALT) was proposed as ANSI/IEEE Std 101, this guide has been used widely for many experiment data. Shim(2004) had done Monte Carlo simulation to compare life of two different systems or materials, based on statistic values obtained from ANSI/IEEE Std 101 data. In this study, a profile analysis is proposed for comparing life of two different systems or materials, and some examples using pre-existing data are given.

Receiver Operating Characteristic Analysis by Data Mining

  • Rhee Seong-Won;Lee Jea-Young
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2001.11a
    • /
    • pp.195-197
    • /
    • 2001
  • Data Mining is used to discover patterns and relationships in huge amounts of data. Researchers in many different fields have shown great interest in data mining analysis. Using the classification technique of data mining analysis, the available model for Receiver Operating Characteristic(ROC) method is presented. We present that this may help analyze result of data mining techniques.

  • PDF

Considerations on gene chip data analysis

  • Lee, Jae-K.
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2001.08a
    • /
    • pp.77-102
    • /
    • 2001
  • Different high-throughput chip technologies are available for genome-wide gene expression studies. Quality control and prescreening analysis are important for rigorous analysis on each type of gene expression data. Statistical significance evaluation of differential expression patterns is needed. Major genome institutes develop database and analysis systems for information sharing of precious expression data.

  • PDF

The Difference Analysis between Maturity Stages of Venture Firms by Classification Techniques of Big Data (빅데이터 분류 기법에 따른 벤처 기업의 성장 단계별 차이 분석)

  • Jung, Byoungho
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.15 no.4
    • /
    • pp.197-212
    • /
    • 2019
  • The purpose of this study is to identify the maturity stages of venture firms through classification analysis, which is widely used as a big data technique. Venture companies should develop a competitive advantage in the market. And the maturity stage of a company can be classified into five stages. I will analyze a difference in the growth stage of venture firms between the survey response and the statistical classification methods. The firm growth level distinguished five stages and was divided into the period of start-up and declines. A classification method of big data uses popularly k-mean cluster analysis, hierarchical cluster analysis, artificial neural network, and decision tree analysis. I used variables that asset increase, capital increase, sales increase, operating profit increase, R&D investment increase, operation period and retirement number. The research results, each big data analysis technique showed a large difference of samples sized in the group. In particular, the decision tree and neural networks' methods were classified as three groups rather than five groups. The groups size of all classification analysis was all different by the big data analysis methods. Furthermore, according to the variables' selection and the sample size may be dissimilar results. Also, each classed group showed a number of competitive differences. The research implication is that an analysts need to interpret statistics through management theory in order to interpret classification of big data results correctly. In addition, the choice of classification analysis should be determined by considering not only management theory but also practical experience. Finally, the growth of venture firms needs to be examined by time-series analysis and closely monitored by individual firms. And, future research will need to include significant variables of the company's maturity stages.

Analysis of Performance of Creative Education based on Twitter Big Data Analysis (트위터 빅데이터 분석을 통한 창의적 교육의 성과요인 분석)

  • Joo, Kilhong
    • Journal of Creative Information Culture
    • /
    • v.5 no.3
    • /
    • pp.215-223
    • /
    • 2019
  • The wave of the information age gradually accelerates, and fusion analysis solutions that can utilize these knowledge data according to accumulation of various forms of big data such as large capacity texts, sounds, movies and the like are increasing, Reduction in the cost of storing data accordingly, development of social network service (SNS), etc. resulted in quantitative qualitative expansion of data. Such a situation makes possible utilization of data which was not trying to be existing, and the potential value and influence of the data are increasing. Research is being actively made to present future-oriented education systems by applying these fusion analysis systems to the improvement of the educational system. In this research, we conducted a big data analysis on Twitter, analyzed the natural language of the data and frequency analysis of the word, quantitative measure of how domestic windows education problems and outcomes were done in it as a solution.

Statistical approach for development of objective evaluation method on tobacco smoke

  • Hwang, Keon-Joong;Rhee, Moon-Soo;Ra, Do-Young
    • Journal of the Korean Society of Tobacco Science
    • /
    • v.22 no.2
    • /
    • pp.184-189
    • /
    • 2000
  • This study was conducted to develop the objective evaluation method for tobacco smoke. The evaluation was carried out by using the data of cut or blended tobacco components, smoke components, electric nose system (ENS), and sensory test. By using the statistical methods, such as cluster analysis, discriminant analysis, factor analysis, correlation analysis, and multiple regression analysis, the relationship among the data of tobacco, smoke, ENS, and sensory evaluation was studied. By the results of cluster analysis, the data from smoke analysis by GC and ENS were able to select the difference of tobacco leaf characteristics. As the results of discriminant analysis, grouping by the components of tobacco leaves and smoke was possible and the results of GC analysis of smoke could be used for discrimination of tobacco leaves. In the results of factor analysis, nicotine, tar, CO, puff No and pH in the smoke were the factors effecting on the tobacco leaf characteristics. From the correlation analysis, aroma, taste, irritation, and smoke volume of sensory test had high relation to tar, p-cresol threonolatone, levoglucosane, and quinic acid- ${\gamma}$ -lactone of smoke. The ENS data showed high efficiency for discriminant analysis and cluster analysis, but it was not good for factor analysis, and correlation analysis. It was possible to estimate tobacco leaves and their blending characteristics by the analytical data of tobacco leaves, smoke, ENS, and sensory test results. By the multiple regression analysis, some correlation among selected chemical components and sensory evaluation were found. This study strongly indicated that the some chemical analysis data was available for the objective evaluation of tobacco sensory attributes.

  • PDF

Customer Classification Method for Household Appliances Industries with a Large Number of Incomplete Data (다수의 결측치가 존재하는 가전업 고객 데이터 활용을 위한 고객분류기법의 개발)

  • Chang, Young-Soon;Seo, Jong-Hyen
    • IE interfaces
    • /
    • v.19 no.1
    • /
    • pp.86-96
    • /
    • 2006
  • Some customer data of manufacturing industries have a large number of incomplete data set due to the customer's infrequent purchasing behavior and the limitation of customer profile data gathered from sales representatives. So that, most sophisticated data analysis methods may not be applied directly. This paper proposes a heuristic data analysis method to classify customers in household appliances industries. The proposed PD (percent of difference) method can be used for the discriminant analysis of incomplete customer data with simple mathematical calculations. The method is composed of variable distribution estimation step, PD measure and cluster score evaluation steps, variable impact construction step, and segment assignment step. A real example is also presented.