• 제목/요약/키워드: Data Analyze

검색결과 19,173건 처리시간 0.043초

빅데이터 분석도구 R을 이용한 성경 데이터의 빈도와 소셜 네트워크 분석 (Frequency and Social Network Analysis of the Bible Data using Big Data Analytics Tools R)

  • 반재훈;하종수;김동현
    • 한국정보통신학회논문지
    • /
    • 제24권2호
    • /
    • pp.166-171
    • /
    • 2020
  • 데이터를 저장하고 분석하여 새로운 지식을 얻을 수 있는 빅데이터 처리기술은 사회의 여러 분야에서 중요성이 강조되고 있으며 정보통신기술 분야의 핵심 이슈로 부각되면서 관련 기술에 대한 관심이 증가하고 있다. 이러한 빅데이터를 분석할 수 있는 도구인 R은 통계 기반의 정보 분석을 가능하게 하는 언어와 환경이다. 본 논문에서는 이를 이용하여 성경데이터를 분석한다. 성경 중에서 신약성경의 4복음서의 데이터를 분석한다. 먼저 성경데이터를 수집하고 분석을 위한 필터링을 수행한다. 이후 R을 이용하여 어떠한 텍스트가 분포되어 있는지를 빈도 조사를 수행하며 정확한 데이터의 분석을 위해 한 문장에서 나오는 단어들을 쌍으로 표현하고 단어 간의 관계성을 분석하는 소셜 네트워크 분석을 통해 성경을 분석한다.

R을 이용한 성경 데이터의 빈도와 소셜 네트워크 분석 (Frequency and Social Network Analysis of the Bible Data using Big Data Analytics Tools R)

  • 반재훈;하종수
    • 한국정보통신학회:학술대회논문집
    • /
    • 한국정보통신학회 2018년도 추계학술대회
    • /
    • pp.93-96
    • /
    • 2018
  • 데이터를 저장하고 분석하여 새로운 지식을 얻을 수 있는 빅데이터 처리기술은 사회의 여러 분야에서 중요성이 강조되고 있으며 정보통신기술 분야의 핵심 이슈로 부각되면서 관련 기술에 대한 관심이 증가하고 있다. 이러한 빅데이터를 분석할 수 있는 도구인 R은 통계 기반의 정보 분석을 가능하게 하는 언어와 환경이다. 본 논문에서는 이를 이용하여 성경데이터를 분석한다. R을 이용하여 어떠한 텍스트가 분포되어 있는지를 빈도 조사를 수행하며 소셜 네트워크 분석을 통해 성경을 분석한다.

  • PDF

Analysis of Market Trajectory Data using k-NN

  • Park, So-Hyun;Ihm, Sun-Young;Park, Young-Ho
    • Journal of Multimedia Information System
    • /
    • 제5권3호
    • /
    • pp.195-200
    • /
    • 2018
  • Recently, as the sensor and big data analysis technology have been developed, there have been a lot of researches that analyze the purchase-related data such as the trajectory information and the stay time. Such purchase-related data is usefully used for the purchase pattern prediction and the purchase time prediction. Because it is difficult to find periodic patterns in large-scale human data, it is necessary to look at actual data sets, find various feature patterns, and then apply a machine learning algorithm appropriate to the pattern and purpose. Although existing papers have been used to analyze data using various machine learning methods, there is a lack of statistical analysis such as finding feature patterns before applying the machine learning algorithm. Therefore, we analyze the purchasing data of Songjeong Maeil Market, which is a data gathering place, and finds some characteristic patterns through statistical data analysis. Based on the results of 1, we derive meaningful conclusions by applying the machine learning algorithm and present future research directions. Through the data analysis, it was confirmed that the number of visits was different according to the regional characteristics around Songjeong Maeil Market, and the distribution of time spent by consumers could be grasped.

ESG 사회적책임 제고를 위한 빅데이터 분석: 장애인 콜택시 운영 효율성 관점 (Big Data Analytics for Social Responsibility of ESG: The Perspective of the Transport for Person with Disabilities)

  • 서창갑;김종기;정대현
    • 한국정보시스템학회지:정보시스템연구
    • /
    • 제32권2호
    • /
    • pp.137-152
    • /
    • 2023
  • Purpose The purpose of this study is to analyze big data related to DURIBAL from the operation of taxis reserved for the disabled to identify the issues and suggest solutions. ESG management should be translated into "environmental factors, social responsibilities, and transparent management." Therefore, the current study used Big Data analysis to analyze the factors affecting the standby of taxis reserved for the disabled and relevant problems for implications on convenience of social weak. Design/methodology/approach The analysis method used R, Excel, Power BI, QGIS, and SPSS. We proposed several suggestions included problems with managing cancellation data, minimization of dark data, needs to develop an integrated database for scattered data, and system upgrades for additional analysis. Findings The results showed that the total duration of standby was 34 minutes 29 seconds. The reasons for cancellation data were mostly use of other modes of transportation or delayed arrival. The study suggests development of an integrated database for scattered data. Finally, follow-up studies may discuss government-initiated big data analysis to comparatively analyze the use of taxis reserved for the disabled nationwide for new social value.

컴퓨터를 이용한 질적 자료 분석 (Qualitative Data Analysis using Computers)

  • 이명선
    • 기본간호학회지
    • /
    • 제6권3호
    • /
    • pp.570-582
    • /
    • 1999
  • Although computers cannot analyze textual data in the same way as they analyze numerical data. they can nevertheless be of great assistance to qualitative researchers. Thus, the use of computers in analyzing qualitative data has increased since the 1980s. The purpose of this article was to explore advantages and disadvanteges of using computers to analyze textual data and to suggest strategies to prevent problems of using computers. In additon, it illustrated characteristics and functions of softwares designed to analyze qualitative data to help researchers choose the program wisely. It also demonstrated precise functions and procedures of the NUDIST program which was designed to develop a conceptual framework or grounded theory from unstructured data. Major advantage of using computers in qualitative research is the management of huge amount of unstructured data. By managing overloaded data, researcher can keep track of the emerging ideas, arguments and theoretical concepts and can organize these tasks mope efficiently than the traditional method of 'cut-and-paste' technique. Additional advantages are the abilities to increase trustworthiness of research, transparency of research process, and intuitional creativity of the researcher, and to facilitate team and secondary research. On the other hand, disvantages of using computers were identified as worries that the machine could conquer the human understanding and as probability of these problems. it suggested strategies such as 1) deep understanding of orthodoxy in analytical process. To overcome philosophical and theoretical background of qualitative research method, 2) deep understanding of the data as a whole before using software, 3) use of software after familiarity with it, 4) continuous evaluation of software and feedback from them, and 5) continuous awareness of the limitation of the machine, that is computer, in the interpretive analysis.

  • PDF

수질자료의 추세분석을 위한 비모수적 통계검정에 관한 연구 (A Study of Non-parametric Statistical Tests to Analyze Trend in Water Quality Data)

  • 이상훈
    • 환경영향평가
    • /
    • 제4권2호
    • /
    • pp.93-103
    • /
    • 1995
  • This study was carried out to suggest the best statistical test to analyze the trend in monthly water quality data. Traditional parametric tests such as t-test and regression analysis are based on the assumption that the underlying population has a normal distribution and regression analysis additionally assumes that residual errors are independent. Analyzing 9-years monthly COD data collected at Paldang in Han River, the underlying population was found to be neither normal nor independent. Therefore parametric tests are invalid for trend detection. Four Kinds of nonparametric statistical tests, such as Run Test, Daniel test, Mann-Kendall test, and Time Series Residual Analysis were applied to analyze the trend in the COD data, Daniel test and Mann-Kendall test indicated upward trend in COD data. The best nonparametric test was suggested to be Daniel test, which is simple in computation and easy to understand the intuitive meaning.

  • PDF

CD 동향기록 시스템 개발 (The Development of CD Trend Recording System)

  • 장미혜;윤갑구;최항섭;이승재
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 1997년도 하계학술대회 논문집 B
    • /
    • pp.465-467
    • /
    • 1997
  • The CD Trend Recorder, developed as a substitution of the existing SCR (Stript Chart Recorder), acquires various outputs (e.g. frequency voltage, current, power, temperature, pressure, etc) simultaneously using PC up to 32 different outputs, display, and analyze them just like the existing SCR. It stores the data in CD-ROM so th various data can be stored permanently this system has built-in MMI program and to monitor and analyze the data in real-t expert system link the data to the do where needs the data to display, analyze, in the new medium at the same time.

  • PDF

Towards Effective Analysis and Tracking of Mozilla and Eclipse Defects using Machine Learning Models based on Bugs Data

  • Hassan, Zohaib;Iqbal, Naeem;Zaman, Abnash
    • Soft Computing and Machine Intelligence
    • /
    • 제1권1호
    • /
    • pp.1-10
    • /
    • 2021
  • Analysis and Tracking of bug reports is a challenging field in software repositories mining. It is one of the fundamental ways to explores a large amount of data acquired from defect tracking systems to discover patterns and valuable knowledge about the process of bug triaging. Furthermore, bug data is publically accessible and available of the following systems, such as Bugzilla and JIRA. Moreover, with robust machine learning (ML) techniques, it is quite possible to process and analyze a massive amount of data for extracting underlying patterns, knowledge, and insights. Therefore, it is an interesting area to propose innovative and robust solutions to analyze and track bug reports originating from different open source projects, including Mozilla and Eclipse. This research study presents an ML-based classification model to analyze and track bug defects for enhancing software engineering management (SEM) processes. In this work, Artificial Neural Network (ANN) and Naive Bayesian (NB) classifiers are implemented using open-source bug datasets, such as Mozilla and Eclipse. Furthermore, different evaluation measures are employed to analyze and evaluate the experimental results. Moreover, a comparative analysis is given to compare the experimental results of ANN with NB. The experimental results indicate that the ANN achieved high accuracy compared to the NB. The proposed research study will enhance SEM processes and contribute to the body of knowledge of the data mining field.

에어컨 서비스데이터 신뢰도분석 (Reliability Analysis of Air-conditioner with Service Data)

  • 윤원영;성문현;정석주
    • 산업공학
    • /
    • 제12권1호
    • /
    • pp.1-9
    • /
    • 1999
  • This paper presents a method for reliability analysis of the air-conditioner with service data. We explain how to acquire and analyze the service data and some problems in data analysis. We propose two procedures to analyze reliability of air-conditioner using operating time concept and predict the operating times by temperature and failure frequency. Finally, the prediction method for future service is studied by numerical example.

  • PDF

Network-based Microarray Data Analysis Tool

  • Park, Hee-Chang;Ryu, Ki-Hyun
    • Journal of the Korean Data and Information Science Society
    • /
    • 제17권1호
    • /
    • pp.53-62
    • /
    • 2006
  • DNA microarray data analysis is a new technology to investigate the expression levels of thousands of genes simultaneously. Since DNA microarray data structures are various and complicative, the data are generally stored in databases for approaching to and controlling the data effectively. But we have some difficulties to analyze and control the data when the data are stored in the several database management systems or that the data are stored to the file format. The existing analysis tools for DNA microarray data have many difficult problems by complicated instructions, and dependency on data types and operating system. In this paper, we design and implement network-based analysis tool for obtaining to useful information from DNA microarray data. When we use this tool, we can analyze effectively DNA microarray data without special knowledge and education for data types and analytical methods.

  • PDF