• Title/Summary/Keyword: Data Analyze

Search Result 18,922, Processing Time 0.043 seconds

Frequency and Social Network Analysis of the Bible Data using Big Data Analytics Tools R (빅데이터 분석도구 R을 이용한 성경 데이터의 빈도와 소셜 네트워크 분석)

  • Ban, ChaeHoon;Ha, JongSoo;Kim, Dong Hyun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.24 no.2
    • /
    • pp.166-171
    • /
    • 2020
  • Big data processing technology that can store and analyze data and obtain new knowledge has been adjusted for importance in many fields of the society. Big data is emerging as an important problem in the field of information and communication technology, but the mind of continuous technology is rising. the R, a tool that can analyze big data, is a language and environment that enables information analysis of statistical bases. In this paper, we use this to analyze the Bible data. We analyze the four Gospels of the New Testament in the Bible. We collect the Bible data and perform filtering for analysis. The R is used to investigate the frequency of what text is distributed and analyze the Bible through social network analysis, in which words from a sentence are paired and analyzed between words for accurate data analysis.

Frequency and Social Network Analysis of the Bible Data using Big Data Analytics Tools R (R을 이용한 성경 데이터의 빈도와 소셜 네트워크 분석)

  • Ban, ChaeHoon;Ha, JongSoo
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2018.10a
    • /
    • pp.93-96
    • /
    • 2018
  • Big datatics technology that can store and analyze data and obtain new knowledge has been adjusted for importance in many fields of the society. Big data is emerging as an important problem in the field of information and communication technology, but the mind of continuous technology is rising. R, a tool that can analyze big data, is a language and environment that enables information analysis of statistical bases. In this thesis, we use this to analyze the Bible data. R is used to investigate the frequency of what text is distributed and analyze the Bible through analysis of social network.

  • PDF

Analysis of Market Trajectory Data using k-NN

  • Park, So-Hyun;Ihm, Sun-Young;Park, Young-Ho
    • Journal of Multimedia Information System
    • /
    • v.5 no.3
    • /
    • pp.195-200
    • /
    • 2018
  • Recently, as the sensor and big data analysis technology have been developed, there have been a lot of researches that analyze the purchase-related data such as the trajectory information and the stay time. Such purchase-related data is usefully used for the purchase pattern prediction and the purchase time prediction. Because it is difficult to find periodic patterns in large-scale human data, it is necessary to look at actual data sets, find various feature patterns, and then apply a machine learning algorithm appropriate to the pattern and purpose. Although existing papers have been used to analyze data using various machine learning methods, there is a lack of statistical analysis such as finding feature patterns before applying the machine learning algorithm. Therefore, we analyze the purchasing data of Songjeong Maeil Market, which is a data gathering place, and finds some characteristic patterns through statistical data analysis. Based on the results of 1, we derive meaningful conclusions by applying the machine learning algorithm and present future research directions. Through the data analysis, it was confirmed that the number of visits was different according to the regional characteristics around Songjeong Maeil Market, and the distribution of time spent by consumers could be grasped.

Big Data Analytics for Social Responsibility of ESG: The Perspective of the Transport for Person with Disabilities (ESG 사회적책임 제고를 위한 빅데이터 분석: 장애인 콜택시 운영 효율성 관점)

  • Seo, Chang Gab;Kim, Jong Ki;Jung, Dae Hyun
    • The Journal of Information Systems
    • /
    • v.32 no.2
    • /
    • pp.137-152
    • /
    • 2023
  • Purpose The purpose of this study is to analyze big data related to DURIBAL from the operation of taxis reserved for the disabled to identify the issues and suggest solutions. ESG management should be translated into "environmental factors, social responsibilities, and transparent management." Therefore, the current study used Big Data analysis to analyze the factors affecting the standby of taxis reserved for the disabled and relevant problems for implications on convenience of social weak. Design/methodology/approach The analysis method used R, Excel, Power BI, QGIS, and SPSS. We proposed several suggestions included problems with managing cancellation data, minimization of dark data, needs to develop an integrated database for scattered data, and system upgrades for additional analysis. Findings The results showed that the total duration of standby was 34 minutes 29 seconds. The reasons for cancellation data were mostly use of other modes of transportation or delayed arrival. The study suggests development of an integrated database for scattered data. Finally, follow-up studies may discuss government-initiated big data analysis to comparatively analyze the use of taxis reserved for the disabled nationwide for new social value.

Qualitative Data Analysis using Computers (컴퓨터를 이용한 질적 자료 분석)

  • Yi Myung-Sun
    • Journal of Korean Academy of Fundamentals of Nursing
    • /
    • v.6 no.3
    • /
    • pp.570-582
    • /
    • 1999
  • Although computers cannot analyze textual data in the same way as they analyze numerical data. they can nevertheless be of great assistance to qualitative researchers. Thus, the use of computers in analyzing qualitative data has increased since the 1980s. The purpose of this article was to explore advantages and disadvanteges of using computers to analyze textual data and to suggest strategies to prevent problems of using computers. In additon, it illustrated characteristics and functions of softwares designed to analyze qualitative data to help researchers choose the program wisely. It also demonstrated precise functions and procedures of the NUDIST program which was designed to develop a conceptual framework or grounded theory from unstructured data. Major advantage of using computers in qualitative research is the management of huge amount of unstructured data. By managing overloaded data, researcher can keep track of the emerging ideas, arguments and theoretical concepts and can organize these tasks mope efficiently than the traditional method of 'cut-and-paste' technique. Additional advantages are the abilities to increase trustworthiness of research, transparency of research process, and intuitional creativity of the researcher, and to facilitate team and secondary research. On the other hand, disvantages of using computers were identified as worries that the machine could conquer the human understanding and as probability of these problems. it suggested strategies such as 1) deep understanding of orthodoxy in analytical process. To overcome philosophical and theoretical background of qualitative research method, 2) deep understanding of the data as a whole before using software, 3) use of software after familiarity with it, 4) continuous evaluation of software and feedback from them, and 5) continuous awareness of the limitation of the machine, that is computer, in the interpretive analysis.

  • PDF

A Study of Non-parametric Statistical Tests to Analyze Trend in Water Quality Data (수질자료의 추세분석을 위한 비모수적 통계검정에 관한 연구)

  • Lee, Sang-Hoon
    • Journal of Environmental Impact Assessment
    • /
    • v.4 no.2
    • /
    • pp.93-103
    • /
    • 1995
  • This study was carried out to suggest the best statistical test to analyze the trend in monthly water quality data. Traditional parametric tests such as t-test and regression analysis are based on the assumption that the underlying population has a normal distribution and regression analysis additionally assumes that residual errors are independent. Analyzing 9-years monthly COD data collected at Paldang in Han River, the underlying population was found to be neither normal nor independent. Therefore parametric tests are invalid for trend detection. Four Kinds of nonparametric statistical tests, such as Run Test, Daniel test, Mann-Kendall test, and Time Series Residual Analysis were applied to analyze the trend in the COD data, Daniel test and Mann-Kendall test indicated upward trend in COD data. The best nonparametric test was suggested to be Daniel test, which is simple in computation and easy to understand the intuitive meaning.

  • PDF

The Development of CD Trend Recording System (CD 동향기록 시스템 개발)

  • Jang, Me-Hea;Yoon, Kap-Koo;Choe, Hang-Soeb;Lee, Seung-Jae
    • Proceedings of the KIEE Conference
    • /
    • 1997.07b
    • /
    • pp.465-467
    • /
    • 1997
  • The CD Trend Recorder, developed as a substitution of the existing SCR (Stript Chart Recorder), acquires various outputs (e.g. frequency voltage, current, power, temperature, pressure, etc) simultaneously using PC up to 32 different outputs, display, and analyze them just like the existing SCR. It stores the data in CD-ROM so th various data can be stored permanently this system has built-in MMI program and to monitor and analyze the data in real-t expert system link the data to the do where needs the data to display, analyze, in the new medium at the same time.

  • PDF

Towards Effective Analysis and Tracking of Mozilla and Eclipse Defects using Machine Learning Models based on Bugs Data

  • Hassan, Zohaib;Iqbal, Naeem;Zaman, Abnash
    • Soft Computing and Machine Intelligence
    • /
    • v.1 no.1
    • /
    • pp.1-10
    • /
    • 2021
  • Analysis and Tracking of bug reports is a challenging field in software repositories mining. It is one of the fundamental ways to explores a large amount of data acquired from defect tracking systems to discover patterns and valuable knowledge about the process of bug triaging. Furthermore, bug data is publically accessible and available of the following systems, such as Bugzilla and JIRA. Moreover, with robust machine learning (ML) techniques, it is quite possible to process and analyze a massive amount of data for extracting underlying patterns, knowledge, and insights. Therefore, it is an interesting area to propose innovative and robust solutions to analyze and track bug reports originating from different open source projects, including Mozilla and Eclipse. This research study presents an ML-based classification model to analyze and track bug defects for enhancing software engineering management (SEM) processes. In this work, Artificial Neural Network (ANN) and Naive Bayesian (NB) classifiers are implemented using open-source bug datasets, such as Mozilla and Eclipse. Furthermore, different evaluation measures are employed to analyze and evaluate the experimental results. Moreover, a comparative analysis is given to compare the experimental results of ANN with NB. The experimental results indicate that the ANN achieved high accuracy compared to the NB. The proposed research study will enhance SEM processes and contribute to the body of knowledge of the data mining field.

Reliability Analysis of Air-conditioner with Service Data (에어컨 서비스데이터 신뢰도분석)

  • Yun, Won-Young;Sung, Mun-Hyun;Choung, Seock-Joo
    • IE interfaces
    • /
    • v.12 no.1
    • /
    • pp.1-9
    • /
    • 1999
  • This paper presents a method for reliability analysis of the air-conditioner with service data. We explain how to acquire and analyze the service data and some problems in data analysis. We propose two procedures to analyze reliability of air-conditioner using operating time concept and predict the operating times by temperature and failure frequency. Finally, the prediction method for future service is studied by numerical example.

  • PDF

Network-based Microarray Data Analysis Tool

  • Park, Hee-Chang;Ryu, Ki-Hyun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.1
    • /
    • pp.53-62
    • /
    • 2006
  • DNA microarray data analysis is a new technology to investigate the expression levels of thousands of genes simultaneously. Since DNA microarray data structures are various and complicative, the data are generally stored in databases for approaching to and controlling the data effectively. But we have some difficulties to analyze and control the data when the data are stored in the several database management systems or that the data are stored to the file format. The existing analysis tools for DNA microarray data have many difficult problems by complicated instructions, and dependency on data types and operating system. In this paper, we design and implement network-based analysis tool for obtaining to useful information from DNA microarray data. When we use this tool, we can analyze effectively DNA microarray data without special knowledge and education for data types and analytical methods.

  • PDF