• Title/Summary/Keyword: basic statistics analysis

Search Result 861, Processing Time 0.041 seconds

A Study on AI basic statistics Education for Non-majors (비전공자를 위한 AI기초통계 교육의 고찰)

  • Yoo, Jin-Ah
    • Journal of Integrative Natural Science
    • /
    • v.14 no.4
    • /
    • pp.176-182
    • /
    • 2021
  • We live in the age of artificial intelligence, and big data and artificial intelligence education are no longer just for majors, but are required to be able to handle non-majors as well. Software and artificial intelligence education for non-majors is not just a general education, it creates talents who can understand and utilize them, and the quality of education is increasingly important. Through such education, we can nurture creative talents who can create and use new values by fusion with various fields of computing technology. Since 2015, many universities have been implementing software-oriented colleges and AI-oriented colleges to foster software-oriented human resources. However, it is not easy to provide AI basic statistics education of big data analysis deception to non-majors. Therefore, we would like to present a big data education model for non-majors in big data analysis so that big data analysis can be directly applied.

Data Technology: New Interdisciplinary Science & Technology (데이터 기술: 지식창조를 위한 새로운 융합과학기술)

  • Park, Sung-Hyun
    • Journal of Korean Society for Quality Management
    • /
    • v.38 no.3
    • /
    • pp.294-312
    • /
    • 2010
  • Data Technology (DT) is a new technology which deals with data collection, data analysis, information generation from data, knowledge generation from modelling and future prediction. DT is a newly emerged interdisciplinary science & technology in this 21st century knowledge society. Even though the main body of DT is applied statistics, it also contains management information system (MIS), quality management, process system analysis and so on. Therefore, it is an interdisciplinary science and technology of statistics, management science, industrial engineering, computer science and social science. In this paper, first of all, the definition of DT is given, and then the effects and the basic properties of DT, the differences between IT and DT, the 6 step process for DT application, and a DT example are provided. Finally, the relationship among DT, e-Statistics and Data Mining is explained, and the direction of DT development is proposed.

Regression and Correlation Analysis via Dynamic Graphs

  • Kang, Hee Mo;Sim, Songyong
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.3
    • /
    • pp.695-705
    • /
    • 2003
  • In this article, we propose a regression and correlation analysis via dynamic graphs and implement them in Java Web Start. For the polynomial relations between dependent and independent variables, dynamic graphics are implemented for both polynomial regression and spline estimates for an instant model selection. The results include basic statistics. They are available both as a web-based service and an application.

On the Negative Quadrant Dependence in Three Dimensions

  • Ko, Mi-Hwa;Kim, Tae-Sung
    • Honam Mathematical Journal
    • /
    • v.25 no.1
    • /
    • pp.117-127
    • /
    • 2003
  • In this note we perform an extreme point analysis on two natural definitions of negative quadrant dependence of three random variables and examine how different these two notions of dependence. We also characterize some special distributions which are both negatively lower orthant dependent and negatively upper orthant dependent.

  • PDF

Application of SOM for the Detection of Spatial Distribution considering the Analysis of Basic Statistics for Water Quality and Runoff Data (수질 및 유량자료의 기초통계량 분석에 따른 공간분포 파악을 위한 SOM의 적용)

  • Jin, Young-Hoon;Kim, Yong-Gu;Roh, Kyong-Bum;Park, Sung-Chun
    • Journal of Korean Society on Water Environment
    • /
    • v.25 no.5
    • /
    • pp.735-741
    • /
    • 2009
  • In order to support the basic information for planning and performing the environment management such as Total Maximum Daily Loads (TMDLs), it is highly recommended to understand the spatial distribution of water quality and runoff data in the unit watersheds. Therefore, in the present study, we applied Self-Organizing Map (SOM) to detect the characteristics of spatial distribution of Biological Oxygen Demand (BOD) concentration and runoff data which have been measured in the Yeongsan, Seomjin, and Tamjin River basins. For the purpose, the input dataset for SOM was constructed with the mean, standard deviation, skewness, and kurtosis values of the respective data measured from the stations of 22-subbasins in the rivers. The results showed that the $4{\times}4$ array structure of SOM was selected by the trial and error method and the best performance was revealed when it classified the stations into three clusters according to the basic statistics. The cluster-1 and 2 were classified primarily by the skewness and kurtosis of runoff data and the cluster-3 including the basic statistics of YB_B, YB_C, and YB_D stations was clearly decomposed by the mean value of BOD concentration showing the worst condition of water quality among the three clusters. Consequently, the methodology based on the SOM proposed in the present study can be considered that it is highly applicable to detect the spatial distribution of BOD concentration and runoff data and it can be used effectively for the further utilization using different water quality items as a data analysis tool.

An Analysis of a Working Diary Log of Public Libraries Considering the Library Evaluation (도서관 운영 평가를 고려한 공공도서관 업무(운영)일지 항목 분석)

  • Kang, Yoon-Ho;Park, Young-Ae
    • Journal of the Korean Society for information Management
    • /
    • v.26 no.3
    • /
    • pp.417-434
    • /
    • 2009
  • Statistics items required by National Library Statistics System are based on data for library evaluation and policy making of library management from a theoretical standpoint. However, It is realized there are differences between Statistics items required by National Library Statistics System and those can be collected at the field of public libraries. In accordance with this point, this paper contained an analysis of a working diary log of public libraries as a basic material able to collect data at the field of public libraries and also surveyed the present usage situation of Library Management Program able to automatically collect data to recognize that National Library Statistics is reliably or validly objective data. An analysis data of this research will be a basic material to plan the standard guide of a working diary log of public libraries from now on.

Introductory Statistics textbooks: crisis or opportunity? (교양 통계학 교재: 위기인가? 기회인가?)

  • Choi, Sookhee;Han, Kyungsoo
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.1
    • /
    • pp.105-117
    • /
    • 2022
  • Recently, the number of students taking basic statistics in liberal arts courses at universities nationwide has been increasing significantly. Students who learn statistics only for one semester are more likely to live as consumers than producers of statistical analysis in the future. What consumers need is statistical literacy and thinking skills rather than statistical methods. This paper deals with what points should be considered in order to develop textbooks that improve statistical thinking.

Implementation of On-Line Survey and Analysis System based on Database Structure

  • Park, Hee-Chang;Nam, Ki-Seong;Kim, Hee-Jae;Song, Gum-Min;Myung, Ho-Min
    • Journal of the Korean Data and Information Science Society
    • /
    • v.13 no.1
    • /
    • pp.1-16
    • /
    • 2002
  • In this paper, we suggest and implement an on-line survey and analysis system based on the database structure. We can do trends analysis in this system and reduce the number of function because we can treat basic algorithms on the database.

  • PDF

A Review of the Statistical Analysis used in Clinical Articles Published on Journal of Korean Neurosurgical Society

  • Kang, Wee-Chang
    • Journal of Korean Neurosurgical Society
    • /
    • v.40 no.4
    • /
    • pp.304-308
    • /
    • 2006
  • Statistical analyses used in clinical articles published on the Journal of Korean Neurosurgical Society were identified and appropriateness of statistical aspects in reporting results was assessed. Forty seven clinical articles were selected in this study, which were published from February, 2005 to February, 2006 on the journal. The frequency of statistical analysis was as follows : descriptive statistics only 24 [51.1%]. one type of statistical method 10 [21.3%], two or more methods 13 [27.6%]. An assessment of statistical aspects was performed in 24 clinical articles reporting inferential statistics. Ten articles [41.7%] did not adequately describe or reference all statistical methods used. There were six articles [25.0%] not reporting the confidence level used as the critical criteria of the statistical significance. In thirteen articles [54.2%] it seems more appropriate to implement multivariate analyses in addition to univariate analyses. We recommend that the journal readers should concentrate on improving their knowledge of basic statistics and statistical review for manuscripts submitted should be sought from professionals in the fields of biostatistics and epidemiology.

A Study on Efficient Cluster Analysis of Bio-Data Using MapReduce Framework

  • Yoo, Sowol;Lee, Kwangok;Bae, Sanghyun
    • Journal of Integrative Natural Science
    • /
    • v.7 no.1
    • /
    • pp.57-61
    • /
    • 2014
  • This study measured the stream data from the several sensors, and stores the database in MapReduce framework environment, and it aims to design system with the small performance and cluster analysis error rate through the KMSVM algorithm. Through the KM-SVM algorithm, the cluster analysis effective data was used for U-health system. In the results of experiment by using 2003 data sets obtained from 52 test subjects, the k-NN algorithm showed 79.29% cluster analysis accuracy, K-means algorithm showed 87.15 cluster analysis accuracy, and SVM algorithm showed 83.72%, KM-SVM showed 90.72%. As a result, the process speed and cluster analysis effective ratio of KM-SVM algorithm was better.