• Title/Summary/Keyword: Statistics Korea

Search Result 8,521, Processing Time 0.03 seconds

A Trend Analysis on the Educational Research of the Probability and Statistics - Focused on Papers Published in , the Journal of Korea Society of Mathematical Education - (확률.통계 연구에 대한 수학교육학적 고찰 -<수학교육>에 게재된 논문을 중심으로-)

  • 이영하;심효정
    • The Mathematical Education
    • /
    • v.42 no.2
    • /
    • pp.203-218
    • /
    • 2003
  • The purpose of this study is to see what the essential characteristics are in teaching probability and statistics among various mathematical fields. we also tried to connect the study of probability and statistics education with what is needed for a science be synthetic to have its own identity as a unique research field. Since we searched for the future direction of the pedagogic study in the probability and statistics we first selected papers on probability and statistics published in (Series A), the Journal of Korea Society of Mathematical Education, and establish the following research questions. What kinds of characteristics can be found when papers on probability and statistics published in (Series A) are classified into low categories; contents of probability and statistics education, research method of the mathematics education, methods of teaming and teaching, and finally measurements and evaluation\ulcorner We classified papers into two kinds. One is related to the educational contents, consisting of the methods of learning and teaching, and of the measurement and evaluation. The other is reined to the methods of research, which is not a part of the educational curriculum but is essential for establishing the identity of mathematics education. According to the periods, papers on the curricular contents in 1960s were influenced by the New Mathematics, and papers on the curricular contents in 1980s were influenced by 'back to basic'. In 1990s, papers on methods of learning and teaching, and measurement md evaluation were increasing in number. Besides, (series A) from the Journal of Korea Society of Mathematical Education covers contents, methods of Loaming and teaching, and measurement and evaluation. And when I examined the papers on the contents of textbook of a junior high school related to the probability and statistics education and on methods of learning and teaching, 1 found that those papers occupy 1.84% in . When it comes to the methods of loaming and teaching, most of studies in (series A) are about application of concrete implement like experiment and practical application of computer programs, Through this study, I found that over-all and more active researches on probability and statistics are required and that the studies about methods of loaming and teaching must be made in diverse directions. It is needed that how students recognize probability and statistics, connection, communication and representation in probability and statistics context, too. (series A) does not have papers on methods of study. Mathematics pedagogy is a mixture of various studies - mathematical psychology, mathematical philosophy, the history of mathematics and Mathematics. So If there doesn't exist a proper method of study adequate in the situation for the mathematics education the issue of mathematics pedagogy might be taken its own place by that of other studies'. We must search for the unique method of study fur mathematics education so that mathematics pedagogy has its own identity as a study. The study concerning this aspect is needed.

  • PDF

Imputation Methods for the Population and Housing Census 2000 in Korea

  • Kim, Young-Won;Ryu, Jeabok;Park, Jinwoo;Lee, Jaewon
    • Communications for Statistical Applications and Methods
    • /
    • v.10 no.2
    • /
    • pp.575-583
    • /
    • 2003
  • We proposed imputation strategies for the Population and Housing Census 2000 in Korea. The total area of floor space and marital status which have relatively high non-response rates in the Census are considered to develope the effective missing value imputation procedures. The Classification and Regression Tree(CART) is employed to construct the imputation cells for hot-deck imputation, as well as to predict missing value by model-based approach. We compare three imputation methods which include CART model-based imputation, hot-deck imputation based on CART and logical hot-deck imputation proposed by The Korea National Statistical Office. The results suggest that the proposed hot-deck imputation based on CART is very efficient and strongly recommendable.

Effect of Outliers on Sample Correlation Coefficient

  • Kim, Chooongrak;Park, Byeong U.;Park, Kook L.;Whasoo Bae
    • Journal of the Korean Statistical Society
    • /
    • v.29 no.3
    • /
    • pp.285-294
    • /
    • 2000
  • In analyzing bivariate date the sample correlation coefficient is often used, and it is quite sensitive to one or few isolated cases. In this article we derive a formula for the effect of $textsc{k}$ observations on the samples correlation coefficient by the deletion method. To give a reference value for the isolated cases the asymptotic distribution fo the formula is derived. Also, we give some interpretations on several types of isolated cases and an example based on a real data set.

  • PDF

A Stratified Randomized Response Technique (층화 확률화 응답 기법)

  • Ki Hak Hong;Jun Keun Yum;Hwa Young Lee
    • The Korean Journal of Applied Statistics
    • /
    • v.7 no.1
    • /
    • pp.141-147
    • /
    • 1994
  • In the present paper an attempt has been made to develop a stratified ramdomized response technique when the respondents are selected using simple random sampling without replacement (SRSWOR) as well as simple random sampling with replacement (SRSWR). The conditions under which the proposed technique will be more efficient than the corresponding Warner's technique have been obtained.

  • PDF

Detecting Genetic Association and Gene-Gene Interaction using Network Analysis in Case-Control Study

  • Jin, Seo-Hoon;Lee, Min-Hee;Lee, Hyo-Jung;Park, Mi-Ra
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.4
    • /
    • pp.563-573
    • /
    • 2012
  • Various methods of analysis have been proposed to understand the gene-disease relation and gene-gene interaction effect for a disease through comparison of genotype in case-control study. In this study, we proposed the method to detect a genetic association and gene-gene interaction through the use of a network graph and centrality measures that are used in social network analysis. The applicability of the proposed method was studied through an analysis of real genetic data.

SUPPORT VECTOR MACHINE USING K-MEANS CLUSTERING

  • Lee, S.J.;Park, C.;Jhun, M.;Koo, J.Y.
    • Journal of the Korean Statistical Society
    • /
    • v.36 no.1
    • /
    • pp.175-182
    • /
    • 2007
  • The support vector machine has been successful in many applications because of its flexibility and high accuracy. However, when a training data set is large or imbalanced, the support vector machine may suffer from significant computational problem or loss of accuracy in predicting minority classes. We propose a modified version of the support vector machine using the K-means clustering that exploits the information in class labels during the clustering process. For large data sets, our method can save the computation time by reducing the number of data points without significant loss of accuracy. Moreover, our method can deal with imbalanced data sets effectively by alleviating the influence of dominant class.

A New Method of Yielding the GDP of Korea Small Business: Conversion of the Statistics of Workplace Units to Industrial Units (중소 서비스 기업 부가가치 산출을 위한 새로운 방법: 사업체 통계를 기업체 통계로 전환)

  • Jeong, Hyeong-Chul;Chung, Yeon-Seung
    • The Korean Journal of Applied Statistics
    • /
    • v.20 no.1
    • /
    • pp.11-22
    • /
    • 2007
  • In this study, we have proposed the new statistical methods to convert the statistics based of workplace units into the statistics based of industrial units using the industrial survey data published by National Statistical Once in 2001. It could help to apprehend the weight of service sectors of korea small business in terms of the Gross Domestic Product (GDP).

Ensemble variable selection using genetic algorithm

  • Seogyoung, Lee;Martin Seunghwan, Yang;Jongkyeong, Kang;Seung Jun, Shin
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.6
    • /
    • pp.629-640
    • /
    • 2022
  • Variable selection is one of the most crucial tasks in supervised learning, such as regression and classification. The best subset selection is straightforward and optimal but not practically applicable unless the number of predictors is small. In this article, we propose directly solving the best subset selection via the genetic algorithm (GA), a popular stochastic optimization algorithm based on the principle of Darwinian evolution. To further improve the variable selection performance, we propose to run multiple GA to solve the best subset selection and then synthesize the results, which we call ensemble GA (EGA). The EGA significantly improves variable selection performance. In addition, the proposed method is essentially the best subset selection and hence applicable to a variety of models with different selection criteria. We compare the proposed EGA to existing variable selection methods under various models, including linear regression, Poisson regression, and Cox regression for survival data. Both simulation and real data analysis demonstrate the promising performance of the proposed method.

A Study on the Acquisition of Usage Statistics based on SUSHI Project (SUSHI 기반 학술정보 이용통계 수집 모델 연구)

  • Kim, Sun-Tae;Lim, seok-Jong
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2007.11a
    • /
    • pp.35-39
    • /
    • 2007
  • Recently Usage statistics are widely available from online content providers. However. the statistics are not yet available in a consistent data container and the administrative cost of individual provider-by-provider downloads is high. The Standardized Usage Statistics Harvesting Initiative (SUSHI) is developing an automated request and response protocol for moving Project COUNTER (Counting Online Usage of Networked Electronic Resources) Code of Practice usage statistics from providers to library electronic repositories. SUSHI will help libraries make better decisions by reducing the administrative overhead of using Project COUNTER statistics. Publishers in the recording and exchange of usage statistics for electronic resources, initially journals and databases. By following COUNTER's Code of Practice, vendors can provide library customers with Excel or CSV (comma delimited) files of usage data using COUNTER's standardized formats and data elements. The result is a consistent, credible, and compatible set of usage data from multiple content providers. On this study, We propose the acquisition model of usage data based on SUSHI for KESLI that is overseas electronic journal consortium in korea.

  • PDF

Asymptotic Distribution of the LM Test Statistic for the Nested Error Component Regression Model

  • Jung, Byoung-Cheol;Myoungshic Jhun;Song, Seuck-Heun
    • Journal of the Korean Statistical Society
    • /
    • v.28 no.4
    • /
    • pp.489-501
    • /
    • 1999
  • In this paper, we consider the panel data regression model in which the disturbances have nested error component. We derive a Lagrange Multiplier(LM) test which is jointly testing for the presence of random individual effects and nested effects under the normality assumption of the disturbances. This test extends the earlier work of Breusch and Pagan(1980) and Baltagi and Li(1991). Further, it is shown that this LM test has the same asymptotic distribution without normality assumption of the disturbances.

  • PDF