• Title/Summary/Keyword: 모란 통계량

Search Result 545, Processing Time 0.031 seconds

A Study of Temporal Characteristics From Multi-Dimensional Precipitation Model (다차원 강우모형의 시간적인 특성 연구)

  • Kim, Sangdan;Yoo, Chulsang;Kim, Joong-Hoon;Yoon, Yong Nam
    • Journal of Korea Water Resources Association
    • /
    • v.33 no.6
    • /
    • pp.783-791
    • /
    • 2000
  • A multidimensional representation for precipitation, given In the theory proposed by E. Waymire et al. (1984), is used for simulating rainfall in space and time. The model produces moving storms with realistic meso-scale meteorological features in time and space. The first- and second-order statistics derived from observed JX)int gauge data were used to estimate the model parameters based on the Nelder-Mead algorithm of optimization. Then twelve-year traces of rainfall intensities at fixed gage stations were generated at intervals of 1 hours. First- and second-order statistics are evaluated from the above series, which are used for estimating the parameters of one dimensional model of temporal rainfall at a point. As a result from the comparisons of one dimensional model parameters used observed and generated data from multidimensional model, we found that the multidimensional rainfall model generated visually realistic spatial patterns of rainfall as well as realistic temporal hyetographs of rainfall at a point. point.

  • PDF

Features of sample concepts in the probability and statistics chapters of Korean mathematics textbooks of grades 1-12 (초.중.고등학교 확률과 통계 단원에 나타난 표본개념에 대한 분석)

  • Lee, Young-Ha;Shin, Sou-Yeong
    • Journal of Educational Research in Mathematics
    • /
    • v.21 no.4
    • /
    • pp.327-344
    • /
    • 2011
  • This study is the first step for us toward improving high school students' capability of statistical inferences, such as obtaining and interpreting the confidence interval on the population mean that is currently learned in high school. We suggest 5 underlying concepts of 'discretion of contingency and inevitability', 'discretion of induction and deduction', 'likelihood principle', 'variability of a statistic' and 'statistical model', those are necessary to appreciate statistical inferences as a reliable arguing tools in spite of its occasional erroneous conclusions. We assume those 5 concepts above are to be gradually developing in their school periods and Korean mathematics textbooks of grades 1-12 were analyzed. Followings were found. For the right choice of solving methodology of the given problem, no elementary textbook but a few high school textbooks describe its difference between the contingent circumstance and the inevitable one. Formal definitions of population and sample are not introduced until high school grades, so that the developments of critical thoughts on the reliability of inductive reasoning could not be observed. On the contrary of it, strong emphasis lies on the calculation stuff of the sample data without any inference on the population prospective based upon the sample. Instead of the representative properties of a random sample, more emphasis lies on how to get a random sample. As a result of it, the fact that 'the random variability of the value of a statistic which is calculated from the sample ought to be inherited from the randomness of the sample' could neither be noticed nor be explained as well. No comparative descriptions on the statistical inferences against the mathematical(deductive) reasoning were found. Few explanations on the likelihood principle and its probabilistic applications in accordance with students' cognitive developmental growth were found. It was hard to find the explanation of a random variability of statistics and on the existence of its sampling distribution. It is worthwhile to explain it because, nevertheless obtaining the sampling distribution of a particular statistic, like a sample mean, is a very difficult job, mere noticing its existence may cause a drastic change of understanding in a statistical inference.

  • PDF

A study on the properties of sensitivity analysis in principal component regression and latent root regression (주성분회귀와 고유값회귀에 대한 감도분석의 성질에 대한 연구)

  • Shin, Jae-Kyoung;Chang, Duk-Joon
    • Journal of the Korean Data and Information Science Society
    • /
    • v.20 no.2
    • /
    • pp.321-328
    • /
    • 2009
  • In regression analysis, the ordinary least squares estimates of regression coefficients become poor, when the correlations among predictor variables are high. This phenomenon, which is called multicollinearity, causes serious problems in actual data analysis. To overcome this multicollinearity, many methods have been proposed. Ridge regression, shrinkage estimators and methods based on principal component analysis (PCA) such as principal component regression (PCR) and latent root regression (LRR). In the last decade, many statisticians discussed sensitivity analysis (SA) in ordinary multiple regression and same topic in PCR, LRR and logistic principal component regression (LPCR). In those methods PCA plays important role. Many statisticians discussed SA in PCA and related multivariate methods. We introduce the method of PCR and LRR. We also introduce the methods of SA in PCR and LRR, and discuss the properties of SA in PCR and LRR.

  • PDF

Sample size using response rate on repeated surveys (계속조사에서 응답률을 반영한 표본크기)

  • Park, Hyeonah;Na, Seongryong
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.5
    • /
    • pp.587-597
    • /
    • 2018
  • Procedures, such as sampling technique, survey method, and questionnaire preparation, are required in order to obtain sample data in accordance with the purpose of a survey. An important procedure is the decision of the sample size formula. The sample size formula is determined by setting the target error and total cost according to the sampling method. In this paper, we propose a sample size formula using population changes over time, estimation error of the previous time and response rate of past data when the target error and the expected response rate are given in the simple random sampling. In actual research, we use estimators that apply complex weights in addition to design-based weights. Therefore, we induce a sample size formula for estimators using design-based weights and nonresponse adjustment coefficients, that can be a formula that reflects differences in response rates when survey methods are changed over time. In addition, we use simulations to compare the proposed formula with the existing sample size formula.

Independence tests using coin package in R (coin 패키지를 이용한 독립성 검정)

  • Kim, Jinheum;Lee, Jung-Dong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.5
    • /
    • pp.1039-1055
    • /
    • 2014
  • The distribution of a test statistic under a null hypothesis depends on the unknown distribution of the data and thus is unknown as well. Conditional tests replace the unknown null distribution by the conditional null distribution, that is, the distribution of the test statistic given the observed data. This approach is known as permutation tests and was developed by Fisher (Fisher, 1935). Theoretical framework for permutation tests was given by Strasser and Weber(1999). The coin package developed by Hothon et al. (2006, 2008) implements a unified approach for conditional inference via the generic independence test. Because convenient functions for the most prominent problems are available, users will not have to use the extremely flexible procedure. In this article we briefly review the underlying theory from Strasser and Weber (1999) and explain how to transform the data to perform the generic function independence test. Finally it was illustrated with a few real data sets.

Yearly Variation of Genetic Parameters for Yielding Characters of Tea Tree(Lycium chinense Miller) Varieties (구기자 품종의 수량형질에 대한 유전통계량의 년차간변동)

  • 권병선;이유식;이종일;이상래;박희진
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.31 no.2
    • /
    • pp.179-185
    • /
    • 1986
  • This study was to compare year variations of heritability, phenotypic, genotypic and environmental correlations as well as pathway coefficients for main characters of tea tree to provide useful selection information for improving tea tree. The data collected from the performance yield trials from 1979 to 1981 were used in this study. I. The genetic variance of fresh fruit yielding, dryed fruit yielding and fresh weight of root was not only greatly varied with year, but also the largest among all characteristies studied. Other characteristics showed higher genetic variance than environmental variances, and year variances were not large. 2. Both year and variety x year interaction were highly significant sources of variation for all yield characteristics, and year variances were not large. 3. All characteristics showed high broad sense heritabilities, and the broad sense heritability was not varied with year. 4. The genetic correlation coefficients between fresh fruit yielding and dryed fruit yielding, fresh weight of root and Gigolpi, dryed weight of root and Gigolpi were positive, and the year variation was not large. S. The pathway coefficients of the character was not only greatly varied with year and the fresh weight of root affected directly on the Gigolpi.

  • PDF

Hierarchically penalized sparse principal component analysis (계층적 벌점함수를 이용한 주성분분석)

  • Kang, Jongkyeong;Park, Jaeshin;Bang, Sungwan
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.1
    • /
    • pp.135-145
    • /
    • 2017
  • Principal component analysis (PCA) describes the variation of multivariate data in terms of a set of uncorrelated variables. Since each principal component is a linear combination of all variables and the loadings are typically non-zero, it is difficult to interpret the derived principal components. Sparse principal component analysis (SPCA) is a specialized technique using the elastic net penalty function to produce sparse loadings in principal component analysis. When data are structured by groups of variables, it is desirable to select variables in a grouped manner. In this paper, we propose a new PCA method to improve variable selection performance when variables are grouped, which not only selects important groups but also removes unimportant variables within identified groups. To incorporate group information into model fitting, we consider a hierarchical lasso penalty instead of the elastic net penalty in SPCA. Real data analyses demonstrate the performance and usefulness of the proposed method.

Development of Adsorption Process with UiO-66 Particles for Hydrogen Purification Using Statistical Design of Experiment (통계학적 실험계획법을 이용한 수소정제용 UiO-66 흡착제 개발)

  • Lee, Hyun Sik;Kim, Da Som;Park, Ji Won;Yoo, Kye Sang
    • Korean Chemical Engineering Research
    • /
    • v.56 no.6
    • /
    • pp.784-791
    • /
    • 2018
  • UiO-66 particles were synthesized under various synthesis conditions to study the adsorption of carbon dioxide for hydrogen purification. For the purpose, the design and analysis of experiments was performed using statistical design of experiment method. As the synthesis time, temperature and acetic acid amount increased, the crystallinity of UiO-66 particles increased. Especially, the amount of acetic acid was confirmed as an important factor in determining the crystallinity of the particles. The specific surface area of the particles measured by the nitrogen adsorption method also showed a similar tendency. Using the general factor analysis in the experimental design method, the main effects and interactions of major factors were analyzed. In addition, the carbon dioxide adsorption capacity was predicted using a nonlinear regression method. Then, the adsorption performance was shown through surface and contour maps for all ranges.

International Patent Classificaton Using Latent Semantic Indexing (잠재 의미 색인 기법을 이용한 국제 특허 분류)

  • Jin, Hoon-Tae
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2013.11a
    • /
    • pp.1294-1297
    • /
    • 2013
  • 본 논문은 기계학습을 통하여 특허문서를 국제 특허 분류(IPC) 기준에 따라 자동으로 분류하는 시스템에 관한 연구로 잠재 의미 색인 기법을 이용하여 분류의 성능을 높일 수 있는 방법을 제안하기 위한 연구이다. 종래 특허문서에 관한 IPC 자동 분류에 관한 연구가 단어 매칭 방식의 색인 기법에 의존해서 이루어진바가 있으나, 현대 기술용어의 발생 속도와 다양성 등을 고려할 때 특허문서들 간의 관련성을 분석하는데 있어서는 단어 자체의 빈도 보다는 용어의 개념에 의한 접근이 보다 효과적일 것이라 판단하여 잠재 의미 색인(LSI) 기법에 의한 분류에 관한 연구를 하게 된 것이다. 실험은 단어 매칭 방식의 색인 기법의 대표적인 자질선택 방법인 정보획득량(IG)과 카이제곱 통계량(CHI)을 이용했을 때의 성능과 잠재 의미 색인 방법을 이용했을 때의 성능을 SVM, kNN 및 Naive Bayes 분류기를 사용하여 분석하고, 그중 가장 성능이 우수하게 나오는 SVM을 사용하여 잠재 의미 색인에서 명사가 해당 용어의 개념적 의미 구조를 구축하는데 기여하는 정도가 어느 정도인지 평가함과 아울러, LSI 기법 이용시 최적의 성능을 나타내는 특이값의 범위를 실험을 통해 비교 분석 하였다. 분석결과 LSI 기법이 단어 매칭 기법(IG, CHI)에 비해 우수한 성능을 보였으며, SVM, Naive Bayes 분류기는 단어 매칭 기법에서는 비슷한 수준을 보였으나, LSI 기법에서는 SVM의 성능이 월등이 우수한 것으로 나왔다. 또한, SVM은 LSI 기법에서 약 3%의 성능 향상을 보였지만 Naive Bayes는 오히려 20%의 성능 저하를 보였다. LSI 기법에서 명사가 잠재적 의미 구조에 미치는 영향은 모든 단어들을 내용어로 한 경우 보다 약 10% 더 향상된 결과를 보여주었고, 특이값의 범위에 따른 성능 분석에 있어서는 30% 수준에 Rank 되는 범위에서 가장 높은 성능의 결과가 나왔다.

Hydrologic Disaggregation Model using Neural Networks Technique (신경망기법을 이용한 수문학적 분해모형)

  • Kim, Sung-Won
    • Journal of Wetlands Research
    • /
    • v.12 no.3
    • /
    • pp.79-97
    • /
    • 2010
  • The purpose of this research is to apply the neural networks models for the hydrologic disaggregation of the yearly pan evaporation(PE) data in Republic of Korea. The neural networks models consist of multilayer perceptron neural networks model(MLP-NNM) and support vector machine neural networks model(SVM-NNM), respectively. And, for the evaluation of the neural networks models, they are composed of training and test performances, respectively. The three types of data such as the historic, the generated, and the mixed data are used for the training performance. The only historic data, however, is used for the testing performance. The application of MLP-NNM and SVM-NNM for the hydrologic disaggregation of nonlinear time series data is evaluated from results of this research. Four kinds of the statistical index for the evaluation are suggested; CC, RMSE, E, and AARE, respectively. Homogeneity test using ANOVA and Mann-Whitney U test, furthermore, is carried out for the observed and calculated monthly PE data. We can construct the credible monthly PE data from the hydrologic disaggregation of the yearly PE data, and the available data for the evaluation of irrigation and drainage networks system can be suggested.