• Title/Summary/Keyword: 최소표본수

Search Result 120, Processing Time 0.031 seconds

Unrelated question model with quantitative attribute by simple cluster sampling (단순집락추출법에 의한 양적속성의 무관질문모형)

  • 이기성;홍기학
    • The Korean Journal of Applied Statistics
    • /
    • v.11 no.1
    • /
    • pp.141-150
    • /
    • 1998
  • In this paper, we developed one-stage cluster randomized response model for obtaining quantitative data by using the Greenberg et al. model(1971) when the population was made up of sensitive quantitative clusters. We obtained the minimum variance by calculating the cluster's size and the optimum number of sample clusters under the some given constant cost. We compared the efficiency of our model with the Greenberg et al. model by simple random sampling.

  • PDF

Local Linear Logistic Classification of Microarray Data Using Orthogonal Components (직교요인을 이용한 국소선형 로지스틱 마이크로어레이 자료의 판별분석)

  • Baek, Jang-Sun;Son, Young-Sook
    • The Korean Journal of Applied Statistics
    • /
    • v.19 no.3
    • /
    • pp.587-598
    • /
    • 2006
  • The number of variables exceeds the number of samples in microarray data. We propose a nonparametric local linear logistic classification procedure using orthogonal components for classifying high-dimensional microarray data. The proposed method is based on the local likelihood and can be applied to multi-class classification. We applied the local linear logistic classification method using PCA, PLS, and factor analysis components as new features to Leukemia data and colon data, and compare the performance of the proposed method with the conventional statistical classification procedures. The proposed method outperforms the conventional ones for each component, and PLS has shown best performance when it is embedded in the proposed method among the three orthogonal components.

A Study on the Estimation of Diameter Distribution and Volumetric Frequency of Joint Discs Using the Least Square Method (최소자승법을 이용한 원판형 절리의 직경분포와 체적빈도 추정에 관한 연구)

  • Song Jae-Joon
    • Tunnel and Underground Space
    • /
    • v.15 no.2 s.55
    • /
    • pp.137-144
    • /
    • 2005
  • An estimation technique of the joint diameter distribution using the least square method is suggested. When utilizing the technique by Song and Lee, the diameter distribution would be obtained only from the trace length distribution defined in an infinite window after the trace length distribution is estimated from the contained trace length distribution. With the new technique, however, the diameter distribution can be directly obtained from the sample histogram of the contained trace lengths. Compared with the previous technique, it shows a more accurate result for small sizes of joint samples and provides the joint geometry parameter of volumetric frequency. Verification of this new technique was completed by using Monte Carlo simulations.

Uncertainty Estimation of AR Model Parameters Using a Bayesian technique (Bayesian 기법을 활용한 AR Model 매개변수의 불확실성 추정)

  • Park, Chan-Young;Park, Jong-Hyeon;Park, Min-Woo;Kwon, Hyun-Han
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2016.05a
    • /
    • pp.280-280
    • /
    • 2016
  • 특정 자료의 시간의 흐름에 따른 예측치를 추정하는 방법으로 AR Model 즉, 자기회귀모형이 많이 사용되고 있다. AR Model은 변수의 현재 값을 과거 값의 함수로 나타내게 되는데, 이런 시계열 분석 모델을 사용할 때 매개변수의 추정 과정이 필수적으로 요구된다. 일반적으로 매개변수를 추정하는 방법에는 확률적근사법(stochastic approximation), 최소제곱법(method of least square), 자기상관법(method of autocorrelation method), 최우도법(method of maximum likelihood) 등이 있다. AR Model에서 가장 많이 사용되는 최우도법은 표본크기가 충분히 클 때 가장 효율적인 방법으로 평가되지만 수치적으로 해를 구하는 과정이 복잡한 경우가 많으며, 해를 구하지 못하는 어려움이 따르기도 한다. 또한 표본 크기가 작을 때 일반적으로 잘 일치하지 않은 결과를 얻게 된다. 우리나라의 강우, 유량 등의 자료는 자료의 수가 적은 경우가 많기 때문에 최우도법을 통한 매개변수 추정 시 불확실성이 내재되어있지만 그것을 정량적으로 제시하는데 한계가 있다. 본 연구에서는 AR Model의 매개변수 추정 시 Bayesian 기법으로 매개변수의 사후분포(posterior distribution)를 제공하여 매개변수의 불확실성 구간을 정량적으로 표현하게 됨으로써, 시계열 분석을 통해 보다 신뢰성 있는 예측치를 얻을 수 있으리라 판단된다.

  • PDF

Plot Size for Investigating Forest Community Structure (IV) - Adequate Number of Plots for Shrub Stratum in a Mixed Forest Community of Abies holophylla and Broad-leaved Trees at Odaesan National Park - (삼림군집구조 조사를 위한 조사구 크기에 관한 연구(IV) - 오대산 국립공원지역 젓나무-활엽수 혼효림군집 관목층의 적정 조사구수 -)

  • 박인협;문광선
    • Korean Journal of Environment and Ecology
    • /
    • v.9 no.2
    • /
    • pp.197-201
    • /
    • 1996
  • A mixed forest community of Abies holophylla and broad-leaved trees in Odaesan National Park was studied to determine the adequate number of plots of shrub stratum for investigating forest community structure. Thirty 5m*5m plots were set up in the shrub stratum, and species-area curve and performance curve were made out. The minimum number of plots where a given percentage increase in number of plots produce in number of plots produced less than the same percentage increase in number of species was six. The minimum number of plots where a given percentage increase in number of plots produced less than the half of the percentage increase in number of plots was eleven. The minimum number of plots where the dominant species was distinguished from the subdominant species was five. The minimum numver of plots where the first subdominant species was distinguished from other subdominant species was ten. The diffrence of species diversity(H') between five or more plots and total thirty plots was less than 0.05. Similarity index was more than 70% between five or more plots and total thirty plots, and more than 80% between ten or more plots and total thirty plots. The conclusion is that the adequate number of 5m*5m plots for the shrub stratum was about 5 in general case and about 10 in case of requiring more accuracy.

  • PDF

A Study on Improving the Reliability of DSRC Traffic Information Considering Traffic and Road Characteristics - Focusing on Busan Urban Expressway - (교통 및 도로특성을 고려한 DSRC 교통정보 신뢰성 향상에 관한 연구)

  • Jeong, Yeon Tak;Jung, Hun Young
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.34 no.5
    • /
    • pp.1535-1545
    • /
    • 2014
  • This study aims at improving the Reliability of DSRC Traffic information considering Traffic and Road Characteristics. First of all, this study analyzed the characteristics of DSRC data on urban expressway and problems of outlier data occurrence. After then, this study produced reliable traffic information by using an optimal method of the Outlier-Filtering. After Outlier-Filtering, this study performed accuracy evaluation and appropriateness check for the number of samples per confidence level. As a result, it showed that the MAPE was between 2.2% and 9.7% and RSME was between 2.2 and 7.5 which are very similar figures to the actual average traffic speed. Also, The samples of both Am peak and Pm peak periods were analyzed to be appropriate at the confidence level of 95%, and 90% within the allowable error range of 5kph.

Forest Thematic Maps and Forest Statistics Using the k-Nearest Neighbor Technique for Pyeongchang-Gun, Gangwon-Do (kNN 기법을 이용한 강원도 평창군의 산림 주제도 작성과 산림통계량 추정)

  • Yim, Jong-Su;Kong, Gee Su;Kim, Sung Ho;Shin, Man Yong
    • Journal of Korean Society of Forest Science
    • /
    • v.96 no.3
    • /
    • pp.259-268
    • /
    • 2007
  • This study was conducted to produce forest thematic maps and estimate forest statistics for Pyeongchang Gun using the kNN technique, which has been applied to produce thematic maps of variables of interest including unobserved plots by combining field plot data, remotely sensed data and other digital map data in forest inventories. The estimation errors for three horizontal reference areas (HRAs), whose radii are 20, 40 and 60 km respectively, were compared. Although the precision for the 40 km radius was lower compared to that for the 60 km radius, the 40 km radius was found to be an efficient HRA because their difference in precision was modest. At a value of k=5 nearest neighbors for the selected HRA, the overall accuracy was high. As a result, using the k=5 neighbors within the HRA of 40 km radius, thematic maps of number of trees, basal area, and growing stock per hectare were generated. As compared to the forest statistics based on field sample plots, the estimated means of each parameter from the produced maps were underestimated.

Analysis of internet addiction in Korean adolescents using sparse partial least-squares regression (희소 부분 최소 제곱법을 이용한 우리나라 청소년 인터넷 중독 자료 분석)

  • Han, Jeongseop;Park, Soobin;Lee, onghwan
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.2
    • /
    • pp.253-263
    • /
    • 2018
  • Internet addiction in adolescents is an important social issue. In this study, sparse partial least-squares regression (SPLS) was applied to internet addiction data in Korean adolescent samples. The internet addiction score and various clinical and psychopathological features were collected and analyzed from self-reported questionnaires. We considered three PLS methods and compared the performance in terms of prediction and sparsity. We found that the SPLS method with the hierarchical likelihood penalty was the best; in addition, two aggression features, AQ and BSAS, are important to discriminate and explain latent features of the SPLS model.

A Parameter Estimation Method using Nonlinear Least Squares (비선형 최소제곱법을 이용한 모수추정 방법론)

  • Oh, Suna;Song, Jongwoo
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.3
    • /
    • pp.431-440
    • /
    • 2013
  • We consider the problem of estimating the parameters of heavy tailed distributions. In general, maximum likelihood estimation(MLE) is the most preferred method of parameter estimation because it has good properties such as asymptotic consistency, normality and efficiency. However, MLE is not always the best solution because MLE is unstable or does not exist in some cases. This paper proposes another parameter estimation method, non-linear least squares(NLS) and compares its performance to MLE. The NLS estimator is achieved by minimizing sum of squared difference between empirical cumulative distribution function(CDF) and a theoretical distribution function. In this article, we compare the NLS method to MLE using simulated data from heavy tailed distributions. The NLS method is shown to perform better than MLE in Burr distribution when the sample size is small; in addition, it performs well in a Frechet distribution.

THE Multiensemble Sampling Method (다중앙상블 표본추출 방법)

  • Han, Kyu-Kwang
    • The Journal of Natural Sciences
    • /
    • v.18 no.1
    • /
    • pp.1-8
    • /
    • 2007
  • An efficient sampling method of computer simulation is reviewed. Using the method, several thermodynamic states can be investigated at a time in a single simulation. It is due to the ability of the method to explore the relevant parts of configuration space equally for every state being investigated. The method can be used in simulations of complex systems such as biopolymers which are still greatly hampered by the multi-minima problem. In this article I present a brief theoretical review of the method and illustrate how to realize it in the simulations.

  • PDF