• 제목/요약/키워드: Sample Sizes

검색결과 683건 처리시간 0.022초

Effective Sample Sizes for the Test of Mean Differences Based on Homogeneity Test

  • Heo, Sunyeong
    • 통합자연과학논문집
    • /
    • 제12권3호
    • /
    • pp.91-99
    • /
    • 2019
  • Many researchers in various study fields use the two sample t-test to confirm their treatment effects. The two sample t-test is generally used for small samples, and assumes that two independent random samples are selected from normal populations, and the population variances are unknown. Researchers often conduct F-test, the test of equality of variances, before testing the treatment effects, and the test statistic or confidence interval for the two sample t-test has two formats according to whether the variances are equal or not. Researchers using the two sample t-test often want to know how large sample sizes they need to get reliable test results. This research gives some guidelines for sample sizes to them through simulation works. The simulation had run for normal populations with the different ratios of two variances for different sample sizes (${\leq}30$). The simulation results are as follows. First, if one has no idea equality of variances but he/she can assume the difference is moderate, it is safe to use sample size at least 20 in terms of the nominal level of significance. Second, the power of F-test for the equality of variances is very low when the sample sizes are small (<30) even though the ratio of two variances is equal to 2. Third, the sample sizes at least 10 for the two sample t-test are recommendable in terms of the nominal level of significance and the error limit.

Sample Size Determination and Evaluation of Form Errors

  • Chang, Sung Ho;Kim, Sunn Ho
    • 품질경영학회지
    • /
    • 제22권3호
    • /
    • pp.85-98
    • /
    • 1994
  • In current coordinate measuring machine practice, there are no commonly accepted sample sizes for estimating form errors which have a statistical confidence. Practically, sample size planning is important for the geometrical tolerance inspection using a coordinate measuring machine. We determine and validate appropriate sample sizes for form error estimation. Also, we develop form error estimation methods with certain confidence levels based on the obtained sample sizes in various form errors: straightness, flatness, circularity, and cylindericity.

  • PDF

An elaboration on sample size determination for correlations based on effect sizes and confidence interval width: a guide for researchers

  • Mohamad Adam Bujang
    • Restorative Dentistry and Endodontics
    • /
    • 제49권2호
    • /
    • pp.21.1-21.8
    • /
    • 2024
  • Objectives: This paper aims to serve as a useful guide for sample size determination for various correlation analyses that are based on effect sizes and confidence interval width. Materials and Methods: Sample size determinations are calculated for Pearson's correlation, Spearman's rank correlation, and Kendall's Tau-b correlation. Examples of sample size statements and their justification are also included. Results: Using the same effect sizes, there are differences between the sample size determination of the 3 statistical tests. Based on an empirical calculation, a minimum sample size of 149 is usually adequate for performing both parametric and non-parametric correlation analysis to determine at least a moderate to an excellent degree of correlation with acceptable confidence interval width. Conclusions: Determining data assumption(s) is one of the challenges to offering a valid technique to estimate the required sample size for correlation analyses. Sample size tables are provided and these will help researchers to estimate a minimum sample size requirement based on correlation analyses.

Approximate Confidence Limits for the Ratio of Two Binomial Variates with Unequal Sample Sizes

  • Cho, Hokwon
    • Communications for Statistical Applications and Methods
    • /
    • 제20권5호
    • /
    • pp.347-356
    • /
    • 2013
  • We propose a sequential method to construct approximate confidence limits for the ratio of two independent sequences of binomial variates with unequal sample sizes. Due to the nonexistence of an unbiased estimator for the ratio, we develop the procedure based on a modified maximum likelihood estimator (MLE). We generalize the results of Cho and Govindarajulu (2008) by defining the sample-ratio when sample sizes are not equal. In addition, we investigate the large-sample properties of the proposed estimator and its finite sample behavior through numerical studies, and we make comparisons from the sample information view points.

이변량 효능과 안전성 이항변수의 표본수 결정방법 (Determination of Sample Sizes of Bivariate Efficacy and Safety Outcomes)

  • 이현학;송혜향
    • 응용통계연구
    • /
    • 제22권2호
    • /
    • pp.341-353
    • /
    • 2009
  • 두 군의 처리를 비교하는 임상시험에서 효능(efficacy)과 안전성(safety)이 동일하게 중요한 변수로 취급되는 경우에 이변량(bivariate) 반응변수로서 분석되고 연구계획의 단계에서도 이변량 표본수 결정방법이 사용되어야 한다. Thall과 Cheng (1999)은 효능과 안전성의 반응값이 이변량 이항(bivariate binary) 변수인 경우의 표본수 결정방법을 제시하였으며, 본 연구에서는 목표모수 설정과정은 기존의 연구와 같으나 월콕슨-만-휘트니(Wilcoxon-Mann-Whitney: WMW) 통계량에 근거한 검정법과 표본수 결정방법을 제시한다. Thall과 Cheng (1999)의 검정통계량은 변수 변환시킨 비율의 근사 정규성에 근거하는 반면에, WMW 통계량은 확률에 근거한 비모수적 방법으로 이변량 이항변수 뿐만 아니라 이변량 순위변수로 측정된 반응값에도 적용시킬 수 있다 Thall과 Cheng (1999)에 제시한 항암치료 임상연구의 두 예제에 위의 두 다른 방법으로 계산된 표본수를 비교한 결과, Thall과 Cheng (1999)의 첫째 예제에서는 이변량 WMW 방법에 의한 표본수가 더욱 작았으나 둘째 예제에서는 더욱 큰 것으로 나타났다.

고차원 자료의 재현성과 표본 수 (Reproducibility and Sample Size in High-Dimensional Data)

  • 서원석;최지아;정형철;조형준
    • 응용통계연구
    • /
    • 제23권6호
    • /
    • pp.1067-1080
    • /
    • 2010
  • 임상시험을 위한 표본 수 산정방법에 대해 지금까지 많은 방법이 개발되었고 현재 국내외 임상시험 기관에서 이 방법들을 토대로 표본 수를 산정하고 있다. 하지만 마이크로어레이칩 을 이용한 실험에 필요한 표본 수 산정에 대한 연구는 아직 미비하여 제대로 이용되지 않고 있다. 본 연구의 목적은 마이크로어레이 실험에 필요한 표본 수를 산정하는 데 있어 실제 마이크로어레이 자료의 재현성에 대한 정보를 이용하여 그 지침을 제공하는데 있다. 재현성 비교에서는 5가지 검정방법 즉, Fold change, Two-sample t-test, Wilcoxon rank-sum test, SAM, LPE 방법 별로 재현성을 측정하였다. 발현 값의 표준화 방법에 있어서는 MAS5, RMA 두 가지로 세분화 하였으며 반복수에 따라 상위 20개 또는 100개 유전자에 대한 일치성도 측정하였다. 또한, 표본수를 산정하는데 있어 기존에 제시한 방법에 현실적인 정보를 이용하여 좀 더 세분화하여 실험에 필요한 표본수를 산정해 보았다.

와이블분포 하에서 베이지안 기법과 전통적 기법 간의 신뢰도 추정 정확도 비교 (A Comparison of the Reliability Estimation Accuracy between Bayesian Methods and Classical Methods Based on Weibull Distribution)

  • 조형준;임준형;김용수
    • 대한산업공학회지
    • /
    • 제42권4호
    • /
    • pp.256-262
    • /
    • 2016
  • The Weibull is widely used in reliability analysis, and several studies have attempted to improve estimation of the distribution's parameters. least squares estimation (LSE) or Maximum likelihood estimation (MLE) are often used to estimate distribution parameters. However, it has been proven that Bayesian methods are more suitable for small sample sizes than LSE and MLE. In this work, the Weibull parameter estimation accuracy of LSE, MLE, and Bayesian method are compared for sample sets with 3 to 30 data points. The Bayesian method was most accurate for sample sizes under 25, and the accuracy of the Bayesian method was similar to LSE and MLE as the sample size increased.

시뮬레이션 최적화 문제 해결을 위한 이산 입자 군집 최적화에서 샘플수와 개체수의 효과 (The Effect of Sample and Particle Sizes in Discrete Particle Swarm Optimization for Simulation-based Optimization Problems)

  • 임동순
    • 산업경영시스템학회지
    • /
    • 제40권1호
    • /
    • pp.95-104
    • /
    • 2017
  • This paper deals with solution methods for discrete and multi-valued optimization problems. The objective function of the problem incorporates noise effects generated in case that fitness evaluation is accomplished by computer based experiments such as Monte Carlo simulation or discrete event simulation. Meta heuristics including Genetic Algorithm (GA) and Discrete Particle Swarm Optimization (DPSO) can be used to solve these simulation based multi-valued optimization problems. In applying these population based meta heuristics to simulation based optimization problem, samples size to estimate the expected fitness value of a solution and population (particle) size in a generation (step) should be carefully determined to obtain reliable solutions. Under realistic environment with restriction on available computation time, there exists trade-off between these values. In this paper, the effects of sample and population sizes are analyzed under well-known multi-modal and multi-dimensional test functions with randomly generated noise effects. From the experimental results, it is shown that the performance of DPSO is superior to that of GA. While appropriate determination of population sizes is more important than sample size in GA, appropriate determination of sample size is more important than particle size in DPSO. Especially in DPSO, the solution quality under increasing sample sizes with steps is inferior to constant or decreasing sample sizes with steps. Furthermore, the performance of DPSO is improved when OCBA (Optimal Computing Budget Allocation) is incorporated in selecting the best particle in each step. In applying OCBA in DPSO, smaller value of incremental sample size is preferred to obtain better solutions.

Effect of Positively Skewed Distribution on the Two sample t-test: Based on Chi-square Distribution

  • Heo, Sunyeong
    • 통합자연과학논문집
    • /
    • 제14권3호
    • /
    • pp.123-129
    • /
    • 2021
  • This research examines the effect of positively skewed population distribution on the two sample t-test through simulation. For simulation work, two independent samples were selected from the same chi-square distributions with 3, 5, 10, 15, 20, 30 degrees of freedom and sample sizes 3, 5, 10, 15, 20, 30, respectively. Chi-square distribution is largely skewed to the right at small degrees of freedom and getting symmetric as the degrees of freedom increase. Simulation results show that the sampled populations are distributed positively skewed like chi-square distribution with small degrees of freedom, the F-test for the equality of variances shows poor performances even at the relatively large degrees of freedom and sample sizes like 30 for both, and so it is recommended to avoid using F-test. When two population variances are equal, the skewness of population distribution does not affect on the t-test in terms of the confidence level. However even though for the highly positively skewed distribution and small sample sizes like three or five the t-test achieved the nominal confidence level, the error limits are very large at small sample size. Therefore, if the sampled population is expected to be highly skewed to the right, it will be recommended to use relatively large sample size, at least 20.

범주형 반복측정자료를 위한 일반화 추정방정식의 소표본 특성 (Small Sample Characteristics of Generalized Estimating Equations for Categorical Repeated Measurements)

  • 김동욱;김재직
    • 응용통계연구
    • /
    • 제15권2호
    • /
    • pp.297-310
    • /
    • 2002
  • Liang과 Zeger는 이산형 혹은 연속형 반복측정자료를 분석하기 위한 일반화 추정방정식 (GEE)을 제안하였다 GEE모형은 범주형 반복측정자료의 모형으로 확장될 수 있으며, 이 GEE추정량은 대표본인 경우 다변량 정규분포를 따른다. 그러나 GEE는 대표본근사이론에 기초한다. 본 논문에서는 소표본인 경우 반복 측정된 순서자료에 대한 GEE추정량의 성질을 연구한다. 우리는 두가지 방법을 사용하여 두그룹의 반복 측정된 순서자료를 생성하며 모의실험을 통하여 소표본인 경우 여러 개 범주를 갖는 순서반응 자료에 대하여 GEE추정량의 1종 오류율, 검정력, 상대효율, 두 그룹의 표본크기가 다를 경우 효과, 그리고 분산 추정량의 성질등을 연구한다.