• Title/Summary/Keyword: sample of random size

Search Result 201, Processing Time 0.02 seconds

Saddlepoint approximations for the ratio of two independent sequences of random variables

  • Cho, Dae-Hyeon
    • Journal of the Korean Data and Information Science Society
    • /
    • v.9 no.2
    • /
    • pp.255-262
    • /
    • 1998
  • In this paper, we study the saddlepoint approximations for the ratio of independent random variables. In Section 2, we derive the saddlepoint approximation to the probability density function. In Section 3, we represent a numerical example which shows that the errors are small even for small sample size.

  • PDF

The Convergence Characteristics of The Time- Averaged Distortion in Vector Quantization: Part I. Theory Based on The Law of Large Numbers (벡터 양자화에서 시간 평균 왜곡치의 수렴 특성 I. 대수 법칙에 근거한 이론)

  • 김동식
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.33B no.7
    • /
    • pp.107-115
    • /
    • 1996
  • The average distortio of the vector quantizer is calcualted using a probability function F of the input source for a given codebook. But, since the input source is unknown in geneal, using the sample vectors that is realized from a random vector having probability function F, a time-average opeation is employed so as to obtain an approximation of the average distortion. In this case the size of the smple set should be large so that the sample vectors represent true F reliably. The theoretical inspection about the approximation, however, is not perfomed rigorously. Thus one might use the time-average distortion without any verification of the approximation. In this paper, the convergence characteristics of the time-average distortions are theoretically investigated when the size of sample vectors or the size of codebook gets large. It has been revealed that if codebook size is large enough, then small sample set is enough to obtain the average distortion by approximatio of the calculated tiem-averaged distortion. Experimental results on synthetic data, which are supporting the analysis, are also provided and discussed.

  • PDF

Two-stage Sampling for Estimation of Prevalence of Bovine Tuberculosis (이단계표본추출을 이용한 소결핵병 유병률 추정)

  • Pak, Son-Il
    • Journal of Veterinary Clinics
    • /
    • v.28 no.4
    • /
    • pp.422-426
    • /
    • 2011
  • For a national survey in which wide geographic region or an entire country is targeted, multi-stage sampling approach is widely used to overcome the problem of simple random sampling, to consider both herd- and animallevel factors associated with disease occurrence, and to adjust clustering effect of disease in the population in the calculation of sample size. The aim of this study was to establish sample size for estimating bovine tuberculosis (TB) in Korea using stratified two-stage sampling design. The sample size was determined by taking into account the possible clustering of TB-infected animals on individual herds to increase the reliability of survey results. In this study, the country was stratified into nine provinces (administrative unit) and herd, the primary sampling unit, was considered as a cluster. For all analyses, design effect of 2, between-cluster prevalence of 50% to yield maximum sample size, and mean herd size of 65 were assumed due to lack of information available. Using a two-stage sampling scheme, the number of cattle sampled per herd was 65 cattle, regardless of confidence level, prevalence, and mean herd size examined. Number of clusters to be sampled at a 95% level of confidence was estimated to be 296, 74, 33, 19, 12, and 9 for desired precision of 0.01, 0.02, 0.03, 0.04, 0.05, and 0.06, respectively. Therefore, the total sample size with a 95% confidence level was 172,872, 43,218, 19,224, 10,818, 6,930, and 4,806 for desired precision ranging from 0.01 to 0.06. The sample size was increased with desired precision and design effect. In a situation where the number of cattle sampled per herd is fixed ranging from 5 to 40 with a 5-head interval, total sample size with a 95% confidence level was estimated to be 6,480, 10,080, 13,770, 17,280, 20.925, 24,570, 28,350, and 31,680, respectively. The percent increase in total sample size resulting from the use of intra-cluster correlation coefficient of 0.3 was 22.2, 32.1, 36.3, 39.6, 41.9, 42.9, 42,2, and 44.3%, respectively in comparison to the use of coefficient of 0.2.

Confidence Interval for the Variance Component in a Unbalanced One-way Random Effects Model

  • Song, Gyu-Moon
    • Journal of the Korean Data and Information Science Society
    • /
    • v.13 no.2
    • /
    • pp.329-340
    • /
    • 2002
  • Two methods are proposed for constructing a confidence interval on the among group variance component in a unbalanced one-way random effects model. Computer simulation is used to compare these methods with alternative procedures. The results indicate that the method1 and methods2 perform well over small group size and large sample size respectively.

  • PDF

On Estimating the Distributional Parameter and the Complete Sample Size from Incomplete Samples

  • Yeo, Sung-chil
    • Journal of the Korean Statistical Society
    • /
    • v.20 no.2
    • /
    • pp.118-138
    • /
    • 1991
  • Given a random sample of size N(unknown) with density f(x $\theta$), suppose that only n observations which lie outside a region R are recorded. On the basis of n observations, the Bayes estimators of $\theta$ and N are considered and their asymptotic expansions are developed to compare their second order asymptotic properties with those of the maximum likelihood estimators and the Bayes modal estimators. Corrections to bias and median bias of these estimators are made. An example is given to illustrate the results obtained.

  • PDF

Nonparametric Test for Ordered Alternatives on Multiple Ranked-Set Samples

  • Kim, Dong HeeKim,;Hyung Gee;Park, Hae Kyung
    • Communications for Statistical Applications and Methods
    • /
    • v.7 no.2
    • /
    • pp.563-573
    • /
    • 2000
  • In this thesis, we propose the test statistic for ordered alternatives on c-sample ranked set samples(RSS). The proposed test statistic JRSS is Jonckheere type statistic using the median of the i-th samples in each cycle. We obtained the asymptotic property of the proposed test statistic and the asymptotic relative efficiencies of the proposed test statistic with respect to J SRS which Jonckheere type statistic on simple random samples(SRS). From the simulation works, J RSS is superior to J SRS. We compared the empirical powers of J RSS with respect to U RSS on ranked set sample and U SRS on simple random sample using all samples, which are proposed by Kim, Kim and Lee(1999). The powers of J RSS are nearly the same values when entire sample size is large. J RSS is superior to U RSS. J RSS is simpler than U RSSon calculating process.

  • PDF

Empirical Bayes Problem With Random Sample Size Components

  • Jung, Inha
    • Journal of the Korean Statistical Society
    • /
    • v.20 no.1
    • /
    • pp.67-76
    • /
    • 1991
  • The empirical Bayes version involves ″independent″ repetitions(a sequence) of the component decision problem. With the varying sample size possible, these are not identical components. However, we impose the usual assumption that the parameters sequence $\theta$=($\theta$$_1$, $\theta$$_2$, …) consists of independent G-distributed parameters where G is unknown. We assume that G $\in$ g, a known family of distributions. The sample size $N_i$ and the decisin rule $d_i$ for component i of the sequence are determined in an evolutionary way. The sample size $N_1$ and the decision rule $d_1$$\in$ $D_{N1}$ used in the first component are fixed and chosen in advance. The sample size $N_2$and the decision rule $d_2$ are functions of *see full text($\underline{X}^1$equation omitted), the observations in the first component. In general, $N_i$ is an integer-valued function of *see full text(equation omitted) and, given $N_i$, $d_i$ is a $D_{Ni}$/-valued function of *see full text(equation omitted). The action chosen in the i-th component is *(equation omitted) which hides the display of dependence on *(equation omitted). We construct an empirical Bayes decision rule for estimating normal mean and show that it is asymptotically optimal.

  • PDF

Sample size using response rate on repeated surveys (계속조사에서 응답률을 반영한 표본크기)

  • Park, Hyeonah;Na, Seongryong
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.5
    • /
    • pp.587-597
    • /
    • 2018
  • Procedures, such as sampling technique, survey method, and questionnaire preparation, are required in order to obtain sample data in accordance with the purpose of a survey. An important procedure is the decision of the sample size formula. The sample size formula is determined by setting the target error and total cost according to the sampling method. In this paper, we propose a sample size formula using population changes over time, estimation error of the previous time and response rate of past data when the target error and the expected response rate are given in the simple random sampling. In actual research, we use estimators that apply complex weights in addition to design-based weights. Therefore, we induce a sample size formula for estimators using design-based weights and nonresponse adjustment coefficients, that can be a formula that reflects differences in response rates when survey methods are changed over time. In addition, we use simulations to compare the proposed formula with the existing sample size formula.

How Should We Randomly Sample Marine Fish Landed at Korea Ports to Represent a Length Frequency Distribution of Those Fish? (한국 연근해 어업에서 수집되는 어류 개체군 체장자료의 표집(sampling) 방법 제안)

  • Park, Min Gyou;Hyun, Saang-Yoon
    • Korean Journal of Fisheries and Aquatic Sciences
    • /
    • v.54 no.1
    • /
    • pp.80-89
    • /
    • 2021
  • In Korea, marine fish landed at ports are randomly sampled on a periodic basis (e.g., daily or weekly), and body sizes (e.g., lengths and weights) of those sampled fish are measured. The motivation for our study is whether or not such measurements reflect the size distribution, especially the length distribution of fish landed (= a population), because such length measurements are key data for a length-based assessment model. The current sampling method is to sample fish landed at ports by body size group (e.g., very small, small, medium, large, very large), using the sampling weights as the number of boxes by body size group. In this study, we showed that length composition data about fish sampled by the current method did not represent the length frequency distribution of the fish landed, and suggested that an alternative sampling method should be applied of using the sampling weights as the number of fish landed by body size group. We also introduced a method for determining an appropriate sample size.