• Title/Summary/Keyword: Under-Sampling

Search Result 1,098, Processing Time 0.033 seconds

Understanding Complex Design Features via Design Effect Models (설계효과모형을 통한 설계요소의 유용성 이해)

  • Park, Inho
    • The Korean Journal of Applied Statistics
    • /
    • v.28 no.6
    • /
    • pp.1217-1225
    • /
    • 2015
  • Survey research, data is commonly collected through a sample design with complex design features that allow the relative efficiency on the precision of an estimator to be measured using the concept of the design effect compared to simple random sampling as a reference design. This concept is most useful when the design effect can be expressed as a function of various design features. We propose a design effect formula suitable under a stratified multistage sampling by generalizing Gabler et al. (1999, 2006)'s approaches for multistage sampling. Its use can either guide improvement in the design efficiency when in design stage or enable the evaluation of the adopted design features afterwards.

Comparison of Nano Particle Size Distributions by Different Measurement Techniques

  • Bae, Min-Suk;Oh, Joon-Seok
    • Journal of Korean Society for Atmospheric Environment
    • /
    • v.26 no.2
    • /
    • pp.219-233
    • /
    • 2010
  • Understanding the Nano size particles is of great interest due to their chemical and physical behaviors such as compositions, size distributions, and number concentrations. Therefore, accurate measurements of size distributions and number concentrations in ultrafine particles are getting required because expected losses such as diffusion for the instrument system from ambient inlet to detector are a significant challenge. In this study, the data using the computed settling losses, impaction losses, diffusion losses for the sampling lines (explored different sampling line diameters, horizontal length, number of bending, line angles, flow rates with and without a bypass), and diffusion losses for the Scanning Mobility Particle Sizers are examined. As expected, the settling losses and impaction losses are very minor under 100 nm, however, diffusion loss corrections for the sampling lines and the size instrument make a large difference for any measurement conditions with high numbers of particles smaller mobility size. Both with and without the loss corrections, which can affect to size distributions and number concentrations are described. First, 80% or more of the smallest particles (less than 10 nm) can be lost in the condition of a flow rate of 0.3 liter per minute and the length of sampling line of 1.0 m, second, total number concentrations of measurements are quite significantly affected, and the mode structure of the size distribution changes dramatically after the loss corrections applied. With compared to the different measurements, statistically diffusion loss corrections yield a required process of the ambient particle concentrations. Based on the current study, as an implication, a possibility of establishing direct revelation mechanisms is suggested.

A Bayesian Sampling Algorithm for Evolving Random Hypergraph Models Representing Higher-Order Correlations (고차상관관계를 표현하는 랜덤 하이퍼그래프 모델 진화를 위한 베이지안 샘플링 알고리즘)

  • Lee, Si-Eun;Lee, In-Hee;Zhang, Byoung-Tak
    • Journal of KIISE:Software and Applications
    • /
    • v.36 no.3
    • /
    • pp.208-216
    • /
    • 2009
  • A number of estimation of distribution algorithms have been proposed that do not use explicitly crossover and mutation of traditional genetic algorithms, but estimate the distribution of population for more efficient search. But because it is not easy to discover higher-order correlations of variables, lower-order correlations are estimated most cases under various constraints. In this paper, we propose a new estimation of distribution algorithm that represents higher-order correlations of the data and finds global optimum more efficiently. The proposed algorithm represents the higher-order correlations among variables by building random hypergraph model composed of hyperedges consisting of variables which are expected to be correlated, and generates the next population by Bayesian sampling algorithm Experimental results show that the proposed algorithm can find global optimum and outperforms the simple genetic algorithm and BOA(Bayesian Optimization Algorithm) on decomposable functions with deceptive building blocks.

Double Demodulation of a Ring Laser Dither Signal for Reducing the Dynamic Error of an Inertial Navigation System (관성항법장치의 동적오차 개선을 위한 링레이저 각진동 신호의 이중 복조방법)

  • Shim, Kyu-Min
    • Journal of the Korean Society for Aeronautical & Space Sciences
    • /
    • v.42 no.1
    • /
    • pp.82-89
    • /
    • 2014
  • This paper discusses the methods for reducing the sampling time quantization errors of the body dither type ring laser gyroscope. A ring laser gyroscope has the angle quantization error which is generated by the frequency counting method of the laser beat signal and sampling time quantization error which is generated by the demodulation method for eliminating the body dithering in which the sampling periods are fitted to the dither periods. Generally, because the dither periods are longer than the calculation periods of the inertial navigation system, vehicle navigation errors are produced by long time attitude update missing during the vehicle move with a high dynamical motion. In this paper, the double demodulation method is proposed for reducing the sampling time quantization error and its effects under the dynamic situation are confirmed by simulation.

Sampling efficiencies of the activated carbon fiber and 3M diffusive samplers for organic vapors (공기중 유기용제 측정을 위한 활성탄섬유 확산포집기와 3M 확산포집기의 시료포집효율에 대한 연구)

  • Byeon, Sang-Hoon;Oh, Se-Min;Lee, Chang-Ha
    • Journal of Korean Society of Occupational and Environmental Hygiene
    • /
    • v.7 no.1
    • /
    • pp.21-31
    • /
    • 1997
  • Toluene, n-hexane, and methyl ethyl ketone(MEK) were exposed to the activated carbon fiber(ACF) and 3M(Model 3500) diffusive samplers under low and high humidity levels. In order to evaluate these two samplers, the sampling capacity, sampling rate, reverse diffusion, and storage stability were obtained. At low humidity level($8{\pm}3%RH$), the adsorption amount of all three organic vapors to the ACF diffusive sampler showed a positive linear relationship up to 8 hours. However, at high humidity level($90{\pm}5%RH$), n-hexane and MEK maintained a positive linear relationship up to 1.5 hrs, but decreased in their adsorption amounts afterwards. On the other hand, the adsorption amount of n-hexane, MEK, and toluene to 3M diffusive sampler showed almost a positive linear relationship up to 8 hours at both humidity levels. At low humidity level, there was almost no reverse diffusion for both 3M and ACF diffusive samplers. However, when the ACF diffusive sampler was used at high humidity level, there was about 52.63% of MEK sample loss and about 92.59% of n-hexane sample loss. The storage stabilities of the ACF and 3M diffusive samplers were both relative stable except for MEK. In the case of MEK, the difference between the analysis of the organic vapor right after the sampling and that of 3 weeks later at room temperature was 45% for the ACF diffusive sampler and 18% for the 3M diffusive sampler. Since the storage stability of the samples stored in a refrigerator was relatively stable, they need to be refrigerated until the analysis is done.

  • PDF

An Additive Stratified Quantitative Attribute Randomized Response Model (층화 가법 양적속성 확률화응답모형)

  • Lee, Gi-Sung;Ahn, Seung-Chul;Hong, Ki-Hak;Son, Chang-Kyoon
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.2
    • /
    • pp.239-247
    • /
    • 2014
  • For a sensitive survey in which the population is composed by several strata with quantitative attributes, we present an additive stratified quantitative attribute randomized response model which applied stratified random sampling instead of simple random sampling to the models of Himmelfarb-Edgell's additive quantitative attribute model and Gjestvang-Singh's. We also establish theoretical grounds to estimate the stratum mean of sensitive quantitative attributes as well as the over all mean. We deal with the proportional and optimal allocation problems in each suggested model and compare the relative efficiency of the suggested two models; subsequently, Himmelfarb-Edgell's model is more efficient than Gjestvang-Singh's model under the condition of stratified random sampling.

Robust Bayesian Inference in Finite Population Sampling under Balanced Loss Function

  • Kim, Eunyoung;Kim, Dal Ho
    • Communications for Statistical Applications and Methods
    • /
    • v.21 no.3
    • /
    • pp.261-274
    • /
    • 2014
  • In this paper we develop Bayes and empirical Bayes estimators of the finite population mean with the assumption of posterior linearity rather than normality of the superpopulation under the balanced loss function. We compare the performance of the optimal Bayes estimator with ones of the classical sample mean and the usual Bayes estimator under the squared error loss with respect to the posterior expected losses, risks and Bayes risks when the underlying distribution is normal as well as when they are binomial and Poisson.

Empirical Statistical Power for Testing Multilocus Genotypic Effects under Unbalanced Designs Using a Gibbs Sampler

  • Lee, Chae-Young
    • Asian-Australasian Journal of Animal Sciences
    • /
    • v.25 no.11
    • /
    • pp.1511-1514
    • /
    • 2012
  • Epistasis that may explain a large portion of the phenotypic variation for complex economic traits of animals has been ignored in many genetic association studies. A Baysian method was introduced to draw inferences about multilocus genotypic effects based on their marginal posterior distributions by a Gibbs sampler. A simulation study was conducted to provide statistical powers under various unbalanced designs by using this method. Data were simulated by combined designs of number of loci, within genotype variance, and sample size in unbalanced designs with or without null combined genotype cells. Mean empirical statistical power was estimated for testing posterior mean estimate of combined genotype effect. A practical example for obtaining empirical statistical power estimates with a given sample size was provided under unbalanced designs. The empirical statistical powers would be useful for determining an optimal design when interactive associations of multiple loci with complex phenotypes were examined.

Bootstrap Confidence Intervals for an Adjusted Survivor Function under the Dependent Censoring Model

  • Lee, Seung-Yeoun;Sok, Yong-U
    • Communications for Statistical Applications and Methods
    • /
    • v.8 no.1
    • /
    • pp.127-135
    • /
    • 2001
  • In this paper, we consider a simple method for testing the assumption of independent censoring on the basis of a Cox proportional hazards regression model with a time-dependent covariate. This method involves a two-stage sampling in which a random subset of censored observations is selected and followed-up until their true survival times are observed. Lee and Wolfe(1998) proposed an adjusted estimate of the survivor function for the dependent censoring under a proportional hazards alternative. This paper extends their result to obtain a bootstrap confidence interval for the adjusted survivor function under the dependent censoring. The proposed procedure is illustrated with an example of a clinical trial for lung cancer analysed in Lee and Wolfe(1998).

  • PDF

Comparison of the Efficiencies of Variable Sampling Intervals Charts for Simultaneous Monitoring the means of multivariate Quality Variables

  • Chang, Duk-Joon
    • Journal of Integrative Natural Science
    • /
    • v.9 no.3
    • /
    • pp.215-222
    • /
    • 2016
  • When the linear correlation of the quality variables are considerably high, multivariate control charts may be a more effective way than univariate charts which operate quality variables and process parameters individually. Performances and efficiencies of the multivariate control charts under multivariate normal process has been considered. Some numerical results are presented under small scale of the shifts in the process to see the improvement of the efficiency of EWMA chart and CUSUM chart, which use past quality information, comparing to Shewart chart, which do not use quality information. We can know that the decision of the optimum value of the smoothing constant in EWMA structure or the reference value in CUSUM structure are very important whether we adopt combine-accumulate technique or accumulate-combine technique under the given condition of process.